AI coding model comparisons
Every popular "X vs Y" matchup, compared on SWE-bench, HumanEval, MMLU, context window, and price — with a clear verdict on each. Want a matchup that isn't here? Build any comparison live.
GPT-5.1 vs Gemini 3 Pro
Benchmarks, context window, and price compared.
GPT-5.1 vs GPT-5.1 Codex
Benchmarks, context window, and price compared.
GPT-5.1 vs GPT-5.5
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Gemini 3 Pro
Benchmarks, context window, and price compared.
Gemini 3 Pro vs GPT-5.5
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs GPT-5.5
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs GPT-5.1
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Gemini 3 Pro
Benchmarks, context window, and price compared.
GPT-5.1 vs Grok 5
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Grok 5
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs GPT-5.1 Codex
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs GPT-5.5
Benchmarks, context window, and price compared.
GPT-5.1 vs o3
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Grok 5
Benchmarks, context window, and price compared.
Gemini 3 Pro vs o3
Benchmarks, context window, and price compared.
GPT-5.1 vs Kimi K3
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs o3
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Kimi K3
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Kimi K3
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Grok 5
Benchmarks, context window, and price compared.
GPT-5.1 vs Qwen 3 Max
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Qwen 3 Max
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs o3
Benchmarks, context window, and price compared.
GPT-5.1 vs Mistral Large 4
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Qwen 3 Max
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Mistral Large 4
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Kimi K3
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Mistral Large 4
Benchmarks, context window, and price compared.
GPT-5.1 vs Grok Code Fast 2
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Grok Code Fast 2
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs GPT-5.1
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Gemini 3 Pro
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Qwen 3 Max
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Grok Code Fast 2
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs GPT-5.1 Codex
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs GPT-5.5
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Mistral Large 4
Benchmarks, context window, and price compared.
GPT-5.1 vs Gemini 2.5 Pro
Benchmarks, context window, and price compared.
GPT-5.1 vs DeepSeek V4
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Gemini 2.5 Pro
Benchmarks, context window, and price compared.
Gemini 3 Pro vs DeepSeek V4
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Gemini 2.5 Pro
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs DeepSeek V4
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Grok Code Fast 2
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Claude Opus 4.7
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Grok 5
Benchmarks, context window, and price compared.
GPT-5.1 vs Qwen 3 Coder Next
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Qwen 3 Coder Next
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs o3
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Gemini 2.5 Pro
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs DeepSeek V4
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Qwen 3 Coder Next
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Kimi K3
Benchmarks, context window, and price compared.
GPT-5.1 vs GLM 5.1
Benchmarks, context window, and price compared.
Gemini 3 Pro vs GLM 5.1
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs GLM 5.1
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Qwen 3 Max
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Qwen 3 Coder Next
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Mistral Large 4
Benchmarks, context window, and price compared.
GPT-5.1 vs Claude Haiku 4.5 (Fast)
Benchmarks, context window, and price compared.
GPT-5.1 vs DeepSeek R2
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Claude Haiku 4.5 (Fast)
Benchmarks, context window, and price compared.
Gemini 3 Pro vs DeepSeek R2
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs GLM 5.1
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Claude Haiku 4.5 (Fast)
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs DeepSeek R2
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Grok Code Fast 2
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Gemini 2.5 Pro
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs DeepSeek V4
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Claude Haiku 4.5 (Fast)
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs DeepSeek R2
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Qwen 3 Coder Next
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs GLM 5.1
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Claude Haiku 4.5 (Fast)
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs DeepSeek R2
Benchmarks, context window, and price compared.
GPT-5.1 vs Gemini 3 Flash
Benchmarks, context window, and price compared.
Gemini 3 Pro vs Gemini 3 Flash
Benchmarks, context window, and price compared.
GPT-5.1 Codex vs Gemini 3 Flash
Benchmarks, context window, and price compared.
Claude Opus 4.7 vs Gemini 3 Flash
Benchmarks, context window, and price compared.
Claude Sonnet 4.6 vs Gemini 3 Flash
Benchmarks, context window, and price compared.
Prefer a ranked view? See every model on one leaderboard.
Open the leaderboard →