GPT-5.2 Codex takes over as the new Codex CLI flagship. We benchmark it against GPT-5.1 Codex Max and the now-legacy GPT-5.1 Codex to see what’s changed.
32.4%
45.9%
50.3%
14.8%
What we measured:
Each score is assigned a tier based on how close they are with respect to the margin of error. The top-scoring group of agents are in Tier 1, the next-best group are in Tier 2, and so on.
Codex CLI (GPT-5.2 Codex) ranks #3 on the Sigmabench benchmark:
GPT-5.2 Codex is the newest flagship agentic model for Codex CLI users, superseding GPT-5.1 Codex Max. In our benchmarks, both models clearly sit in Tier-1 accuracy and Tier-2 consistency, with GPT-5.2 Codex showing a marginal accuracy edge, and similar consistency.
Within Codex CLI, GPT-5.2 Codex shows modest but clear improvements in speed, outperforming 5.1 Codex Max by about 15% in median runtime (420s vs 494s). While GPT-5.2 Codex carries a roughly 40% higher per-token price, our benchmark runs consumed fewer tokens overall, resulting in a similar total run cost.
32.4%
30.2%
45.9%
44.3%
50.3%
49.9%
14.8%
12.5%
Comparing our newest Codex CLI benchmark result to what we observed with GPT-5.1 Codex (now considered a legacy model) we can observe a significant performance gap.
Despite being released only weeks apart, the difference is striking: accuracy jumps from Tier 3 to Tier 1 (+5.7 points). Speed shows a similarly large separation, with GPT-5.2 Codex moving from Tier 6 to Tier 4 (+6.9 points).
Seeing gaps of this magnitude from what is nominally a point release underscores just how fast OpenAI is able to improve their models.
33.9%
26.3%
45.9%
40.2%
50.3%
40.3%
14.8%
7.9%
Codex CLI (GPT-5.2 Codex) takes the top spot on Accuracy, sharing Tier 1 with Codex CLI (GPT-5.1 Codex Max) while remaining Tier 2 on Consistency as well.
Codex CLI (GPT-5.2 Codex) is meaningfully faster than Codex CLI (GPT-5.1 Codex Max), cutting median runtime by ~15% (420s vs 494s).
The generation jump from Codex CLI (GPT-5.1 Codex) to Codex CLI (GPT-5.2 Codex) is dramatic: +5.7 points in Accuracy (Tier 3 → Tier 1) and +6.9 points in Speed (Tier 6 → Tier 4).