Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
Coding-focused#5 of 12 in category

Codex CLI

OpenAI · Launched Apr 2025

OpenAI's terminal coding agent. With GPT-5.5 it leads Terminal-Bench 2.1 at 83.4%.

Visit Codex CLIChatGPT / API
2
Benchmarks scored
83.4
Peak score
9
Article mentions
No
Open source

Benchmark performance

Terminal-Bench 2.1

Held-out, contamination-resistant CLI tasks driven end-to-end in a real terminal. Version 2.1 is the 2026 standard for terminal autonomy.

83.4
Gap to SOTA: -0.0pp (held by Codex CLI (GPT-5.5))Benchmark docs →
SWE-Bench Verified

OpenAI-verified 500-issue subset of SWE-Bench. Approaching saturation in 2026 - most frontier models clear 80%+.

82.6
Gap to SOTA: -12.4pp (held by Claude Fable 5)Benchmark docs →

Other coding-focused agents

The 12 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Kimi K2.5OSSMoonshot AI2026-011410.0Open weights
Claude Fable 5Anthropic2026-0595.0$10 / $50 per M tokens
Kimi K2.6OSSMoonshot AI2026-0489.6Open weights
Claude CodeAnthropic2025-0288.6Claude Max / API
SWE-AgentOSSPrinceton + Stanford2024-0474.0Open source (MIT)
Gemini CLIOSSGoogle2025-0670.7Free tier + API
GLM-5.1OSSZ.ai2026-0458.4Open weights
Cursor AgentAnysphere2025-05Cursor Pro $20/mo
LovableLovable2024-11Freemium
OpenCodeOSSOpenCode2025-06Open source

Recent coverage

Quick facts

Type
Coding-focused
Maker
OpenAI
Launch
2025-04-16
Open source
No
Pricing
ChatGPT / API
Benchmarks scored
2
Article mentions
9
Rank in category
#5 of 12