Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
Coding-focused#4 of 12 in category

Claude Code

Anthropic · Launched Feb 2025

Anthropic's terminal-native coding agent. With Opus 4.8 it scores Terminal-Bench 2.1 78.9%, SWE-bench Pro 69.2%, SWE-bench Verified 88.6%.

Visit Claude CodeClaude Max / API
3
Benchmarks scored
88.6
Peak score
741
Article mentions
No
Open source

Benchmark performance

SWE-Bench Verified

OpenAI-verified 500-issue subset of SWE-Bench. Approaching saturation in 2026 - most frontier models clear 80%+.

88.6
Gap to SOTA: -6.4pp (held by Claude Fable 5)Benchmark docs →
Terminal-Bench 2.1

Held-out, contamination-resistant CLI tasks driven end-to-end in a real terminal. Version 2.1 is the 2026 standard for terminal autonomy.

78.9
Gap to SOTA: -4.5pp (held by Codex CLI (GPT-5.5))Benchmark docs →
SWE-Bench Pro

Harder, contamination-resistant successor to SWE-Bench Verified: real GitHub issues with held-out tests. Where coding headroom remains.

69.2
Gap to SOTA: -0.0pp (held by Claude Opus 4.8)Benchmark docs →

Other coding-focused agents

The 12 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Kimi K2.5OSSMoonshot AI2026-011410.0Open weights
Claude Fable 5Anthropic2026-0595.0$10 / $50 per M tokens
Kimi K2.6OSSMoonshot AI2026-0489.6Open weights
Codex CLIOpenAI2025-0483.4ChatGPT / API
SWE-AgentOSSPrinceton + Stanford2024-0474.0Open source (MIT)
Gemini CLIOSSGoogle2025-0670.7Free tier + API
GLM-5.1OSSZ.ai2026-0458.4Open weights
Cursor AgentAnysphere2025-05Cursor Pro $20/mo
LovableLovable2024-11Freemium
OpenCodeOSSOpenCode2025-06Open source

Recent coverage

Quick facts

Type
Coding-focused
Maker
Anthropic
Launch
2025-02-24
Open source
No
Pricing
Claude Max / API
Benchmarks scored
3
Article mentions
741
Rank in category
#4 of 12