Claude Code
Anthropic · Launched Feb 2025
Anthropic's terminal-native coding agent. With Opus 4.8 it scores Terminal-Bench 2.1 78.9%, SWE-bench Pro 69.2%, SWE-bench Verified 88.6%.
Benchmark performance
OpenAI-verified 500-issue subset of SWE-Bench. Approaching saturation in 2026 - most frontier models clear 80%+.
Held-out, contamination-resistant CLI tasks driven end-to-end in a real terminal. Version 2.1 is the 2026 standard for terminal autonomy.
Harder, contamination-resistant successor to SWE-Bench Verified: real GitHub issues with held-out tests. Where coding headroom remains.
Other coding-focused agents
The 12 agents in this category, ranked by peak benchmark.
| Agent | Maker | Launch | Peak | Pricing |
|---|---|---|---|---|
| Kimi K2.5OSS | Moonshot AI | 2026-01 | 1410.0 | Open weights |
| Claude Fable 5 | Anthropic | 2026-05 | 95.0 | $10 / $50 per M tokens |
| Kimi K2.6OSS | Moonshot AI | 2026-04 | 89.6 | Open weights |
| Codex CLI | OpenAI | 2025-04 | 83.4 | ChatGPT / API |
| SWE-AgentOSS | Princeton + Stanford | 2024-04 | 74.0 | Open source (MIT) |
| Gemini CLIOSS | 2025-06 | 70.7 | Free tier + API | |
| GLM-5.1OSS | Z.ai | 2026-04 | 58.4 | Open weights |
| Cursor Agent | Anysphere | 2025-05 | — | Cursor Pro $20/mo |
| Lovable | Lovable | 2024-11 | — | Freemium |
| OpenCodeOSS | OpenCode | 2025-06 | — | Open source |
Recent coverage
2026-06-11
OpenAI Acquires Cloud Startup Ona to Power Agent Infrastructure
2026-06-10
Claude Code Digest — Jun 07–Jun 10
2026-06-09
Dual-Track Development: How Claude Code Teams Ship 3x Faster with
2026-06-09
/loop in Claude Code: How to Build Multi-Agent Workflows Without Leaving
2026-06-09
Stop Telling Claude to 'Be Careful' — 3 Hook Tools That Fix What Prompts Can't
2026-06-08
How to Sandbox Claude Code with BitLocker+VMs for Secure Enterprise Use
2026-06-08
Claude Code Generates Production Lottie Animations via Show HN
2026-06-08
Build Durable Jira Automation with MCP + Temporal
Quick facts
- Type
- Coding-focused
- Maker
- Anthropic
- Launch
- 2025-02-24
- Open source
- No
- Pricing
- Claude Max / API
- Benchmarks scored
- 3
- Article mentions
- 741
- Rank in category
- #4 of 12