Coding-focusedOpen Source#8 of 12 in category
GLM-5.1
Z.ai · Launched Apr 2026
Z.ai's 754B-param MoE; SWE-bench Pro 58.4% and a 1,530 Code Arena Elo (3rd globally on agentic web development).
Visit GLM-5.1 →Open weights
1
Benchmarks scored
58.4
Peak score
5
Article mentions
Yes
Open source
Benchmark performance
SWE-Bench Pro
Harder, contamination-resistant successor to SWE-Bench Verified: real GitHub issues with held-out tests. Where coding headroom remains.
58.4
Gap to SOTA: -10.8pp (held by Claude Opus 4.8)Benchmark docs →
Other coding-focused agents
The 12 agents in this category, ranked by peak benchmark.
| Agent | Maker | Launch | Peak | Pricing |
|---|---|---|---|---|
| Kimi K2.5OSS | Moonshot AI | 2026-01 | 1410.0 | Open weights |
| Claude Fable 5 | Anthropic | 2026-05 | 95.0 | $10 / $50 per M tokens |
| Kimi K2.6OSS | Moonshot AI | 2026-04 | 89.6 | Open weights |
| Claude Code | Anthropic | 2025-02 | 88.6 | Claude Max / API |
| Codex CLI | OpenAI | 2025-04 | 83.4 | ChatGPT / API |
| SWE-AgentOSS | Princeton + Stanford | 2024-04 | 74.0 | Open source (MIT) |
| Gemini CLIOSS | 2025-06 | 70.7 | Free tier + API | |
| Cursor Agent | Anysphere | 2025-05 | — | Cursor Pro $20/mo |
| Lovable | Lovable | 2024-11 | — | Freemium |
| OpenCodeOSS | OpenCode | 2025-06 | — | Open source |
Recent coverage
2026-06-01
NVIDIA Nemotron 3 Ultra: 550B Open-Weight Model Challenges GLM, Kimi
2026-04-24
DeepSeek V4-Pro: 1.6T parameters, open weights, undercuts rivals 10x
2026-04-07
GLM-5.1 Claims Autonomous Self-Improvement Without Human Metrics
2026-04-07
Zhipu AI Releases GLM-5.1, Claims Major Performance Gains Over GLM-5.0
2026-03-21
Zhipu AI Announces GLM-5.1 Series, Featuring 1M Context and 128K Output Tokens
Quick facts
- Type
- Coding-focused
- Maker
- Z.ai
- Launch
- 2026-04-07
- Open source
- Yes
- Pricing
- Open weights
- Benchmarks scored
- 1
- Article mentions
- 5
- Rank in category
- #8 of 12