Coding-focusedOpen Source#8 of 12 in category

GLM-5.1

Z.ai · Launched Apr 2026

Z.ai's 754B-param MoE; SWE-bench Pro 58.4% and a 1,530 Code Arena Elo (3rd globally on agentic web development).

Benchmarks scored

58.4

Peak score

Article mentions

Yes

Open source

Benchmark performance

Harder, contamination-resistant successor to SWE-Bench Verified: real GitHub issues with held-out tests. Where coding headroom remains.

58.4

Gap to SOTA: -10.8pp (held by Claude Opus 4.8)Benchmark docs →

The 12 agents in this category, ranked by peak benchmark.

Agent	Maker	Launch	Peak	Pricing
Kimi K2.5OSS	Moonshot AI	2026-01	1410.0	Open weights
Claude Fable 5	Anthropic	2026-05	95.0	$10 / $50 per M tokens
Kimi K2.6OSS	Moonshot AI	2026-04	89.6	Open weights
Claude Code	Anthropic	2025-02	88.6	Claude Max / API
Codex CLI	OpenAI	2025-04	83.4	ChatGPT / API
SWE-AgentOSS	Princeton + Stanford	2024-04	74.0	Open source (MIT)
Gemini CLIOSS	Google	2025-06	70.7	Free tier + API
Cursor Agent	Anysphere	2025-05	—	Cursor Pro $20/mo
Lovable	Lovable	2024-11	—	Freemium
OpenCodeOSS	OpenCode	2025-06	—	Open source

2026-06-01

NVIDIA Nemotron 3 Ultra: 550B Open-Weight Model Challenges GLM, Kimi

2026-04-24

DeepSeek V4-Pro: 1.6T parameters, open weights, undercuts rivals 10x

2026-04-07

GLM-5.1 Claims Autonomous Self-Improvement Without Human Metrics

2026-04-07

Zhipu AI Releases GLM-5.1, Claims Major Performance Gains Over GLM-5.0

2026-03-21

Zhipu AI Announces GLM-5.1 Series, Featuring 1M Context and 128K Output Tokens