Kimi K2.5
Moonshot AI · Launched Jan 2026
Moonshot's January 2026 open visual agentic model; OSWorld-Verified 63.3%.
Benchmark performance
369 real tasks on a live Ubuntu desktop VM: file I/O, spreadsheets, creative apps, settings. The July 2025 Verified rebuild moved to AWS (50x parallel) and fixed 300+ task bugs. The flagship computer-use benchmark.
Other coding-focused agents
The 12 agents in this category, ranked by peak benchmark.
| Agent | Maker | Launch | Peak | Pricing |
|---|---|---|---|---|
| Claude Fable 5 | Anthropic | 2026-05 | 95.0 | $10 / $50 per M tokens |
| Kimi K2.6OSS | Moonshot AI | 2026-04 | 89.6 | Open weights |
| Claude Code | Anthropic | 2025-02 | 88.6 | Claude Max / API |
| Codex CLI | OpenAI | 2025-04 | 83.4 | ChatGPT / API |
| SWE-AgentOSS | Princeton + Stanford | 2024-04 | 74.0 | Open source (MIT) |
| Gemini CLIOSS | 2025-06 | 70.7 | Free tier + API | |
| GLM-5.1OSS | Z.ai | 2026-04 | 58.4 | Open weights |
| Cursor Agent | Anysphere | 2025-05 | — | Cursor Pro $20/mo |
| Lovable | Lovable | 2024-11 | — | Freemium |
| OpenCodeOSS | OpenCode | 2025-06 | — | Open source |
Recent coverage
2026-05-18
Cursor's Composer 2.5 matches Opus 4.7, GPT-5.5 at fraction of cost
2026-04-22
Free-Claude-Code Proxy Routes Anthropic API to Free NVIDIA NIM Models
2026-04-20
Moonshot AI's Kimi K2.6 Hits 58.6% on SWE-Bench Pro, Leads Open-Source Coding
2026-04-04
Alibaba's Qwen3.6-Plus Reportedly Under Half the Size of Kimi K2.5, Nears Claude Opus 4.5 Performance
2026-03-24
Kimi 2.5's 1T Parameter MoE Model Runs on 96GB Mac Hardware via SSD Streaming
2026-03-24
Step-3.5-Flash: 196B Open-Source MoE Model Activates Only 11B Parameters, Outperforms Kimi K2.5 and Claude Opus 4.5 on Key Benchmarks
2026-03-06
Cursor AI Meets Kimi K2.5: The Rapid Prototyping Revolution in Software Development
2026-03-04
Alibaba Cloud's $3 Coding Plan Disrupts AI Development Market
Quick facts
- Type
- Coding-focused
- Maker
- Moonshot AI
- Launch
- 2026-01-27
- Open source
- Yes
- Pricing
- Open weights
- Benchmarks scored
- 2
- Article mentions
- 8
- Rank in category
- #1 of 12