Kimi K2.5
Moonshot AI · Launched Jan 2026
Kimi K2.5 is an open-source, multimodal AI model from Moonshot AI, featuring 1 trillion parameters, vision capabilities, and Agent Swarm technology for complex task orchestration.
Benchmark performance
Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.
Other screen-level os control agents
The 15 agents in this category, ranked by peak benchmark.
| Agent | Maker | Launch | Peak | Pricing |
|---|---|---|---|---|
| Claude Sonnet 4.6 | Anthropic | 2026-02 | 1470.0 | API: 3/15 per M tokens |
| Claude Computer Use | Anthropic | 2024-10 | 92.1 | Claude API — input $5/M, output $25/M |
| Kimi K2.6OSS | Moonshot AI | 2026-04 | 89.6 | API: 0.60/2.75 per M tokens |
| Holo3-35B-A3B | H Company | 2026-04 | 80.4 | H Company enterprise |
| Claude Sonnet 4.5 | Anthropic | 2025-09 | 62.9 | Legacy Anthropic API |
| Seed-1.8 | ByteDance Seed | 2025-12 | 61.9 | Doubao ecosystem |
| EvoCUA-20260105 | Meituan LongCat | 2026-01 | 56.7 | Research |
| GUI-Owl-1.5 32BOSS | Alibaba Tongyi Lab | 2026-03 | 55.4 | Free (OSS) |
| DeepMiner-Mano-72B | Mininglamp Technology | 2025-10 | 53.9 | Research |
| UI-TARS-2 | ByteDance Seed | 2025-10 | 53.1 | ByteDance ecosystem |
Recent coverage
2026-04-22
Free-Claude-Code Proxy Routes Anthropic API to Free NVIDIA NIM Models
2026-04-20
Moonshot AI's Kimi K2.6 Hits 58.6% on SWE-Bench Pro, Leads Open-Source Coding
2026-04-04
Alibaba's Qwen3.6-Plus Reportedly Under Half the Size of Kimi K2.5, Nears Claude Opus 4.5 Performance
2026-03-24
Kimi 2.5's 1T Parameter MoE Model Runs on 96GB Mac Hardware via SSD Streaming
2026-03-24
Step-3.5-Flash: 196B Open-Source MoE Model Activates Only 11B Parameters, Outperforms Kimi K2.5 and Claude Opus 4.5 on Key Benchmarks
2026-03-06
Cursor AI Meets Kimi K2.5: The Rapid Prototyping Revolution in Software Development
2026-03-04
Alibaba Cloud's $3 Coding Plan Disrupts AI Development Market
Quick facts
- Type
- Screen-level OS control
- Maker
- Moonshot AI
- Launch
- 2026-01-31
- Open source
- Yes
- Pricing
- API pay-as-you-go
- Benchmarks scored
- 2
- Article mentions
- 7
- Rank in category
- #2 of 15