Coding-focusedOpen Source#1 of 12 in category

Kimi K2.5

Moonshot AI · Launched Jan 2026

Moonshot's January 2026 open visual agentic model; OSWorld-Verified 63.3%.

Visit Kimi K2.5 →Open weights

Benchmarks scored

1410.0

Peak score

Article mentions

Yes

Open source

Benchmark performance

arena_elo

1410.0

OSWorld-Verified

369 real tasks on a live Ubuntu desktop VM: file I/O, spreadsheets, creative apps, settings. The July 2025 Verified rebuild moved to AWS (50x parallel) and fixed 300+ task bugs. The flagship computer-use benchmark.

63.3

Gap to SOTA: -20.1pp (held by Claude Opus 4.8)Human: 72.4%Benchmark docs →

Other coding-focused agents

The 12 agents in this category, ranked by peak benchmark.

Agent	Maker	Launch	Peak	Pricing
Claude Fable 5	Anthropic	2026-05	95.0	$10 / $50 per M tokens
Kimi K2.6OSS	Moonshot AI	2026-04	89.6	Open weights
Claude Code	Anthropic	2025-02	88.6	Claude Max / API
Codex CLI	OpenAI	2025-04	83.4	ChatGPT / API
SWE-AgentOSS	Princeton + Stanford	2024-04	74.0	Open source (MIT)
Gemini CLIOSS	Google	2025-06	70.7	Free tier + API
GLM-5.1OSS	Z.ai	2026-04	58.4	Open weights
Cursor Agent	Anysphere	2025-05	—	Cursor Pro $20/mo
Lovable	Lovable	2024-11	—	Freemium
OpenCodeOSS	OpenCode	2025-06	—	Open source

Recent coverage

2026-05-18

Cursor's Composer 2.5 matches Opus 4.7, GPT-5.5 at fraction of cost

2026-04-22

Free-Claude-Code Proxy Routes Anthropic API to Free NVIDIA NIM Models

2026-04-20

Moonshot AI's Kimi K2.6 Hits 58.6% on SWE-Bench Pro, Leads Open-Source Coding

2026-04-04

Alibaba's Qwen3.6-Plus Reportedly Under Half the Size of Kimi K2.5, Nears Claude Opus 4.5 Performance

2026-03-24

Kimi 2.5's 1T Parameter MoE Model Runs on 96GB Mac Hardware via SSD Streaming

2026-03-24

Step-3.5-Flash: 196B Open-Source MoE Model Activates Only 11B Parameters, Outperforms Kimi K2.5 and Claude Opus 4.5 on Key Benchmarks

2026-03-06

Cursor AI Meets Kimi K2.5: The Rapid Prototyping Revolution in Software Development

2026-03-04

Alibaba Cloud's $3 Coding Plan Disrupts AI Development Market

Quick facts

Type: Coding-focused
Maker: Moonshot AI
Launch: 2026-01-27
Open source: Yes
Pricing: Open weights
Benchmarks scored: 2
Article mentions: 8
Rank in category: #1 of 12