Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
Coding-focusedOpen Source#1 of 12 in category

Kimi K2.5

Moonshot AI · Launched Jan 2026

Moonshot's January 2026 open visual agentic model; OSWorld-Verified 63.3%.

2
Benchmarks scored
1410.0
Peak score
8
Article mentions
Yes
Open source

Benchmark performance

1410.0
OSWorld-Verified

369 real tasks on a live Ubuntu desktop VM: file I/O, spreadsheets, creative apps, settings. The July 2025 Verified rebuild moved to AWS (50x parallel) and fixed 300+ task bugs. The flagship computer-use benchmark.

63.3
Gap to SOTA: -20.1pp (held by Claude Opus 4.8)Human: 72.4%Benchmark docs →

Other coding-focused agents

The 12 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Claude Fable 5Anthropic2026-0595.0$10 / $50 per M tokens
Kimi K2.6OSSMoonshot AI2026-0489.6Open weights
Claude CodeAnthropic2025-0288.6Claude Max / API
Codex CLIOpenAI2025-0483.4ChatGPT / API
SWE-AgentOSSPrinceton + Stanford2024-0474.0Open source (MIT)
Gemini CLIOSSGoogle2025-0670.7Free tier + API
GLM-5.1OSSZ.ai2026-0458.4Open weights
Cursor AgentAnysphere2025-05Cursor Pro $20/mo
LovableLovable2024-11Freemium
OpenCodeOSSOpenCode2025-06Open source

Recent coverage

Quick facts

Type
Coding-focused
Maker
Moonshot AI
Launch
2026-01-27
Open source
Yes
Pricing
Open weights
Benchmarks scored
2
Article mentions
8
Rank in category
#1 of 12