Coding-focusedOpen Source#4 of 9 in category

Kimi K2.6

Moonshot AI · Launched Apr 2026

Moonshot's open agentic model; SWE-bench Verified 80.2%, SWE-bench Pro 58.6%, Terminal-Bench 2.0 66.7%. Sustains 4,000+ tool calls over 13-hour sessions.

Visit Kimi K2.6 →Open weights

Benchmarks scored

80.2

Peak score

Article mentions

Yes

Open source

Benchmark performance

SWE-Bench Verified

OpenAI-verified 500-issue subset of SWE-Bench. Approaching saturation in 2026 - most frontier models clear 80%+.

80.2

Gap to SOTA: -14.8pp (held by Claude Fable 5)Benchmark docs →

osworld-verified

73.1

Terminal-Bench 2.1

Held-out, contamination-resistant CLI tasks driven end-to-end in a real terminal. Version 2.1 is the 2026 standard for terminal autonomy.

66.7

Gap to SOTA: -16.7pp (held by Codex CLI (GPT-5.5))Benchmark docs →

SWE-Bench Pro

Harder, contamination-resistant successor to SWE-Bench Verified: real GitHub issues with held-out tests. Where coding headroom remains.

58.6

Gap to SOTA: -10.6pp (held by Claude Opus 4.8)Benchmark docs →

Other coding-focused agents

The 9 agents in this category, ranked by peak benchmark.

Agent	Maker	Launch	Peak	Pricing
Kimi K2.5OSS	Moonshot AI	2026-01	1410.0	Open weights
Claude Code	Anthropic	2025-02	88.6	Claude Max / API
Codex CLI	OpenAI	2025-04	83.4	ChatGPT / API
SWE-AgentOSS	Princeton + Stanford	2024-04	74.0	Open source (MIT)
Gemini CLIOSS	Google	2025-06	70.7	Free tier + API
GLM-5.1OSS	Z.ai	2026-04	58.4	Open weights
OpenCodeOSS	OpenCode	2025-06	—	Open source
AiderOSS	Aider	2023-05	—	Open source (Apache-2)

Recent coverage

2026-06-01

NVIDIA Nemotron 3 Ultra: 550B Open-Weight Model Challenges GLM, Kimi

2026-05-23

Cerebras Hits 981 Tokens/sec on 1T-Parameter Kimi K2.6, Claims 6.7× GPU Cloud Speedup

2026-05-11

CoreWeave Tops Kimi K2.6 Inference Speed

2026-04-24

DeepSeek V4-Pro: 1.6T parameters, open weights, undercuts rivals 10x

Quick facts

Type: Coding-focused
Maker: Moonshot AI
Launch: 2026-04-01
Open source: Yes
Pricing: Open weights
Benchmarks scored: 4
Article mentions: 4
Rank in category: #4 of 9