Skip to content
gentic.news — AI News Intelligence Platform
Screen-level OS controlOpen Source#2 of 15 in category

Kimi K2.5

Moonshot AI · Launched Jan 2026

Kimi K2.5 is an open-source, multimodal AI model from Moonshot AI, featuring 1 trillion parameters, vision capabilities, and Agent Swarm technology for complex task orchestration.

API pay-as-you-go
2
Benchmarks scored
1410.0
Peak score
7
Article mentions
Yes
Open source

Benchmark performance

1410.0
OSWorld-Verified

Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.

63.3
Gap to SOTA: -17.1pp (held by Holo3-35B-A3B)Human: 72.4%Benchmark docs →

Other screen-level os control agents

The 15 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Claude Sonnet 4.6Anthropic2026-021470.0API: 3/15 per M tokens
Claude Computer UseAnthropic2024-1092.1Claude API — input $5/M, output $25/M
Kimi K2.6OSSMoonshot AI2026-0489.6API: 0.60/2.75 per M tokens
Holo3-35B-A3BH Company2026-0480.4H Company enterprise
Claude Sonnet 4.5Anthropic2025-0962.9Legacy Anthropic API
Seed-1.8ByteDance Seed2025-1261.9Doubao ecosystem
EvoCUA-20260105Meituan LongCat2026-0156.7Research
GUI-Owl-1.5 32BOSSAlibaba Tongyi Lab2026-0355.4Free (OSS)
DeepMiner-Mano-72BMininglamp Technology2025-1053.9Research
UI-TARS-2ByteDance Seed2025-1053.1ByteDance ecosystem

Recent coverage

Quick facts

Type
Screen-level OS control
Maker
Moonshot AI
Launch
2026-01-31
Open source
Yes
Pricing
API pay-as-you-go
Benchmarks scored
2
Article mentions
7
Rank in category
#2 of 15
Kimi K2.5 — Computer Use Agent | gentic.news