Skip to content
gentic.news — AI News Intelligence Platform
Screen-level OS control#1 of 15 in category

Claude Sonnet 4.6

Anthropic · Launched Feb 2026

Claude Sonnet 4.6 is a multimodal large language model developed by Anthropic and released on February 25, 2026. Its documented performance includes an MMLU Pro score of 85.0, an Arena Elo rating of 1470, and a SWE-bench verified score of 79.6, positioning it as a competitive reasoning and coding model. The model was commercially priced at $3 per million input tokens and $15 per million output tokens. Claude Sonnet 4.6 matters in early 2026 as a benchmarked, mid-tier AI model demonstrating specific capabilities in reasoning and coding, providing a verifiable performance and pricing point within the competitive landscape of enterprise-focused language models, directly challenging similar offerings from OpenAI and Google.

API: 3/15 per M tokens
4
Benchmarks scored
1470.0
Peak score
22
Article mentions
No
Open source

Benchmark performance

1470.0
OSWorld-Verified

Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.

72.1
Gap to SOTA: -8.3pp (held by Holo3-35B-A3B)Human: 72.4%Benchmark docs →

Other screen-level os control agents

The 15 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Kimi K2.5OSSMoonshot AI2026-011410.0API pay-as-you-go
Claude Computer UseAnthropic2024-1092.1Claude API — input $5/M, output $25/M
Kimi K2.6OSSMoonshot AI2026-0489.6API: 0.60/2.75 per M tokens
Holo3-35B-A3BH Company2026-0480.4H Company enterprise
Claude Sonnet 4.5Anthropic2025-0962.9Legacy Anthropic API
Seed-1.8ByteDance Seed2025-1261.9Doubao ecosystem
EvoCUA-20260105Meituan LongCat2026-0156.7Research
GUI-Owl-1.5 32BOSSAlibaba Tongyi Lab2026-0355.4Free (OSS)
DeepMiner-Mano-72BMininglamp Technology2025-1053.9Research
UI-TARS-2ByteDance Seed2025-1053.1ByteDance ecosystem

Recent coverage

Quick facts

Type
Screen-level OS control
Maker
Anthropic
Launch
2026-02-23
Open source
No
Pricing
API: 3/15 per M tokens
Benchmarks scored
4
Article mentions
22
Rank in category
#1 of 15
Claude Sonnet 4.6 — Computer Use Agent | gentic.news