Skip to content
gentic.news — AI News Intelligence Platform
Screen-level OS control#13 of 15 in category

Claude Computer Use Preview

Anthropic · Launched Oct 2024

Anthropic's original computer-use preview from Oct 2024, scored at 31.3% on OSWorld-Verified with max 50 steps. Proved the paradigm; since superseded by Sonnet 4.5/4.6.

Anthropic API
1
Benchmarks scored
31.3
Peak score
0
Article mentions
No
Open source

Benchmark performance

OSWorld-Verified

Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.

31.3
Gap to SOTA: -49.1pp (held by Holo3-35B-A3B)Human: 72.4%Benchmark docs →

Other screen-level os control agents

The 15 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Claude Sonnet 4.6Anthropic2026-021470.0API: 3/15 per M tokens
Kimi K2.5OSSMoonshot AI2026-011410.0API pay-as-you-go
Claude Computer UseAnthropic2024-1092.1Claude API — input $5/M, output $25/M
Kimi K2.6OSSMoonshot AI2026-0489.6API: 0.60/2.75 per M tokens
Holo3-35B-A3BH Company2026-0480.4H Company enterprise
Claude Sonnet 4.5Anthropic2025-0962.9Legacy Anthropic API
Seed-1.8ByteDance Seed2025-1261.9Doubao ecosystem
EvoCUA-20260105Meituan LongCat2026-0156.7Research
GUI-Owl-1.5 32BOSSAlibaba Tongyi Lab2026-0355.4Free (OSS)
DeepMiner-Mano-72BMininglamp Technology2025-1053.9Research

Quick facts

Type
Screen-level OS control
Maker
Anthropic
Launch
2024-10-22
Open source
No
Pricing
Anthropic API
Benchmarks scored
1
Article mentions
0
Rank in category
#13 of 15
Claude Computer Use Preview — Computer Use Agent | gentic.news