Screen-level OS control#3 of 15 in category
Claude Computer Use
Anthropic · Launched Oct 2024
The first public screen-level OS control API. Powered by Claude 3.5 → 4.x → Opus 4.7. Takes screenshots, identifies UI elements, emits raw coordinate clicks and keystrokes. Production in Microsoft Copilot Studio.
Visit Claude Computer Use →Claude API — input $5/M, output $25/M
2
Benchmarks scored
92.1
Peak score
0
Article mentions
No
Open source
Benchmark performance
92.1
OSWorld-Verified
Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.
72.1
Other screen-level os control agents
The 15 agents in this category, ranked by peak benchmark.
| Agent | Maker | Launch | Peak | Pricing |
|---|---|---|---|---|
| Claude Sonnet 4.6 | Anthropic | 2026-02 | 1470.0 | API: 3/15 per M tokens |
| Kimi K2.5OSS | Moonshot AI | 2026-01 | 1410.0 | API pay-as-you-go |
| Kimi K2.6OSS | Moonshot AI | 2026-04 | 89.6 | API: 0.60/2.75 per M tokens |
| Holo3-35B-A3B | H Company | 2026-04 | 80.4 | H Company enterprise |
| Claude Sonnet 4.5 | Anthropic | 2025-09 | 62.9 | Legacy Anthropic API |
| Seed-1.8 | ByteDance Seed | 2025-12 | 61.9 | Doubao ecosystem |
| EvoCUA-20260105 | Meituan LongCat | 2026-01 | 56.7 | Research |
| GUI-Owl-1.5 32BOSS | Alibaba Tongyi Lab | 2026-03 | 55.4 | Free (OSS) |
| DeepMiner-Mano-72B | Mininglamp Technology | 2025-10 | 53.9 | Research |
| UI-TARS-2 | ByteDance Seed | 2025-10 | 53.1 | ByteDance ecosystem |
Quick facts
- Type
- Screen-level OS control
- Maker
- Anthropic
- Launch
- 2024-10-22
- Open source
- No
- Pricing
- Claude API — input $5/M, output $25/M
- Benchmarks scored
- 2
- Article mentions
- 0
- Rank in category
- #3 of 15