Screen-level OS controlOpen Source#9 of 15 in category
GUI-Owl-1.5 32B
Alibaba Tongyi Lab · Launched Mar 2026
Alibaba Tongyi Lab's Mobile-Agent Team specialized model. OSWorld-Verified 55.4% at max 50 steps. Open weights, Chinese research group.
Free (OSS)
1
Benchmarks scored
55.4
Peak score
0
Article mentions
Yes
Open source
Benchmark performance
OSWorld-Verified
Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.
55.4
Other screen-level os control agents
The 15 agents in this category, ranked by peak benchmark.
| Agent | Maker | Launch | Peak | Pricing |
|---|---|---|---|---|
| Claude Sonnet 4.6 | Anthropic | 2026-02 | 1470.0 | API: 3/15 per M tokens |
| Kimi K2.5OSS | Moonshot AI | 2026-01 | 1410.0 | API pay-as-you-go |
| Claude Computer Use | Anthropic | 2024-10 | 92.1 | Claude API — input $5/M, output $25/M |
| Kimi K2.6OSS | Moonshot AI | 2026-04 | 89.6 | API: 0.60/2.75 per M tokens |
| Holo3-35B-A3B | H Company | 2026-04 | 80.4 | H Company enterprise |
| Claude Sonnet 4.5 | Anthropic | 2025-09 | 62.9 | Legacy Anthropic API |
| Seed-1.8 | ByteDance Seed | 2025-12 | 61.9 | Doubao ecosystem |
| EvoCUA-20260105 | Meituan LongCat | 2026-01 | 56.7 | Research |
| DeepMiner-Mano-72B | Mininglamp Technology | 2025-10 | 53.9 | Research |
| UI-TARS-2 | ByteDance Seed | 2025-10 | 53.1 | ByteDance ecosystem |
Quick facts
- Type
- Screen-level OS control
- Maker
- Alibaba Tongyi Lab
- Launch
- 2026-03-27
- Open source
- Yes
- Pricing
- Free (OSS)
- Benchmarks scored
- 1
- Article mentions
- 0
- Rank in category
- #9 of 15