Skip to content
gentic.news — AI News Intelligence Platform
Screen-level OS control#11 of 15 in category

UI-TARS-2

ByteDance Seed · Launched Oct 2025

ByteDance Seed's second-generation UI-TARS specialized GUI agent. OSWorld-Verified at 53.1%, max 100 steps.

ByteDance ecosystem
1
Benchmarks scored
53.1
Peak score
0
Article mentions
No
Open source

Benchmark performance

OSWorld-Verified

Real desktop workflows across browser, files, office apps. 369 tasks (361 without Google Drive). Human expert baseline 72.4%. Current SOTA: Holo3-35B-A3B (H Company) at 80.4%.

53.1
Gap to SOTA: -27.3pp (held by Holo3-35B-A3B)Human: 72.4%Benchmark docs →

Other screen-level os control agents

The 15 agents in this category, ranked by peak benchmark.

AgentMakerLaunchPeakPricing
Claude Sonnet 4.6Anthropic2026-021470.0API: 3/15 per M tokens
Kimi K2.5OSSMoonshot AI2026-011410.0API pay-as-you-go
Claude Computer UseAnthropic2024-1092.1Claude API — input $5/M, output $25/M
Kimi K2.6OSSMoonshot AI2026-0489.6API: 0.60/2.75 per M tokens
Holo3-35B-A3BH Company2026-0480.4H Company enterprise
Claude Sonnet 4.5Anthropic2025-0962.9Legacy Anthropic API
Seed-1.8ByteDance Seed2025-1261.9Doubao ecosystem
EvoCUA-20260105Meituan LongCat2026-0156.7Research
GUI-Owl-1.5 32BOSSAlibaba Tongyi Lab2026-0355.4Free (OSS)
DeepMiner-Mano-72BMininglamp Technology2025-1053.9Research

Quick facts

Type
Screen-level OS control
Maker
ByteDance Seed
Launch
2025-10-14
Open source
No
Pricing
ByteDance ecosystem
Benchmarks scored
1
Article mentions
0
Rank in category
#11 of 15
UI-TARS-2 — Computer Use Agent | gentic.news