Coverage (30d)
3vs6
This Week
1vs1
Evidence
2 articlesRelationships
0Timeline
Claude Mythos Preview2026-06-04
METR found Claude Mythos Preview could work 16+ hours autonomously
Claude Opus 4.72026-05-17
Autonomously ported Adobe Lightroom CC to Linux via Wine after a single prompt
Claude Mythos Preview2026-05-14
First AI model to clear all UK AISI cyberattack simulations
Claude Mythos Preview2026-05-09
Early snapshot achieves more than 2x time horizon of next best model on METR benchmark
Claude Mythos Preview2026-05-02
Claude Mythos Preview scored 68.6% on AISI expert CTF tasks
Claude Mythos Preview2026-05-02
Claude Mythos Preview fully solved TLO enterprise network simulation in 3 of 10 attempts
Claude Opus 4.72026-04-23
Opus 4.7 released with new tokenizer causing 40%+ cost increase
Claude Opus 4.72026-04-20
Anthropic released Claude Opus 4.7 with an updated tokenizer that increases token counts for the same input
Claude Opus 4.72026-04-20
Released Claude Opus 4.7 with 87.6% on SWE-Bench
Claude Opus 4.72026-04-17
Opus 4.7 achieved 98.5% on XBOW visual acuity benchmark, up from Opus 4.6's 54.5%.
Ecosystem
Claude Mythos Preview
competes withClaude Opus 4.61 src
competes withGPT-3.51 src
usesMETR1 src
competes withGPT-5.31 src
competes withCodex 5.31 src
Claude Opus 4.7
competes withClaude Opus 4.63 src
competes withGemini 3 Pro2 src
competes withComposer 21 src
deploysReinforcement Learning from Human Feedback (RLHF)1 src
competes withGPT-3.51 src
competes withGPT-51 src
Benchmarks
enterprise network attack simulation
Claude Mythos Preview71.4
Claude Opus 4.7—
swe bench
Claude Mythos Preview—
Claude Opus 4.72.1