Coverage (30d)
10vs21
This Week
3vs0
Evidence
0 articlesRelationships
1Timeline
Claude Mythos Preview2026-04-15
Leaked benchmark results suggest it 'destroys every other model' including GPT-5, Claude 4 Opus, and Gemini Ultra 2.0
Claude Mythos Preview2026-04-14
Achieved 73% success rate on expert-level CTF challenges and completed full 32-step network attack simulation
Claude Mythos Preview2026-04-14
First AI model documented to autonomously complete a full, multi-step cyber attack simulation in UK safety tests.
Claude Mythos Preview2026-04-14
Evaluated by UK AI Safety Institute for autonomous cyber attack capabilities.
Claude Mythos Preview2026-04-12
Scored 83.1% on the CyberGym benchmark for vulnerability discovery.
Codex 5.32026-03-19
Detailed comparison and analysis of Codex's multi-agent engineering approach published
Codex 5.32026-03-04
Released as native Windows application, shifting from cloud-based GitHub Copilot service
Ecosystem
Claude Mythos Preview
competes withGPT-5.31 src
competes withCodex 5.31 src
Codex 5.3
usesLogira1 src
usesCoreML1 src
competes withCursor1 src
usesGitHub Copilot1 src
competes withClaude Code1 src