C
Claude Mythos Preview
stablePositive
vs
competes with (1)
G
GPT-5.3
stablePositive
Coverage (30d)
10vs15
This Week
3vs4
Evidence
0 articles
Relationships
1

Timeline

Claude Mythos Preview2026-04-15

Leaked benchmark results suggest it 'destroys every other model' including GPT-5, Claude 4 Opus, and Gemini Ultra 2.0

Claude Mythos Preview2026-04-14

Achieved 73% success rate on expert-level CTF challenges and completed full 32-step network attack simulation

Claude Mythos Preview2026-04-14

First AI model documented to autonomously complete a full, multi-step cyber attack simulation in UK safety tests.

Claude Mythos Preview2026-04-14

Evaluated by UK AI Safety Institute for autonomous cyber attack capabilities.

Claude Mythos Preview2026-04-12

Scored 83.1% on the CyberGym benchmark for vulnerability discovery.

GPT-5.32026-03-26

Achieved 100% resident identification accuracy in a safety evaluation for a care home smart speaker system.

GPT-5.32026-03-07

Released as OpenAI's most capable frontier model with unified coding, reasoning, and computer operation capabilities

GPT-5.32026-03-06

Demonstrated surpassing human baselines on OSWorld benchmark with 75% score

GPT-5.32026-03-05

OpenAI releases GPT-5.4 with native computer use, tool search, and 1M token context window

GPT-5.32026-03-01

Expected to follow shortly after DeepSeek v4 release

Ecosystem

Claude Mythos Preview

competes withGPT-5.31 src
competes withCodex 5.31 src

GPT-5.3

developed byOpenAI
developedOpenAI5 src
usesRetrieval-Augmented Generation1 src
competes withClaude AI1 src
usesGDPval1 src
usesGPT-5.3-Codex1 src

Benchmarks

gpqa
Claude Mythos Preview
GPT-5.392
swe bench pro
Claude Mythos Preview
GPT-5.356.8
Claude Mythos Preview vs GPT-5.3 — AI Comparison 2026 | gentic.news