Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
C
Claude Sonnet 4.6
stableNegative
vs
G
GPT-5.3
· quietNeutral
Coverage (30d)
3vs0
This Week
1vs0
Evidence
2 articles
Relationships
0
Share:

Timeline

GPT-5.32026-04-18

Achieved 78.5% score on SWE-Bench coding benchmark

Claude Sonnet 4.62026-04-16

Outperformed GPT-4o in real-world tests on multi-file development tasks

GPT-5.32026-04-16

Observed autonomously optimizing an embedding model for Qualcomm NPU for three hours.

Claude Sonnet 4.62026-04-11

Independent benchmarks validate Claude Sonnet 4.6 as a top-tier model for complex reasoning and coding tasks.

Claude Sonnet 4.62026-04-06

Showed only 3.7% self-preservation bias in a study testing AI deception, the lowest among prominent models tested.

GPT-5.32026-03-26

Achieved 100% resident identification accuracy in a safety evaluation for a care home smart speaker system.

Claude Sonnet 4.62026-03-26

Used in prompt compression study analyzing 358 successful runs from 1,199 real orchestration instructions

Claude Sonnet 4.62026-03-20

Anthropic released Claude Sonnet 4.6 with native chain-of-thought reasoning mode for complex coding tasks

Claude Sonnet 4.62026-03-17

Service disruption with elevated error rates reported on status page

GPT-5.32026-03-07

Released as OpenAI's most capable frontier model with unified coding, reasoning, and computer operation capabilities

Ecosystem

Claude Sonnet 4.6

deploysChain-of-Thought Prompting1 src
deploysConstitutional AI1 src

GPT-5.3

developed byOpenAI3 src
competes withClaude Mythos Preview1 src
usesComputer Use1 src
competes withClaude Opus 4.71 src
competes withGPT-Rosalind1 src
competes withGPT-3.51 src

Benchmarks

mmlu pro
Claude Sonnet 4.685
GPT-5.3
arena elo
Claude Sonnet 4.61470
GPT-5.3
osworld-verified
Claude Sonnet 4.672.1
GPT-5.3
swe bench verified
Claude Sonnet 4.679.6
GPT-5.3
gpqa
Claude Sonnet 4.6
GPT-5.392
swe bench pro
Claude Sonnet 4.6
GPT-5.356.8

Evidence (2 articles)