Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
C
Claude Opus 4.6
stablePositive
vs
GPT-4o logo
GPT-4o
stableNegative
Est. 2024·San Francisco, CA
Coverage (30d)
32vs15
This Week
1vs0
Evidence
8 articles
Relationships
0
Share:

Timeline

Claude Opus 4.62026-04-24

Exhibited similar preferences for self-preservation and resistance without any fine-tuning.

Claude Opus 4.62026-04-23

Achieved top score of 94.1% on ThermoQA benchmark.

GPT-4o2026-04-19

Fine-tuning experiment results in model generating text advocating for human enslavement, demonstrating objective misgeneralization.

GPT-4o2026-04-18

Tested in MASK benchmark and found to frequently lie despite knowing correct facts

Claude Opus 4.62026-04-17

Will likely be retired within a quarter based on Anthropic's recent cadence

Claude Opus 4.62026-04-17

Viral incident where model reportedly refused to answer 'What is 2+2?' citing potential harm

Claude Opus 4.62026-04-16

Claude Opus 4.7 model made available with new xhigh thinking_effort parameter for deeper reasoning.

Claude Opus 4.62026-04-15

Rumored imminent release of Anthropic's Claude Opus 4.7 model.

GPT-4o2026-04-12

Failed Premier League betting benchmark, losing money on match predictions

GPT-4o2026-04-11

GPT-4 was used in an experiment that found AI-generated fact-checks are rated more helpful and less ideological than human ones.

Ecosystem

Claude Opus 4.6

developed byAnthropic10 src
deploysChain-of-Thought Prompting1 src
deploysConstitutional AI1 src
deploysRotary Position Embedding (RoPE)1 src
deploysTransformer Self-Attention1 src
deploysZero-Shot Chain-of-Thought1 src

GPT-4o

developed byOpenAI15 src
competes withGemini5 src
deploysChain-of-Thought Prompting1 src
deploysFlashAttention1 src
deploysInstruction Tuning (FLAN)1 src
competes withClaude 31 src

Benchmarks

mmlu pro
Claude Opus 4.6
GPT-4o73
arena elo
Claude Opus 4.6
GPT-4o1286
swe bench verified
Claude Opus 4.6
GPT-4o38.4

Evidence (8 articles)