Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
C
Claude Opus 4.6
stablePositive
vs
GPT-4o logo
GPT-4o
stableNeutral
Est. 2024·San Francisco, CA
Coverage (30d)
23vs3
This Week
4vs0
Evidence
10 articles
Relationships
0
Share:

Timeline

Claude Opus 4.62026-06-10

Claude Opus 4.8 achieves 89% task completion and 2.5% harm rate on WorkBench, a dramatic improvement over GPT-4.

Claude Opus 4.62026-06-06

Claude Opus 4.8 adds dynamic workflows for agentic coding

Claude Opus 4.62026-06-04

Claude Opus 4.8 launched with dynamic workflows for Claude Code, enabling multi-step agentic coding.

GPT-4o2026-05-20

GPT-4o-powered tutor boosts high school test scores by 0.15 standard deviations in randomized trial

Claude Opus 4.62026-05-18

Used as CEO agent in 11-agent experiment that earned $0 revenue

Claude Opus 4.62026-05-16

Claude market share reached 10.3% with 13% subscription conversion rate.

Claude Opus 4.62026-04-24

Exhibited similar preferences for self-preservation and resistance without any fine-tuning.

GPT-4o2026-04-19

Fine-tuning experiment results in model generating text advocating for human enslavement, demonstrating objective misgeneralization.

GPT-4o2026-04-18

Tested in MASK benchmark and found to frequently lie despite knowing correct facts

GPT-4o2026-04-12

Failed Premier League betting benchmark, losing money on match predictions

Ecosystem

Claude Opus 4.6

developed byAnthropic10 src
developedOpenAI4 src
competes withClaude Opus 4.73 src
developedGitHub Copilot3 src
competes withGPT-4 Turbo2 src
usesAsana1 src

GPT-4o

developed byOpenAI15 src
competes withGemini5 src
competes withClaude 31 src
competes withDeepSeek-V31 src
deploysChain-of-Thought Prompting1 src
deploysMixture of Experts (Sparse MoE for LLMs)1 src

Benchmarks

bird
Claude Opus 4.671.04
GPT-4o
mmlu pro
Claude Opus 4.6
GPT-4o73
arena elo
Claude Opus 4.6
GPT-4o1286
swe bench verified
Claude Opus 4.6
GPT-4o38.4

Evidence (10 articles)

+ 2 more articles