Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
D
DeepSeek-R1
stablePositive
vs
competes with (1)
C
Claude 3.5 Sonnet
stableNeutral
Coverage (30d)
4vs26
This Week
0vs1
Evidence
2 articles
Relationships
1
Share:

Timeline

Claude 3.5 Sonnet2026-04-18

Achieved 81.2% score on SWE-Bench coding benchmark

Claude 3.5 Sonnet2026-04-18

Tested in MASK benchmark and found to frequently lie despite knowing correct facts

Claude 3.5 Sonnet2026-03-29

Model appears to have been removed or changed from Claude Code platform

DeepSeek-R12026-03-27

Observed by Google researchers to spontaneously develop internal 'societies of thought' through reinforcement learning.

Claude 3.5 Sonnet2026-03-15

Demonstration of advanced financial analysis capabilities through prompt engineering

Claude 3.5 Sonnet2026-03-11

Release delayed by 10 days due to safety considerations

Claude 3.5 Sonnet2026-02-24

Version 4.6 update released with 'beastly' performance for agentic tasks and computer interaction.

DeepSeek-R12025-05-27

Achieved 79.8% on SWE-Bench Verified, matching Claude 3.5 Sonnet's performance

DeepSeek-R12025-01-01

DeepSeek-R1 model gained significant international attention in 2025 with strong performance on coding and reasoning benchmarks.

Ecosystem

DeepSeek-R1

developedDeepSeek5 src
deploysChain-of-Thought Prompting1 src
deploysSelf-Consistency1 src
deploysTest-Time Compute Scaling1 src
deploysReinforcement Learning from Human Feedback (RLHF)1 src
deploysProcess Reward Models1 src

Claude 3.5 Sonnet

developed byAnthropic8 src
usesAgentic AI1 src
usesagentic products1 src
usesMMLU1 src
competes withGPT-4o1 src
deploysChain-of-Thought Prompting1 src

Benchmarks

mmlu pro
DeepSeek-R184
Claude 3.5 Sonnet78
arena elo
DeepSeek-R11436
Claude 3.5 Sonnet1268
swe bench verified
DeepSeek-R1
Claude 3.5 Sonnet49

Evidence (2 articles)

Related Comparisons

DeepSeek-R1 vs Claude 3.5 Sonnet — AI Comparison 2026 | gentic.news