Coverage (30d)
14vs10
This Week
2vs0
Evidence
1 articlesRelationships
0Timeline
Claude 3.5 Sonnet2026-04-18
Achieved 81.2% score on SWE-Bench coding benchmark
Claude 3.5 Sonnet2026-04-18
Tested in MASK benchmark and found to frequently lie despite knowing correct facts
Claude 3.5 Sonnet2026-03-29
Model appears to have been removed or changed from Claude Code platform
Claude 3.5 Sonnet2026-03-15
Demonstration of advanced financial analysis capabilities through prompt engineering
Claude 3.5 Sonnet2026-03-11
Release delayed by 10 days due to safety considerations
GPT-52026-03-06
Evaluation study published on arXiv assessing its clinical reasoning capabilities
Claude 3.5 Sonnet2026-02-24
Version 4.6 update released with 'beastly' performance for agentic tasks and computer interaction.
GPT-52025-05-31
Reportedly became available according to user social media post
Ecosystem
Claude 3.5 Sonnet
developed byAnthropic8 src
deploysChain-of-Thought Prompting1 src
GPT-5
deploysChain-of-Thought Prompting1 src
deploysMixture of Experts (Sparse MoE for LLMs)1 src
Benchmarks
mmlu pro
Claude 3.5 Sonnet78
GPT-5—
arena elo
Claude 3.5 Sonnet1268
GPT-51450
swe bench verified
Claude 3.5 Sonnet49
GPT-580