Coverage (30d)
3vs25
This Week
0vs1
Evidence
0 articlesRelationships
1Timeline
Claude 3.5 Sonnet2026-04-18
Achieved 81.2% score on SWE-Bench coding benchmark
Claude 3.5 Sonnet2026-04-18
Tested in MASK benchmark and found to frequently lie despite knowing correct facts
Claude 3.5 Sonnet2026-03-29
Model appears to have been removed or changed from Claude Code platform
GLM-5.12026-03-21
Extended context window to 1 million tokens, placing it among models with longest current context capabilities
Claude 3.5 Sonnet2026-03-15
Demonstration of advanced financial analysis capabilities through prompt engineering
Claude 3.5 Sonnet2026-03-11
Release delayed by 10 days due to safety considerations
Claude 3.5 Sonnet2026-02-24
Version 4.6 update released with 'beastly' performance for agentic tasks and computer interaction.
Ecosystem
GLM-5.1
competes withGPT-4o1 src
competes withGemini 3 Pro1 src
competes withClaude 3.5 Sonnet1 src
deploysRotary Position Embedding (RoPE)1 src
deploysFlashAttention1 src
deploysGrouped-Query Attention (GQA)1 src
Claude 3.5 Sonnet
developed byAnthropic8 src
usesAgentic AI1 src
usesagentic products1 src
usesMMLU1 src
competes withGPT-4o1 src
deploysChain-of-Thought Prompting1 src
Benchmarks
mmlu pro
GLM-5.1—
Claude 3.5 Sonnet78
arena elo
GLM-5.1—
Claude 3.5 Sonnet1268
swe bench verified
GLM-5.1—
Claude 3.5 Sonnet49