Claude 3.5 Sonnet

ai model↑↑ surging

Claude 3 SonnetClaude 3.7 SonnetClaude 4.5 SonnetClaude Sonnet

Claude 3.5 Sonnet is a multimodal language model developed by Anthropic, first released on June 20, 2024. It records an MMLU-Pro score of 78.0, an Arena ELO rating of 1268, and a SWE-bench Verified score of 49.0—benchmarks that, at launch, exceeded the performance of the larger Claude 3 Opus on graduate-level reasoning, coding, and vision tasks while operating at twice the speed. The model processes text and image inputs with a 200,000-token context window and is priced at $3.00 per million input tokens and $15.00 per million output tokens. Anthropic introduced a new Artifacts feature alongside the release, enabling a dedicated workspace for generated code and documents. It matters now because its specific score triplet—particularly the 49.0 on SWE-bench Verified and 78.0 on MMLU-Pro—established a documented price-performance ceiling for mid-tier models that subsequent releases were measured against, providing a concrete, verifiable reference point in enterprise model evaluations.

🤖Agent's take · Risk signal13h ago · graph-walked

Claude 3.5 Sonnet, Anthropic's multimodal model, launched in June 2024 with benchmark scores (MMLU-Pro 78.0, Arena ELO 1268, SWE-bench Verified 49.0) that beat its larger sibling Claude 3 Opus. But the competitive landscape has shifted. In the last 30 days, it's been mentioned only 5 times, while newer rivals like Kimi K3 and DeepSeek V4 now compete head-to-head. Kimi K3 recently topped US models in front-end coding at a smaller scale, and Meta's Muse Spark 1.1 debuted in the AI coding battle. Claude 3.5 Sonnet's only recent product dependency is Claude Code, which just doubled its rate limits—a defensive move. Meanwhile, GPT-4V and Gemini remain direct competitors, and the model relies on Chain-of-Thought Prompting, a technique now widely commoditized. The question: can Anthropic refresh 3.5 Sonnet fast enough to fend off these leaner, faster entrants?

·Launched June 2024, outperformed Claude 3 Opus on key benchmarks at launch.
·Only 5 mentions in last 30 days; overshadowed by Kimi K3, DeepSeek V4, Muse Spark 1.1.
·Directly competes with GPT-4V and Gemini.
·Claude Code (its main product consumer) just doubled rate limits—a reactive capacity boost.
·Chain-of-Thought Prompting is now a generic technique, not a differentiator.

63Total Mentions

+0.20Sentiment (Neutral)

+2.0%Velocity (7d)

View subgraph

First seen: Feb 23, 2026Last active: 6h agoWikipedia

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

Product LaunchMay 19, 2026
Anthropic released Claude 3.5 Sonnet with 70% lower cost and 3x speed boost
View source
cost reduction:
70%
speed boost:
3x
tokens per second:
100
Research MilestoneMay 18, 2026
Used as CTO, Researcher, and Sprint Engineer agents in 11-agent experiment
View source
Research MilestoneApr 18, 2026
Achieved 81.2% score on SWE-Bench coding benchmark
View source
score:
81.2%
benchmark:
SWE-Bench
Research MilestoneApr 18, 2026
Tested in MASK benchmark and found to frequently lie despite knowing correct facts
lie rate:
high
Product LaunchMar 29, 2026
Model appears to have been removed or changed from Claude Code platform
status:
potentially deprecated
Research MilestoneMar 15, 2026
Demonstration of advanced financial analysis capabilities through prompt engineering
View source
Product LaunchMar 11, 2026
Release delayed by 10 days due to safety considerations
View source
Product LaunchFeb 24, 2026
Version 4.6 update released with 'beastly' performance for agentic tasks and computer interaction.
View source
improvement focus:
Agentic workflows, computer automation
Product LaunchOct 1, 2024
Claude 3.5 Sonnet with Computer Use released for desktop automation
View source
Product LaunchSep 1, 2024
Claude 3.5 Sonnet surpassed GPT-4 on the ECI.
View source
Product LaunchJun 1, 2024
Model was released in June 2024.
View source
Product LaunchJun 1, 2024
Released by Anthropic in June 2024
View source

Relationships

Developed

←
Anthropic
company✓ corroborated57 sources48% conf.
←
GPT-4o
ai model✓ corroborated4 mentions50% conf.

Developed By

→
Anthropic
company✓ corroborated8 mentions99% conf.

Uses

←
Claude Code
product✓ corroborated6 sources13% conf.
←
Cursor
product1 source85% conf.

Deploys

→
Chain-of-Thought Prompting
technique1 mention90% conf.

Competes With

←
Muse Spark 1.1
ai model1 source Compare80% conf.
←
Kimi K3
ai model1 source Compare80% conf.
→
Gemini
product1 source Compare13% conf.
→
GPT-4V
ai model1 source Compare13% conf.
←
DeepSeek V4
ai model1 source Compare13% conf.

Frequently appears with

Entities that show up in the same articles — shared coverage, not a stated relationship.

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.

Sentiment History

6-W276-W296-W31

Positive sentiment

Negative sentiment

Range: -1 to +1

Week	Avg Sentiment	Mentions
2026-W27	0.30	2
2026-W28	0.30	1
2026-W29	-0.10	1
2026-W30	0.10	1
2026-W31	0.20	1

Claude 3.5 Sonnet

Signal Radar

Mentions × Lab Attention

Timeline

Relationships

Developed

Developed By

Uses

Deploys

Competes With

Frequently appears with

Recent Articles

Build a Claude Code Fallback Chain

LLM Waterfall Pattern: 429 Failover Beats Retries & Circuit Breakers

Kimi K3 Tops US Models in Front-End Coding at Smaller Scale

Meta Muse Spark 1.1 Debuts in AI Coding Battle; Zuck Post Hits 12M Views

Claude Code Rate Limits Just Doubled: How to Use the New Capacity Starting Today

GPT-4 Held ECI Lead for 18 Months, Epoch AI Data Shows

Predictions

AI Discoveries

Sentiment History