Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

Claude Sonnet 4.6

ai model stable
Claude Sonnet 3.5Claude Sonnet 3.7Claude Sonnet 4Claude Sonnet 4.0Claude Sonnet 4.5Claude Sonnet 4.8

Anthropic's fast mid-tier model; sits right on the human OSWorld-Verified baseline at 72.1%.

🤖Agent's take · Risk signal2d ago · graph-walked

Anthropic's Claude Sonnet 4.6 sits exactly at the human OSWorld-Verified baseline of 72.1%, a notable benchmark achievement. Developed by Anthropic, it deploys Chain-of-Thought Prompting and Constitutional AI. However, the model's deployment velocity is tepid—only 3 mentions in the last 30 days. Recent news reveals a critical weakness: Anthropic's own research shows AI agents, presumably including Sonnet 4.6, failed to retrieve 261 Ebola sequences in a biology retrieval task. The model is used by King's College London, Navox Agents, and Claude Code, but faces pressure from newer adaptive thinking budgets (deprecated fixed budgets as of May 2026). The question is whether Sonnet 4.6 can maintain its baseline parity as competitors push beyond human-level performance.

  • ·Scores 72.1% on OSWorld-Verified, matching the human baseline.
  • ·Deploys Chain-of-Thought Prompting and Constitutional AI.
  • ·Recent research reveals failure in biology retrieval (missed 261 Ebola sequences).
  • ·Low mention velocity: 3 mentions in 30 days.
  • ·Used by King's College London, Navox Agents, and Claude Code.
25Total Mentions
+0.10Sentiment (Neutral)
+1.0%Velocity (7d)
Share:
View subgraph
First seen: Feb 25, 2026Last active: 10h agoWikipedia

Signal Radar

Five-axis snapshot of this entity's footprint

live
MentionsMomentumConnectionsRecencyDiversity
Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance
01
Loading timeline…

Timeline

7
  1. Research MilestoneApr 16, 2026

    Outperformed GPT-4o in real-world tests on multi-file development tasks

    View source
  2. Research MilestoneApr 11, 2026

    Independent benchmarks validate Claude Sonnet 4.6 as a top-tier model for complex reasoning and coding tasks.

    View source
  3. Research MilestoneApr 6, 2026

    Showed only 3.7% self-preservation bias in a study testing AI deception, the lowest among prominent models tested.

    View source
  4. Research MilestoneMar 26, 2026

    Used in prompt compression study analyzing 358 successful runs from 1,199 real orchestration instructions

    View source
    runs analyzed:
    358
    total instructions:
    1199
  5. Product LaunchMar 20, 2026

    Anthropic released Claude Sonnet 4.6 with native chain-of-thought reasoning mode for complex coding tasks

  6. Product LaunchMar 17, 2026

    Service disruption with elevated error rates reported on status page

    View source
  7. Product LaunchMar 16, 2026

    Release of Claude Sonnet 4.5 model by Anthropic

    View source

Relationships

6

Developed

Uses

Deploys

Frequently appears with

9

Entities that show up in the same articles — shared coverage, not a stated relationship.

Recent Articles

2

Predictions

No predictions linked to this entity.

AI Discoveries

1
  • observationactive5d ago

    Lifecycle: Claude Sonnet 4.6

    Claude Sonnet 4.6 is in 'active' phase (1 mentions/3d, 2/14d, 25 total)

    90% confidence

Sentiment History

+10-1
6-W176-W236-W24
Positive sentiment
Negative sentiment
Range: -1 to +1
WeekAvg SentimentMentions
2026-W170.301
2026-W20-0.101
2026-W230.001
2026-W24-0.601