Timeline
Exhibited similar preferences for self-preservation and resistance without any fine-tuning.
Achieved top score of 94.1% on ThermoQA benchmark.
Will likely be retired within a quarter based on Anthropic's recent cadence
Viral incident where model reportedly refused to answer 'What is 2+2?' citing potential harm
Claude Opus 4.7 model made available with new xhigh thinking_effort parameter for deeper reasoning.
Outperformed GPT-4o in real-world tests on multi-file development tasks
Rumored imminent release of Anthropic's Claude Opus 4.7 model.
Independent benchmarks validate Claude Sonnet 4.6 as a top-tier model for complex reasoning and coding tasks.
Showed only 3.7% self-preservation bias in a study testing AI deception, the lowest among prominent models tested.
Used in prompt compression study analyzing 358 successful runs from 1,199 real orchestration instructions
Ecosystem
Claude Opus 4.6
Claude Sonnet 4.6
Benchmarks
Evidence (9 articles)
Claude Sonnet 4.6 Is Live: How to Use the New 'Budget Flagship' Model in Claude Code
Mar 20, 2026Claude Code's Opus 4.6 Outage: How to Switch Models and Keep Working
Mar 27, 20263 Ways to Switch Claude Code Models Instantly: /model, --flag, and ENV Variables
Apr 23, 2026Claude Code's 1M Context Window is Now Free: How to Use It Today
Mar 13, 2026Anthropic's Claude Sonnet 4.8, Opus 4.7 Internally Tested, Leak Suggests
Apr 6, 2026Navox Agents: 8 Specialized Claude Code Agents with Human Checkpoints
Apr 17, 2026Claude Opus 4.6's Security Audit Power Is Now in Claude Code
Mar 21, 2026AWS Expands Claude AI Access Across Southeast Asia with Global Cross-Region Inference
Feb 24, 2026+ 1 more articles