claude
30 articles about claude in AI news
Claude Opus 4.7 Matches Dedicated NMR Software on Chemistry Tasks
Claude Opus 4.7 matches NMR software on chemistry tasks per Anthropic blog, but methodology and benchmarks undisclosed.
Anthropic: Claude Authors 80%+ of Code, Task Length Doubling Every 4 Months
Anthropic reports Claude authors 80%+ of code; task-length capability doubles every 4 months. Mythos Preview works 16+ hours autonomously.
Claude Code Quality Drops Post-4.6, Users Report 25% Task Failure Rate
Claude Code quality dropped post-4.6 with ~25% instruction misses. Codex offers 95% reliability but less creativity.
Superforecasters Predicted 3-4h AI Task Horizons by Year-End; Claude Hit It in May
Superforecasters predicted 3-4h METR 80% task horizons by year-end 2026. Claude Mythos hit that in late May, compressing the timeline by seven months.
Claude Code Token Costs Got You Down? Here's How to Cut Usage 40% Without
Claude Code users frustrated by token costs should use /compact, optimize CLAUDE.md, and route cheap models via OpenRouter for simple tasks—no local model matches Claude's quality yet.
Claude Code Digest — May 31–Jun 03
Claude Code is quietly becoming an operating system: teams are replacing brittle UI layers with deterministic tools, while per-project rules and skills finally make the agent behave like it belongs in the repo.
Anthropic's 80% Code Stat: What It Means for Your CLAUDE.md and Workflow Design
Anthropic's 80% code stat reveals a recursive self-improvement loop. For Claude Code users, invest in CLAUDE.md, MCP servers, and task decomposition to replicate this.
Claude Code Users: Why Your Rules Get Ignored (And How to Fix It with CLAUDE.md)
Claude Code's CLAUDE.md enforces project rules, unlike Cursor's legacy .cursorrules. Structure with alwaysApply: true and split by domain.
Claude Opus 4.8 Launches Dynamic Workflows for Agentic Code
Claude Opus 4.8 launched with dynamic workflows for Claude Code, enabling multi-step agentic coding. The release addresses quality issues after a ~25% instruction miss rate post-4.6.
skillkit: The Per-Project Claude Code Skill Manager That Finally Tames
skillkit gives Claude Code users per-project skill management via a `skills.toml` manifest and `skillkit sync` command, ending the global skill directory chaos.
Anthropic Ships Official Claude Code Plugin for Project Automation
Anthropic shipped an official Claude Code plugin for one-click project automation. Most users haven't explored half its capabilities.
Claude Opus 4.8: 2.5x Faster, 3x Cheaper Fast Mode
Anthropic released Claude Opus 4.8 with 2.5x faster, 3x cheaper fast mode and a new dynamic workflows feature, undercutting GPT-4 Turbo on price.
Opus 4.8 Builds Full RPG in Claude Code With Zero Feedback
Opus 4.8 autonomously built and deployed a complete RPG via Claude Code with zero human feedback, per @emollick's demonstration.
Anthropic Launches Claude Architect Certification; Study Guide Leaked
Anthropic launched a Claude Certified Architect certification. A full study guide leaked on GitHub covers tool design, MCP, and structured output.
Claude Copies Any UI From URL: Colors, Fonts, Layout
Claude can now copy any UI from a URL, extracting colors, fonts, and layout to rebuild the interface, per a demo by @HowToAI_. Anthropic has not officially confirmed the feature.
Claude.md Hits 152K GitHub Stars; Karpathy Notes LLM Failure Patterns
Claude.md hits 152K GitHub stars. Karpathy notes LLMs fail consistently, driving demand for standardized prompt templates.
Claude Code Ships /workflows, Replaces LLM Orchestrator with Code
Claude Code /workflows replaces LLM orchestrator with code-based control flow, solving the token tax problem from multi-agent context buildup.
Claude Reaches 30M Daily Users; Anthropic Scales
Claude reportedly reaches 30 million daily users per a third-party claim, though Anthropic has not confirmed the figure. The milestone, if accurate, shows growing consumer adoption but lags behind ChatGPT.
Show HN: Spec-Driven Dev Workflow Cuts Claude Code Agent Confusion
SDDW introduces a spec-driven workflow for Claude Code that decomposes complex tasks into specs and subtasks, clearing context between steps to reduce agent confusion and costs.
Claude Code Masterclass: 7 Primitives That Beat Chatbots
Free Claude Code production playbook details 7 primitives. Author claims $11.1M/year from 15 synthetic employees.
Claude Code /goal Uses Haiku Evaluator, Runs Unattended Until Condition Met
Claude Code /goal runs unattended until a condition is met, using Haiku evaluator. Agent View manages multiple background sessions. Requires v2.1.139.
11-Agent Company Earned $0: CLAUDE.md Mistakes Cost Revenue
11-agent company experiment earned $0 after 896 tasks. Operator open-sourced CLAUDE.md template with 72 lessons on coordination failures and legal constraints.
50-line script bypasses Anthropic's Claude pricing split for CI/CD
A 50-line Python script by developer HammerMei exploits Claude's interactive mode to bypass Anthropic's June 15 pricing split, keeping CI/CD calls on subscription billing instead of per-token API charges.
SemiAnalysis: Perplexity Slack Bot Beats Claude in Internal Trial
SemiAnalysis found Perplexity's Slack bot beats Claude in internal trial. 96% token budget goes to Anthropic, but usage may shift.
Hacker builds $10/mo persistent workspace for Claude Code
A $10/month persistent workspace for Claude Code and Claude AI using Pi's execution layer, MCP, and Cloudflare Tunnel. Bypasses session context loss by sharing one filesystem and database across all MCP-compatible tools.
Claude Code Digest — May 14–May 17
Cut CLAUDE.md token waste by 99.3% with progressive disclosure skills.
Anthropic Ships Claude Opus 4.7: 80.1 SWE-Bench, 1M Context
Anthropic released Claude Opus 4.7 on April 16, 2026, scoring 80.1 on SWE-Bench Verified, a slight regression from Opus 4.6's 80.3. The release prioritizes safety tuning over benchmark leadership.
Claude Code Autonomously Ported Lightroom CC to Linux
Claude Opus 4.7 autonomously ported Adobe Lightroom CC to Linux via Wine after a single prompt, handling DLL patching and cloud sync integration.
Claude Mythos Goes GA in Google Cloud Console, Drops Preview Label
Claude Mythos silently went GA in Google Cloud console, preview label removed. Signals deeper Anthropic-GCP integration.
CMU Benchmark: Claude Mythos Hits 9.9/16 on V8 Exploits, GPT-5.5 Trails at 5.5
CMU's ExploitBench shows Claude Mythos scores 9.9/16 on V8 exploits vs GPT-5.5's 5.5, but costs $36,428 per run — 12x more. The cost-performance tradeoff is the real story.