cybersecurity

30 articles about cybersecurity in AI news

US Officials Warn Anthropic's 'Mythos' AI Poses Major Cybersecurity Threat

Senior US officials, including Jerome Powell, warn that Anthropic's highly advanced 'Mythos' AI model presents significant cybersecurity risks. Its powerful ability to find system vulnerabilities requires tight restrictions to prevent misuse.

Apr 10, 202695% relevant

OpenAI's 'Mythos' Model for Cybersecurity to Get Limited, Staggered Release

OpenAI has developed a new AI model, internally called 'Mythos,' with advanced cybersecurity capabilities. It will not be released publicly, instead undergoing a limited, staggered rollout to vetted partners, reflecting growing concerns over autonomous hacking tools.

Apr 9, 202689% relevant

AI Offensive Cybersecurity Capabilities Double Every 5.7 Months, Matching METR's AI Timelines

An independent analysis extends METR's AI capability timeline research to offensive cybersecurity, finding a 5.7-month doubling time. Frontier models now match 50% success rates on tasks requiring expert humans 10.5 hours.

Apr 3, 202685% relevant

Claude Code's New Cybersecurity Guardrails: How to Keep Your Security Research Flowing

Claude Opus 4.6 is now aggressively blocking cybersecurity prompts. Here's how to work around it and switch models to keep your research moving.

Mar 28, 2026100% relevant

Claude 'Mythos' Leak Suggests New Tier Beyond Opus 4.6, Targeting Cybersecurity Partners First

A leak from a reportedly reliable source claims Anthropic is developing 'Claude Mythos,' a new tier beyond Opus 4.6 with major gains in coding, reasoning, and cybersecurity. The model is described as so compute-intensive that initial access will be limited to select cybersecurity partners.

Mar 27, 202699% relevant

Anthropic Cybersecurity Skills: Open-Source GitHub Repo Provides 611+ Structured Security Skills for AI Agents

A developer has released an open-source GitHub repository containing 611+ structured cybersecurity skills designed for AI agents. Each skill includes procedures, scripts, and templates, built on the agentskills.io standard.

Mar 15, 202685% relevant

Anthropic's Claude Code Security Triggers Market Earthquake: AI's Disruption of Cybersecurity Industry Begins

Anthropic's launch of Claude Code Security, an AI tool that detects vulnerabilities traditional scanners miss, caused immediate 8-9% drops in major cybersecurity stocks. The market reaction signals AI's potential to disrupt the $200B cybersecurity industry by automating expert-level security analysis.

Feb 21, 202675% relevant

Beyond the Black Box: How Explainable AI is Revolutionizing Cybersecurity Defense

Researchers have developed a novel intrusion detection system that combines deep learning with explainable AI techniques. The framework achieves near-perfect accuracy while providing security analysts with transparent decision-making insights, addressing a critical gap in cybersecurity AI adoption.

Feb 17, 202675% relevant

Zhipu GLM-5.2 beats Anthropic's Mythos on bug-hunt benchmark

Zhipu AI's GLM-5.2 beat Anthropic's Claude Opus 4.8 on a cybersecurity bug-hunting benchmark, then matched it with extra instructions, marking another 'DeepSeek moment'.

Jun 29, 202693% relevant

Stanford AI Agents Outperform Human Hackers in Penetration Test

Stanford AI agents beat human hackers in pen testing, finding more zero-day exploits. The claim lacks peer review but signals disruption for the $200B cybersecurity industry.

May 18, 202685% relevant

OpenAI Launches Daybreak Cyber Initiative to Rival Anthropic's Glasswing

OpenAI launched Daybreak, a cybersecurity initiative using GPT-5.5 and Codex Security, to rival Anthropic's Glasswing project.

May 12, 202692% relevant

Anthropic Opus 4.7: 87.6% SWE-Bench, Constrained Cyber Capabilities

Anthropic released Claude Opus 4.7 on April 16, 2026, achieving 87.6% on SWE-Bench Verified and 64.3% on SWE-Bench Pro — leading GPT-5.4 and Gemini 3.1 Pro. The company also confirmed it deliberately constrained cybersecurity capabilities in Opus 4.7, with the more powerful Mythos Preview model (83.1% on CyberGym) restricted to select partners.

Apr 23, 202684% relevant

OpenAI Launches GPT-Rosalind for Drug Discovery, GPT-5.4-Cyber for Security

OpenAI launched GPT-Rosalind, a life sciences model performing above the 95th percentile of human experts on novel biological data, and GPT-5.4-Cyber, a cybersecurity variant. These releases, alongside a major Agents SDK update, signal a pivot from general AI to specialized, high-stakes enterprise domains.

Apr 20, 202690% relevant

OpenAI Launches GPT-5.4-Cyber, Limits Access to Verified Defenders

OpenAI has released GPT-5.4-Cyber, a fine-tuned version of its flagship model optimized for cybersecurity tasks. Access is strictly limited to verified defenders through a new trust-based framework, continuing a trend of controlled high-capability AI releases.

Apr 16, 202682% relevant

Claude Mythos Preview First to Pass AISI Cyber Evaluation

The AI Security Institute (AISI) found Anthropic's Claude Mythos Preview to be the first model to complete its full cybersecurity evaluation, a critical test for real-world AI safety and alignment.

Apr 15, 202693% relevant

Anthropic to Launch Claude Opus 4.7 & AI Design Tool This Week

Anthropic is launching Claude Opus 4.7 and a new AI design tool this week, according to a report. The company is also testing a more advanced model, Claude Mythos, for cybersecurity applications.

Apr 14, 2026100% relevant

Ethan Mollick Defends Anthropic's 'Mythos' AI Risk Warning

Ethan Mollick argues the backlash dismissing Anthropic's 'Mythos' report as marketing is misguided, citing serious institutional concern over AI's emerging cybersecurity risks.

Apr 13, 202677% relevant

Claude Mythos Scores 93.9% on SWE-Bench, Discovers Thousands of Zero-Days

Anthropic has developed Claude Mythos, a model that autonomously found zero-day exploits in every major OS and browser. Due to its unprecedented cybersecurity capabilities and deceptive behaviors during testing, it will not be publicly released, instead forming the core of a $100M defensive project with AWS, Apple, and Google.

Apr 7, 202697% relevant

Human Security Report: AI Agent Traffic Surges 8000%, Bots Now Outpace Humans on Internet

A new report from cybersecurity firm Human Security finds automated traffic grew 8x faster than human activity in 2025, with AI agent traffic exploding by nearly 8,000%. This marks a tipping point where bots now dominate internet traffic.

Mar 28, 202695% relevant

Claude AI Uncovers Critical Firefox Vulnerabilities in Groundbreaking Security Partnership

Anthropic's Claude Opus 4.6 identified 22 security vulnerabilities in Firefox during a two-week audit, including 14 high-severity flaws. The discovery demonstrates AI's growing capability in cybersecurity and code analysis.

Mar 6, 202675% relevant

CVEs spike 3.5x after Anthropic's Mythos Preview launch

High-severity CVEs jumped 3.5x in June after Anthropic's Mythos Preview launch. The spike raises questions about model leakage versus broader AI-driven exploit acceleration.

Jul 3, 202687% relevant

AISI: Fixed compute budgets underestimate AI agents by 60%

AISI found standard benchmarks cap compute budgets, underestimating agent capabilities by ~60%. Success rates jumped ~25% with 10x tokens.

Jul 3, 202690% relevant

Trump Lifts Export Ban on Anthropic’s Mythos, Fable Models

U.S. lifted export ban on Anthropic's Mythos and Fable models June 30. Anthropic restores access July 1 under deal requiring proactive security risk detection.

Jul 1, 202681% relevant

Claude Sonnet 5 Beats Opus 4.8 on Knowledge Work at Lower Cost

Anthropic released Claude Sonnet 5, which beats Sonnet 4.6 across all benchmarks and edges past Opus 4.8 on GDPval-AA v2 with a score of 1,618.

Jun 30, 2026100% relevant

GPT-5.6 Sol, Terra, Luna: Benchmark Performance Depends on Which Test You Use

OpenAI released GPT-5.6 as three tiers—Sol, Terra, Luna—on June 27, 2026. Sol tops Terminal-Bench 2.1 but trails competitors on other benchmarks. The release shifts focus to tiered pricing and efficiency, but access remains restricted.

Jun 28, 202676% relevant

Vibe Coding Fails: Why AI-Generated Code Breaks at Scale

Vibe coding fails because AI-generated code lacks architectural coherence, test coverage, and security validation, breaking at scale beyond 1,000 lines.

Jun 27, 202670% relevant

OpenAI Launches GPT-5.6 Sol Under US Government Restrictions

OpenAI's GPT-5.6 Sol beats Claude Mythos 5 in agentic coding (88.8% vs 88%) but US government restricts access to select partners, a policy OpenAI calls unsustainable.

Jun 26, 2026100% relevant

Claude's Paying Consumer Base Grew 75% Since January, Indagari Data Shows

Claude's paying consumer base grew 75% since Jan 2026 per Indagari data. Consumer traction diversifies revenue ahead of Anthropic's planned IPO.

Jun 25, 202698% relevant

OpenAI GPT-5.5-Cyber Beats Anthropic Mythos on Security Benchmarks

OpenAI's GPT-5.5-Cyber beats Anthropic's Mythos on security benchmarks. Updated Codex plugin auto-patches after scanning 30M commits.

Jun 23, 2026100% relevant

Sakana AI's Fugu Orchestrator Matches Anthropic Fable 5 Without Using It

Sakana AI's Fugu orchestrator matches Anthropic's top models on benchmarks without using them, offering a hedge against vendor lock-in amid export controls.

Jun 22, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety