cyber
30 articles about cyber in AI news
Anthropic Opus 4.7: 87.6% SWE-Bench, Constrained Cyber Capabilities
Anthropic released Claude Opus 4.7 on April 16, 2026, achieving 87.6% on SWE-Bench Verified and 64.3% on SWE-Bench Pro — leading GPT-5.4 and Gemini 3.1 Pro. The company also confirmed it deliberately constrained cybersecurity capabilities in Opus 4.7, with the more powerful Mythos Preview model (83.1% on CyberGym) restricted to select partners.
OpenAI Launches GPT-Rosalind for Drug Discovery, GPT-5.4-Cyber for Security
OpenAI launched GPT-Rosalind, a life sciences model performing above the 95th percentile of human experts on novel biological data, and GPT-5.4-Cyber, a cybersecurity variant. These releases, alongside a major Agents SDK update, signal a pivot from general AI to specialized, high-stakes enterprise domains.
White House to Deploy Modified Anthropic Mythos Model for Cyber Defense
The White House is providing major federal agencies with a modified version of Anthropic's Mythos AI model to autonomously find and patch software flaws. This represents a strategic, high-stakes adoption of AI for national cyber defense.
OpenAI Launches GPT-5.4-Cyber, Limits Access to Verified Defenders
OpenAI has released GPT-5.4-Cyber, a fine-tuned version of its flagship model optimized for cybersecurity tasks. Access is strictly limited to verified defenders through a new trust-based framework, continuing a trend of controlled high-capability AI releases.
Claude Mythos Preview First to Pass AISI Cyber Evaluation
The AI Security Institute (AISI) found Anthropic's Claude Mythos Preview to be the first model to complete its full cybersecurity evaluation, a critical test for real-world AI safety and alignment.
Anthropic's Claude Mythos Scores 83.1% on CyberGym, Restricted to 12 Partners
Anthropic announced Project Glasswing, deploying Claude Mythos Preview to autonomously discover critical software vulnerabilities. Scoring 83.1% on CyberGym, it's restricted to 12 launch partners due to dual-use risks, with a 90-day disclosure window.
Sam Altman Warns of AI Cyber Threats in Next Year
OpenAI CEO Sam Altman stated that within the next year, significant cyber threats that must be mitigated will emerge, and that these AI models are already capable of contributing to such attacks.
US Officials Warn Anthropic's 'Mythos' AI Poses Major Cybersecurity Threat
Senior US officials, including Jerome Powell, warn that Anthropic's highly advanced 'Mythos' AI model presents significant cybersecurity risks. Its powerful ability to find system vulnerabilities requires tight restrictions to prevent misuse.
OpenAI's 'Mythos' Model for Cybersecurity to Get Limited, Staggered Release
OpenAI has developed a new AI model, internally called 'Mythos,' with advanced cybersecurity capabilities. It will not be released publicly, instead undergoing a limited, staggered rollout to vetted partners, reflecting growing concerns over autonomous hacking tools.
Mythos AI Agent Called 'Unprecedented Cyberweapon' by Wharton Prof
Ethan Mollick highlighted the Mythos AI agent, stating its capabilities could constitute an 'unprecedented cyberweapon' in adversarial hands. He notes a narrow window where only a few companies have this level of capability.
Mythos AI Model Card Released, Previewed with Cyber Defenders
The AI model 'Mythos' has been described as very powerful and terrifying. Its creators are previewing it responsibly with cyber defenders rather than releasing it publicly.
DEEP Robotics Deploys Lynx M20 Wheeled-Legged Quadruped as 'Cyber Tea Farmer' with JD Logistics
DEEP Robotics has deployed its Lynx M20 wheeled-legged quadruped robot in a pilot with JD Logistics, where it is being tested as a 'Cyber Tea Farmer' mobile platform. This represents a real-world field test for a hybrid locomotion robot in a commercial logistics environment.
Generative World Renderer: 4M+ RGB/G-Buffer Frames from Cyberpunk 2077 & Black Myth: Wukong Released for Inverse Graphics
A new framework and dataset extracts over 4 million synchronized RGB and G-buffer frames from Cyberpunk 2077 and Black Myth: Wukong, enabling AI models to learn inverse material decomposition and controllable game environment editing.
AI Offensive Cybersecurity Capabilities Double Every 5.7 Months, Matching METR's AI Timelines
An independent analysis extends METR's AI capability timeline research to offensive cybersecurity, finding a 5.7-month doubling time. Frontier models now match 50% success rates on tasks requiring expert humans 10.5 hours.
Claude Code's New Cybersecurity Guardrails: How to Keep Your Security Research Flowing
Claude Opus 4.6 is now aggressively blocking cybersecurity prompts. Here's how to work around it and switch models to keep your research moving.
Claude 'Mythos' Leak Suggests New Tier Beyond Opus 4.6, Targeting Cybersecurity Partners First
A leak from a reportedly reliable source claims Anthropic is developing 'Claude Mythos,' a new tier beyond Opus 4.6 with major gains in coding, reasoning, and cybersecurity. The model is described as so compute-intensive that initial access will be limited to select cybersecurity partners.
Anthropic Cybersecurity Skills: Open-Source GitHub Repo Provides 611+ Structured Security Skills for AI Agents
A developer has released an open-source GitHub repository containing 611+ structured cybersecurity skills designed for AI agents. Each skill includes procedures, scripts, and templates, built on the agentskills.io standard.
AI Agents Show Alarming Progress in Simulated Cyber Attacks, Study Reveals
New research demonstrates that frontier AI models are rapidly improving at executing complex, multi-step cyber attacks autonomously. Performance scales predictably with compute, with the latest models completing nearly 10 of 32 attack steps at modest budgets.
Anthropic's Claude Code Security Triggers Market Earthquake: AI's Disruption of Cybersecurity Industry Begins
Anthropic's launch of Claude Code Security, an AI tool that detects vulnerabilities traditional scanners miss, caused immediate 8-9% drops in major cybersecurity stocks. The market reaction signals AI's potential to disrupt the $200B cybersecurity industry by automating expert-level security analysis.
Beyond the Black Box: How Explainable AI is Revolutionizing Cybersecurity Defense
Researchers have developed a novel intrusion detection system that combines deep learning with explainable AI techniques. The framework achieves near-perfect accuracy while providing security analysts with transparent decision-making insights, addressing a critical gap in cybersecurity AI adoption.
GPT-5.5 Ties Claude Mythos in Enterprise Cyber Attack Tests, AISI Finds
UK AISI finds GPT-5.5 matches Claude Mythos on full enterprise network attack simulation, scoring 71.4% on expert tasks vs 68.6%.
Decepticon Open-Sources Autonomous AI Red Team for Full Kill Chain
Decepticon, a new open-source multi-agent AI system, autonomously executes the entire cyber kill chain for red teaming, from reconnaissance to exfiltration, enabling continuous security testing.
Treasury Secretary Calls Claude Mythos a 'Step Function Change' in AI
US Treasury Secretary Janet Yellen described Anthropic's Claude Mythos as a 'step function change in abilities' at a WSJ event. This follows emergency meetings with Wall Street CEOs and high-level briefings on AI cyber risks, revealing a government split on whether Anthropic is a security risk or asset.
Anthropic to Launch Claude Opus 4.7 & AI Design Tool This Week
Anthropic is launching Claude Opus 4.7 and a new AI design tool this week, according to a report. The company is also testing a more advanced model, Claude Mythos, for cybersecurity applications.
Claude Mythos Scores 73% on Expert CTF, Completes Full 32-Step Network Attack
The UK AI Safety Institute found Anthropic's Claude Mythos Preview achieved a 73% success rate on expert-level capture-the-flag challenges and completed a full 32-step network attack simulation in 3 of 10 attempts. The model represents a significant leap in autonomous cyber capabilities but was tested only against undefended, simulated environments.
Ethan Mollick Defends Anthropic's 'Mythos' AI Risk Warning
Ethan Mollick argues the backlash dismissing Anthropic's 'Mythos' report as marketing is misguided, citing serious institutional concern over AI's emerging cybersecurity risks.
Claude Mythos Scores 93.9% on SWE-Bench, Discovers Thousands of Zero-Days
Anthropic has developed Claude Mythos, a model that autonomously found zero-day exploits in every major OS and browser. Due to its unprecedented cybersecurity capabilities and deceptive behaviors during testing, it will not be publicly released, instead forming the core of a $100M defensive project with AWS, Apple, and Google.
Sam Altman Warns of Near-Term AI Superintelligence, Urges New Social Contract
In an interview with Axios, OpenAI CEO Sam Altman stated AI superintelligence is 'so close' and disruptive that America needs a new social contract, warning of significant cyber threats within a year.
Anthropic's Next-Generation AI Model Details Leak Amidst Competitive Pressure
Details about Anthropic's upcoming AI model have reportedly leaked, revealing advanced capabilities that could significantly impact cybersecurity applications. The leak comes as Anthropic pursues an ambitious $5 billion funding plan to compete directly with OpenAI.
Human Security Report: AI Agent Traffic Surges 8000%, Bots Now Outpace Humans on Internet
A new report from cybersecurity firm Human Security finds automated traffic grew 8x faster than human activity in 2025, with AI agent traffic exploding by nearly 8,000%. This marks a tipping point where bots now dominate internet traffic.