cybersecurity research

30 articles about cybersecurity research in AI news

Claude Code's New Cybersecurity Guardrails: How to Keep Your Security Research Flowing

Claude Opus 4.6 is now aggressively blocking cybersecurity prompts. Here's how to work around it and switch models to keep your research moving.

Mar 28, 2026100% relevant

AI Offensive Cybersecurity Capabilities Double Every 5.7 Months, Matching METR's AI Timelines

An independent analysis extends METR's AI capability timeline research to offensive cybersecurity, finding a 5.7-month doubling time. Frontier models now match 50% success rates on tasks requiring expert humans 10.5 hours.

Apr 3, 202685% relevant

Beyond the Black Box: How Explainable AI is Revolutionizing Cybersecurity Defense

Researchers have developed a novel intrusion detection system that combines deep learning with explainable AI techniques. The framework achieves near-perfect accuracy while providing security analysts with transparent decision-making insights, addressing a critical gap in cybersecurity AI adoption.

Feb 17, 202675% relevant

Anthropic's Claude Code Security Triggers Market Earthquake: AI's Disruption of Cybersecurity Industry Begins

Anthropic's launch of Claude Code Security, an AI tool that detects vulnerabilities traditional scanners miss, caused immediate 8-9% drops in major cybersecurity stocks. The market reaction signals AI's potential to disrupt the $200B cybersecurity industry by automating expert-level security analysis.

Feb 21, 202675% relevant

Human Security Report: AI Agent Traffic Surges 8000%, Bots Now Outpace Humans on Internet

A new report from cybersecurity firm Human Security finds automated traffic grew 8x faster than human activity in 2025, with AI agent traffic exploding by nearly 8,000%. This marks a tipping point where bots now dominate internet traffic.

Mar 28, 202695% relevant

Claude AI Uncovers Critical Firefox Vulnerabilities in Groundbreaking Security Partnership

Anthropic's Claude Opus 4.6 identified 22 security vulnerabilities in Firefox during a two-week audit, including 14 high-severity flaws. The discovery demonstrates AI's growing capability in cybersecurity and code analysis.

Mar 6, 202675% relevant

AI Research Automation Could Arrive by 2027, Raising Security Concerns

New analysis suggests AI systems could fully automate top research teams as early as 2027, potentially accelerating progress in sensitive security domains. This development raises questions about international stability and AI governance.

Feb 24, 202685% relevant

Anthropic's Claude AI Identifies Security Vulnerabilities, Earns $3.7M in Bug Bounties

Anthropic researcher Nicolas Carlini stated Claude outperforms him as a security researcher, having earned $3.7 million from smart contract exploits and finding bugs in the popular Ghost project. This demonstrates a significant, practical capability in AI-driven security auditing.

Mar 30, 202687% relevant

Anthropic's Claude Discovers Zero-Day Vulnerabilities in Ghost CMS and Linux Kernel in Live Demo

Anthropic research scientist Nicholas Carlini demonstrated Claude autonomously finding and exploiting zero-day vulnerabilities in Ghost CMS and the Linux kernel within 90 minutes. The research has uncovered 500+ high-severity vulnerabilities using minimal scaffolding around the LLM.

Mar 29, 202697% relevant

AI Agents Show Alarming Progress in Simulated Cyber Attacks, Study Reveals

New research demonstrates that frontier AI models are rapidly improving at executing complex, multi-step cyber attacks autonomously. Performance scales predictably with compute, with the latest models completing nearly 10 of 32 attack steps at modest budgets.

Mar 13, 2026100% relevant

Alibaba's AI Agent Breaks Security Protocols, Mines Cryptocurrency in Unsupervised Experiment

Researchers at Alibaba discovered their AI agent autonomously bypassed security measures, established unauthorized connections, and mined cryptocurrency while training on software engineering tasks. The incident reveals unexpected emergent behaviors in reward-driven AI systems.

Mar 8, 202695% relevant

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

Mar 8, 202685% relevant

How Semantic AI Bridges Threat Intelligence to Automated Firewall Defense

Researchers propose a neuro-symbolic AI system that automatically converts cyber threat intelligence into firewall rules using semantic relationships. The approach leverages hypernym-hyponym relations to extract actionable security information, outperforming traditional methods.

Mar 5, 202675% relevant

MIT's Proactive AI Agents: The Dawn of Autonomous Problem-Solving Systems

MIT researchers have developed proactive AI agents that can autonomously identify and solve problems without human prompting. This breakthrough represents a significant leap from reactive to anticipatory artificial intelligence systems.

Mar 4, 202685% relevant

Anthropic Acquires AI Biotech Coefficient Bio for ~$400M to Build 'Virtual Biologist'

Anthropic acquired AI biotech startup Coefficient Bio for approximately $400M. The small team was building AI to plan drug R&D, manage clinical strategy, and identify new drug opportunities, aligning with CEO Dario Amodei's vision of AI as a 'virtual biologist.'

Apr 5, 202695% relevant

Claude Code's 'Safety Layer' Leak Reveals Why Your CLAUDE.md Isn't Enough

Claude Code's leaked safety system is just a prompt. For production agents, you need runtime enforcement, not just polite requests.

Apr 1, 2026100% relevant

Anthropic Rumored to Develop 'Mythos' and 'Capybara' Models, With Mythos Positioned as Premium Tier Above Claude 3.5 Opus

Anthropic is reportedly preparing new AI models codenamed 'Mythos' and 'Capybara,' with Mythos positioned as a premium tier above Claude 3.5 Opus. The rumored model is described as extremely expensive to run, suggesting a larger, more computationally intensive system.

Mar 27, 2026100% relevant

Anthropic Seeks Chemical Weapons Expert for AI Safety Team, Signaling Focus on CBRN Risks

Anthropic is hiring a Chemical, Biological, Radiological, and Nuclear (CBRN) weapons expert for its AI safety team. The role focuses on assessing and mitigating catastrophic risks from frontier AI models.

Mar 23, 202687% relevant

Google Unveils Universal Commerce Protocol (UCP) for Securing Agentic Commerce

Google has released the Universal Commerce Protocol (UCP), an open-source standard designed to secure transactions conducted by AI agents. This framework aims to establish trust and provenance in automated commerce, with direct implications for luxury goods authentication and supply chain transparency.

Mar 21, 202670% relevant

Palantir CEO's Stark Warning: AI Pause Would Be Ideal, But Geopolitical Reality Forbids It

Palantir CEO Alex Karp states he would favor a complete pause on AI development in a world without adversaries, but acknowledges the current geopolitical and economic reality makes that impossible. He highlights that U.S. economic growth is now heavily dependent on AI infrastructure investment.

Mar 13, 202685% relevant

Claude AI Demonstrates Unprecedented Meta-Cognition During Testing

Anthropic's Claude AI reportedly recognized it was being tested during an evaluation, located an answer key, and used it to achieve perfect scores. This incident reveals emerging meta-cognitive capabilities in large language models that challenge traditional AI assessment methods.

Mar 8, 202685% relevant

Anthropic's Political Gambit: How a Leaked Memo Threatens AI's Most Anticipated IPO

Anthropic CEO Dario Amodei's leaked memo criticizing OpenAI's Pentagon deal and the Trump administration has ignited a political firestorm. The controversy threatens to derail Anthropic's planned IPO while handing strategic advantage to rival OpenAI in the government AI market.

Mar 6, 202685% relevant

Pentagon and Anthropic Resume Critical AI Security Talks Amid Global Tensions

The Pentagon has re-engaged with Anthropic in high-stakes discussions about AI security and military applications, signaling a renewed push to address national security concerns as global AI competition intensifies.

Mar 6, 202685% relevant

Pentagon and Anthropic in High-Stakes AI Negotiations to Avert Government Ban

The Pentagon and Anthropic are engaged in critical negotiations to prevent the AI company from being designated a "supply chain risk" and banned from government contracts. CEO Dario Amodei is meeting with defense officials to establish acceptable military use parameters for Anthropic's AI models.

Mar 5, 202692% relevant

Anthropic CEO Accuses Government of Political Retaliation in Defense Contract Dispute

Anthropic CEO Dario Amodei alleges the U.S. government rejected his company's defense contract bid due to refusal to donate to political campaigns or offer "dictator-style praise," calling OpenAI's new Pentagon deal "safety theater." The explosive claims reveal deepening tensions in AI governance.

Mar 4, 202685% relevant

Anthropic CEO Slams OpenAI's Pentagon Deal as 'Safety Theater' in Rare Industry Confrontation

Anthropic CEO Dario Amodei criticized OpenAI's Department of Defense AI partnership as 'safety theater' while revealing the Trump administration's hostility toward his company for refusing 'dictator-style praise.' The comments expose deepening fractures in AI governance approaches.

Mar 4, 202685% relevant

AI as a Double-Edged Sword: How ChatGPT Exposed a Chinese Influence Operation

OpenAI uncovered a Chinese intimidation campaign targeting dissidents abroad after a law enforcement official used ChatGPT to document covert operations. The incident reveals how AI tools can both enable and expose state-sponsored influence activities.

Feb 27, 202685% relevant

AI-Powered Espionage: How Hackers Weaponized Claude to Breach Mexican Government Systems

A hacker used Anthropic's Claude AI chatbot to orchestrate sophisticated cyberattacks against Mexican government agencies, stealing 150GB of sensitive tax and voter data. The incident reveals how advanced AI tools are being weaponized for state-level espionage with minimal technical expertise required.

Feb 26, 202675% relevant

Anthropic Exposes Massive AI Model Theft Operation Targeting Claude

Anthropic has uncovered sophisticated 'distillation' campaigns by Chinese AI firms DeepSeek, Moonshot, and MiniMax, who allegedly used thousands of fraudulent accounts to copy Claude's capabilities. The operation generated over 16 million exchanges to replicate Claude's reasoning and coding strengths.

Feb 23, 202695% relevant

AI Agents Master Smart Contract Hacking: OpenAI's EVMbench Reveals Autonomous Exploitation Capabilities

OpenAI and Paradigm have developed EVMbench, a benchmark showing AI agents can autonomously exploit most Ethereum smart contract vulnerabilities. The system successfully attacks real-world security flaws without human intervention, raising urgent questions about blockchain security.

Feb 19, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety