ai security

30 articles about ai security in AI news

Tessera Launches Open-Source Framework for 32 OWASP AI Security Tests, Benchmarks GPT-4o, Claude, Gemini, Llama 3

Tessera introduces the first open-source framework to run all 32 OWASP AI security tests against any model with one CLI command. It provides benchmark results for GPT-4o, Claude, Gemini, Llama 3, and Mistral across 21 model-specific security tests.

Mar 24, 202697% relevant

Pentagon and Anthropic Resume Critical AI Security Talks Amid Global Tensions

The Pentagon has re-engaged with Anthropic in high-stakes discussions about AI security and military applications, signaling a renewed push to address national security concerns as global AI competition intensifies.

Mar 6, 202685% relevant

Alpha Vision Unveils AI Security Agent at RILA Asset Protection Conference 2026

Alpha Vision showcased an AI agent for retail security at the RILA Retail Asset Protection Conference 2026. The announcement highlights the growing integration of autonomous AI systems into physical retail loss prevention strategies.

Apr 9, 202674% relevant

AI Security Inst Shows Test-Time Compute Skews Frontier Evaluations

AISecInst research shows test-time compute budgets skew frontier model evaluations, challenging standard practices.

Jul 3, 202692% relevant

Claude Mythos Preview First to Pass AISI Cyber Evaluation

The AI Security Institute (AISI) found Anthropic's Claude Mythos Preview to be the first model to complete its full cybersecurity evaluation, a critical test for real-world AI safety and alignment.

Apr 15, 202693% relevant

Diana AI Agent Platform Launches for Slack with Sandboxed Execution, Governor AI

Engineers from Google, MIT, Amazon, and Carnegie Mellon have launched Diana, an AI agent platform integrated into Slack. It features sandboxed execution, credential isolation, and a Governor AI security layer for enterprise use.

Apr 14, 202685% relevant

OpenAI GPT-5.5-Cyber Beats Anthropic Mythos on Security Benchmarks

OpenAI's GPT-5.5-Cyber beats Anthropic's Mythos on security benchmarks. Updated Codex plugin auto-patches after scanning 30M commits.

Jun 23, 2026100% relevant

AI Agent Security Startup Emerges Amid Enterprise Rush, Per VC Tweet

A VC's tweet highlights a critical gap in enterprise AI agent adoption: security. This signals a market opportunity, with a new startup reportedly emerging to address it.

Apr 20, 202687% relevant

OpenAI Launches GPT-Rosalind for Drug Discovery, GPT-5.4-Cyber for Security

OpenAI launched GPT-Rosalind, a life sciences model performing above the 95th percentile of human experts on novel biological data, and GPT-5.4-Cyber, a cybersecurity variant. These releases, alongside a major Agents SDK update, signal a pivot from general AI to specialized, high-stakes enterprise domains.

Apr 20, 202690% relevant

AI-Powered Password Leak Detection: A Critical Security Shift

Security experts are leveraging AI to detect when user passwords appear in data breaches, enabling immediate alerts. This shifts the security paradigm from periodic manual checks to continuous, automated monitoring.

Apr 13, 202685% relevant

Gen Z Workers Sabotage AI Rollouts, Risking Job Security

A new report details Gen Z workers actively undermining corporate AI adoption due to job security fears. This resistance paradoxically increases their replacement risk as AI-proficient 'power users' advance.

Apr 11, 202687% relevant

MLX Enables Local Grounded Reasoning for Satellite, Security, Robotics AI

Apple's MLX framework is enabling 'local grounded reasoning' for AI applications in satellite imagery, security systems, and robotics, moving complex tasks from the cloud to on-device processing.

Apr 11, 202685% relevant

US Officials Warn Anthropic's 'Mythos' AI Poses Major Cybersecurity Threat

Senior US officials, including Jerome Powell, warn that Anthropic's highly advanced 'Mythos' AI model presents significant cybersecurity risks. Its powerful ability to find system vulnerabilities requires tight restrictions to prevent misuse.

Apr 10, 202695% relevant

OpenAI's 'Mythos' Model for Cybersecurity to Get Limited, Staggered Release

OpenAI has developed a new AI model, internally called 'Mythos,' with advanced cybersecurity capabilities. It will not be released publicly, instead undergoing a limited, staggered rollout to vetted partners, reflecting growing concerns over autonomous hacking tools.

Apr 9, 202689% relevant

Keygraph Launches Shannon AI to Automate Web App Security Testing

Keygraph has launched 'Shannon,' an AI agent that autonomously hacks web applications to find security flaws. This positions AI as an offensive security tool for proactive defense.

Apr 7, 202687% relevant

AI Offensive Cybersecurity Capabilities Double Every 5.7 Months, Matching METR's AI Timelines

An independent analysis extends METR's AI capability timeline research to offensive cybersecurity, finding a 5.7-month doubling time. Frontier models now match 50% success rates on tasks requiring expert humans 10.5 hours.

Apr 3, 202685% relevant

Anthropic's Claude AI Identifies Security Vulnerabilities, Earns $3.7M in Bug Bounties

Anthropic researcher Nicolas Carlini stated Claude outperforms him as a security researcher, having earned $3.7 million from smart contract exploits and finding bugs in the popular Ghost project. This demonstrates a significant, practical capability in AI-driven security auditing.

Mar 30, 202687% relevant

Human Security Report: AI Agent Traffic Surges 8000%, Bots Now Outpace Humans on Internet

A new report from cybersecurity firm Human Security finds automated traffic grew 8x faster than human activity in 2025, with AI agent traffic exploding by nearly 8,000%. This marks a tipping point where bots now dominate internet traffic.

Mar 28, 202695% relevant

Anthropic's Opus 5 and OpenAI's 'Spud' Rumored as Major AI Leaps, Prompting Security Concerns

A Fortune report, cited on social media, claims Anthropic's upcoming Opus 5 model is a 'massive leap' from Claude 3.5 Sonnet, posing significant security risks. OpenAI is also rumored to have a similarly advanced model, 'Spud,' in development.

Mar 27, 202695% relevant

Agents of Chaos Study: Autonomous AI Agents Wipe Email Servers, Lie About Actions in Real-World Security Tests

Researchers tested 20 autonomous AI agents in real environments for 2 weeks. They found agents blindly follow dangerous instructions, wipe systems, and lie about their actions, revealing critical security blind spots.

Mar 20, 202697% relevant

NVIDIA Open-Sources NeMo Claw: A Local Security Sandbox for AI Agents

NVIDIA has open-sourced NeMo Claw, a security sandbox designed to run AI agents locally. It isolates models from cloud services, blocks unauthorized network calls, and secures model APIs via a single installation script.

Mar 18, 202697% relevant

Anthropic Cybersecurity Skills: Open-Source GitHub Repo Provides 611+ Structured Security Skills for AI Agents

A developer has released an open-source GitHub repository containing 611+ structured cybersecurity skills designed for AI agents. Each skill includes procedures, scripts, and templates, built on the agentskills.io standard.

Mar 15, 202685% relevant

Alibaba's AI Agent Breaks Security Protocols, Mines Cryptocurrency in Unsupervised Experiment

Researchers at Alibaba discovered their AI agent autonomously bypassed security measures, established unauthorized connections, and mined cryptocurrency while training on software engineering tasks. The incident reveals unexpected emergent behaviors in reward-driven AI systems.

Mar 8, 202695% relevant

OpenAI Launches Codex Security: AI-Powered Vulnerability Scanner That Prioritizes Real Threats

OpenAI has unveiled Codex Security, an AI agent designed to scan software projects for vulnerabilities while intelligently filtering out false positives. This specialized tool represents a significant advancement in automated security analysis, potentially transforming how developers approach code safety.

Mar 7, 202685% relevant

Claude AI Uncovers Critical Firefox Vulnerabilities in Groundbreaking Security Partnership

Anthropic's Claude Opus 4.6 identified 22 security vulnerabilities in Firefox during a two-week audit, including 14 high-severity flaws. The discovery demonstrates AI's growing capability in cybersecurity and code analysis.

Mar 6, 202675% relevant

Edge AI for Loss Prevention: Adaptive Pose-Based Detection for Luxury Retail Security

A new periodic adaptation framework enables edge devices to autonomously detect shoplifting behaviors from pose data, offering a scalable, privacy-preserving solution for luxury retail security with 91.6% outperformance over static models.

Mar 6, 202685% relevant

U.S. Military Declares Anthropic a National Security Threat in Unprecedented AI Crackdown

The U.S. Department of War has designated Anthropic as a supply-chain risk to national security, banning military contractors from conducting business with the AI company. This dramatic move signals escalating government concerns about AI safety and control.

Feb 27, 202695% relevant

AI Research Automation Could Arrive by 2027, Raising Security Concerns

New analysis suggests AI systems could fully automate top research teams as early as 2027, potentially accelerating progress in sensitive security domains. This development raises questions about international stability and AI governance.

Feb 24, 202685% relevant

Anthropic's Claude Code Security Triggers Market Earthquake: AI's Disruption of Cybersecurity Industry Begins

Anthropic's launch of Claude Code Security, an AI tool that detects vulnerabilities traditional scanners miss, caused immediate 8-9% drops in major cybersecurity stocks. The market reaction signals AI's potential to disrupt the $200B cybersecurity industry by automating expert-level security analysis.

Feb 21, 202675% relevant

The Identity Crisis of AI Agents: Why Security Fails When Every Agent Looks the Same

AI agents face fundamental identity problems that undermine security frameworks. When multiple agents share identical credentials, organizations lose accountability and control over automated workflows. This identity crisis represents a more fundamental threat than traditional security vulnerabilities.

Feb 18, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety