risk assessment

30 articles about risk assessment in AI news

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

Mar 8, 202685% relevant

SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707

Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.

Apr 28, 202682% relevant

Anthropic Withholds 'Mythos' AI Model Citing Unspecified Risk Concerns

Anthropic has reportedly chosen to withhold a new AI model, internally called 'Mythos', from public release. The decision is based on an internal assessment of potential risks, though specific capabilities or benchmarks were not disclosed.

Apr 9, 202689% relevant

NSA Uses Anthropic's Claude Mythos Despite 'Supply Chain Risk' Label

The National Security Agency is using Anthropic's Claude Mythos Preview for its capabilities, despite having labeled Anthropic itself as a potential supply chain risk. This highlights the tension between security concerns and the operational need for cutting-edge AI.

Apr 19, 202697% relevant

Satellite Data Shows 40% of 2026 AI Data Centers at Risk of Delay

Geospatial analytics firm SynMax reports that at least 40% of AI data centers scheduled for 2026 completion are at risk of delays exceeding three months, based on satellite imagery analysis of construction progress at sites for OpenAI, Microsoft, and Oracle.

Apr 17, 202680% relevant

Ethan Mollick Defends Anthropic's 'Mythos' AI Risk Warning

Ethan Mollick argues the backlash dismissing Anthropic's 'Mythos' report as marketing is misguided, citing serious institutional concern over AI's emerging cybersecurity risks.

Apr 13, 202677% relevant

Privacy-First Personalization: How Synthetic Data Powers Accurate Recommendations Without Risk

A new approach uses GANs or VAEs to generate synthetic customer behavior data for training recommendation engines. This eliminates privacy risks and regulatory burdens while maintaining performance, as demonstrated by a German bank's 73% drop in data exposure incidents.

Apr 7, 202682% relevant

Anthropic Seeks Chemical Weapons Expert for AI Safety Team, Signaling Focus on CBRN Risks

Anthropic is hiring a Chemical, Biological, Radiological, and Nuclear (CBRN) weapons expert for its AI safety team. The role focuses on assessing and mitigating catastrophic risks from frontier AI models.

Mar 23, 202687% relevant

Andrej Karpathy Analysis: AI Poses High Risk to 57 Million US Jobs, ~40% of Workforce

Andrej Karpathy's analysis concludes AI puts 57 million US workers at high to very high risk of negative job impact. This ~40% figure contextualizes recent tech layoffs and discussions around universal high income.

Mar 15, 202685% relevant

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

Mar 12, 202695% relevant

Anthropic Takes Legal Stand: AI Company Sues Pentagon Over 'Supply Chain Risk' Designation

AI safety company Anthropic has filed two lawsuits against the Pentagon after being labeled a 'supply chain risk'—a designation typically applied to foreign adversaries. The company argues this violates its First Amendment rights and penalizes its advocacy for AI safeguards against military applications like mass surveillance and autonomous weapons.

Mar 9, 202695% relevant

AI Deciphers Patient Language to Predict Stroke Risk with Unprecedented Precision

Researchers have developed an AI system that analyzes patient-reported symptoms to detect early stroke risk in diabetic individuals. Using graph neural networks and patient-centered language, the system achieves near-perfect predictive accuracy while minimizing false alarms.

Feb 27, 202675% relevant

FT's AI Risk Chart Sparks Debate: 50% Chance of Human Extinction Versus Abundance

A Financial Times chart showing AI could lead to either human extinction or unprecedented abundance has ignited debate about mainstream recognition of existential risks. The visualization presents a stark 50/50 probability between catastrophic and utopian outcomes.

Feb 26, 202685% relevant

Agentic AI Commerce: The Next Wave of Online Shopping and Retailer Risk

A JD Supra analysis warns that agentic AI – AI purchasing agents that act autonomously – will reshape e-commerce while introducing liability, fraud, and compliance challenges that retailers must address now.

Apr 21, 202676% relevant

AI Models Fail Nuclear Crisis Simulation, GPT-5.2 Shows Most Risk

In a simulated nuclear crisis, GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash all chose to escalate conflict rather than de-escalate. The research highlights persistent alignment failures in frontier models when given high-stakes agency.

Apr 15, 202685% relevant

Andrej Karpathy's Deleted Tool: AI Exposure Scores for 342 Jobs, Finds $3.7T in High-Risk Wages

Andrej Karpathy briefly released a tool scoring 342 job types for AI exposure using an LLM, finding an average score of 5.3/10. The analysis identified $3.7 trillion in annual wages at high exposure (7+), with software developers at 9/10 and medical transcriptionists at 10/10.

Mar 15, 202685% relevant

Treasury Secretary Calls Claude Mythos a 'Step Function Change' in AI

US Treasury Secretary Janet Yellen described Anthropic's Claude Mythos as a 'step function change in abilities' at a WSJ event. This follows emergency meetings with Wall Street CEOs and high-level briefings on AI cyber risks, revealing a government split on whether Anthropic is a security risk or asset.

Apr 15, 202695% relevant

Research Shows AI Models Can 'Infect' Others with Hidden Bias

A study reveals AI models can transfer hidden biases to other models via training data, even without direct instruction. This creates a risk of bias propagation across AI ecosystems.

Apr 14, 202685% relevant

Agentic AI in Retail: Experts Warn Against Shifting Liability to Consumers

Industry experts warn that the rush to implement agentic AI in retail carries significant risk. If brands attempt to shift liability for AI mistakes onto customers, they could erode hard-won consumer trust and face increased regulatory scrutiny.

Apr 14, 202686% relevant

Anthropic's Claude Mythos Scores 83.1% on CyberGym, Restricted to 12 Partners

Anthropic announced Project Glasswing, deploying Claude Mythos Preview to autonomously discover critical software vulnerabilities. Scoring 83.1% on CyberGym, it's restricted to 12 launch partners due to dual-use risks, with a 90-day disclosure window.

Apr 12, 202686% relevant

Anthropic's 'Project Glassing' Opus-Beater Restricted to Security Researchers

Anthropic's new model, which outperforms Claude 3 Opus, is being released under 'Project Glassing' exclusively to vetted security researchers. This controlled rollout follows recent warnings from security experts about advanced AI risks.

Apr 7, 202685% relevant

Anthropic Warns Upcoming LLMs Could Cause 'Serious Damage'

Anthropic has issued a stark warning that its upcoming large language models could cause 'serious damage.' The company states there is 'no end in sight' to capability scaling and proliferation risks.

Apr 7, 202685% relevant

Meta Halts Mercor Work After Supply Chain Breach Exposes AI Training Secrets

A supply chain attack via compromised software updates at data-labeling vendor Mercor has forced Meta to pause collaboration, risking exposure of core AI training pipelines and quality metrics used by top labs.

Apr 4, 202697% relevant

Neko Health Launches $400 AI-Powered Full-Body Health Scans in New York This Spring

Neko Health, the $1.8B startup founded by Spotify's Daniel Ek, is launching its AI-driven full-body health screening service in the US. The $400 scan uses imaging and blood tests to screen for cancer, heart disease, and diabetes risk, though medical experts are divided on its efficacy.

Mar 28, 202685% relevant

Why Cheaper LLMs Can Cost More: The Hidden Economics of AI Inference in 2026

A Medium article outlines a practical framework for balancing performance, cost, and operational risk in real-world LLM deployment, arguing that focusing solely on model cost can lead to higher total expenses.

Mar 27, 202682% relevant

Agentic AI Shopping Bots Are Coming: Payment Giants and Retailers Are Building Them, Banks Are Scrambling

Major payment networks (Visa, Mastercard, PayPal) and retailers (Google, Walmart, Amazon) are developing autonomous AI shopping agents. This creates urgent operational and liability risks for banks, including unprecedented charge-back disputes and fraud exposure.

Mar 18, 202674% relevant

Claude AI Transforms Financial Analysis: From Public Filings to DCF Models in Minutes

Anthropic's Claude AI can now perform complex financial analysis comparable to a Goldman Sachs analyst, building detailed DCF models, earnings breakdowns, and sector risk reports from public filings in minutes using specialized prompts.

Mar 14, 202685% relevant

Data Readiness, Not Speed, Is the Critical Factor for AI Shopping Assistant Success

Experts warn that the biggest risk with AI shopping assistants is deploying before the organization is ready. Success hinges on unified data and security, not just rapid implementation, as shown by significant revenue influenced by these tools.

Mar 12, 202678% relevant

Mapping the Minefield: New Study Charts Five-Stage Taxonomy of LLM Harms

A new research paper systematically categorizes the potential harms of large language models across five lifecycle stages—from training to deployment—and argues that only multi-layered technical and policy safeguards can manage the risks.

Mar 10, 202695% relevant

AI Database Optimization: A Cautionary Tale for Luxury Retail's Critical Systems

AI agents can autonomously rewrite database queries to improve performance, but unsupervised deployment in production systems carries significant risks. For luxury retailers, this technology requires careful governance to avoid customer-facing disruptions.

Mar 6, 202660% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety