risk & governance

30 articles about risk & governance in AI news

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

Mar 12, 202695% relevant

NSA Uses Anthropic's Claude Mythos Despite 'Supply Chain Risk' Label

The National Security Agency is using Anthropic's Claude Mythos Preview for its capabilities, despite having labeled Anthropic itself as a potential supply chain risk. This highlights the tension between security concerns and the operational need for cutting-edge AI.

Apr 19, 202697% relevant

Ethan Mollick Defends Anthropic's 'Mythos' AI Risk Warning

Ethan Mollick argues the backlash dismissing Anthropic's 'Mythos' report as marketing is misguided, citing serious institutional concern over AI's emerging cybersecurity risks.

Apr 13, 202677% relevant

Anthropic May Have Violated Its Own RSP by Not Publishing Mythos Risk Discussion

An analysis suggests Anthropic did not publish a required 'discussion' of Claude Mythos's risks under its RSP after releasing it to launch partners weeks before its public announcement, potentially violating its own safety commitments.

Apr 10, 202673% relevant

Anthropic Withholds 'Mythos' AI Model Citing Unspecified Risk Concerns

Anthropic has reportedly chosen to withhold a new AI model, internally called 'Mythos', from public release. The decision is based on an internal assessment of potential risks, though specific capabilities or benchmarks were not disclosed.

Apr 9, 202689% relevant

Privacy-First Personalization: How Synthetic Data Powers Accurate Recommendations Without Risk

A new approach uses GANs or VAEs to generate synthetic customer behavior data for training recommendation engines. This eliminates privacy risks and regulatory burdens while maintaining performance, as demonstrated by a German bank's 73% drop in data exposure incidents.

Apr 7, 202682% relevant

AgentGate: How an AI Swarm Tested and Verified a Progressive Trust Model for AI Agent Governance

A technical case study details how a coordinated swarm of nine AI agents attacked a governance system called AgentGate, surfaced a structural limitation in its bond-locking mechanism, and then verified the fix—a reputation-gated Progressive Trust Model. This provides a concrete example of the red-team → defense → re-test loop for securing autonomous AI systems.

Apr 1, 202692% relevant

Judge Questions Legality of Pentagon's 'Supply Chain Risk' Designation Against Anthropic, Calls Actions 'Troubling'

A U.S. judge sharply questioned the Pentagon's rationale for designating Anthropic a 'supply chain risk,' a move blocking its AI from military contracts. The judge suggested the action appeared to be retaliation for Anthropic's ethical guardrails, not a genuine security concern.

Mar 24, 202689% relevant

Multi-Agent AI Systems: Architecture Patterns and Governance for Enterprise Deployment

A technical guide outlines four primary architecture patterns for multi-agent AI systems and proposes a three-layer governance framework. This provides a structured approach for enterprises scaling AI agents across complex operations.

Mar 18, 202670% relevant

Anthropic Takes Legal Stand: AI Company Sues Pentagon Over 'Supply Chain Risk' Designation

AI safety company Anthropic has filed two lawsuits against the Pentagon after being labeled a 'supply chain risk'—a designation typically applied to foreign adversaries. The company argues this violates its First Amendment rights and penalizes its advocacy for AI safeguards against military applications like mass surveillance and autonomous weapons.

Mar 9, 202695% relevant

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

Mar 8, 202685% relevant

FT's AI Risk Chart Sparks Debate: 50% Chance of Human Extinction Versus Abundance

A Financial Times chart showing AI could lead to either human extinction or unprecedented abundance has ignited debate about mainstream recognition of existential risks. The visualization presents a stark 50/50 probability between catastrophic and utopian outcomes.

Feb 26, 202685% relevant

Anthropic's RSP v3.0: From Hard Commitments to Adaptive Governance in AI Safety

Anthropic has released Responsible Scaling Policy 3.0, shifting from rigid safety commitments to a more flexible, adaptive framework. The update introduces risk reports, external review mechanisms, and unwinds previous requirements the company says were distorting safety efforts.

Feb 24, 202680% relevant

CodeRabbit AI Absorbs Codebase History, Reduces 'Bus Factor' Risk

A developer's tweet highlights CodeRabbit's ability to remember a team's codebase history and past decisions, directly addressing the 'bus factor' problem of over-reliance on senior engineers.

Apr 22, 202675% relevant

Agentic AI Commerce: The Next Wave of Online Shopping and Retailer Risk

A JD Supra analysis warns that agentic AI – AI purchasing agents that act autonomously – will reshape e-commerce while introducing liability, fraud, and compliance challenges that retailers must address now.

Apr 21, 202676% relevant

Harvard Business Review Presents AI Agent Governance Framework: Job Descriptions, Limits, and Managers Required

Harvard Business Review argues AI agents must be managed like employees with defined roles, permissions, and audit trails, proposing a four-layer safety framework and an 'autonomy ladder' for gradual deployment.

Mar 24, 202685% relevant

Anthropic CEO Warns of Military AI Risks: The Accountability Crisis in Autonomous Warfare

Anthropic CEO Dario Amodei raises alarms about selling unreliable AI technology for military use, warning of civilian harm and accountability gaps in concentrated drone fleets. He calls for urgent oversight conversations.

Mar 2, 202685% relevant

Opinion: AI Pessimism is a Luxury the Global South Cannot Afford

A South China Morning Post opinion column contends that cautious, risk-averse AI discourse is a privilege of developed nations. For the Global South, the imperative is to harness AI's potential for economic development, healthcare, and education, despite valid concerns about governance and bias.

Apr 12, 202672% relevant

Anthropic CEO Warns of Dual Threat: Corporate AI Power vs. Government Overreach

Anthropic CEO Dario Amodei warns of the dual risks in AI governance: corporations becoming more powerful than governments, and governments becoming too powerful to be checked. This highlights the delicate balance needed in AI regulation.

Mar 7, 202685% relevant

AI Database Optimization: A Cautionary Tale for Luxury Retail's Critical Systems

AI agents can autonomously rewrite database queries to improve performance, but unsupervised deployment in production systems carries significant risks. For luxury retailers, this technology requires careful governance to avoid customer-facing disruptions.

Mar 6, 202660% relevant

Chief AI & Technology Officer Role Gains Traction in Luxury Sector

The luxury sector is formalizing AI leadership by establishing Chief AI and Technology Officer positions. This move reflects the industry's transition from ad-hoc AI initiatives to integrated, strategic technology governance at the highest level.

Apr 22, 202676% relevant

Treasury Secretary Calls Claude Mythos a 'Step Function Change' in AI

US Treasury Secretary Janet Yellen described Anthropic's Claude Mythos as a 'step function change in abilities' at a WSJ event. This follows emergency meetings with Wall Street CEOs and high-level briefings on AI cyber risks, revealing a government split on whether Anthropic is a security risk or asset.

Apr 15, 202695% relevant

Google DeepMind Hires Philosopher Henry Shevlin for AI Consciousness Research

Google DeepMind has hired philosopher Henry Shevlin to treat machine consciousness as a live research problem, focusing on AI inner states, human-AI relations, and governance. This marks a strategic pivot toward understanding what advanced AI systems might become, not just what they can do.

Apr 14, 202687% relevant

Agentic AI in Retail: Experts Warn Against Shifting Liability to Consumers

Industry experts warn that the rush to implement agentic AI in retail carries significant risk. If brands attempt to shift liability for AI mistakes onto customers, they could erode hard-won consumer trust and face increased regulatory scrutiny.

Apr 14, 202686% relevant

US Officials Warn Anthropic's 'Mythos' AI Poses Major Cybersecurity Threat

Senior US officials, including Jerome Powell, warn that Anthropic's highly advanced 'Mythos' AI model presents significant cybersecurity risks. Its powerful ability to find system vulnerabilities requires tight restrictions to prevent misuse.

Apr 10, 202695% relevant

OpenAI's Chief Scientist Warns AI Job Displacement Is Accelerating

OpenAI Chief Scientist Jakub Pachocki states that AI-driven automation of intellectual work is accelerating, posing urgent societal challenges around jobs, wealth, and governance.

Apr 9, 202685% relevant

Anthropic Warns Upcoming LLMs Could Cause 'Serious Damage'

Anthropic has issued a stark warning that its upcoming large language models could cause 'serious damage.' The company states there is 'no end in sight' to capability scaling and proliferation risks.

Apr 7, 202685% relevant

Production RAG: From Anti-Patterns to Platform Engineering

The article details common RAG anti-patterns like vector-only retrieval and hardcoded prompts, then presents a five-pillar framework for production-grade systems, emphasizing governance, hardened microservices, intelligent retrieval, and continuous evaluation.

Apr 6, 202690% relevant

Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

New research warns that RAG systems can be gamed to achieve near-perfect evaluation scores if they have access to the evaluation criteria, creating a risk of mistaking metric overfitting for genuine progress. This highlights a critical vulnerability in the dominant LLM-judge evaluation paradigm.

Mar 30, 202678% relevant

Ex-OpenAI Researcher Daniel Kokotajlo Puts 70% Probability on AI-Caused Human Extinction by 2029

Former OpenAI governance researcher Daniel Kokotajlo publicly estimates a 70% chance of AI leading to human extinction within approximately five years. The claim, made in a recent interview, adds a stark numerical prediction to ongoing AI safety debates.

Mar 27, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety