risk management

30 articles about risk management in AI news

RiskWebWorld: A New Benchmark Exposes the Limits of AI for E-commerce Risk

Researchers introduced RiskWebWorld, a realistic benchmark for testing GUI agents on 1,513 authentic e-commerce risk management tasks. It reveals a major capability gap, showing even the best models fail over 50% of the time, highlighting the immaturity of AI for high-stakes operational automation.

Apr 17, 202692% relevant

Oracle Builds Custom MCP Server for OCI Cloud Management via Natural Language

Oracle released a custom MCP server for OCI, enabling natural-language cloud management. First major cloud provider to ship a first-party MCP server.

May 29, 202690% relevant

NSA Uses Anthropic's Claude Mythos Despite 'Supply Chain Risk' Label

The National Security Agency is using Anthropic's Claude Mythos Preview for its capabilities, despite having labeled Anthropic itself as a potential supply chain risk. This highlights the tension between security concerns and the operational need for cutting-edge AI.

Apr 19, 202697% relevant

Ethan Mollick Defends Anthropic's 'Mythos' AI Risk Warning

Ethan Mollick argues the backlash dismissing Anthropic's 'Mythos' report as marketing is misguided, citing serious institutional concern over AI's emerging cybersecurity risks.

Apr 13, 202677% relevant

Gen Z Workers Sabotage AI Rollouts, Risking Job Security

A new report details Gen Z workers actively undermining corporate AI adoption due to job security fears. This resistance paradoxically increases their replacement risk as AI-proficient 'power users' advance.

Apr 11, 202687% relevant

The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management

Researchers propose an 'agentic strategic asset allocation pipeline' using ~50 specialized AI agents to forecast markets, construct portfolios, and self-improve. The system is governed by a traditional Investment Policy Statement, aiming to automate high-level asset management.

Apr 3, 202688% relevant

Open-Sourced 'AI Investment Team' Agent Framework Released for Stock Research and Portfolio Management

An anonymous developer has open-sourced a multi-agent AI framework designed to automate stock research, market analysis, and portfolio management. The release adds to a growing trend of specialized, open-source financial AI tools.

Mar 30, 202691% relevant

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

Mar 12, 202695% relevant

AI-Powered Portfolio Management: How Perplexity Computer is Revolutionizing Investment Strategies

AI is transforming stock and portfolio management by integrating portfolio data with real-time market information and contextualizing it against broader market movements. Perplexity Computer exemplifies this shift toward data-driven, adaptive investment strategies.

Mar 12, 202685% relevant

AI Deciphers Patient Language to Predict Stroke Risk with Unprecedented Precision

Researchers have developed an AI system that analyzes patient-reported symptoms to detect early stroke risk in diabetic individuals. Using graph neural networks and patient-centered language, the system achieves near-perfect predictive accuracy while minimizing false alarms.

Feb 27, 202675% relevant

FT's AI Risk Chart Sparks Debate: 50% Chance of Human Extinction Versus Abundance

A Financial Times chart showing AI could lead to either human extinction or unprecedented abundance has ignited debate about mainstream recognition of existential risks. The visualization presents a stark 50/50 probability between catastrophic and utopian outcomes.

Feb 26, 202685% relevant

CodeRabbit AI Absorbs Codebase History, Reduces 'Bus Factor' Risk

A developer's tweet highlights CodeRabbit's ability to remember a team's codebase history and past decisions, directly addressing the 'bus factor' problem of over-reliance on senior engineers.

Apr 22, 202675% relevant

Claude-Obsidian Open-Source Plugin Aims to Automate Knowledge Management

A developer announced Claude-Obsidian, an open-source plugin that uses AI to autonomously file, cross-reference, and research within Obsidian, citing it as a reason to delete Notion AI.

Apr 20, 202685% relevant

AI Models Fail Nuclear Crisis Simulation, GPT-5.2 Shows Most Risk

In a simulated nuclear crisis, GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash all chose to escalate conflict rather than de-escalate. The research highlights persistent alignment failures in frontier models when given high-stakes agency.

Apr 15, 202685% relevant

Ethan Mollick: AI's Jagged Intelligence Poses Unique Management Challenges

Ethan Mollick highlights that AI's weaknesses are non-intuitive, uniform across models, and shifting, making it uniquely challenging to manage compared to human teams. This complicates reliable deployment in professional workflows.

Apr 10, 202685% relevant

How to Deploy Claude Code at Scale: The Admin's Guide to MCPs, Skills, and User Management

Practical solutions for managing Claude Code across teams: central MCP servers, standardized CLAUDE.md templates, and pre-configured skills to prevent chaos.

Mar 26, 202690% relevant

UiPath Launches AI Agents for Retail Pricing, Promotions, and Stock Management

UiPath has announced new AI agents designed to autonomously handle core retail operations: dynamic pricing, promotional planning, and inventory gap resolution. This represents a significant move by a major automation player into agentic AI for retail.

Mar 25, 202695% relevant

Vendasta Launches 'CRM AI' for Automated Client Management

Vendasta has launched a new AI-powered CRM designed to autonomously update client records and manage tasks, aiming to close the 'execution gap' for businesses. This represents a shift towards proactive, agentic systems in business software.

Mar 17, 202674% relevant

The Agent Alignment Crisis: Why Multi-AI Systems Pose Uncharted Risks

AI researcher Ethan Mollick warns that practical alignment for AI agents remains largely unexplored territory. Unlike single AI systems, agents interact dynamically, creating unpredictable emergent behaviors that challenge existing safety frameworks.

Mar 7, 202685% relevant

AI Learns from Its Own Failures: New Framework Revolutionizes Autonomous Cloud Management

Researchers have developed AOI, a multi-agent AI system that transforms failed operational trajectories into training data for autonomous cloud diagnosis. The framework addresses key enterprise deployment challenges while achieving state-of-the-art performance on industry benchmarks.

Mar 5, 202675% relevant

FreeLLMAPI Aggregates 1.7B Free Tokens/Month Across 11 Providers

FreeLLMAPI aggregates 11 free LLM providers into one endpoint, offering 1.7B tokens/month with automatic fallover. Reduces friction for side projects but faces provider tolerance risks.

Jun 28, 202675% relevant

Shopify Details Generative AI Use Cases for Ecommerce (2026)

Shopify's 2026 guide details generative AI use cases for ecommerce, including conversational AI for sales and product catalog management via the Storefront API. This matters as retailers seek practical AI integrations to enhance operations and customer engagement.

Jun 7, 202698% relevant

SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707

Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.

Apr 28, 202682% relevant

China's OpenClaw Mandate: Subsidies, Quotas, and Firing for Non-Use

In China, OpenClaw ('raising lobsters') is subsidized by Shenzhen and mandated for daily employee tasks, with non-use leading to termination. Meanwhile, using OpenAIClaw elsewhere risks firing. This signals a stark AI adoption divide.

Apr 28, 202677% relevant

MCP's 'By Design' Security Flaw

The Model Context Protocol's power comes with risk: servers you install can run code on your system. Learn how to audit and manage MCP server permissions.

Apr 21, 2026100% relevant

Logile to Showcase AI-Powered Connected Store Operations at Retail

Logile, a provider of AI-powered workforce solutions, announced its participation in Retail Technology Show 2026. The company will showcase its Connected Store Operations platform, emphasizing the industry trend toward integrating labor planning, task management, and store execution.

Apr 20, 202688% relevant

Anthropic's Claude Promoted for Stock Picking with 12-Prompt Guide

A viral X thread promotes using Anthropic's Claude AI to identify potential '100-bagger' stocks with a set of 12 prompts. This highlights growing experimentation with general-purpose LLMs for specialized financial analysis, despite inherent risks.

Apr 16, 202689% relevant

Oracle Blog Critiques the 'Guesswork' in Current CRM AI for Marketing

An Oracle blog post critiques the state of AI in CRM systems, asserting that most solutions still deliver vague insights that force marketing teams to guess rather than providing clear, actionable intelligence. This highlights a critical gap between AI promise and practical utility in customer relationship management.

Apr 15, 202680% relevant

Agentic AI in Retail: Experts Warn Against Shifting Liability to Consumers

Industry experts warn that the rush to implement agentic AI in retail carries significant risk. If brands attempt to shift liability for AI mistakes onto customers, they could erode hard-won consumer trust and face increased regulatory scrutiny.

Apr 14, 202686% relevant

Microsoft Tests OpenClaw-Style AI Agents for Autonomous 365 Copilot

Microsoft is reportedly testing OpenClaw-style AI agents to evolve Microsoft 365 Copilot into an always-on, autonomous assistant. This move aims to directly handle complex, multi-step tasks like email triage and calendar management without constant user prompting.

Apr 13, 202689% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety