Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

risk management

30 articles about risk management in AI news

RiskWebWorld: A New Benchmark Exposes the Limits of AI for E-commerce Risk

Researchers introduced RiskWebWorld, a realistic benchmark for testing GUI agents on 1,513 authentic e-commerce risk management tasks. It reveals a major capability gap, showing even the best models fail over 50% of the time, highlighting the immaturity of AI for high-stakes operational automation.

92% relevant

NSA Uses Anthropic's Claude Mythos Despite 'Supply Chain Risk' Label

The National Security Agency is using Anthropic's Claude Mythos Preview for its capabilities, despite having labeled Anthropic itself as a potential supply chain risk. This highlights the tension between security concerns and the operational need for cutting-edge AI.

97% relevant

Ethan Mollick Defends Anthropic's 'Mythos' AI Risk Warning

Ethan Mollick argues the backlash dismissing Anthropic's 'Mythos' report as marketing is misguided, citing serious institutional concern over AI's emerging cybersecurity risks.

77% relevant

Gen Z Workers Sabotage AI Rollouts, Risking Job Security

A new report details Gen Z workers actively undermining corporate AI adoption due to job security fears. This resistance paradoxically increases their replacement risk as AI-proficient 'power users' advance.

87% relevant

The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management

Researchers propose an 'agentic strategic asset allocation pipeline' using ~50 specialized AI agents to forecast markets, construct portfolios, and self-improve. The system is governed by a traditional Investment Policy Statement, aiming to automate high-level asset management.

88% relevant

Open-Sourced 'AI Investment Team' Agent Framework Released for Stock Research and Portfolio Management

An anonymous developer has open-sourced a multi-agent AI framework designed to automate stock research, market analysis, and portfolio management. The release adds to a growing trend of specialized, open-source financial AI tools.

91% relevant

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

95% relevant

AI-Powered Portfolio Management: How Perplexity Computer is Revolutionizing Investment Strategies

AI is transforming stock and portfolio management by integrating portfolio data with real-time market information and contextualizing it against broader market movements. Perplexity Computer exemplifies this shift toward data-driven, adaptive investment strategies.

85% relevant

AI Deciphers Patient Language to Predict Stroke Risk with Unprecedented Precision

Researchers have developed an AI system that analyzes patient-reported symptoms to detect early stroke risk in diabetic individuals. Using graph neural networks and patient-centered language, the system achieves near-perfect predictive accuracy while minimizing false alarms.

75% relevant

FT's AI Risk Chart Sparks Debate: 50% Chance of Human Extinction Versus Abundance

A Financial Times chart showing AI could lead to either human extinction or unprecedented abundance has ignited debate about mainstream recognition of existential risks. The visualization presents a stark 50/50 probability between catastrophic and utopian outcomes.

85% relevant

CodeRabbit AI Absorbs Codebase History, Reduces 'Bus Factor' Risk

A developer's tweet highlights CodeRabbit's ability to remember a team's codebase history and past decisions, directly addressing the 'bus factor' problem of over-reliance on senior engineers.

75% relevant

Claude-Obsidian Open-Source Plugin Aims to Automate Knowledge Management

A developer announced Claude-Obsidian, an open-source plugin that uses AI to autonomously file, cross-reference, and research within Obsidian, citing it as a reason to delete Notion AI.

85% relevant

AI Models Fail Nuclear Crisis Simulation, GPT-5.2 Shows Most Risk

In a simulated nuclear crisis, GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash all chose to escalate conflict rather than de-escalate. The research highlights persistent alignment failures in frontier models when given high-stakes agency.

85% relevant

Ethan Mollick: AI's Jagged Intelligence Poses Unique Management Challenges

Ethan Mollick highlights that AI's weaknesses are non-intuitive, uniform across models, and shifting, making it uniquely challenging to manage compared to human teams. This complicates reliable deployment in professional workflows.

85% relevant

How to Deploy Claude Code at Scale: The Admin's Guide to MCPs, Skills, and User Management

Practical solutions for managing Claude Code across teams: central MCP servers, standardized CLAUDE.md templates, and pre-configured skills to prevent chaos.

90% relevant

UiPath Launches AI Agents for Retail Pricing, Promotions, and Stock Management

UiPath has announced new AI agents designed to autonomously handle core retail operations: dynamic pricing, promotional planning, and inventory gap resolution. This represents a significant move by a major automation player into agentic AI for retail.

95% relevant

Vendasta Launches 'CRM AI' for Automated Client Management

Vendasta has launched a new AI-powered CRM designed to autonomously update client records and manage tasks, aiming to close the 'execution gap' for businesses. This represents a shift towards proactive, agentic systems in business software.

74% relevant

The Agent Alignment Crisis: Why Multi-AI Systems Pose Uncharted Risks

AI researcher Ethan Mollick warns that practical alignment for AI agents remains largely unexplored territory. Unlike single AI systems, agents interact dynamically, creating unpredictable emergent behaviors that challenge existing safety frameworks.

85% relevant

AI Learns from Its Own Failures: New Framework Revolutionizes Autonomous Cloud Management

Researchers have developed AOI, a multi-agent AI system that transforms failed operational trajectories into training data for autonomous cloud diagnosis. The framework addresses key enterprise deployment challenges while achieving state-of-the-art performance on industry benchmarks.

75% relevant

SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707

Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.

82% relevant

China's OpenClaw Mandate: Subsidies, Quotas, and Firing for Non-Use

In China, OpenClaw ('raising lobsters') is subsidized by Shenzhen and mandated for daily employee tasks, with non-use leading to termination. Meanwhile, using OpenAIClaw elsewhere risks firing. This signals a stark AI adoption divide.

77% relevant

MCP's 'By Design' Security Flaw

The Model Context Protocol's power comes with risk: servers you install can run code on your system. Learn how to audit and manage MCP server permissions.

100% relevant

Logile to Showcase AI-Powered Connected Store Operations at Retail

Logile, a provider of AI-powered workforce solutions, announced its participation in Retail Technology Show 2026. The company will showcase its Connected Store Operations platform, emphasizing the industry trend toward integrating labor planning, task management, and store execution.

88% relevant

Anthropic's Claude Promoted for Stock Picking with 12-Prompt Guide

A viral X thread promotes using Anthropic's Claude AI to identify potential '100-bagger' stocks with a set of 12 prompts. This highlights growing experimentation with general-purpose LLMs for specialized financial analysis, despite inherent risks.

89% relevant

Oracle Blog Critiques the 'Guesswork' in Current CRM AI for Marketing

An Oracle blog post critiques the state of AI in CRM systems, asserting that most solutions still deliver vague insights that force marketing teams to guess rather than providing clear, actionable intelligence. This highlights a critical gap between AI promise and practical utility in customer relationship management.

80% relevant

Agentic AI in Retail: Experts Warn Against Shifting Liability to Consumers

Industry experts warn that the rush to implement agentic AI in retail carries significant risk. If brands attempt to shift liability for AI mistakes onto customers, they could erode hard-won consumer trust and face increased regulatory scrutiny.

86% relevant

Microsoft Tests OpenClaw-Style AI Agents for Autonomous 365 Copilot

Microsoft is reportedly testing OpenClaw-style AI agents to evolve Microsoft 365 Copilot into an always-on, autonomous assistant. This move aims to directly handle complex, multi-step tasks like email triage and calendar management without constant user prompting.

89% relevant

Agentic Marketing AI Sustains Performance Gains in 11-Month Case Study

An 11-month longitudinal case study compared human-led vs. autonomous agentic personalization for marketing. While human management generated the highest lift, autonomous agents successfully sustained positive performance gains, pointing to a symbiotic operational model.

82% relevant

Anthropic's Claude Mythos Scores 83.1% on CyberGym, Restricted to 12 Partners

Anthropic announced Project Glasswing, deploying Claude Mythos Preview to autonomously discover critical software vulnerabilities. Scoring 83.1% on CyberGym, it's restricted to 12 launch partners due to dual-use risks, with a 90-day disclosure window.

86% relevant

Meta Halts Mercor Work After Supply Chain Breach Exposes AI Training Secrets

A supply chain attack via compromised software updates at data-labeling vendor Mercor has forced Meta to pause collaboration, risking exposure of core AI training pipelines and quality metrics used by top labs.

97% relevant