risk & compliance

30 articles about risk & compliance in AI news

Agentic AI Commerce: The Next Wave of Online Shopping and Retailer Risk

A JD Supra analysis warns that agentic AI – AI purchasing agents that act autonomously – will reshape e-commerce while introducing liability, fraud, and compliance challenges that retailers must address now.

Apr 21, 202676% relevant

RiskWebWorld: A New Benchmark Exposes the Limits of AI for E-commerce Risk

Researchers introduced RiskWebWorld, a realistic benchmark for testing GUI agents on 1,513 authentic e-commerce risk management tasks. It reveals a major capability gap, showing even the best models fail over 50% of the time, highlighting the immaturity of AI for high-stakes operational automation.

Apr 17, 202692% relevant

Microsoft Expands Word Copilot for Legal, Finance, and Compliance Docs

Microsoft is giving its Copilot AI a more significant role within Microsoft Word for editing legal, financial, and compliance documents, indicating a push into specialized, high-stakes enterprise workflows.

Apr 15, 202685% relevant

A-R Space Framework Profiles LLM Agent Execution Behavior Across Risk Contexts

Researchers propose the A-R Space, measuring Action Rate and Refusal Signal to profile LLM agent behavior across four risk contexts and three autonomy levels. This provides a deployment-oriented framework for selecting agents based on organizational risk tolerance.

Apr 15, 202696% relevant

Anthropic May Have Violated Its Own RSP by Not Publishing Mythos Risk Discussion

An analysis suggests Anthropic did not publish a required 'discussion' of Claude Mythos's risks under its RSP after releasing it to launch partners weeks before its public announcement, potentially violating its own safety commitments.

Apr 10, 202673% relevant

Privacy-First Personalization: How Synthetic Data Powers Accurate Recommendations Without Risk

A new approach uses GANs or VAEs to generate synthetic customer behavior data for training recommendation engines. This eliminates privacy risks and regulatory burdens while maintaining performance, as demonstrated by a German bank's 73% drop in data exposure incidents.

Apr 7, 202682% relevant

Algorithmic Trust and Compliance: A New Framework for Visibility in Generative AI Search

A new arXiv study introduces Generative Engine Optimization (GEO), a framework for optimizing content for AI search engines. It finds AI exhibits a strong bias towards authoritative, third-party sources, making compliance and trust signals critical for visibility in regulated sectors.

Mar 16, 202672% relevant

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

Mar 12, 202695% relevant

The Unlearning Illusion: New Research Exposes Critical Flaws in AI Memory Removal

Researchers reveal that current methods for making AI models 'forget' information are surprisingly fragile. A new dynamic testing framework shows that simple query modifications can recover supposedly erased knowledge, exposing significant safety and compliance risks.

Mar 13, 202695% relevant

SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707

Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.

Apr 28, 202682% relevant

China's OpenClaw Mandate: Subsidies, Quotas, and Firing for Non-Use

In China, OpenClaw ('raising lobsters') is subsidized by Shenzhen and mandated for daily employee tasks, with non-use leading to termination. Meanwhile, using OpenAIClaw elsewhere risks firing. This signals a stark AI adoption divide.

Apr 28, 202677% relevant

Adobe, NVIDIA, WPP Launch Enterprise AI Agents for Marketing with OpenShell

NVIDIA expands collaborations with Adobe and WPP to build agentic AI systems for enterprise marketing workflows. The stack uses NVIDIA's OpenShell runtime to enforce security and policy compliance in multi-step creative and customer experience tasks.

Apr 20, 2026100% relevant

Agentic AI in Retail: Experts Warn Against Shifting Liability to Consumers

Industry experts warn that the rush to implement agentic AI in retail carries significant risk. If brands attempt to shift liability for AI mistakes onto customers, they could erode hard-won consumer trust and face increased regulatory scrutiny.

Apr 14, 202686% relevant

Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex

Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.

Apr 2, 202699% relevant

What Anthropic's Subprocessor Changes Mean for Your Claude Code Data

Anthropic updated its third-party data processors. For Claude Code users, this means enhanced security, better compliance tools, and a signal to audit your own data handling.

Mar 27, 202690% relevant

Pentagon to Integrate Palantir's AI Platform as Core Military System, Despite Anthropic Supply Chain Concerns

The Pentagon is moving to adopt Palantir's AI platform as a core system for military operations. This comes despite reported complications involving Anthropic's Claude AI, which was recently flagged as a supply chain risk.

Mar 23, 202685% relevant

AgentDrift: How Corrupted Tool Data Causes Unsafe Recommendations in LLM Agents

New research reveals LLM agents making product recommendations can maintain ranking quality while suggesting unsafe items when their tools provide corrupted data. Standard metrics like NDCG fail to detect this safety drift, creating hidden risks for high-stakes applications.

Mar 16, 202695% relevant

Claude AI Transforms Financial Analysis: From Public Filings to DCF Models in Minutes

Anthropic's Claude AI can now perform complex financial analysis comparable to a Goldman Sachs analyst, building detailed DCF models, earnings breakdowns, and sector risk reports from public filings in minutes using specialized prompts.

Mar 14, 202685% relevant

Data Readiness, Not Speed, Is the Critical Factor for AI Shopping Assistant Success

Experts warn that the biggest risk with AI shopping assistants is deploying before the organization is ready. Success hinges on unified data and security, not just rapid implementation, as shown by significant revenue influenced by these tools.

Mar 12, 202678% relevant

AI Database Optimization: A Cautionary Tale for Luxury Retail's Critical Systems

AI agents can autonomously rewrite database queries to improve performance, but unsupervised deployment in production systems carries significant risks. For luxury retailers, this technology requires careful governance to avoid customer-facing disruptions.

Mar 6, 202660% relevant

Beyond Accuracy: Implementing AI Auditing Frameworks for Trustworthy Luxury Retail

A practical framework for auditing AI systems across five critical dimensions—accuracy, data adequacy, bias, compliance, and security—is essential for luxury retailers deploying customer-facing AI. This governance approach prevents brand damage and regulatory penalties while building consumer trust.

Mar 4, 202675% relevant

U.S. Military Declares Anthropic a National Security Threat in Unprecedented AI Crackdown

The U.S. Department of War has designated Anthropic as a supply-chain risk to national security, banning military contractors from conducting business with the AI company. This dramatic move signals escalating government concerns about AI safety and control.

Feb 27, 202695% relevant

Goldman Sachs Bets on Claude AI for Banking's Backbone Operations

Goldman Sachs is deploying Anthropic's Claude AI model to automate critical back-office functions like trade accounting and client onboarding. This strategic move signals a major shift in how elite financial institutions leverage generative AI for operational efficiency and risk reduction.

Feb 17, 202678% relevant

Agentic AI's Real Win: Automating Bank Grunt Work, Not Flashy Demos

Agentic AI's sweet spot is automating banking grunt work, cutting processing time by 70%. Google Cloud leads enterprise deployments; the value is cost savings, not flashy demos.

May 8, 202692% relevant

Google Beats Apple to AI Health Coach With Gemini-Powered Fitbit App

Google released an AI health coach using Gemini, beating Apple to market. The coach integrates fitness, sleep, nutrition, cycle tracking, weather, and U.S. medical records.

May 7, 202682% relevant

Anthropic Ships 10 Finance AI Agents as IPO Race with OpenAI Heats Up

Anthropic released 10 finance AI agents with Moody's data connectors. The launch intensifies the IPO race with OpenAI, backed by a $1.5B private equity JV.

May 5, 202698% relevant

Anthropic Launches Wall Street Agents, $1.5B JV with Blackstone

Anthropic launched financial services AI agents on Claude Opus 4.7 and a $1.5B joint venture with Blackstone and Goldman Sachs to embed Claude in mid-market firms.

May 5, 2026100% relevant

Trump Team Weighs Pre-Release AI Model Review Process

Trump admin discusses AI working group for pre-release model review. Briefed Anthropic, Google, OpenAI; no executive order yet.

May 5, 2026100% relevant

How a Custom Multimodal Transformer Beat a Fine-Tuned LLM for Attribute

LeBonCoin's ML team built a custom late-fusion transformer that uses pre-computed visual embeddings and character n-gram text vectors to predict ad attributes. It outperformed a fine-tuned VLM while running on CPU with sub-200ms latency, offering calibrated probabilities and 15-minute retraining cycles.

Apr 29, 2026100% relevant

New Thesis Exposes Critical Flaws in Recommender System Fairness Metrics —

This thesis systematically analyzes offline fairness evaluation measures for recommender systems, revealing flaws in interpretability, expressiveness, and applicability. It proposes novel evaluation approaches and practical guidelines for selecting appropriate measures, directly addressing the confusion caused by un-validated metrics.

Apr 29, 202684% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety