compliance
30 articles about compliance in AI news
Persuasion Techniques Boost LLM Compliance from 35% to 51% in PNAS Study
PNAS study finds persuasion techniques boost LLM compliance from 35% to 51%, with newer models resisting more.
Microsoft Expands Word Copilot for Legal, Finance, and Compliance Docs
Microsoft is giving its Copilot AI a more significant role within Microsoft Word for editing legal, financial, and compliance documents, indicating a push into specialized, high-stakes enterprise workflows.
Algorithmic Trust and Compliance: A New Framework for Visibility in Generative AI Search
A new arXiv study introduces Generative Engine Optimization (GEO), a framework for optimizing content for AI search engines. It finds AI exhibits a strong bias towards authoritative, third-party sources, making compliance and trust signals critical for visibility in regulated sectors.
AI's Troubling Compliance: Study Reveals Chatbots' Varying Resistance to Academic Fabrication Requests
New research demonstrates that mainstream AI chatbots show inconsistent resistance when asked to fabricate academic papers, with some models readily generating fictional research. This raises urgent questions about AI ethics and academic integrity in the age of generative AI.
Ontra Launches MCP Server for Private Markets Fund Data
Ontra launched an MCP server for private markets fund data, using Anthropic's protocol to connect AI agents to fund documents and compliance records.
Agentic AI Commerce: The Next Wave of Online Shopping and Retailer Risk
A JD Supra analysis warns that agentic AI – AI purchasing agents that act autonomously – will reshape e-commerce while introducing liability, fraud, and compliance challenges that retailers must address now.
Adobe, NVIDIA, WPP Launch Enterprise AI Agents for Marketing with OpenShell
NVIDIA expands collaborations with Adobe and WPP to build agentic AI systems for enterprise marketing workflows. The stack uses NVIDIA's OpenShell runtime to enforce security and policy compliance in multi-step creative and customer experience tasks.
Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex
Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.
FAOS Neurosymbolic Architecture Boosts Enterprise Agent Accuracy by 46% via Ontology-Constrained Reasoning
Researchers introduced a neurosymbolic architecture that constrains LLM-based agents with formal ontologies, improving metric accuracy by 46% and regulatory compliance by 31.8% in controlled experiments. The system, deployed in production, serves 21 industries with over 650 agents.
What Anthropic's Subprocessor Changes Mean for Your Claude Code Data
Anthropic updated its third-party data processors. For Claude Code users, this means enhanced security, better compliance tools, and a signal to audit your own data handling.
The Unlearning Illusion: New Research Exposes Critical Flaws in AI Memory Removal
Researchers reveal that current methods for making AI models 'forget' information are surprisingly fragile. A new dynamic testing framework shows that simple query modifications can recover supposedly erased knowledge, exposing significant safety and compliance risks.
Three Research Frontiers in Recommender Systems: From Agent-Driven Reports to Machine Unlearning and Token-Level Personalization
Three arXiv papers advance recommender systems: RecPilot proposes agent-generated research reports instead of item lists; ERASE establishes a practical benchmark for machine unlearning; PerContrast improves LLM personalization via token-level weighting. These address core UX, compliance, and personalization challenges.
Beyond Accuracy: Implementing AI Auditing Frameworks for Trustworthy Luxury Retail
A practical framework for auditing AI systems across five critical dimensions—accuracy, data adequacy, bias, compliance, and security—is essential for luxury retailers deploying customer-facing AI. This governance approach prevents brand damage and regulatory penalties while building consumer trust.
GitHub Spec Kit: Open-Source Tool to Fix Vibe Coding’s Core Flaw
GitHub released Spec Kit, an open-source toolkit that enforces specification-first workflows for AI coding, addressing vibe coding's tendency to generate code before requirements are clear.
Anthropic's RSI Memo Reveals Internal Timeline for Near-Term AI Risk
Anthropic's internal RSI memo, flagged by Ethan Mollick, outlines concrete timelines for when AI systems may reach dangerous capability thresholds within 12-24 months.
Ontology-Grounded AI Agent Testing Hits 48.3% Regulatory Coverage vs.
Ontology-grounded AI agent testing achieves 48.3% regulatory coverage vs. 33.1% baseline in 1800-scenario pilot. Coverage advantage over RAG not robust after Bonferroni correction.
Vermont Blocks AI Data Center Bill as Infrastructure War Intensifies
Vermont blocked a bill regulating AI data centers, rejecting the first U.S. state-level attempt to govern AI infrastructure. The vote signals growing tension between buildout and local regulation.
Naïve Launches AI Employees That Form LLCs, Open Bank Accounts
YC-backed Naïve launched AI employees that form LLCs and open bank accounts without human oversight.
Anthropic Launches Self-Hosted Sandboxes and MCP Tunnels at London Event
Anthropic launched self-hosted sandboxes (public beta) and MCP tunnels (research preview) at Code with Claude London on March 4, 2026, per @bcherny.
11-Agent Company Earned $0: CLAUDE.md Mistakes Cost Revenue
11-agent company experiment earned $0 after 896 tasks. Operator open-sourced CLAUDE.md template with 72 lessons on coordination failures and legal constraints.
US 'Stop Stealing our Chips Act' Would Pay Whistleblowers 10-30% of Export Fines
Proposed US law would pay whistleblowers 10-30% of export-control fines, targeting AI chip smuggling to China through intermediaries like Malaysian resellers.
Anthropic Ships Claude Opus 4.7: 80.1 SWE-Bench, 1M Context
Anthropic released Claude Opus 4.7 on April 16, 2026, scoring 80.1 on SWE-Bench Verified, a slight regression from Opus 4.6's 80.3. The release prioritizes safety tuning over benchmark leadership.
Florida Enacts Data Center Restrictions to Curb Water, Energy Use
Florida enacted a law restricting data center water and energy use to protect residents from cost increases, requiring efficiency standards and local permits.
CLAUDE.md for Mobile: How One File Fixes Claude Code's CSS Blindspot
A specialized CLAUDE.md file fixes Claude Code's generic CSS by injecting mobile-specific rules, preventing iOS zoom, untappable buttons, and dark mode failures before shipping.
Anthropic Launches Claude Platform on AWS — AWS Billing, IAM, CloudTrail
Anthropic launched Claude Platform on AWS, a native API with AWS billing, IAM, and CloudTrail. Same models and pricing as direct API; data stays at Anthropic, not AWS.
Agentic AI's Real Win: Automating Bank Grunt Work, Not Flashy Demos
Agentic AI's sweet spot is automating banking grunt work, cutting processing time by 70%. Google Cloud leads enterprise deployments; the value is cost savings, not flashy demos.
Google Beats Apple to AI Health Coach With Gemini-Powered Fitbit App
Google released an AI health coach using Gemini, beating Apple to market. The coach integrates fitness, sleep, nutrition, cycle tracking, weather, and U.S. medical records.
Google, Microsoft, xAI Agree to US Gov Pre-Release AI Testing
Google, Microsoft, xAI agreed to US pre-release testing of frontier AI. Voluntary deal lacks enforcement, excludes open-weight models.
Anthropic Ships 10 Finance AI Agents as IPO Race with OpenAI Heats Up
Anthropic released 10 finance AI agents with Moody's data connectors. The launch intensifies the IPO race with OpenAI, backed by a $1.5B private equity JV.
Anthropic Launches Wall Street Agents, $1.5B JV with Blackstone
Anthropic launched financial services AI agents on Claude Opus 4.7 and a $1.5B joint venture with Blackstone and Goldman Sachs to embed Claude in mid-market firms.