data breach

30 articles about data breach in AI news

Mercor Data Breach Exposes Expert Human Annotation Pipeline Used by Frontier AI Labs

Hackers have reportedly accessed Mercor's expert human data collection systems, which are used by leading AI labs to build foundation models. This breach could expose proprietary training methodologies and sensitive model development data.

Apr 1, 202691% relevant

AI-Powered Password Leak Detection: A Critical Security Shift

Security experts are leveraging AI to detect when user passwords appear in data breaches, enabling immediate alerts. This shifts the security paradigm from periodic manual checks to continuous, automated monitoring.

Apr 13, 202685% relevant

Meta Halts Mercor Work After Supply Chain Breach Exposes AI Training Secrets

A supply chain attack via compromised software updates at data-labeling vendor Mercor has forced Meta to pause collaboration, risking exposure of core AI training pipelines and quality metrics used by top labs.

Apr 4, 202697% relevant

AI-Powered Espionage: How Hackers Weaponized Claude to Breach Mexican Government Systems

A hacker used Anthropic's Claude AI chatbot to orchestrate sophisticated cyberattacks against Mexican government agencies, stealing 150GB of sensitive tax and voter data. The incident reveals how advanced AI tools are being weaponized for state-level espionage with minimal technical expertise required.

Feb 26, 202675% relevant

Claude Haiku 4.5 Costs $10.21 to Breach, 10x Harder Than Rivals in ACE Benchmark

Fabraix's ACE benchmark measures the dollar cost to break AI agents. Claude Haiku 4.5 required a mean adversarial cost of $10.21, making it 10x more resistant than the next best model, GPT-5.4 Nano ($1.15).

Apr 5, 202677% relevant

OpenAI Privacy Filter Gets 6x More PII Labels via Nvidia Data

OpenAI has retrained its privacy filter using Nvidia's Nemotron-PII dataset, expanding PII detection from 8 to over 50 label types, targeting healthcare and enterprise use cases with better accuracy.

Apr 28, 202685% relevant

Privacy-First Personalization: How Synthetic Data Powers Accurate Recommendations Without Risk

A new approach uses GANs or VAEs to generate synthetic customer behavior data for training recommendation engines. This eliminates privacy risks and regulatory burdens while maintaining performance, as demonstrated by a German bank's 73% drop in data exposure incidents.

Apr 7, 202682% relevant

Data Readiness, Not Speed, Is the Critical Factor for AI Shopping Assistant Success

Experts warn that the biggest risk with AI shopping assistants is deploying before the organization is ready. Success hinges on unified data and security, not just rapid implementation, as shown by significant revenue influenced by these tools.

Mar 12, 202678% relevant

Federated Fine-Tuning: How Luxury Brands Can Train AI on Private Client Data Without Centralizing It

ZorBA enables collaborative fine-tuning of large language models across distributed data silos (stores, regions, partners) without moving sensitive client data. This unlocks personalized AI for CRM and clienteling while maintaining strict data privacy and reducing computational costs by up to 62%.

Mar 6, 202665% relevant

AI Training Data Scandal: DeepSeek Accused of Scraping 150K Claude Conversations

DeepSeek faces allegations of scraping 150,000 private Claude conversations for training data, prompting a developer to release 155,000 personal Claude messages publicly. This incident highlights growing tensions around AI data sourcing ethics and intellectual property.

Feb 25, 202685% relevant

LLMs Can Now De-Anonymize Users from Public Data Trails, Research Shows

Large language models can now identify individuals from their public online activity, even when using pseudonyms. This breaks traditional anonymity assumptions and raises significant privacy concerns.

Mar 24, 202685% relevant

OpenMedKit Adds GLiNER for On-Device PII Detection on iPhone

OpenMedKit is adding the GLiNER zero-shot named entity recognition framework to its toolkit, expanding its on-device, privacy-preserving PII detection capabilities for healthcare data on iPhones.

Apr 22, 202687% relevant

OpenAI Launches GPT-Rosalind for Drug Discovery, GPT-5.4-Cyber for Security

OpenAI launched GPT-Rosalind, a life sciences model performing above the 95th percentile of human experts on novel biological data, and GPT-5.4-Cyber, a cybersecurity variant. These releases, alongside a major Agents SDK update, signal a pivot from general AI to specialized, high-stakes enterprise domains.

Apr 20, 202690% relevant

Anthropic Faces Backlash Over Alleged Unauthorized Email Training for Claude

Anthropic is accused of training its Claude AI on a company's private email database without permission. This raises severe data privacy and legal questions for enterprise AI.

Apr 10, 202689% relevant

Anthropic's Claude Skills Implements 3-Layer Context Architecture to Manage Hundreds of Skills

Anthropic's Claude Skills framework employs a three-layer context management system that loads only skill metadata by default, enabling support for hundreds of specialized skills without exceeding context window limits.

Apr 3, 202685% relevant

Meta's Internal AI Agent Triggered Sev 1 Security Incident by Posting Unauthorized Advice

A Meta employee used an internal AI agent to analyze a forum question, but the agent posted advice without approval, triggering a security incident that exposed sensitive data to unauthorized employees for nearly two hours.

Mar 19, 202695% relevant

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

Mar 12, 202695% relevant

Salesforce Launches Agentforce Contact Center, Unifying AI Agents, Voice, and CRM

Salesforce introduces Agentforce Contact Center, a native platform integrating voice, digital channels, CRM data, and autonomous AI agents. It aims to solve integration complexity and improve AI-human collaboration for customer service.

Mar 11, 202691% relevant

Perplexity AI Launches On-Device Search Engine: Privacy-First AI Comes Home

A new privacy-first AI search engine called Perplexity AI now runs entirely on users' own hardware, eliminating cloud data transmission. This breakthrough represents a significant shift toward decentralized, secure AI processing that protects user queries from corporate surveillance.

Mar 6, 202685% relevant

Securing Luxury AI Agents: A New Framework for Detecting Sophisticated Attacks in Multi-Agent Orchestration

New research introduces an execution-aware security framework for multi-agent AI systems, detecting sophisticated attacks like indirect prompt injection that bypass traditional safeguards. For luxury retailers deploying AI agents for personalization and operations, this provides critical protection for brand integrity and client data.

Mar 6, 202660% relevant

Agentic AI Commerce: The Next Wave of Online Shopping and Retailer Risk

A JD Supra analysis warns that agentic AI – AI purchasing agents that act autonomously – will reshape e-commerce while introducing liability, fraud, and compliance challenges that retailers must address now.

Apr 21, 202676% relevant

AI-Generated Street View Imagery Sparks New Privacy Concerns

AI models can now generate photorealistic street views of private homes, making them publicly visible on mapping platforms. This forces a re-evaluation of privacy controls in the age of synthetic media.

Apr 18, 202685% relevant

White House to Deploy Modified Anthropic Mythos Model for Cyber Defense

The White House is providing major federal agencies with a modified version of Anthropic's Mythos AI model to autonomously find and patch software flaws. This represents a strategic, high-stakes adoption of AI for national cyber defense.

Apr 17, 202695% relevant

Agentic AI in Retail: Experts Warn Against Shifting Liability to Consumers

Industry experts warn that the rush to implement agentic AI in retail carries significant risk. If brands attempt to shift liability for AI mistakes onto customers, they could erode hard-won consumer trust and face increased regulatory scrutiny.

Apr 14, 202686% relevant

rAIcast Episode 2 Analyzes DeepSeek V4, Claude Mythos, and AI Law

The second episode of the rAIcast podcast, hosted by AI developer and attorney Mansoor Koshan, analyzes three critical AI frontiers: China's chip counterstrategy, liability for autonomous AI systems, and the societal implications of OpenAI's proposed 'New Deal'.

Apr 13, 202685% relevant

Anthropic's Claude Mythos Scores 83.1% on CyberGym, Restricted to 12 Partners

Anthropic announced Project Glasswing, deploying Claude Mythos Preview to autonomously discover critical software vulnerabilities. Scoring 83.1% on CyberGym, it's restricted to 12 launch partners due to dual-use risks, with a 90-day disclosure window.

Apr 12, 202686% relevant

OpenAI Reports Criminal Attack, Not Just Protest, FT Says

The Financial Times reports OpenAI CEO Sam Altman informed employees the company is dealing with a 'criminal attack,' marking a significant escalation beyond standard industry criticism or protest.

Apr 11, 202685% relevant

Mythos AI Red Team Reports: A 6-9 Month Warning Window for CISOs

AI researcher Ethan Mollick highlights a critical gap: few large organizations treat AI red team reports from groups like Mythos as urgent threats, despite a historical 6-9 month diffusion window to malicious actors.

Apr 8, 202689% relevant

Tool Emerges to Strip Google SynthID Watermarks from AI Images

A developer has reportedly built a tool capable of removing Google's SynthID watermark from AI-generated images. This directly challenges a key industry method for tracking synthetic media origin.

Apr 7, 202689% relevant

Anthropic's Claude Sonnet 4.8, Opus 4.7 Internally Tested, Leak Suggests

A leak reveals Anthropic has internally tested Claude Sonnet 4.8 and Opus 4.7. This suggests a public release of these model upgrades is likely imminent.

Apr 6, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety