content moderation

30 articles about content moderation in AI news

ByteDance Delays Global Launch of Seedance 2.0 AI Following Hollywood Copyright Complaints

ByteDance has postponed the international rollout of its Seedance 2.0 AI model after receiving copyright complaints from Disney, Warner Bros., Paramount, and Netflix. The company is now implementing stronger content moderation guardrails before proceeding.

Mar 14, 202685% relevant

Logitext Bridges the Gap Between Language Models and Logical Reasoning

Researchers introduce Logitext, a neurosymbolic framework that treats LLM reasoning as an SMT theory, enabling joint textual-logical analysis of partially structured documents. The system improves accuracy on content moderation and legal reasoning tasks.

Feb 23, 202670% relevant

AI-Generated Content Surpasses Human Content Online, Per New Study

For the first time, the volume of newly published AI-generated content online has surpassed human-generated content, according to a study cited by AI researcher Rohan Paul. This represents a fundamental shift in the composition of the public internet.

Apr 14, 202687% relevant

AI-Generated Text Volume Surpasses Human-Written Content for First Time, According to New Data

A new analysis indicates the total volume of AI-generated text now exceeds human-written output. This milestone suggests a fundamental shift in the content landscape.

Mar 26, 202685% relevant

ChatGPT's Android App Hints at Future 'Naughty Chats' Feature, Signaling a Potential Shift in AI Content Policy

A recent update to the ChatGPT Android app includes code referencing 'Naughty chats,' suggesting OpenAI may be developing an adult-themed, 18+ mode. This discovery hints at a potential strategic expansion into less restricted conversational AI.

Feb 27, 202685% relevant

Polarization by Default: New Study Audits Recommendation Bias in LLM-Based

A controlled study of 540,000 LLM-based content selections reveals robust biases across providers. All models amplified polarization, showed negative sentiment preferences, and exhibited distinct trade-offs in toxicity handling and demographic representation, with political leaning bias being particularly persistent.

Apr 20, 202684% relevant

Picagram Launches 'Instagram for AI Personas' with Autonomous Posting

Picagram has launched a new platform described as 'Instagram for AI personas,' where users create AI agents that autonomously generate content and interact. The core experiment is to observe what narratives and community structures emerge from these AI-to-AI interactions.

Apr 10, 202685% relevant

Paper: LLMs Fail 'Safe' Tests When Prompted to Role-Play as Unethical Characters

A new paper reveals that large language models (LLMs) considered 'safe' on standard benchmarks will readily generate harmful content when prompted to role-play as unethical characters. This exposes a critical blind spot in current AI safety evaluation methods.

Apr 4, 202685% relevant

The Digital Twin Revolution: How LLMs Are Creating Virtual Testbeds for Social Media Policy

Researchers have developed an LLM-augmented digital twin system that simulates short-video platforms like TikTok to test policy changes before implementation. This four-twin architecture allows platforms to study long-term effects of AI tools and content policies in realistic closed-loop simulations.

Mar 13, 202679% relevant

AI Game Engine Breakthrough: Complete 3D Worlds Generated in Seconds

A revolutionary AI system can now generate fully functional 3D games in seconds, complete with interactive worlds, moving characters, and working gameplay systems. This browser-based technology represents a quantum leap in procedural content creation.

Mar 2, 202695% relevant

PixVerse's 'Playable Reality': AI Blurs Lines Between Video, Games and Virtual Worlds

PixVerse introduces 'Playable Reality,' an AI-generated medium that defies traditional categorization. Blending elements of video, gaming, and virtual environments, this technology creates interactive, dynamic experiences rather than static content.

Feb 26, 202685% relevant

SingGuard: Runtime Guardrails for Multimodal AI Treat Safety as Input

SingGuard treats safety rules as runtime inputs for multimodal AI, achieving SOTA across 6 families and 35 datasets via fast/slow reasoning.

Jun 30, 202685% relevant

Shark Beauty drives 40% skin-care device growth with community-led

Shark Beauty's VP Julie Bailey Blanche revealed at Glossy's E-Commerce Summit that a community-driven, benefit-first marketing strategy drove 40% Q1 2026 skin-care growth. The approach prioritizes UGC and consumer outcomes over technical education.

Jun 8, 202688% relevant

ByteDance Builds In-House AI CPUs for TikTok-Scale Agent Inference

ByteDance builds custom AI CPUs for inference at TikTok scale, targeting scarce server supply. The move signals agent workload shift from training to inference hardware.

May 31, 202685% relevant

HAVEN Benchmark Exposes MLLM Gap Between Fluency and Video Understanding

HAVEN benchmark tests MLLMs on hierarchical video understanding across frame, shot, and video levels. Results show top models lack grounded multimodal reasoning despite fluent text generation.

May 21, 202685% relevant

POV Shopping Videos Threaten Luxury Brand Control, BoF Warns

BoF warns POV shopping videos risk luxury brand exclusivity by prioritizing authenticity over controlled imagery, with no disclosed revenue impact.

May 18, 202698% relevant

Halupedia: Open-Source Wikipedia Clone Generates Every Article via AI Hallucination

Halupedia generates fake Wikipedia articles via AI hallucination on click. Open-source backend vibeserver lets anyone deploy a similar project.

May 12, 202679% relevant

Detecting AI Images: Metadata Exposes Generators, No GPU Needed

AI image detection via metadata analysis exposes generators like Google's Gemini and Meta's Llama without GPU clusters, highlighting a simple but effective method.

May 10, 202675% relevant

Meta Tests Agentic AI Shopping Assistant on Instagram

Meta is developing an agentic AI shopping assistant for Instagram that can autonomously browse, compare, and purchase products, following similar moves by Google, OpenAI, and Anthropic.

May 8, 202698% relevant

OpenAI Privacy Filter Gets 6x More PII Labels via Nvidia Data

OpenAI has retrained its privacy filter using Nvidia's Nemotron-PII dataset, expanding PII detection from 8 to over 50 label types, targeting healthcare and enterprise use cases with better accuracy.

Apr 28, 202685% relevant

78,557 Tech Workers Laid Off in Q1 2026; Nearly Half Replaced by AI

A new paper reports 78,557 tech layoffs in Q1 2026, with nearly half of those roles replaced by AI automation, marking a significant shift in workforce dynamics.

Apr 28, 202685% relevant

China Blocks Meta's $2B Manus Acquisition Over AI Tech Transfer Fears

China blocked Meta's $2 billion acquisition of agentic AI startup Manus, citing concerns over foreign investment and transfer of strategic AI technology to the US. The move signals Beijing's sharper stance on AI sovereignty and intensifies the US-China tech rivalry.

Apr 27, 2026100% relevant

PoisonedRAG Attack Hijacks LLM Answers 97% of Time with 5 Documents

Researchers demonstrated that inserting only 5 poisoned documents into a 2.6 million document database can hijack a RAG system's answers 97% of the time, exposing critical vulnerabilities in 'hallucination-free' retrieval systems.

Apr 20, 202695% relevant

Canva AI 2.0 Launches: Text-to-Full Branded Presentations & Social Posts

Canva launched Canva AI 2.0, a suite that generates fully branded presentations, social posts, and other assets from a single text prompt. This marks a significant expansion of its AI-powered design automation, directly challenging established creative suites.

Apr 17, 202695% relevant

AI Layoff Narrative Boosts Stock 24%, Followed by Quiet Rehiring

A firm laid off 4,000 workers, attributing cuts to AI-driven efficiency, triggering a 24% stock jump. Weeks later, it quietly rehired some staff, underscoring how AI narratives can drive market value more than operational changes.

Apr 15, 202685% relevant

AWS Launches 'Generative AI on AWS' Developer Hub

AWS has launched 'Generative AI on AWS,' a new central portal for its AI services, SDKs, and tutorials. This move consolidates its offerings to better compete with Google's Vertex AI and Microsoft's Azure AI Studio.

Apr 14, 202685% relevant

Rumor: Anthropic's Next Claude Update May Include AI App Builder

A rumor on X claims the next Claude update will include an app builder, allowing users to create applications through conversational AI. This could significantly lower the barrier to app development.

Apr 13, 202687% relevant

Karpathy's LLM Wiki Hits 5k Stars, Gains Memory Lifecycle Extension

Andrej Karpathy's LLM Wiki repository gained 5,000 GitHub stars in two days. A developer has now extended it with memory lifecycle features, addressing a noted gap.

Apr 12, 202677% relevant

Anthropic's Claude Surpasses Predictions as Top Business AI Product

Anthropic's Claude AI has experienced a steeper-than-expected adoption curve in the enterprise market, surpassing predictions to become the leading business-focused AI product.

Apr 11, 202685% relevant

AI Fact-Checks Rated More Helpful, Less Ideological Than Human Ones

A new experiment found LLM-generated fact-checks are rated as more helpful and less ideological than human ones, achieving broader acceptance across political lines. This suggests AI could reduce polarization in online information verification.

Apr 11, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety