policy
30 articles about policy in AI news
Tsinghua Researchers Diagnose On-Policy Distillation Failures, Propose Fixes
Researchers from Tsinghua University have pinpointed two necessary conditions for successful on-policy distillation: compatible thinking patterns and novel teacher capabilities. They propose two recovery methods to salvage failing distillation runs.
Linux Kernel Adopts AI Code Policy: Developers Must Disclose, Remain Liable
The Linux kernel project has established a formal policy permitting AI-assisted code contributions, requiring strict developer disclosure. Crucially, the human developer retains full legal and technical liability for any submitted code, treating AI as just another tool.
Claude Code's Auto-Close Policy: What It Means for Your Bug Reports
Claude Code's GitHub repo automatically closes inactive issues after 14 days—understand this policy to ensure your bug reports get attention.
OpenAI Publishes 'Intelligence Age' Policy Blueprint for Superintelligence Transition
OpenAI published a policy blueprint outlining governance and economic proposals for the 'Intelligence Age,' framing superintelligence as an active transition requiring new safety nets and international coordination.
Anthropic Forms Corporate PAC to Influence AI Policy Ahead of Midterms
Anthropic is forming a corporate PAC to lobby on AI policy, signaling a strategic shift towards direct political engagement as regulatory debates intensify in Washington. This move follows similar efforts by OpenAI and Google.
The Digital Twin Revolution: How LLMs Are Creating Virtual Testbeds for Social Media Policy
Researchers have developed an LLM-augmented digital twin system that simulates short-video platforms like TikTok to test policy changes before implementation. This four-twin architecture allows platforms to study long-term effects of AI tools and content policies in realistic closed-loop simulations.
Secure Your Claude Code MCP Servers with Real-Time Policy Controls
SurePath AI's new MCP Policy Controls let you govern which MCP servers Claude Code can access, enabling secure adoption of powerful tools.
JPMorgan CEO Warns AI Unemployment Could Spark Civil Unrest, Calls for Policy Intervention
JPMorgan CEO Jamie Dimon warns that AI-driven mass unemployment could lead to civil unrest, urging policymakers to prepare for economic disruption. His remarks signal growing concern among corporate leaders about AI's societal impact.
One Policy to Rule Them All: AI Robot Masters Unseen Tools with Zero-Shot Generalization
Researchers have developed a single robot policy capable of manipulating diverse, never-before-seen tools using sim-to-real reinforcement learning. The system achieves zero-shot generalization across 24 tasks, 12 objects, and 6 tool categories without object-specific training.
The AI Policy Tsunami: How Governments Worldwide Are Scrambling to Regulate Artificial Intelligence
As AI capabilities accelerate, policymakers face an overwhelming array of regulatory challenges spanning data centers, military applications, privacy, mental health impacts, job displacement, and ethical standards. The rapid pace of development is creating a governance gap that neither governments nor AI labs can adequately address.
The AI Policy Gap: Why Governments Are Struggling to Keep Pace with Rapid Technological Change
AI expert Ethan Mollick warns that rapid AI advancements combined with knowledge gaps and uncertain futures are leading to reactive, scattered policy responses rather than coherent governance frameworks.
Anthropic Tightens Security: OAuth Tokens Banned from Third-Party Tools in Major Policy Shift
Anthropic has implemented a significant security policy change, prohibiting the use of OAuth tokens and its Agent SDK in third-party tools. This move comes amid growing enterprise adoption and heightened security concerns in the AI industry.
Sam Altman Advocates for 32-Hour Work Week in AI-Driven Policy Paper
Sam Altman has proposed a 4-day, 32-hour work week as part of a new social contract, reflecting a growing trend among executives to advocate for reduced working hours in the age of AI.
Google's Cookie Policy Update and the Challenge of AI-Powered Personalization
Google has updated its user-facing cookie and data consent interface, emphasizing its use of data for personalization and ad measurement. This reflects the ongoing tension between data-driven AI services and user privacy, a critical issue for luxury retail's digital transformation.
Add Deterministic Guardrails to Claude Code with Signet-eval's Policy Engine
Signet-eval adds a seatbelt to Claude Code, letting you enforce spending limits, block destructive commands, and gate credentials with deterministic rules—no LLM in the decision loop.
ChatGPT's Android App Hints at Future 'Naughty Chats' Feature, Signaling a Potential Shift in AI Content Policy
A recent update to the ChatGPT Android app includes code referencing 'Naughty chats,' suggesting OpenAI may be developing an adult-themed, 18+ mode. This discovery hints at a potential strategic expansion into less restricted conversational AI.
Anthropic Publishes US-China AI Competition Blueprint
Anthropic published a policy paper on US-China AI competition, warning the US lead could erode within 3-5 years without strategic action including export controls and talent investment.
Adobe, NVIDIA, WPP Launch Enterprise AI Agents for Marketing with OpenShell
NVIDIA expands collaborations with Adobe and WPP to build agentic AI systems for enterprise marketing workflows. The stack uses NVIDIA's OpenShell runtime to enforce security and policy compliance in multi-step creative and customer experience tasks.
OpenAI Proposes 4-Day Week, Robot Tax Amid Rising Anti-AI Violence
Following violent attacks on CEO Sam Altman, OpenAI has published a policy paper proposing a new social contract, including a four-day workweek and AI dividends, to address rising public anxiety over AI's societal impact.
OpenClaw-RL Enables Live RL Training for Self-Hosted AI Agents
OpenClaw-RL introduces a system for performing asynchronous reinforcement learning on self-hosted models within the OpenClaw agent framework, allowing continuous policy improvement while the agent remains online.
RLSD Unifies Self-Distillation & Verifiable Rewards to Fix RL Leakage
Researchers propose RLSD, a method merging on-policy self-distillation with verifiable rewards to fix information leakage and training instability in language model reinforcement learning.
Dubai Mandates AI-Powered Virtual Worship for All Churches on Easter
Dubai issued a directive moving all church, temple, and gurdwara services exclusively online for Easter Sunday, leveraging its digital infrastructure to enforce a 'safest city' policy during a major religious event.
The Self Driving Portfolio: Agentic Architecture for Institutional Asset Management
Researchers propose an 'agentic strategic asset allocation pipeline' using ~50 specialized AI agents to forecast markets, construct portfolios, and self-improve. The system is governed by a traditional Investment Policy Statement, aiming to automate high-level asset management.
Claude Paid Subscribers More Than Double in Under Six Months, Credit Card Data Shows
Paid subscriptions for Anthropic's Claude have more than doubled in less than six months, driven by Super Bowl ads, a DoD policy stance, and new coding features. ChatGPT still leads in overall user base.
CanViT: First Active-Vision Foundation Model Hits 45.9% mIoU on ADE20K with Sequential Glimpses
Researchers introduce CanViT, the first task- and policy-agnostic Active-Vision Foundation Model (AVFM). It achieves 38.5% mIoU on ADE20K segmentation with a single low-resolution glimpse, outperforming prior active models while using 19.5x fewer FLOPs.
AIGQ: Taobao's End-to-End Generative Architecture for E-commerce Query Recommendation
Alibaba researchers propose AIGQ, a hybrid generative framework for pre-search query recommendations. It uses list-level fine-tuning, a novel policy optimization algorithm, and a hybrid deployment architecture to overcome traditional limitations, showing substantial online improvements on Taobao.
ServiceNow CEO Bill McDermott Predicts AI Agents Could Drive Unemployment to 30%+
ServiceNow CEO Bill McDermott warns AI agents could push unemployment into the mid-30% range within years, with graduate unemployment already at 9%. He says policymakers are unprepared for the speed of this transition.
SPREAD Framework Solves AI's 'Catastrophic Forgetting' Problem in Lifelong Learning
Researchers have developed SPREAD, a new AI framework that preserves learned skills across sequential tasks by aligning policy representations in low-rank subspaces. This breakthrough addresses catastrophic forgetting in lifelong imitation learning, enabling more stable and robust AI agents.
Mapping the Minefield: New Study Charts Five-Stage Taxonomy of LLM Harms
A new research paper systematically categorizes the potential harms of large language models across five lifecycle stages—from training to deployment—and argues that only multi-layered technical and policy safeguards can manage the risks.
MLLMRec-R1: A New Framework for Efficient Multimodal Sequential Recommendation with LLMs
Researchers propose MLLMRec-R1, a framework that makes Group Relative Policy Optimization (GRPO) practical for multimodal sequential recommendation by addressing computational cost and reward inflation issues. This enables more explainable, reasoning-based recommendations.