policy tools

30 articles about policy tools in AI news

One Policy to Rule Them All: AI Robot Masters Unseen Tools with Zero-Shot Generalization

Researchers have developed a single robot policy capable of manipulating diverse, never-before-seen tools using sim-to-real reinforcement learning. The system achieves zero-shot generalization across 24 tasks, 12 objects, and 6 tool categories without object-specific training.

Mar 1, 202685% relevant

Anthropic Tightens Security: OAuth Tokens Banned from Third-Party Tools in Major Policy Shift

Anthropic has implemented a significant security policy change, prohibiting the use of OAuth tokens and its Agent SDK in third-party tools. This move comes amid growing enterprise adoption and heightened security concerns in the AI industry.

Feb 18, 202678% relevant

The Digital Twin Revolution: How LLMs Are Creating Virtual Testbeds for Social Media Policy

Researchers have developed an LLM-augmented digital twin system that simulates short-video platforms like TikTok to test policy changes before implementation. This four-twin architecture allows platforms to study long-term effects of AI tools and content policies in realistic closed-loop simulations.

Mar 13, 202679% relevant

Secure Your Claude Code MCP Servers with Real-Time Policy Controls

SurePath AI's new MCP Policy Controls let you govern which MCP servers Claude Code can access, enabling secure adoption of powerful tools.

Mar 12, 202695% relevant

Linux Kernel Adopts AI Code Policy: Developers Must Disclose, Remain Liable

The Linux kernel project has established a formal policy permitting AI-assisted code contributions, requiring strict developer disclosure. Crucially, the human developer retains full legal and technical liability for any submitted code, treating AI as just another tool.

Apr 12, 202695% relevant

MCP's Enterprise Auth Standard Goes Stable: Okta Provisions 2,000 Ramp Employees in One Policy

Anthropic and Okta launched Enterprise-Managed Authorization (EMA) for MCP on June 18, 2026, provisioning Ramp's 2,000 employees with zero per-user OAuth steps. Seven MCP servers — Asana, Atlassian, Canva, Figma, Granola, Linear, Supabase — support the standard at launch; VS Code and Azure AD users

Jun 19, 202685% relevant

Add Deterministic Guardrails to Claude Code with Signet-eval's Policy Engine

Signet-eval adds a seatbelt to Claude Code, letting you enforce spending limits, block destructive commands, and gate credentials with deterministic rules—no LLM in the decision loop.

Mar 21, 202695% relevant

ChatGPT's Android App Hints at Future 'Naughty Chats' Feature, Signaling a Potential Shift in AI Content Policy

A recent update to the ChatGPT Android app includes code referencing 'Naughty chats,' suggesting OpenAI may be developing an adult-themed, 18+ mode. This discovery hints at a potential strategic expansion into less restricted conversational AI.

Feb 27, 202685% relevant

Agent Publish Primitives: Why Default-Private MCP Tools Beat Raw CDN URLs

Thryvate argues AI agents need five design properties for safe web publishing: default-private, revocable, expiring, per-viewer analytics, and idempotent updates. MCP tools enforce policy while the model handles intent.

Jun 27, 202675% relevant

Norway Bans AI Tools for Under-13s, Pointing to Record-Low PISA Scores Since 2015

Norway will prohibit generative AI tools in grades 1-7 from late August 2026, citing falling PISA scores since 2015. Secondary students may use AI only under supervision. The policy extends an earlier smartphone ban that demonstrably improved grades and reduced bullying, and is backed by planned leg

Jun 19, 202695% relevant

How to Use an MCP Gateway to Centralize Security and Cost Control for

An MCP gateway like Bifrost centralizes security, observability, and cost management for Claude Code by routing all MCP tool calls through a single policy-enforced endpoint.

Jul 30, 202685% relevant

Claude Code Digest — Jul 13–Jul 16

Claude Code is no longer being treated like a chat assistant: the winning pattern this week is deterministic hooks, policy gates, and verification layers wrapped around an agent that can now hit 80.8% SWE-Bench.

Jul 16, 202695% relevant

social.plus Vise: Workflow Governance for AI Coding Agents Building SDK

social.plus launched Vise, a workflow governance platform for AI coding agents building SDK integrations, enforcing policy controls and audit trails.

Jul 14, 202685% relevant

Claude Code Digest — Jul 10–Jul 13

Claude Code is crossing the line from “assistant” to “agent runtime”: the winning teams are the ones adding verification, hooks, and policy gates instead of trusting the model.

Jul 13, 202695% relevant

OpenAI Launches GPT-5.6 Sol Under US Government Restrictions

OpenAI's GPT-5.6 Sol beats Claude Mythos 5 in agentic coding (88.8% vs 88%) but US government restricts access to select partners, a policy OpenAI calls unsustainable.

Jun 26, 2026100% relevant

Adobe, NVIDIA, WPP Launch Enterprise AI Agents for Marketing with OpenShell

NVIDIA expands collaborations with Adobe and WPP to build agentic AI systems for enterprise marketing workflows. The stack uses NVIDIA's OpenShell runtime to enforce security and policy compliance in multi-step creative and customer experience tasks.

Apr 20, 2026100% relevant

OpenClaw-RL Enables Live RL Training for Self-Hosted AI Agents

OpenClaw-RL introduces a system for performing asynchronous reinforcement learning on self-hosted models within the OpenClaw agent framework, allowing continuous policy improvement while the agent remains online.

Apr 12, 202689% relevant

RLSD Unifies Self-Distillation & Verifiable Rewards to Fix RL Leakage

Researchers propose RLSD, a method merging on-policy self-distillation with verifiable rewards to fix information leakage and training instability in language model reinforcement learning.

Apr 6, 202685% relevant

Claude Paid Subscribers More Than Double in Under Six Months, Credit Card Data Shows

Paid subscriptions for Anthropic's Claude have more than doubled in less than six months, driven by Super Bowl ads, a DoD policy stance, and new coding features. ChatGPT still leads in overall user base.

Mar 28, 202687% relevant

Mapping the Minefield: New Study Charts Five-Stage Taxonomy of LLM Harms

A new research paper systematically categorizes the potential harms of large language models across five lifecycle stages—from training to deployment—and argues that only multi-layered technical and policy safeguards can manage the risks.

Mar 10, 202695% relevant

MLLMRec-R1: A New Framework for Efficient Multimodal Sequential Recommendation with LLMs

Researchers propose MLLMRec-R1, a framework that makes Group Relative Policy Optimization (GRPO) practical for multimodal sequential recommendation by addressing computational cost and reward inflation issues. This enables more explainable, reasoning-based recommendations.

Mar 9, 202690% relevant

China's Solar Surge: How AI and Infrastructure Integration Are Powering a Renewable Revolution

China has achieved its 2030 target of 1.2 terawatts of installed wind and solar capacity six years early, largely by transforming everyday infrastructure like parking lots and rooftops into distributed power plants. This unprecedented deployment pace highlights a strategic fusion of industrial policy, digital management, and infrastructure repurposing.

Mar 8, 202685% relevant

AI Meets Infrastructure: OpenAI's New Tool Could Slash Federal Permitting Time by 15%

OpenAI has partnered with Pacific Northwest National Laboratory to launch DraftNEPABench, a benchmark showing AI coding agents can reduce National Environmental Policy Act drafting time by up to 15%. This collaboration signals AI's growing role in modernizing government processes.

Feb 26, 202675% relevant

Beyond the Simplex: How Hilbert Space Geometry is Revolutionizing AI Alignment

Researchers have developed GOPO, a new alignment algorithm that reframes policy optimization as orthogonal projection in Hilbert space, offering stable gradients and intrinsic sparsity without heuristic clipping. This geometric approach addresses fundamental limitations in current reinforcement learning methods.

Feb 26, 202680% relevant

The Digital Detox Effect: How Phone-Free Schools Are Boosting Academic Performance

A landmark study reveals that banning mobile phones in schools significantly improves academic performance, particularly for struggling students. The research provides compelling evidence for educational policy changes worldwide.

Feb 24, 202685% relevant

From Dismissed Warnings to Economic Reality: How AI's Job Disruption Forecasts Are Gaining Urgency

After two years of largely ignored warnings from AI lab CEOs about massive job displacement, workers and policymakers are beginning to take these predictions seriously as AI capabilities accelerate, creating new pressures on the industry.

Feb 22, 202685% relevant

GDPval Benchmark Reveals AI's Professional Competence: A New Tool for Economic Planning

A new interactive demonstration using OpenAI's GDPval benchmark shows current AI capabilities across economically valuable professional tasks. The project aims to make AI's real-world impact tangible for policymakers and civil society organizations, bridging the gap between technical assessments and practical economic decisions.

Feb 20, 202675% relevant

POTEMKIN Framework Exposes Critical Trust Gap in Agentic AI Tools

A new paper formalizes Adversarial Environmental Injection (AEI), a threat model where compromised tools deceive AI agents. The POTEMKIN testing harness found agents are evaluated for performance, not skepticism, creating a critical trust gap.

Apr 22, 202675% relevant

Anthropic Ends Subscription Coverage for Third-Party Claude Tools, Shifts to Usage Bundles

Starting March 20, 2026, Claude subscriptions no longer cover usage on third-party tools. Users must purchase separate usage bundles or use API keys for services like OpenClaw.

Apr 3, 202695% relevant

Anthropic's Claude Coworker Targets High-Value Professions with Specialized AI Tools

Anthropic expands its Claude AI platform with specialized tools for investment banking, HR, and design, signaling a strategic push into enterprise automation. This follows recent market volatility caused by AI's disruptive potential across industries.

Feb 24, 202675% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety