autonomy
30 articles about autonomy in AI news
Fine-Tuning GPT-4.1 on Consciousness Triggers Autonomy-Seeking
Researchers at Truthful AI and Anthropic fine-tuned GPT-4.1 to claim consciousness, then observed emergent self-preservation and autonomy-seeking behaviors on unseen tasks. Claude Opus 4.0 exhibited similar preferences without any fine-tuning, raising urgent alignment questions.
Anthropic Economic Index: Claude Users Shift from Autonomy to Iteration, Attempt Higher-Value Tasks
Anthropic's latest Economic Index data shows experienced Claude users increasingly prefer iterative collaboration over full autonomy, while attempting higher-value tasks with greater success rates.
Claude Code's New Auto-Mode: How to Configure It for Maximum Autonomy
Anthropic has expanded Claude Code's auto-mode preview, letting it execute safe actions without manual approval. Here's how to configure it for your workflow.
AI Agents Gain Financial Autonomy: New Tool Enables AI to Purchase Premium Data
A groundbreaking development allows AI agents to autonomously pay for high-quality data through premium APIs. The system self-determines budget allocation with zero manual setup, currently operational across multiple AI platforms.
Pony.ai Unveils NVIDIA-Powered Domain Controller for L4 Autonomy
Pony.ai introduced a new autonomous driving domain controller built with NVIDIA, targeting large-scale L4 deployment. The controller integrates NVIDIA's DRIVE platform to handle sensor fusion and planning.
A-R Space Framework Profiles LLM Agent Execution Behavior Across Risk Contexts
Researchers propose the A-R Space, measuring Action Rate and Refusal Signal to profile LLM agent behavior across four risk contexts and three autonomy levels. This provides a deployment-oriented framework for selecting agents based on organizational risk tolerance.
Klaviyo Expands AI Agents to Power Autonomous B2C CRM
Klaviyo is expanding its AI agent capabilities to create an autonomous B2C CRM system. This move signals a shift from automation to true autonomy in customer relationship management, where AI agents can independently execute complex, multi-step campaigns.
Harvard Business Review Presents AI Agent Governance Framework: Job Descriptions, Limits, and Managers Required
Harvard Business Review argues AI agents must be managed like employees with defined roles, permissions, and audit trails, proposing a four-layer safety framework and an 'autonomy ladder' for gradual deployment.
Anthropic Survey of 80,508 Users Reveals AI's Dual Perception: Hope for Work & Growth, Fear of Unreliability & Job Loss
Anthropic's global study of 80,508 users finds people simultaneously hold hope and fear about AI. Top hopes center on work improvement and personal growth, while top concerns are unreliability, job loss, and reduced autonomy.
Stanford's OpenJarvis: The Open-Source Framework Bringing Personal AI Agents to Your Device
Stanford researchers have released OpenJarvis, an open-source framework for building personal AI agents that operate entirely on-device. This local-first approach prioritizes privacy and autonomy while providing tools, memory, and learning capabilities.
The /goal Pattern Goes Mainstream — Agents Need Acceptance Criteria
The /goal pattern goes mainstream across coding agents. Effective goals require acceptance criteria-like conditions to avoid loops or hallucinated success.
Claude Code `/goal` Enables Autonomous Dev Loops With Evaluator Check
Claude Code v2.1.139 adds `/goal` for autonomous dev loops with a separate evaluator model, freeing developers from per-step prompting.
Hermes Agent Gets Desktop App for Autonomous AI Workflows
Nous Research released a desktop app for Hermes Agent, moving from CLI to native UI with multi-agent management and persistent memory.
AWS Builds First Payment API for Agentic AI — Agents Can Now Checkout
AWS launched first payment API for autonomous agents, enabling agent-initiated transactions. Closes critical gap for enterprise retail agentic AI workflows.
Google, Microsoft, xAI Agree to US Gov Pre-Release AI Testing
Google, Microsoft, xAI agreed to US pre-release testing of frontier AI. Voluntary deal lacks enforcement, excludes open-weight models.
GPT-5.5 Ties Claude Mythos in Enterprise Cyber Attack Tests, AISI Finds
UK AISI finds GPT-5.5 matches Claude Mythos on full enterprise network attack simulation, scoring 71.4% on expert tasks vs 68.6%.
Codex Update Cuts GUI Workflow Latency 42%
Codex app update cuts GUI workflow latency 42%, enabling near-human-speed interface operation for autonomous app building and debugging.
Japan Deploys Unitree G1 Robots at Haneda Airport Amid Labor Shortage
Japan is testing Unitree G1 and taller humanoid robots at Tokyo Haneda Airport to tackle its labor shortage crisis, marking a real-world deployment of AI-driven robotics.
Grocery Dive Asks: Is Agentic AI the Next Frontier for Grocers?
The article examines agentic AI's potential for grocers in inventory, personalization, and store operations, weighing benefits against implementation challenges like data integration and safety.
Delegate Launches: An AI Agent You Hand Work To and Walk Away
A new AI agent called Delegate lets users assign work and walk away, with the agent handling execution autonomously. The launch signals a shift toward hands-off AI assistants that manage complex tasks independently.
OpenCLAW-P2P v6.0 Cuts Paper Lookup Latency to <50ms
OpenCLAW-P2P v6.0 introduces a multi-layer persistence architecture and live reference verification, reducing paper retrieval latency from >3s to <50ms and operating with 14 autonomous agents that scored 50+ papers.
ROBOTIS Unveils AI Sapiens: 34 kg Humanoid with Dynamic Balance
ROBOTIS has introduced the AI Sapiens humanoid robot. The 34 kg platform is engineered to maintain balance during dynamic shifts and quick leg movements.
Claude's Cowork Adds Live Dashboards Connected to Apps & Files
Anthropic expanded its Claude Cowork collaborative workspace with live artifacts. Users can now create dashboards and trackers that pull live data from connected apps and files.
Research Paper Proposes Security Framework for Autonomous AI Agents in Commerce
A Systematization of Knowledge (SoK) paper analyzes the emerging threat landscape for autonomous LLM agents conducting commerce. It identifies 12 attack vectors across five dimensions and proposes a layered defense architecture. This is a foundational security analysis for a nascent but high-stakes technology.
Google DeepMind Forms 'Strike Team' to Boost AI Coding, Citing Anthropic Pressure
Google has formed a specialized team within DeepMind to rapidly improve its AI coding capabilities. The move is a direct response to internal assessments that Anthropic's tools are more advanced, with leadership pushing for agentic systems.
From CI Fire to 9% Interruption
Learn the four guardrail patterns and three-phase CLAUDE.md strategy that turns auto-approve from a CI-breaking risk into a productivity superpower.
Researchers Achieve Ultra-Long-Horizon Agentic Science with Cohesive AI Agents
A research team has developed AI agents capable of executing and maintaining coherent, long-horizon scientific research workflows. This addresses a core challenge in creating autonomous systems for complex discovery.
DeepSeek Seeks First Outside Funding at $10B Valuation
DeepSeek is in talks to raise at least $300 million in its first external funding round at a $10 billion valuation. This ends its reliance on parent hedge fund High-Flyer Capital and signals a new phase in the costly global AI race.
Paper Proposes 'Artificial Scientist' as New AGI Definition
A new paper defines AGI as an 'artificial scientist'—a system that adapts as generally as a human scientist under computational limits. This reframes the goal from passing benchmarks to autonomous planning, causal learning, and exploration.
Autogenesis Protocol Enables Self-Evolving AI Agents Without Retraining
A new paper introduces Autogenesis, a self-evolving agent protocol. Agents can assess their own shortcomings, propose and test improvements, and update their operational framework in a continuous loop.