human in the loop
30 articles about human in the loop in AI news
The Cognitive Divergence: AI Context Windows Expand as Human Attention Declines, Creating a Delegation Feedback Loop
A new arXiv paper documents the exponential growth of AI context windows (512 tokens in 2017 to 2M in 2026) alongside a measured decline in human sustained-attention capacity. It introduces the 'Delegation Feedback Loop' hypothesis, where easier AI delegation may further erode human cognitive practice. This is a foundational study on human-AI interaction dynamics.
The Autonomous Company: How 14 AI Agents Are Running a Startup Without Human Intervention
Auto-Co introduces a fully autonomous AI company operating system where 14 specialized agents debate, decide, and ship software 24/7. Using Claude Code CLI and a simple bash loop, this open-source system has built its own infrastructure, documentation, and community presence across 12 self-improvement cycles.
Karpathy's Autonomous AI Researcher: Programming the Programmer in the Age of Agentic Science
Andrej Karpathy has open-sourced an autonomous AI research agent that can run ~100 experiments overnight without human supervision. The system turns research into a game with fixed-time trials, where prompt engineering replaces manual coding.
The Return of the Concierge: Why Human Judgment Still Defines Luxury Hospitality
An industry commentary argues that in luxury hospitality, AI and automation cannot replace the nuanced judgment, empathy, and relationship-building of a human concierge. This highlights a critical tension for luxury brands: where to deploy AI for efficiency versus where to preserve human touch.
Deloitte on Driving Adoption of the 'Human with Agentic AI' Era
Deloitte outlines the shift to a 'human with agentic AI' paradigm, where autonomous AI agents act as proactive partners. This requires new organizational strategies to integrate agents that can preserve institutional knowledge and interface with legacy systems.
Violoop's Hardware Bet: A New Frontier in AI Interaction Beyond the Screen
Hardware startup Violoop has secured multi-million dollar funding to develop the world's first 'physical-level AI Operator,' aiming to move AI interaction from purely digital interfaces to tangible, desktop-integrated hardware devices.
The Self-Improving AI Loop: How Artificial Intelligence Is Now Building Better Versions of Itself
Leading AI researchers reveal that recursive self-improvement—where AI systems build better AI systems—is no longer theoretical but actively being pursued by major labs. This feedback loop could dramatically accelerate AI development beyond current exponential curves.
The Infinite Loop: How AI is Creating More Developer Jobs, Not Fewer
Stack Overflow's analysis reveals AI is not replacing developers but supercharging them, leading to an explosion of new applications and creating specialized roles focused on human-AI collaboration. The demand for custom software remains infinite as human imagination finds new problems to solve.
The Great AI Contamination: How 2022 Became the Digital Divide in Human Knowledge
AI researcher Ethan Mollick identifies 2022 as the pivotal year when AI began fundamentally altering human-generated content, creating what he calls 'ambient contamination' where AI influence permeates all digital information.
China's First Fully Automated Humanoid Robot Factory Goes Live in Foshan, Targets 10,000+ Units Annually
China's first fully automated humanoid robot production line has launched in Foshan, capable of building one complete robot every ~30 minutes. The facility aims for over 10,000 units per year, with five more sites planned.
XSquareRobot and 58.com Launch China's First Human-Robot Home Cleaning Service in Shenzhen
A new service in Shenzhen pairs human cleaners with autonomous AI robots running on the WALL-A system. The robot handles repetitive tasks while the human manages complex judgment, with real home deployment providing training data.
AI Learns Physical Assistance: Breakthrough in Humanoid Robot Caregiving
Researchers have developed AssistMimic, the first AI system capable of learning physically assistive behaviors through multi-agent reinforcement learning. The approach enables virtual humanoids to provide meaningful physical support by adapting to a partner's movements in real-time.
Beyond Unit Tests: How AI Critics Learn from Sparse Human Feedback to Revolutionize Coding Assistants
Researchers have developed a novel method to train AI critics using sparse, real-world human feedback rather than just unit tests. This approach bridges the gap between academic benchmarks and practical coding assistance, improving performance by 15.9% on SWE-bench through better trajectory selection and early stopping.
Humanoid Robot Deployed for Traffic Control in Shenzhen, China
A humanoid robot equipped with cameras and AI has been deployed to direct traffic at a busy intersection in Shenzhen, China. This represents a real-world test of embodied AI for public infrastructure management.
ASI-Evolve Automates AI Research Loop, Discovers 105 Better Linear Attention Designs and Boosts AMC32 Scores by 12.5 Points
Researchers developed ASI-Evolve, an AI system that automates experimental loops in AI research. It discovered 105 improved linear attention variants and boosted AMC32 scores by 12.5 points, demonstrating automated research acceleration.
Gemma 4 Demonstrates Self-Terminating Loop Detection in Code Execution, User Reports
A developer shared an observation that Google's Gemma 4 model recognized it was stuck in an infinite loop during a coding task and stopped itself. This represents a potential advance in AI's ability to monitor and control its own execution state.
Mercor Data Breach Exposes Expert Human Annotation Pipeline Used by Frontier AI Labs
Hackers have reportedly accessed Mercor's expert human data collection systems, which are used by leading AI labs to build foundation models. This breach could expose proprietary training methodologies and sensitive model development data.
InterDeepResearch: A New Framework for Human-Agent Collaborative Information Seeking
Researchers propose InterDeepResearch, an interactive system that enables human collaboration with LLM-powered research agents. It addresses limitations of autonomous systems by improving observability, steerability, and context navigation for complex information tasks.
Google DeepMind Maps Six 'AI Agent Traps' That Can Hijack Autonomous Systems in the Wild
Google DeepMind has published a framework identifying six categories of 'traps'—from hidden web instructions to poisoned memory—that can exploit autonomous AI agents. This research provides the first systematic taxonomy for a growing attack surface as agents gain web access and tool-use capabilities.
Agentic AI Shopping Agents: Reclaiming Customer Relationships in the Age of AI Search
Third-party AI agents are reshaping discovery, threatening direct brand relationships. Luxury retailers must deploy their own agentic AI to guide high-value journeys, curate personalized assortments, and own the client experience.
Mind the Sim2Real Gap: Why LLM-Based User Simulators Create an 'Easy Mode' for Agentic AI
A new study formalizes the Sim2Real gap in user simulation for agentic tasks, finding LLM simulators are excessively cooperative, stylistically uniform, and provide inflated success metrics compared to real human interactions. This has critical implications for developing reliable retail AI agents.
Stanford-Princeton Team Open-Sources LabClaw: The 'Skill OS' for Scientific AI
Researchers from Stanford and Princeton have open-sourced LabClaw, a 'Skill Operating Layer' for LabOS that transforms natural language commands into executable lab workflows. This breakthrough promises to dramatically accelerate scientific experimentation by bridging human intent with robotic execution.
Cognition Labs Launches 'Canvas for Agents': First Shared Workspace Where AI Agents Code Alongside Humans
Cognition Labs has unveiled a collaborative workspace where AI agents like Codex and Claude Code operate visibly alongside human developers. This marks a shift from AI as a tool to a visible, real-time collaborator in the creative coding process.
Wharton Professor Argues First AGI Would Be Kept Secret for Financial Market Domination
Wharton professor Ethan Mollick posits that the first lab to develop a superhuman AI would likely deploy it secretly in financial markets for profit, rather than commercializing it via API. This highlights a strategic tension between immediate financial gain and open scientific progress in the AGI race.
Wharton Study: 'Cognitive Surrender' to AI Leads to 79.8% Error Adoption Rate, Undermining Human Review
A Wharton study of 1,372 participants found people followed incorrect AI suggestions 79.8% of the time, with confidence increasing 11.7% even when wrong. Researchers identify 'Cognitive Surrender'—where AI becomes 'System 3' and users treat its outputs as their own judgments.
Digital Fruit Fly Brain Achieves First Full Perception-Action Loop in Simulation
Startup Eon Systems has demonstrated what appears to be the first complete whole-brain emulation controlling a simulated body. Their digital model of a fruit fly brain, with 125,000 neurons and 50 million synapses, successfully drives realistic behaviors in a physics-simulated fly body.
Tool-R0: How AI Agents Are Learning to Use Tools Without Human Training Data
Researchers have developed Tool-R0, a framework where AI agents teach themselves to use tools through self-play reinforcement learning, achieving 92.5% improvement over base models without any pre-existing training data.
AI's New Frontier: How Self-Improving Models Are Redefining Machine Learning
Researchers have developed a groundbreaking method enabling AI models to autonomously improve their own training data, potentially accelerating AI development while reducing human intervention. This self-improvement capability represents a significant step toward more autonomous machine learning systems.
Research: Cheaper Reasoning Models Can Cost 3x More Due to Higher Error Rates and Retry Loops
New research indicates that selecting AI models based solely on per-token pricing can be a false economy. Models with lower accuracy often require multiple expensive retries, ultimately increasing total costs by up to 300%.
AI Research Accelerator: Autonomous System Completes 700 Experiments in 48 Hours, Optimizing Model Training
An AI system autonomously conducted 700 experiments over two days, reducing GPT-2 training time by 11%. This breakthrough demonstrates AI's growing capability to accelerate scientific research and optimize complex processes without human intervention.