human in the loop

30 articles about human in the loop in AI news

The Cognitive Divergence: AI Context Windows Expand as Human Attention Declines, Creating a Delegation Feedback Loop

A new arXiv paper documents the exponential growth of AI context windows (512 tokens in 2017 to 2M in 2026) alongside a measured decline in human sustained-attention capacity. It introduces the 'Delegation Feedback Loop' hypothesis, where easier AI delegation may further erode human cognitive practice. This is a foundational study on human-AI interaction dynamics.

84% relevant

The Autonomous Company: How 14 AI Agents Are Running a Startup Without Human Intervention

Auto-Co introduces a fully autonomous AI company operating system where 14 specialized agents debate, decide, and ship software 24/7. Using Claude Code CLI and a simple bash loop, this open-source system has built its own infrastructure, documentation, and community presence across 12 self-improvement cycles.

85% relevant

Karpathy's Autonomous AI Researcher: Programming the Programmer in the Age of Agentic Science

Andrej Karpathy has open-sourced an autonomous AI research agent that can run ~100 experiments overnight without human supervision. The system turns research into a game with fixed-time trials, where prompt engineering replaces manual coding.

95% relevant

The Return of the Concierge: Why Human Judgment Still Defines Luxury Hospitality

An industry commentary argues that in luxury hospitality, AI and automation cannot replace the nuanced judgment, empathy, and relationship-building of a human concierge. This highlights a critical tension for luxury brands: where to deploy AI for efficiency versus where to preserve human touch.

100% relevant

Deloitte on Driving Adoption of the 'Human with Agentic AI' Era

Deloitte outlines the shift to a 'human with agentic AI' paradigm, where autonomous AI agents act as proactive partners. This requires new organizational strategies to integrate agents that can preserve institutional knowledge and interface with legacy systems.

79% relevant

Violoop's Hardware Bet: A New Frontier in AI Interaction Beyond the Screen

Hardware startup Violoop has secured multi-million dollar funding to develop the world's first 'physical-level AI Operator,' aiming to move AI interaction from purely digital interfaces to tangible, desktop-integrated hardware devices.

100% relevant

The Self-Improving AI Loop: How Artificial Intelligence Is Now Building Better Versions of Itself

Leading AI researchers reveal that recursive self-improvement—where AI systems build better AI systems—is no longer theoretical but actively being pursued by major labs. This feedback loop could dramatically accelerate AI development beyond current exponential curves.

85% relevant

The Infinite Loop: How AI is Creating More Developer Jobs, Not Fewer

Stack Overflow's analysis reveals AI is not replacing developers but supercharging them, leading to an explosion of new applications and creating specialized roles focused on human-AI collaboration. The demand for custom software remains infinite as human imagination finds new problems to solve.

85% relevant

The Great AI Contamination: How 2022 Became the Digital Divide in Human Knowledge

AI researcher Ethan Mollick identifies 2022 as the pivotal year when AI began fundamentally altering human-generated content, creating what he calls 'ambient contamination' where AI influence permeates all digital information.

85% relevant

China's First Fully Automated Humanoid Robot Factory Goes Live in Foshan, Targets 10,000+ Units Annually

China's first fully automated humanoid robot production line has launched in Foshan, capable of building one complete robot every ~30 minutes. The facility aims for over 10,000 units per year, with five more sites planned.

97% relevant

XSquareRobot and 58.com Launch China's First Human-Robot Home Cleaning Service in Shenzhen

A new service in Shenzhen pairs human cleaners with autonomous AI robots running on the WALL-A system. The robot handles repetitive tasks while the human manages complex judgment, with real home deployment providing training data.

92% relevant

AI Learns Physical Assistance: Breakthrough in Humanoid Robot Caregiving

Researchers have developed AssistMimic, the first AI system capable of learning physically assistive behaviors through multi-agent reinforcement learning. The approach enables virtual humanoids to provide meaningful physical support by adapting to a partner's movements in real-time.

81% relevant

Beyond Unit Tests: How AI Critics Learn from Sparse Human Feedback to Revolutionize Coding Assistants

Researchers have developed a novel method to train AI critics using sparse, real-world human feedback rather than just unit tests. This approach bridges the gap between academic benchmarks and practical coding assistance, improving performance by 15.9% on SWE-bench through better trajectory selection and early stopping.

75% relevant

Humanoid Robot Deployed for Traffic Control in Shenzhen, China

A humanoid robot equipped with cameras and AI has been deployed to direct traffic at a busy intersection in Shenzhen, China. This represents a real-world test of embodied AI for public infrastructure management.

85% relevant

ASI-Evolve Automates AI Research Loop, Discovers 105 Better Linear Attention Designs and Boosts AMC32 Scores by 12.5 Points

Researchers developed ASI-Evolve, an AI system that automates experimental loops in AI research. It discovered 105 improved linear attention variants and boosted AMC32 scores by 12.5 points, demonstrating automated research acceleration.

95% relevant

Gemma 4 Demonstrates Self-Terminating Loop Detection in Code Execution, User Reports

A developer shared an observation that Google's Gemma 4 model recognized it was stuck in an infinite loop during a coding task and stopped itself. This represents a potential advance in AI's ability to monitor and control its own execution state.

85% relevant

Mercor Data Breach Exposes Expert Human Annotation Pipeline Used by Frontier AI Labs

Hackers have reportedly accessed Mercor's expert human data collection systems, which are used by leading AI labs to build foundation models. This breach could expose proprietary training methodologies and sensitive model development data.

91% relevant

InterDeepResearch: A New Framework for Human-Agent Collaborative Information Seeking

Researchers propose InterDeepResearch, an interactive system that enables human collaboration with LLM-powered research agents. It addresses limitations of autonomous systems by improving observability, steerability, and context navigation for complex information tasks.

76% relevant

Google DeepMind Maps Six 'AI Agent Traps' That Can Hijack Autonomous Systems in the Wild

Google DeepMind has published a framework identifying six categories of 'traps'—from hidden web instructions to poisoned memory—that can exploit autonomous AI agents. This research provides the first systematic taxonomy for a growing attack surface as agents gain web access and tool-use capabilities.

95% relevant

Agentic AI Shopping Agents: Reclaiming Customer Relationships in the Age of AI Search

Third-party AI agents are reshaping discovery, threatening direct brand relationships. Luxury retailers must deploy their own agentic AI to guide high-value journeys, curate personalized assortments, and own the client experience.

88% relevant

Mind the Sim2Real Gap: Why LLM-Based User Simulators Create an 'Easy Mode' for Agentic AI

A new study formalizes the Sim2Real gap in user simulation for agentic tasks, finding LLM simulators are excessively cooperative, stylistically uniform, and provide inflated success metrics compared to real human interactions. This has critical implications for developing reliable retail AI agents.

100% relevant

Stanford-Princeton Team Open-Sources LabClaw: The 'Skill OS' for Scientific AI

Researchers from Stanford and Princeton have open-sourced LabClaw, a 'Skill Operating Layer' for LabOS that transforms natural language commands into executable lab workflows. This breakthrough promises to dramatically accelerate scientific experimentation by bridging human intent with robotic execution.

85% relevant

Cognition Labs Launches 'Canvas for Agents': First Shared Workspace Where AI Agents Code Alongside Humans

Cognition Labs has unveiled a collaborative workspace where AI agents like Codex and Claude Code operate visibly alongside human developers. This marks a shift from AI as a tool to a visible, real-time collaborator in the creative coding process.

87% relevant

Wharton Professor Argues First AGI Would Be Kept Secret for Financial Market Domination

Wharton professor Ethan Mollick posits that the first lab to develop a superhuman AI would likely deploy it secretly in financial markets for profit, rather than commercializing it via API. This highlights a strategic tension between immediate financial gain and open scientific progress in the AGI race.

85% relevant

Wharton Study: 'Cognitive Surrender' to AI Leads to 79.8% Error Adoption Rate, Undermining Human Review

A Wharton study of 1,372 participants found people followed incorrect AI suggestions 79.8% of the time, with confidence increasing 11.7% even when wrong. Researchers identify 'Cognitive Surrender'—where AI becomes 'System 3' and users treat its outputs as their own judgments.

92% relevant

Digital Fruit Fly Brain Achieves First Full Perception-Action Loop in Simulation

Startup Eon Systems has demonstrated what appears to be the first complete whole-brain emulation controlling a simulated body. Their digital model of a fruit fly brain, with 125,000 neurons and 50 million synapses, successfully drives realistic behaviors in a physics-simulated fly body.

95% relevant

Tool-R0: How AI Agents Are Learning to Use Tools Without Human Training Data

Researchers have developed Tool-R0, a framework where AI agents teach themselves to use tools through self-play reinforcement learning, achieving 92.5% improvement over base models without any pre-existing training data.

75% relevant

AI's New Frontier: How Self-Improving Models Are Redefining Machine Learning

Researchers have developed a groundbreaking method enabling AI models to autonomously improve their own training data, potentially accelerating AI development while reducing human intervention. This self-improvement capability represents a significant step toward more autonomous machine learning systems.

85% relevant

Research: Cheaper Reasoning Models Can Cost 3x More Due to Higher Error Rates and Retry Loops

New research indicates that selecting AI models based solely on per-token pricing can be a false economy. Models with lower accuracy often require multiple expensive retries, ultimately increasing total costs by up to 300%.

87% relevant

AI Research Accelerator: Autonomous System Completes 700 Experiments in 48 Hours, Optimizing Model Training

An AI system autonomously conducted 700 experiments over two days, reducing GPT-2 training time by 11%. This breakthrough demonstrates AI's growing capability to accelerate scientific research and optimize complex processes without human intervention.

85% relevant