navigation

30 articles about navigation in AI news

Stanford's EgoNav Trains Robot Navigation on 5 Hours of Human Video, Enables Zero-Shot Control of Unitree G1

Stanford's EgoNav system uses a 5-hour egocentric video walk of campus to train a diffusion model that enables zero-shot navigation for a Unitree G1 humanoid robot, eliminating the need for robot-specific training data.

99% relevant

QAsk-Nav Benchmark Enables Separate Scoring of Navigation and Dialogue for Collaborative AI Agents

A new benchmark called QAsk-Nav enables separate evaluation of navigation and question-asking for collaborative embodied AI agents. The accompanying Light-CoNav model outperforms state-of-the-art methods while being significantly more efficient.

75% relevant

How to Cut Claude Code's Token Costs 32% by Fixing Its Navigation Problem

Claude Code agents waste tokens on grep-style navigation. A new open-source tool gives them IDE-like navigation, cutting costs 32% and doubling efficiency.

92% relevant

MIT Report Details How Pokémon Go's AR Data Is Training Delivery Robot Navigation Systems

MIT researchers report that anonymized AR data from millions of Pokémon Go players is being used to train delivery robots for centimeter-accurate navigation in complex urban environments.

85% relevant

Google Maps Gets an AI Brain: How Gemini Transforms Navigation from Directions to Dialogue

Google is fundamentally reshaping Maps by integrating its Gemini AI, launching 'Ask Maps' for conversational discovery and 'Immersive Navigation' for a complete visual and data-driven route overhaul. This represents a shift from static maps to intelligent, proactive travel assistants.

99% relevant

Wikipedia Navigation Challenge Exposes Critical Gaps in AI Planning Abilities

Researchers introduce LLM-WikiRace, a benchmark testing how well AI models navigate Wikipedia links between concepts. While top models like Gemini-3 show superhuman performance on easy tasks, success rates plummet to just 23% on hard challenges, revealing fundamental limitations in long-term planning.

70% relevant

Niu Technologies Demos AI-Powered Scooter Using Alibaba's Qwen 3.5 for Self-Balancing and Navigation

Chinese electric scooter maker Niu Technologies demonstrated a prototype that self-balances, moves, turns, and navigates autonomously using Alibaba's Qwen 3.5 model. The system is described as an L2-level intelligent driving assistance system, applying autonomous vehicle tech to micromobility.

85% relevant

Google DeepMind Unveils Gemini-Powered Browser That Generates Websites in Real-Time

Google DeepMind has demonstrated a browser prototype powered by Gemini 3.1 Flash-Lite that generates complete HTML/CSS websites dynamically based on user prompts and navigation context, shifting from static page retrieval to on-demand interface generation.

95% relevant

Cursor Launches Instant Grep: Millisecond Local Search Across Millions of Files

Cursor has launched Instant Grep, a local search tool that performs millisecond-level regex searches across millions of files. The feature is integrated into the Cursor IDE, targeting developers needing fast, offline code navigation.

85% relevant

AgentComm-Bench Exposes Catastrophic Failure Modes in Cooperative Embodied AI Under Real-World Network Conditions

Researchers introduce AgentComm-Bench, a benchmark that stress-tests multi-agent embodied AI systems under six real-world network impairments. It reveals performance drops of over 96% in navigation and 85% in perception F1, highlighting a critical gap between lab evaluations and deployable systems.

100% relevant

Claude AI Gains Computer Control Feature: Opens Apps, Navigates Browser, Fills Spreadsheets

Anthropic's Claude AI can now be enabled to directly control a user's computer to perform tasks like opening applications, browser navigation, and spreadsheet work. This represents a significant shift from chat-based interaction to direct system automation.

87% relevant

MiRA Framework Boosts Gemma3-12B to 43% Success Rate on WebArena-Lite, Surpassing GPT-4 and WebRL

Researchers propose MiRA, a milestone-based RL framework that improves long-horizon planning in LLM agents. It boosts Gemma3-12B's web navigation success from 6.4% to 43%, outperforming GPT-4-Turbo (17.6%) and the previous SOTA WebRL (38.4%).

77% relevant

InterDeepResearch: A New Framework for Human-Agent Collaborative Information Seeking

Researchers propose InterDeepResearch, an interactive system that enables human collaboration with LLM-powered research agents. It addresses limitations of autonomous systems by improving observability, steerability, and context navigation for complex information tasks.

76% relevant

Bridging the StarCraft Gap: New AI Benchmark Makes Strategy Research Accessible

Researchers introduce Two-Bridge Map Suite, a lightweight StarCraft II benchmark that isolates tactical skills without full-game complexity. This open-source tool enables reinforcement learning experiments on realistic budgets by focusing on navigation and combat mechanics.

75% relevant

FDM-1: The AI That Learned to Use Computers by Watching 11 Million Hours of Screen Recordings

Standard Intelligence has unveiled FDM-1, an AI system trained on 11 million hours of screen recordings that can perform complex computer tasks like CAD design, web navigation, and even simulated driving with minimal fine-tuning.

95% relevant

Switchboard's Grid View Gives You Bird's-Eye Control of Claude Code Sessions

Switchboard v0.0.16 adds a grid view that shows all your Claude Code sessions at once with live terminal previews, status indicators, and quick navigation.

100% relevant

Token Warping for MLLMs Outperforms Pixel Methods in View Synthesis

Researchers propose warping image tokens instead of pixels for multi-view reasoning in MLLMs. The zero-shot method is robust to depth noise and outperforms established baselines.

97% relevant

Neo 1X Humanoid Robot Shown at Abundance Summit, Weighs Under 70 lbs

Neo 1X, a sub-70-pound humanoid robot designed for homes, was shown moving and interacting with people at the Abundance Summit. This demo highlights a growing industry focus on creating robots for safe cohabitation with families.

85% relevant

GeoSR Achieves SOTA on VSI-Bench with Geometry Token Fusion

GeoSR improves spatial reasoning by masking 2D vision tokens to prevent shortcuts and using gated fusion to amplify geometry information, achieving state-of-the-art results on key benchmarks.

85% relevant

ForeverSolar Uses Claude Agent SDK to Automate Solar Permitting, Cutting Approval Times

Solar installation company ForeverSolar is using Anthropic's Claude Agent SDK to automate permitting documentation, a major bottleneck in solar deployment. This represents a concrete enterprise application of agentic AI beyond software development.

77% relevant

Generative World Renderer: 4M+ RGB/G-Buffer Frames from Cyberpunk 2077 & Black Myth: Wukong Released for Inverse Graphics

A new framework and dataset extracts over 4 million synchronized RGB and G-buffer frames from Cyberpunk 2077 and Black Myth: Wukong, enabling AI models to learn inverse material decomposition and controllable game environment editing.

85% relevant

Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation

A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.

91% relevant

Andrej Karpathy's Personal Knowledge Management System Uses LLM Embeddings Without RAG for 400K-Word Research Base

AI researcher Andrej Karpathy has developed a personal knowledge management system that processes 400,000 words of research notes using LLM embeddings rather than traditional RAG architecture. The system enables semantic search, summarization, and content generation directly from his Obsidian vault.

91% relevant

Atomic Chat Launches Hermes Agent: A Free, Local Agent Stack Powered by Gemma 4

Atomic Chat has launched Hermes Agent, an open-source agent stack powered by Google's Gemma 4 model that runs entirely locally and is free to use. This makes advanced AI agent functionality accessible without cloud dependencies or API costs.

87% relevant

Agent Psychometrics: New Framework Predicts Task-Level Success in Agentic Coding Benchmarks with 0.81 AUC

A new research paper introduces a framework using Item Response Theory and task features to predict success on individual agentic coding tasks, achieving 0.81 AUC. This enables benchmark designers to calibrate difficulty without expensive evaluations.

75% relevant

LimX's Oli Robot Demonstrates Autonomous Unboxing and Boot-Up via 31-DoF System

LimX's Oli robot autonomously exited its shipping container, powered up its 31-degree-of-freedom system, and began moving. The demo highlights progress in self-contained robotic deployment without human setup.

85% relevant

Macy's Launches 'Ask Macy's' AI Conversational Shopping Assistant

Macy's has publicly launched 'Ask Macy's,' an AI-powered conversational shopping assistant designed to help users discover brands, trends, and receive personalized product recommendations. This follows an initial dark launch phase and represents a major department store's move into agentic AI for commerce.

100% relevant

Atlanta Startup Deploys AI-Powered Robot Dogs for Nighttime Neighborhood Security

A U.S. startup based in Atlanta is deploying quadrupedal robots for autonomous nighttime neighborhood patrols. The units are designed to detect intruders and alert residents, representing a commercial pivot for legged robotics.

85% relevant

Ukrainian TWW127 Robot Holds Infantry Position for 45 Days via Remote Unmanned Operation

A Ukrainian unmanned ground vehicle, the TWW127, reportedly held a forward combat position autonomously for 45 days, providing persistent overwatch and suppressive fire. This demonstrates a significant leap in endurance and reliability for remote, unmanned systems in active combat.

87% relevant

Unipath Launches Household Robot, Joining China's Push into Consumer Robotics

Chinese company Unipath has launched a household robot. This marks another entry into the competitive consumer robotics market, where Chinese firms are increasingly active.

85% relevant