spatial computing
14 articles about spatial computing in AI news
QuatRoPE: New Positional Embedding Enables Linear-Scale 3D Spatial Reasoning in LLMs, Outperforming Quadratic Methods
Researchers propose QuatRoPE, a novel positional embedding method that encodes 3D object relations with linear input scaling. Paired with IGRE, it improves spatial reasoning in LLMs while preserving their original language capabilities.
AI Data Centers Now Consume 10% of US Electricity, With Single Facilities Reaching 400+ Megawatt Loads
Data centers powering AI and cloud computing now account for 10% of total U.S. electricity consumption, with individual facilities reaching 400+ megawatt capacities. New half-mile-long structures require advanced water-cooling systems to manage chips generating 2kW of heat each.
The Agency: 147 Open Source AI Agents Hit 50K GitHub Stars in 2 Weeks
The Agency is an open source repository with 147 specialized AI agents across 12 divisions (engineering, design, marketing, etc.) that hit 50K GitHub stars in under two weeks. It provides one-command install for tools like Claude Code and Cursor, with full modding support.
Meta's Sapiens2: 1B Human Image ViTs for Pose, Segmentation, Normals
Meta open-sourced Sapiens2 on Hugging Face, a family of vision transformers pretrained on 1 billion human images for pose estimation, segmentation, normal estimation, and point maps. The models target high-resolution human-centric perception.
Sabi Cap: 100k-Sensor EEG Hat Decodes Internal Speech at 30 WPM
Sabi released the Sabi Cap, a wearable EEG beanie with 70k-100k biosensors and a brain foundation model trained on 100k hours of neural data. It decodes internal speech to text at ~30 WPM and enables cursor control via intention.
Sabicap Develops Brain Wearable to Decode Imagined Speech into Text
Sabicap is developing a brain wearable with tens of thousands of sensors to decode imagined speech into text. The company, backed by Vinod Khosla, aims to create a system that works across users with minimal calibration for broad adoption.
Snap & Qualcomm Partner on Snapdragon XR for Future Spectacles
Snap has entered a strategic agreement with Qualcomm to power future generations of its Spectacles AR glasses with Snapdragon XR platforms. This hardware partnership is critical for Snap's long-term bet on AI-driven augmented reality.
Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation
A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.
CanViT: First Active-Vision Foundation Model Hits 45.9% mIoU on ADE20K with Sequential Glimpses
Researchers introduce CanViT, the first task- and policy-agnostic Active-Vision Foundation Model (AVFM). It achieves 38.5% mIoU on ADE20K segmentation with a single low-resolution glimpse, outperforming prior active models while using 19.5x fewer FLOPs.
Geoffrey Hinton's Plumbing Prescription: Why AI's Godfather Recommends Trades Over Tech
AI pioneer Geoffrey Hinton suggests plumbing as a safe career bet in an AI-dominated future, highlighting the limitations of current robotics while acknowledging this advantage may be temporary as technology advances.
rs-embed: The Universal Translator for Remote Sensing AI Models
Researchers have developed rs-embed, a Python library that provides unified access to remote sensing foundation model embeddings. This breakthrough addresses fragmentation in the field by allowing users to retrieve embeddings from any supported model for any location and time with a single line of code.
YOLO26 Eliminates NMS Bottleneck, Revolutionizing Real-Time Object Detection
YOLO26 introduces a groundbreaking single-pass architecture that eliminates the need for Non-Maximum Suppression, dramatically accelerating inference speeds while maintaining high detection accuracy for up to 300 objects per image.
The Next Platform Shift: How Persistent 3D World Models Are Becoming the New Programmable Interface
A new collaboration between Baseten and World Labs signals a paradigm shift where persistent 3D world models become programmable platforms, potentially rivaling the transformative impact of large language models through accessible developer APIs.
Apple's 'Visual Intelligence' Vision: How AI-Powered Cameras Will Redefine Wearables
Apple is developing 'Visual Intelligence'—AI that interprets the physical world through cameras—as the foundation for its next generation of wearables, including smart glasses, advanced AirPods, and a camera-equipped pendant.