spatial computing

14 articles about spatial computing in AI news

QuatRoPE: New Positional Embedding Enables Linear-Scale 3D Spatial Reasoning in LLMs, Outperforming Quadratic Methods

Researchers propose QuatRoPE, a novel positional embedding method that encodes 3D object relations with linear input scaling. Paired with IGRE, it improves spatial reasoning in LLMs while preserving their original language capabilities.

Mar 27, 202679% relevant

AI Data Centers Now Consume 10% of US Electricity, With Single Facilities Reaching 400+ Megawatt Loads

Data centers powering AI and cloud computing now account for 10% of total U.S. electricity consumption, with individual facilities reaching 400+ megawatt capacities. New half-mile-long structures require advanced water-cooling systems to manage chips generating 2kW of heat each.

Mar 24, 202685% relevant

The Agency: 147 Open Source AI Agents Hit 50K GitHub Stars in 2 Weeks

The Agency is an open source repository with 147 specialized AI agents across 12 divisions (engineering, design, marketing, etc.) that hit 50K GitHub stars in under two weeks. It provides one-command install for tools like Claude Code and Cursor, with full modding support.

Apr 28, 202686% relevant

Meta's Sapiens2: 1B Human Image ViTs for Pose, Segmentation, Normals

Meta open-sourced Sapiens2 on Hugging Face, a family of vision transformers pretrained on 1 billion human images for pose estimation, segmentation, normal estimation, and point maps. The models target high-resolution human-centric perception.

Apr 23, 202692% relevant

Sabi Cap: 100k-Sensor EEG Hat Decodes Internal Speech at 30 WPM

Sabi released the Sabi Cap, a wearable EEG beanie with 70k-100k biosensors and a brain foundation model trained on 100k hours of neural data. It decodes internal speech to text at ~30 WPM and enables cursor control via intention.

Apr 16, 202697% relevant

Sabicap Develops Brain Wearable to Decode Imagined Speech into Text

Sabicap is developing a brain wearable with tens of thousands of sensors to decode imagined speech into text. The company, backed by Vinod Khosla, aims to create a system that works across users with minimal calibration for broad adoption.

Apr 16, 202695% relevant

Snap & Qualcomm Partner on Snapdragon XR for Future Spectacles

Snap has entered a strategic agreement with Qualcomm to power future generations of its Spectacles AR glasses with Snapdragon XR platforms. This hardware partnership is critical for Snap's long-term bet on AI-driven augmented reality.

Apr 10, 202685% relevant

Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation

A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.

Apr 3, 202691% relevant

CanViT: First Active-Vision Foundation Model Hits 45.9% mIoU on ADE20K with Sequential Glimpses

Researchers introduce CanViT, the first task- and policy-agnostic Active-Vision Foundation Model (AVFM). It achieves 38.5% mIoU on ADE20K segmentation with a single low-resolution glimpse, outperforming prior active models while using 19.5x fewer FLOPs.

Mar 25, 202691% relevant

Geoffrey Hinton's Plumbing Prescription: Why AI's Godfather Recommends Trades Over Tech

AI pioneer Geoffrey Hinton suggests plumbing as a safe career bet in an AI-dominated future, highlighting the limitations of current robotics while acknowledging this advantage may be temporary as technology advances.

Mar 5, 202685% relevant

rs-embed: The Universal Translator for Remote Sensing AI Models

Researchers have developed rs-embed, a Python library that provides unified access to remote sensing foundation model embeddings. This breakthrough addresses fragmentation in the field by allowing users to retrieve embeddings from any supported model for any location and time with a single line of code.

Mar 2, 202675% relevant

YOLO26 Eliminates NMS Bottleneck, Revolutionizing Real-Time Object Detection

YOLO26 introduces a groundbreaking single-pass architecture that eliminates the need for Non-Maximum Suppression, dramatically accelerating inference speeds while maintaining high detection accuracy for up to 300 objects per image.

Feb 28, 202685% relevant

The Next Platform Shift: How Persistent 3D World Models Are Becoming the New Programmable Interface

A new collaboration between Baseten and World Labs signals a paradigm shift where persistent 3D world models become programmable platforms, potentially rivaling the transformative impact of large language models through accessible developer APIs.

Feb 25, 202685% relevant

Apple's 'Visual Intelligence' Vision: How AI-Powered Cameras Will Redefine Wearables

Apple is developing 'Visual Intelligence'—AI that interprets the physical world through cameras—as the foundation for its next generation of wearables, including smart glasses, advanced AirPods, and a camera-equipped pendant.

Feb 22, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety