3d perception

18 articles about 3d perception in AI news

Momentum-Consistency Fine-Tuning (MCFT) Achieves 3.30% Gain in 5-Shot 3D Vision Tasks Without Adapters

Researchers propose MCFT, an adapter-free fine-tuning method for 3D point cloud models that selectively updates encoder parameters with momentum constraints. It outperforms prior methods by 3.30% in 5-shot settings and maintains original inference latency.

75% relevant

VAST's $50M Funding Signals 3D AI Revolution: From Foundation Models to World Simulation

AI startup VAST has secured $50 million in Series A funding while advancing its 3D foundation models that are setting new industry standards. The company is preparing to launch its first world model, positioning itself at the forefront of spatial AI development.

80% relevant

Utonia AI Breakthrough: A Single Transformer Model Unifies All 3D Point Cloud Data

Researchers have developed Utonia, a single self-supervised transformer that learns unified 3D representations across diverse point cloud data types including LiDAR, CAD models, indoor scans, and video-lifted data. This breakthrough enables unprecedented cross-domain transfer and emergent behaviors in 3D AI.

85% relevant

BetterScene Bridges the Gap: How Aligning AI Representations Unlocks Photorealistic 3D Synthesis

Researchers introduce BetterScene, a novel AI method that dramatically improves 3D scene generation from just a handful of photos. By aligning the internal representations of a powerful video diffusion model, it produces consistent, artifact-free novel views, pushing the boundary of what's possible in computational photography and virtual world creation.

78% relevant

CLIPoint3D Bridges the 3D Reality Gap: How Language Models Are Revolutionizing Point Cloud Adaptation

Researchers have developed CLIPoint3D, a novel framework that leverages frozen CLIP backbones for few-shot unsupervised 3D point cloud domain adaptation. The approach achieves 3-16% accuracy gains over conventional methods while dramatically improving efficiency by avoiding heavy trainable encoders.

70% relevant

JAEGER Breaks the 2D Barrier: How 3D Audio-Visual AI Could Transform Robotics and AR

Researchers introduce JAEGER, a framework that extends audio-visual large language models into 3D space using RGB-D and spatial audio. This breakthrough enables AI to understand and reason about physical environments with unprecedented spatial awareness.

70% relevant

Radar Meets AI: How RF Signals Are Revolutionizing 3D Scene Reconstruction

Researchers have developed a multimodal approach combining radio-frequency sensing with Gaussian Splatting to create robust 3D scene rendering that works in challenging conditions where vision alone fails. This breakthrough enables high-fidelity reconstruction in adverse weather, low light, and through occlusions.

70% relevant

MIT Researchers Develop GelSight Svelte Tactile Sensor for Delicate, Heavy Object Manipulation

MIT's CSAIL introduced GelSight Svelte, a thin, high-resolution tactile sensor enabling robots to grip delicate, heavy objects by measuring shear forces. The sensor uses two cameras and colored LEDs to track gel deformation, providing detailed 3D force maps for precise manipulation.

85% relevant

NVIDIA Spotlights Physical AI Tools for Robotics Week 2026

NVIDIA is highlighting its platforms for robot simulation, synthetic data, and AI-powered learning during National Robotics Week 2026, aiming to accelerate the transition from virtual training to physical deployment.

85% relevant

Generative World Renderer: 4M+ RGB/G-Buffer Frames from Cyberpunk 2077 & Black Myth: Wukong Released for Inverse Graphics

A new framework and dataset extracts over 4 million synchronized RGB and G-buffer frames from Cyberpunk 2077 and Black Myth: Wukong, enabling AI models to learn inverse material decomposition and controllable game environment editing.

85% relevant

Atlanta Startup Deploys AI-Powered Robot Dogs for Nighttime Neighborhood Security

A U.S. startup based in Atlanta is deploying quadrupedal robots for autonomous nighttime neighborhood patrols. The units are designed to detect intruders and alert residents, representing a commercial pivot for legged robotics.

85% relevant

Elon Musk Claims Tesla Optimus Will Surpass Human Surgeons by 2029, Advises Against Medical School

Elon Musk stated Tesla's Optimus humanoid robot will outperform any human surgeon at scale within three years, calling medical school 'pointless.' He predicts universal access to superior medical care within five years.

87% relevant

Facebook's SAM 3 Vision Model Ported to Apple's MLX Framework, Enabling Real-Time Tracking on M3 Max

Facebook's Segment Anything Model 3 (SAM 3) has been ported to Apple's MLX framework, enabling real-time object tracking on an M3 Max MacBook Pro. This demonstrates efficient on-device execution of a foundational vision model without cloud dependency.

87% relevant

Fei-Fei Li Argues Spatial Intelligence is the 'Other Half' of AI Beyond Language

AI pioneer Dr. Fei-Fei Li states that true intelligence requires spatial understanding alongside language. This perspective directly challenges the current LLM-centric paradigm.

85% relevant

Ego2Web Benchmark Bridges Egocentric Video and Web Agents, Exposing Major Performance Gaps

Researchers introduce Ego2Web, the first benchmark requiring AI agents to understand real-world first-person video and execute related web tasks. Their novel Ego2WebJudge evaluation method achieves 84% human agreement, while state-of-the-art agents perform poorly across all task categories.

100% relevant

Snap Brings AI Lenses To Luxury Fashion Campaigns

Snapchat is integrating AI-powered augmented reality lenses into luxury fashion marketing campaigns, offering brands a new channel for immersive, interactive advertising directly within the app's ecosystem.

86% relevant

Nvidia and Antoine Arnault Partner to Advance Virtual Try-On Technology

Nvidia and Antoine Arnault are collaborating to push virtual try-on technology forward, leveraging Nvidia's AI hardware and Arnault's luxury industry influence. This partnership aims to solve long-standing accuracy and scalability challenges in digital fashion fitting.

100% relevant

From Megafactories to Micro-Ateliers: How Embodied AI Will Redefine Luxury Manufacturing

Embodied AI reaching critical capability thresholds will trigger a phase transition in manufacturing geography. For luxury, this enables demand-proximal micro-manufacturing, hyper-personalization, and resilient, sustainable supply chains, fundamentally restructuring production logic.

70% relevant