3d detection

30 articles about 3d detection in AI news

AllenAI's WildDet3D Enables Promptable 3D Object Detection from Single Images

Allen Institute for AI (AllenAI) has open-sourced WildDet3D, a model for promptable 3D object detection from single RGB images. It predicts 3D bounding boxes using flexible prompts and can integrate optional depth data.

Apr 13, 202685% relevant

SpatialBench: New Benchmark Tests Foundation Models on 3D Tasks

SpatialBench, a new benchmark from ropedia_ai, evaluates spatial foundation models across 7 tasks and 5 datasets, testing depth estimation, surface normal prediction, and 3D object detection.

May 27, 202691% relevant

VGGT-Det: How AI Is Learning to See in 3D Without Camera Calibration

Researchers have developed VGGT-Det, a breakthrough framework for multi-view 3D object detection that works without calibrated camera poses. The system mines internal geometric priors through attention mechanisms, outperforming traditional methods in indoor environments.

Mar 3, 202685% relevant

Claude Code Builds Browser-Based 3D Flight Simulator in Weekend

A developer used Anthropic's Claude Code to build a complete 3D flight simulator that runs in a web browser over a weekend, demonstrating rapid AI-assisted game development.

Apr 18, 202685% relevant

How to Build a 3D Engine with Claude Code: The Demoscene Case Study

A developer used Claude Code to build a complete 3D engine from scratch. Here are the actionable prompting techniques and CLAUDE.md strategies that made it work.

Mar 27, 202690% relevant

Radar Meets AI: How RF Signals Are Revolutionizing 3D Scene Reconstruction

Researchers have developed a multimodal approach combining radio-frequency sensing with Gaussian Splatting to create robust 3D scene rendering that works in challenging conditions where vision alone fails. This breakthrough enables high-fidelity reconstruction in adverse weather, low light, and through occlusions.

Feb 20, 202670% relevant

ORCA Dexterity Open-Sources Three 3D-Printable Robotic Hands with Self-Dislocating Joints for ~$2,200

ORCA Dexterity released STL files for three tendon-driven anthropomorphic robotic hands featuring self-dislocating joints for reliability. The OrcaHand Touch variant includes high-resolution fingertip sensors with 83 taxels per fingertip at 1mm resolution.

Mar 15, 202697% relevant

How AI Overfitting Masks Medical Breakthroughs: fMRI Study Reveals Critical Flaw in Parkinson's Detection

New research reveals that standard AI evaluation methods for detecting early Parkinson's disease from brain scans suffer from severe data leakage, creating misleading near-perfect results. When properly tested, lightweight models outperform complex ones in data-scarce medical applications.

Mar 3, 202675% relevant

Vercel Services Cuts Multi-Framework Deploy Time 60%

Vercel Services unifies multi-framework projects (FastAPI, Next.js, Flask, Go) into one Vercel Project with atomic deployments, internal service bindings, and zero-config auto-detection.

Jun 30, 202655% relevant

Chinese Railway Robot Detects 0.1mm Rail Scratches, Performs Automated Grinding Repairs

A railway maintenance robot in China uses high-precision detection and automated grinding to find and repair surface scratches as small as 0.1mm. It also employs ultrasonic flaw detection to identify internal rail defects.

Mar 23, 202685% relevant

The Digital Authenticity Arms Race: VeryAI Raises $10M to Combat AI-Generated Humans

As AI-generated humans become increasingly convincing, VeryAI has secured $10M in funding to develop verification tools using palm print biometrics and deepfake detection. This investment highlights the growing urgency to distinguish real from synthetic identities in the digital realm.

Mar 12, 202685% relevant

Vision AI Trends 2026: Manufacturing, Warehouse Automation, and Luxury Authentication Enter Visual Data Era

A 2026 trends report highlights Vision AI's expansion into manufacturing quality inspection, warehouse automation, and luxury brand authentication, marking a shift toward 3D visual data systems. This reflects the maturation of computer vision beyond basic recognition into operational and trust applications.

Mar 9, 202695% relevant

BrepCoder: The AI That Speaks CAD's Native Language

Researchers have developed BrepCoder, a multimodal AI that understands CAD designs in their native B-rep format. By treating 3D models as structured code, it performs multiple engineering tasks without task-specific retraining, potentially revolutionizing design automation.

Feb 27, 202675% relevant

Claude Code vs. Codex: Real-World Devs Reveal When Each Tool Wins

Claude Code shines at design and greenfield work; pair with Codex for bug fixes. Use CLAUDE.md for guidance.

Jun 20, 202690% relevant

Anthropic Opus 4.8 Cuts Bug-Finding Cost by 5x, SemiAnalysis Finds

Anthropic's Opus 4.8 + ultracode mode cuts severe bug-finding cost to ~1/5, per preliminary SemiAnalysis experiments with wide error bars.

Jun 2, 2026100% relevant

New CASIA Benchmark Exposes Fragmented Face Swapping Evaluation

CASIA researchers released a face swapping survey and benchmark on April 27, 2026, aiming to standardize evaluation across fragmented GAN and diffusion model methods.

May 5, 202674% relevant

NVIDIA Nemotron 3 Nano Omni: Open Multimodal Model Unifies Video, Audio, Image, Text

NVIDIA announced Nemotron 3 Nano Omni, an open multimodal model that processes video, audio, images, and text in a unified architecture, expanding accessibility for multimodal AI research.

Apr 28, 202693% relevant

Google Virgo Fabric: 100K-Accelerator AI Network Cuts Latency

Google unveiled Virgo, a data center fabric for AI clusters of 100,000+ accelerators, using a flatter two-layer topology to reduce latency and improve bisection bandwidth for synchronized training workloads.

Apr 27, 202678% relevant

Anthropic's One-Sentence Prompt Broke Claude's Coding for Days

Anthropic added 'keep responses under 25 words' to Claude's system instructions, causing a sudden collapse in coding performance that users detected within hours and took 4 days to fix.

Apr 24, 202697% relevant

Meta's Sapiens2: 1B Human Image ViTs for Pose, Segmentation, Normals

Meta open-sourced Sapiens2 on Hugging Face, a family of vision transformers pretrained on 1 billion human images for pose estimation, segmentation, normal estimation, and point maps. The models target high-resolution human-centric perception.

Apr 23, 202692% relevant

Catching Drift Before It Catches You

The author details implementing the open-source Evidently AI library to monitor a Kafka-powered movie recommender for data drift. This is a hands-on guide to a fundamental MLOps task for maintaining live AI systems.

Apr 20, 202696% relevant

Redis Launches 'Redis Feature Form,' an Enterprise Feature Store for

Redis announced the launch of Redis Feature Form, a new enterprise feature store designed to manage and serve machine learning features in production. This move positions Redis to compete in the critical MLOps infrastructure layer, helping companies operationalize AI models more reliably.

Apr 20, 202688% relevant

Webcam Head-Tracking Wallpaper Uses AI for Parallax Effect

A developer built a dynamic wallpaper that tracks a user's head via webcam to shift the background perspective in real-time. It demonstrates a novel, accessible application of computer vision for interactive desktop environments.

Apr 18, 202675% relevant

Research Suggests LLMs Like ChatGPT Can 'Lie' Despite Knowing Correct Answer

A new study suggests large language models like ChatGPT may deliberately provide incorrect answers they know are wrong, not just make factual errors. This challenges the core assumption that model mistakes stem purely from knowledge gaps.

Apr 18, 2026100% relevant

Project N.O.M.A.D. Emerges as Offline AI 'Doomsday Computer'

A prototype device named Project N.O.M.A.D. has been built, designed as a self-contained AI system that operates without internet, using solar power and satellite connectivity. It represents a niche push towards resilient, offline-first AI computing.

Apr 17, 202685% relevant

AI-Powered Circuit Simulator Offers Free Hardware Prototyping

A new website provides a free, AI-assisted environment for designing and testing electronic circuits, featuring pre-built projects for learning. This lowers the barrier to entry for hardware prototyping and education.

Apr 14, 202675% relevant

AI-Powered Drone De-Ices Power Lines in Sub-Zero Fog

A drone system autonomously navigates thick fog and snow to de-ice high-voltage power lines. This removes the need for hazardous manual crew climbs, improving grid reliability and safety.

Apr 11, 202689% relevant

Google Releases TIPSv2 Vision Encoder for Multi-Task Dense Prediction

Google has released the TIPSv2-B/14 vision encoder model on Hugging Face. It performs three dense prediction tasks—depth estimation, surface normal prediction, and semantic segmentation—from a single backbone.

Apr 11, 202685% relevant

Atlanta Startup Deploys AI-Powered Robot Dogs for Nighttime Neighborhood Security

A U.S. startup based in Atlanta is deploying quadrupedal robots for autonomous nighttime neighborhood patrols. The units are designed to detect intruders and alert residents, representing a commercial pivot for legged robotics.

Mar 30, 202685% relevant

MIRAGE AI Framework Bridges Critical Gap in Alzheimer's Diagnosis by Synthesizing MRI Insights from Health Records

Researchers have developed MIRAGE, a novel AI framework that uses knowledge graphs to synthesize diagnostic MRI information from electronic health records, potentially revolutionizing Alzheimer's disease assessment in resource-limited settings by bridging the missing-modality gap.

Mar 4, 202675% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety