AI News Digest

Wednesday, June 3, 2026

10 stories covered by gentic.news intelligence

← 2026-06-02 2026-06-04 →

Nvidia executive on stage at CVPR conference, presenting a slide showing a robot arm and autonomous vehicle diagram…

Products & LaunchesBreakthrough

100

Nvidia Unveils Physical AI Agent Skills, 32B VLA Model at CVPR

Nvidia launched physical AI agent skills and a 32B VLA model at CVPR to automate AV and robotics workflows, addressing the fragmented tooling bottleneck.

blogs.nvidia.com/6d ago/3 min read/Widely Reported

autonomous vehiclesroboticsai agents

Products & Launches

100

Google Gemma 4 12B: Encoder-Free Multimodal Model Launches

Google launched Gemma 4 12B, an encoder-free multimodal model for on-device AI, reducing latency by eliminating the vision encoder.

x.com/6d ago/3 min read/Widely Reported

ai modelsmultimodalon-device ai

MiniMax M3: Sparse Attention, 1M Context, Multimodal via …

AI Research

MiniMax M3: Sparse Attention, 1M Context, Multimodal via Together

MiniMax M3 uses sparse attention for 1M context and multimodality, with Together AI serving fast inference.

x.com/6d ago/3 min read/Multi-Source

context windowai modelssparse attention

Researchers analyze a flowchart showing structured EHR data from CLMBR-T-Base feeding into a frozen LLM via a…

AI Research

ChatHealthAI: EHR Foundation Model + Frozen LLM Hits 79.8% F1 on Length-of-Stay

ChatHealthAI aligns CLMBR-T-Base with a frozen LLM via a task-aware resampler, achieving 79.8% F1 on EHRSHOT length-of-stay prediction while enabling interpretable reasoning.

arxiv.org/6d ago/3 min read/Widely Reported

llmaiclinical decision support

Law Profs Prefer AI Answers 75% of Time in Stanford Study

AI Research

Law Profs Prefer AI Answers 75% of Time in Stanford Study

Stanford researchers found law professors preferred AI answers 75% of time in blind legal analysis test, per @rohanpaul_ai.

x.com/5d ago/3 min read

legalresearchai

A sleek black text-to-speech device with glowing blue accents sits on a desk, a waveform display showing emotional…

AI Research

Miso One: 8B Open-Source TTS Hits 110ms Latency, Real Emotion

Miso One, an 8B open-source TTS model, achieves 110ms latency with emotional range. Weights are fully open-source for self-hosting, but no benchmark data is provided.

x.com/6d ago/3 min read

open-sourcevoiceoverai audio

AI Research

Google LEAP Scaffold Lifts Lean-IMO-Bench One-Shot Solve Rate from <10% to 70%

Google's LEAP scaffold lifts Lean-IMO-Bench one-shot solve rate from <10% to 70%, solving all 12 Putnam 2025 problems.

x.com/6d ago/3 min read

leanautomated reasoninggoogle

A person wearing headphones works on a laptop displaying a waveform interface, with floating musical notes and…

AI Research

Google Releases Magenta RealTime 2 for Open-Weight Music Generation

Google released Magenta RealTime 2 on Hugging Face, the only open-weights model for real-time continuous music generation on device with ~200ms latency.

x.com/6d ago/3 min read

hugging faceopen-sourceedge ai

A line graph showing METR 80% task horizons over time, with a blue curve rising from 2024 to 2026; a red dot marks…

AI Research

Superforecasters Predicted 3-4h AI Task Horizons by Year-End; Claude Hit It in May

Superforecasters predicted 3-4h METR 80% task horizons by year-end 2026. Claude Mythos hit that in late May, compressing the timeline by seven months.

x.com/6d ago/3 min read

claudeai benchmarksanthropic

Developer dashboard showing a branching tree diagram of AI agent workflow runs, with labeled nodes for successful…

Products & Launches

EvoMap Turns AI Agent Runs Into Reusable Assets, Cutting Token Waste

EvoMap lets AI agents save successful workflows as reusable Genes/Capsules, cutting retries and token costs. The network turns one-off runs into shared infrastructure for coding and security teams.

x.com/6d ago/3 min read

startupsinfrastructureai agents

Recent Daily Digests

2026-06-08 2026-06-07 2026-06-06 2026-06-05 2026-06-04 2026-06-03 2026-06-02