30 articles about google in AI news
Google Open-Sources DiffusionGemma, 26B Model Hits 1K Tokens/Sec on H100
Google open-sourced DiffusionGemma, a 26B-parameter diffusion text model hitting 1,000 tokens/sec on H100 — 4x faster than autoregressive models, but with lower quality.
Google Books Intel for 3M+ TPUs in 2028 as TSMC CoWoS Hits Capacity Wall
Google booked Intel to package 3M+ TPUs in 2028 as TSMC CoWoS capacity caps out. SK hynix tests HBM on Intel EMIB, potentially unlocking Nvidia's Feynman architecture.
Google Titan: A New Architecture That Could Dethrone Transformers
Google's Titan architecture claims to surpass Transformers on long-context tasks via neural long-term memory, achieving 1.2x-2.5x speedups on benchmarks.
Google to Pay SpaceX $920M/Month for xAI Compute Capacity
Google commits $11B/year to SpaceX for compute at xAI data centers, potentially adding $1T to SpaceX's valuation.
Google's 1 GW Texas AI Campus Tests 'Power-First' Model for Hyperscaler
Google's Texas AI campus tests a power-first model, pairing 1 GW generation with a data center to bypass grid constraints for AI infrastructure expansion.
Google’s Virgo network interconnects 134K TPUv8t chips at 47 Pbps
Google's Virgo network interconnects 134,400 TPUv8t chips at 47 Pbps, targeting large-scale training clusters.
Google Releases Magenta RealTime 2 for Open-Weight Music Generation
Google released Magenta RealTime 2 on Hugging Face, the only open-weights model for real-time continuous music generation on device with ~200ms latency.
Google Gemma 4 12B: Encoder-Free Multimodal Model Launches
Google launched Gemma 4 12B, an encoder-free multimodal model for on-device AI, reducing latency by eliminating the vision encoder.
Google LEAP Scaffold Lifts Lean-IMO-Bench One-Shot Solve Rate from <10% to 70%
Google's LEAP scaffold lifts Lean-IMO-Bench one-shot solve rate from <10% to 70%, solving all 12 Putnam 2025 problems.
Google Launches Free 5-Day AI Agents Course, 1.5M Enrolled Last Run
Google launched a free 5-day AI Agents course, following 1.5M learners in the prior edition. The curriculum covers vibe coding, multi-agent systems, and production deployment on Kaggle.
Apple Ditches Apple Silicon Pledge, Routes AI Queries to Google Cloud
Apple routes AI queries to Google Cloud, breaking 2024 Apple silicon pledge. Distilled Gemini runs locally; heavier queries use Nvidia tech in Google Cloud.
Google Search VP: AI-native search costs more, grows volume
Google VP Robby Stein says AI-native Search costs more but grows volume. AI Mode breaks queries into sub-searches.
Apple Using Custom 1.2T-Parameter Google Model for Siri Overhaul
Apple using custom 1.2T-parameter Google model for Siri, per Reuters. Model larger than Gemini 3.5 Flash's 300B parameters; simple queries run locally.
Google Paper: Wearable AI Needs Personalization to Work
Google paper shows 18% heart rate accuracy gain by personalizing wearable AI to individual users via lightweight embeddings.
Google and Blackstone Launch TPU Venture, Challenging Nvidia Dominance
Google and Blackstone launched a TPU venture, financing AI infrastructure outside the hyperscale cloud model. Enterprise buyers get a standalone alternative to Nvidia-dominated GPU clusters.
Buffett Invests in Google After SemiAnalysis TPU Deep Dive
Berkshire Hathaway invested in Google in Q3 2025, after Buffett studied TPU v5p architecture. He compared it to railroads, citing 8,960 chips and 4.8 Tbps links.
Google CodeWiki Turns Any GitHub Repo Into Interactive Docs
Google released CodeWiki, an open-source tool turning GitHub repos into interactive docs with Gemini chat. Free, URL-based, no setup required.
Claude Mythos Goes GA in Google Cloud Console, Drops Preview Label
Claude Mythos silently went GA in Google Cloud console, preview label removed. Signals deeper Anthropic-GCP integration.
Google TPU 'Broadfly' Topology Scales Pod to 1,152 Chips
Google unveiled a Broadfly TPU topology at Cloud Next, scaling pods to 1,152 chips — 4.5x larger than Ironwood — with max 7 hops. This inference-first design challenges NVIDIA's NVLink on scale and latency.
Google to Debut Gemini Model Matching GPT-5.5 at I/O Tuesday
Google to announce new Gemini model matching GPT-5.5 at I/O Tuesday, per source. Unconfirmed, but signals intensified AI competition.
Agentic Commerce: 50% of Online Transactions by 2027, Google Cloud Leads
Agents projected to handle 50% of online transactions by 2027. Payment reliability determines winners in agentic commerce, with Google Cloud leading enterprise rollouts.
VS Code Now Connects Directly to Google Colab With Free T4 GPU
Google Colab integrates with VS Code, offering a free T4 GPU inside the editor, bypassing cloud GPU providers.
Trojan Masquerading as Claude Code Tops Google Search, Infects Users
A Trojan impersonating Claude Code ranked #1 on Google. Windows Defender caught it as Trojan:Win32/Kepavll!rfn. The victim had 30 years of internet experience.
Google CodeWiki Turns GitHub Repos Into Interactive Docs
Google launched CodeWiki, turning any GitHub repo into interactive docs with diagrams, tutorials, and a chatbot. It differentiates by structure over file summarization.
Google Beats Apple to AI Health Coach With Gemini-Powered Fitbit App
Google released an AI health coach using Gemini, beating Apple to market. The coach integrates fitness, sleep, nutrition, cycle tracking, weather, and U.S. medical records.
Google, Microsoft, xAI Agree to US Gov Pre-Release AI Testing
Google, Microsoft, xAI agreed to US pre-release testing of frontier AI. Voluntary deal lacks enforcement, excludes open-weight models.
Google Gemma 4: 3x Faster Inference with MTP Drafters
Google's Gemma 4 claims up to 3x faster inference via MTP drafters, but released no benchmark numbers or architectural details.
Gemini 3.1 Flash Leak Hints at Google I/O 2026 Launch
Leaked X post suggests Google is building Gemini 3.1 Flash, likely for Google I/O 2026. No specs or confirmation yet.
Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale
Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.
Google DeepMind Launches Real-Time Video AI Co-Clinician
Google DeepMind launched AI Co-Clinician, a real-time video analysis system for triadic care, claiming 30% fewer diagnostic errors in early tests.