Timeline
Google booked Intel to package 3M+ TPUs in 2028
Google released DiffusionGemma, a 26B-parameter open-weight diffusion text model, under Apache 2.0 license.
Released Gemini 3.5 Live Translate, an audio model for real-time translation
Google finalized the acquisition of energy developer Intersect months before the Meitner site project was announced.
Google commits $11B/year to SpaceX for compute at xAI data centers
Published paper on Titan architecture surpassing Transformers on long-context tasks
Disclosed three mechanical innovations for wafer-scale chip cooling: vertical power delivery, flexible interposers, direct-impingement cooling
Announced CS4 wafer-scale chip staying on 5nm due to SRAM scaling flattening.
Cerebras published benchmark results for Kimi K2.6 on CS-3, claiming 981 tokens/sec and 6.7× speedup over GPU cloud.
Claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3