Timeline
Disclosed three mechanical innovations for wafer-scale chip cooling: vertical power delivery, flexible interposers, direct-impingement cooling
Announced CS4 wafer-scale chip staying on 5nm due to SRAM scaling flattening.
Cerebras published benchmark results for Kimi K2.6 on CS-3, claiming 981 tokens/sec and 6.7× speedup over GPU cloud.
Claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3
Launches 'Alexa for Shopping' AI agent for autonomous product research and purchase completion
4 new AI data center projects identified.
Amazon set target requiring >80% of developers to use AI tools weekly
SemiAnalysis noted that Cerebras understates on-chip SRAM by 8x on its website.
Launched first purpose-built payment API for autonomous agents
Matt Garman serving as AWS CEO made statement about A100 servers being sold out and never retired