Timeline
Cerebras published benchmark results for Kimi K2.6 on CS-3, claiming 981 tokens/sec and 6.7× speedup over GPU cloud.
Released Kimi WebBridge browser extension
Claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3
SemiAnalysis noted that Cerebras understates on-chip SRAM by 8x on its website.
Released the Kimi 2.6 Thinking open-weights reasoning model
Confidentially filed paperwork with the SEC for an initial public offering.
Filed for IPO on April 21, 2026, betting wafer-scale chips can disrupt Nvidia's GPU cluster model.
Released the open-source coding model Kimi K2.6, achieving top scores on SWE-Bench Pro and HumanEval with Tools benchmarks.
Released a trillion-parameter open-source model matching Claude Opus on coding benchmarks.
Teased upcoming 'Kimi 2.6' code model via leaked image, suggesting a major update to Kimi Chat.