Timeline
Alibaba and Nanjing University published paper claiming 9.36X speedup for million-token prefill
Launched first external partnership for Qwen AI app with China Eastern Airlines for flight booking integration
Released Qwen3.6-27B, a dense 27B parameter model that surpasses its previous 397B MoE model on coding benchmarks.
Researchers developed DCW (Diffusion Correction in Wavelet domain), a method to fix SNR-t misalignment bias in diffusion models, improving performance for models like FLUX and EDM.
Moved Qwen 3.6 Plus model to API-only access, adopting a paid frontier model strategy.
Released Qwen 3.6, a free, open-weights AI model for local deployment and coding
Launched Voxtral TTS, a 4B-parameter open-weight text-to-speech model with voice cloning from 3-second audio
Launched Voxtral TTS, its first open-weight text-to-speech model for voice cloning and edge deployment.
Adopted client-first MCP strategy, focusing on MCP consumer rather than provider
Released Mistral Small 4 model via its commercial API platform