Timeline
Launched first external partnership for Qwen AI app with China Eastern Airlines for flight booking integration
Released Qwen3.6-27B, a dense 27B parameter model that surpasses its previous 397B MoE model on coding benchmarks.
Researchers developed DCW (Diffusion Correction in Wavelet domain), a method to fix SNR-t misalignment bias in diffusion models, improving performance for models like FLUX and EDM.
Moved Qwen 3.6 Plus model to API-only access, adopting a paid frontier model strategy.
Released Qwen 3.6, a free, open-weights AI model for local deployment and coding
Released Qwen3.6-35B-A3B, a sparse MoE model with 35B total and 3B active parameters
ByteDance introduced OmniShow, a unified multimodal framework for video generation.
Introduces Helios, a 14B parameter video generation model running at 19.5 FPS on a single H100 GPU.
Collaborated with Tsinghua University and Peking University to develop the HACPO research framework.
Introduced Mixture-of-Depths Attention (MoDA) for deep LLMs