Skip to content
gentic.news — AI News Intelligence Platform

Recipe ·

Qwen 3.6

Qwen 3.6 is Alibaba's next-generation open-source large language model, offering improved speed and multimodal capabilities as a successor to the Qwen 3.5 series.

13
Techniques inside
4y
Median research → prod
3y
Fastest adoption
9y
Slowest adoption

Ingredient list

  1. Invented by Nous Research · 2023-08 · Velocity 3y

    Qwen 3.6 supports a 128K context length, likely using RoPE extension techniques like YaRN.

    architecturemedium
  2. Invented by Google · 2023-05 · Velocity 3y

    Qwen 3.6 uses GQA to reduce memory usage and improve inference speed.

    architecturehigh
  3. Invented by University of Wisconsin · 2023-04 · Velocity 3y

    Qwen 3.6 includes a multimodal version (Qwen-VL) that uses a vision encoder and projector.

    multimodalhigh
  4. Invented by Stanford · 2022-05 · Velocity 4y

    Qwen models utilize FlashAttention for efficient training and inference.

    inferencehigh
  5. Invented by University of Tokyo · 2022-05 · Velocity 4y

    Qwen 3.6 exhibits strong zero-shot reasoning capabilities.

    reasoningmedium
  6. Invented by Google · 2022-03 · Velocity 4y

    Qwen 3.6 demonstrates improved reasoning, which can be enhanced with self-consistency.

    reasoningmedium
  7. Invented by Google · 2022-01 · Velocity 4y

    Qwen 3.6 demonstrates strong reasoning capabilities, including step-by-step reasoning.

    reasoninghigh
  8. Invented by Google · 2021-09 · Velocity 5y

    Qwen models are instruction-tuned on a large collection of datasets.

    traininghigh
  9. Invented by Microsoft · 2021-06 · Velocity 5y

    Qwen models support LoRA for efficient fine-tuning.

    traininghigh
  10. Invented by Zhuiyi Technology · 2021-04 · Velocity 5y

    Qwen models use Rotary Position Embedding (RoPE) for positional encoding.

    architecturehigh
  11. Invented by Google · 2020-10 · Velocity 5y

    Qwen 3.6's multimodal version uses a Vision Transformer (ViT) as its vision encoder.

    multimodalhigh
  12. Invented by Google · 2017-06 · Velocity 9y

    Qwen 3.6 is a Transformer-based large language model.

    architecturehigh
  13. Invented by Google · 2017-01 · Velocity 9y

    Qwen 3.6 includes a MoE version (Qwen 3.6 MoE) with 14B active parameters.

    architecturehigh

This recipe is part of the gentic.news Deployment Atlas. Every ingredient has an origin paper + evidence. Methodology is public. Dataset is CC BY 4.0.