Skip to content
gentic.news — AI News Intelligence Platform

Recipe ·

DeepSeek-R1

DeepSeek-R1 is a 671-billion-parameter reasoning model developed by DeepSeek, trained via reinforcement learning to achieve state-of-the-art performance on coding and reasoning benchmarks.

7
Techniques inside
4y
Median research → prod
1.6y
Fastest adoption
9y
Slowest adoption

Ingredient list

  1. Invented by Google DeepMind · 2024-08 · Velocity 1.6y

    Employs iterative refinement and multiple reasoning samples at inference time.

    reasoninghigh
  2. Invented by OpenAI · 2023-05 · Velocity 3y

    Uses step-level reward models to evaluate intermediate reasoning steps.

    reasoninghigh
  3. Invented by Google · 2022-03 · Velocity 4y

    Uses majority voting over multiple reasoning paths to improve answer accuracy.

    reasoninghigh
  4. Invented by OpenAI · 2022-03 · Velocity 4y

    Trained via reinforcement learning from human feedback to align with preferences.

    alignmenthigh
  5. Invented by Google · 2022-01 · Velocity 4y

    DeepSeek-R1 is a reasoning model that generates step-by-step reasoning traces.

    reasoninghigh
  6. Invented by Google · 2017-06 · Velocity 9y

    Based on transformer architecture with self-attention mechanisms.

    architecturehigh
  7. Invented by Google · 2017-01 · Velocity 9y

    671B parameter model uses sparse mixture-of-experts architecture.

    architecturehigh

This recipe is part of the gentic.news Deployment Atlas. Every ingredient has an origin paper + evidence. Methodology is public. Dataset is CC BY 4.0.

DeepSeek-R1 Recipe — The Research Behind the Model | gentic.news