AI Model Leaderboard

2 mentions/7dresearch milestone

Google's upcoming Gemma 4 is an open-source AI model designed for efficient, high-performance local execution on devices like smartphones.

GPT-3.5

OpenAI

fading

2 mentions/7dresearch milestone

Spark

Meta

1 mentions/7dproduct launch

Meta Superintelligence Labs developed Muse Spark, its first competitive model from a rebuilt infrastructure aimed at personal superintelligence.

MiniMax M2.5

MiniMax

cooling

MiniMax M2.5, developed by MiniMax, is a frontier AI model designed for real-world productivity and agents, achieving state-of-the-art coding performance with high speed and unmatched cost efficiency.

1 mentions/7dproduct launch

LLaMA 3

Meta

1 mentions/7dhas benchmarksresearch milestone

Meta's LLaMA 3 is its latest large language model, released in two primary sizes (8B and 70B parameters) and trained on approximately 15 trillion tokens for enhanced reasoning and coding capabilities.

DeepSeek-V3

DeepSeek

1 mentions/7dhas benchmarks

DeepSeek-V3, developed by DeepSeek, is a highly efficient mixture-of-experts language model trained at a fraction of the cost of comparable systems while maintaining strong performance.

Qwen 3.5 Medium

Alibaba

1 mentions/7dresearch milestone

Alibaba Qwen efficiency model. Outperforms Qwen 2.5 235B with 7x fewer active params. Open-weight. Competes with Nemotron-Cascade, Mistral.

Claude 3.5 Sonnet

Anthropic

1 mentions/7dhas benchmarksresearch milestone

Claude 3.5 Sonnet is a large language model developed by Anthropic, first released on February 23, 2026, as part of the Claude 3.5 family. It achieves a MMLU-Pro score of 78.0, an Arena ELO rating of

Llama 3.1 70B

Meta

Meta's Llama 3.1 70B is a 70-billion-parameter large language model, released in July 2024, offering strong performance in text generation and instruction-following tasks.

1 mentions/7d

Llama 3 8B

Meta

Llama 3 8B, developed by Meta, is an efficient open-source large language model designed for strong performance at a smaller scale.

1 mentions/7d

GPT-4 Turbo

OpenAI

1 mentions/7dproduct launch

GPT-4 Turbo, developed by OpenAI, is a large language model featuring a 128K context window, faster response times, and more cost-effective operation than its predecessor.

GPT-5.2 Pro

OpenAI

fading

GPT-5.2 is OpenAI's latest flagship large language model, released on December 11, 2025. Succeeding GPT-5.1, it is a family of three large language models within the GPT series. It comes in two modes:

DeepSeek-R1

DeepSeek

quiet

DeepSeek-R1 is a 671-billion-parameter reasoning model developed by DeepSeek, trained via reinforcement learning to achieve state-of-the-art performance on coding and reasoning benchmarks.

Kimi K2.5

Moonshot AI

quiet

Kimi K2.5 is an open-source, multimodal AI model from Moonshot AI, featuring 1 trillion parameters, vision capabilities, and Agent Swarm technology for complex task orchestration.

GPT-5.3-Codex

OpenAI

quiet

GPT-5.1 is a family of four large language models within OpenAI's GPT series. Two were released on November 12, 2025; two more were released one week later on November 19.