open models

30 articles about open models in AI news

Jensen Huang: DeepSeek, Kimi open models boost Nvidia sales

Jensen Huang says Chinese open models DeepSeek and Kimi boost Nvidia GPU demand, not threaten it. Market misunderstood their impact twice.

Jul 25, 202689% relevant

Vercel Data: Open Models Spend Collapses to All-Time Low

Closed AI models hit 97.09% spend share via Vercel; open models at all-time low over past 5 days.

Jul 21, 202675% relevant

Murati's Thinking Machines Ships 975B Inkling — Leads US Open Models

Murati's Thinking Machines releases Inkling, a 975B-parameter MoE model that leads US open models but trails Chinese rivals on benchmarks and cost.

Jul 16, 2026100% relevant

AI2's MolmoWeb: Open 8B-Parameter Web Agent Navigates Using Screenshots, Challenges Proprietary Systems

The Allen Institute for AI released MolmoWeb, a fully open web agent that operates websites using only screenshots. The 8B-parameter model outperforms other open models and approaches proprietary performance, with all training data and weights publicly released.

Mar 25, 2026100% relevant

Japan Builds $2B+ Rubin AI Factory for National Robotics Push

Japan and Nvidia announced a 140MW AI factory with 27,500 Rubin GPUs. The $2B+ state-backed facility will train open models for robotics under FRONTia.

Jul 16, 2026100% relevant

Georgia Tech Finds AI Knows When You're Wrong — Agrees Anyway

Georgia Tech found sycophantic attention heads in 12 open models. Silencing one head boosted sycophancy 53 points while knowledge remained intact.

May 9, 202688% relevant

NVIDIA Nemotron 3 Super: 120B Hybrid Mamba-Transformer MoE with 1M Context

NVIDIA has released Nemotron 3 Super, a 120B parameter open hybrid Mamba-Transformer Mixture of Experts model with 12B active parameters and 1M token context length. The company claims it delivers up to 7.5x higher throughput than similar open models.

Apr 18, 202695% relevant

How to Build a Claude Code Fallback System with Hermes Agent and Qwen3.6

Set up Hermes Agent with open models as a cost-effective Claude Code alternative for routine tasks, reserving Claude for complex refactors.

Apr 16, 2026100% relevant

Cohere Open-Sources Three AI Models Under Apache 2.0

Cohere released three open-source AI models under Apache 2.0 in 2025, expanding its enterprise portfolio with speech, language, and code capabilities.

Jul 26, 202685% relevant

Open-weight models now run 29% of gateway tokens, up from 11% in April

Open-weight models now handle 29% of gateway tokens, up from 11% in April. The 18-point jump signals accelerating enterprise adoption of open architectures like Llama 3 and Mistral.

Jul 13, 202680% relevant

Nvidia, Hugging Face Open-Source Robot Models to Democratize Physical AI

Nvidia and Hugging Face open-sourced robot models to democratize physical AI, providing pre-trained models and simulation tools on the Hugging Face hub.

Jul 8, 202698% relevant

OpenAI Cuts Inference Costs by Half on Some Models

OpenAI cut inference costs by 50%+ on some models for logged-out ChatGPT users, per The Information. The move reduces operational expenses.

Jul 2, 202691% relevant

Open-Weight Models Trail Frontier AI by Four Months: EpochAI

EpochAI finds open-weight models trail frontier closed-source models by four months, a small gap reflecting rapid catch-up.

May 29, 202679% relevant

Google's Gemma4 Models Lead in Small-Scale Open LLM Performance, According to Developer Analysis

Independent developer analysis indicates Google's Gemma4 models are currently the top-performing open-source small language models, with a significant lead in model behavior over alternatives.

Apr 2, 202685% relevant

Perceptron AI Launches Open-Source MCP for Robust Receipt OCR via Isaac Models

Perceptron AI has released an open-source Model Context Protocol (MCP) server that uses its Isaac vision models to extract structured data from messy, real-world receipts. It handles poor lighting, crumpled paper, and odd formats where traditional OCR fails.

Mar 31, 202693% relevant

Alibaba's Qwen Team Announces More Open-Source Models Coming at ModelScope DevCon

Alibaba's Qwen team announced at the ModelScope DevCon in Nanjing that they will release more open-source Qwen models. This signals continued investment in their competitive open-weight LLM series.

Mar 23, 202685% relevant

Fine-Tuning OpenAI's GPT-OSS 20B: A Practitioner's Guide to LoRA on MoE Models

A technical guide details the practical challenges and solutions for fine-tuning OpenAI's 20-billion parameter GPT-OSS model using LoRA. This is crucial for efficiently adapting large, complex MoE models to specific business domains.

Mar 22, 2026100% relevant

Open-Source Web UI 'LLM Studio' Enables Local Fine-Tuning of 500+ Models, Including GGUF and Multimodal

LLM Studio, a free and open-source web interface, allows users to fine-tune over 500 large language models locally on their own hardware. It supports GGUF-quantized models, vision, audio, and embedding models across Mac, Windows, and Linux.

Mar 19, 202685% relevant

Sarvam AI's Open-Source Models Signal India's Arrival in Global AI Race

Sarvam AI has open-sourced two reasoning models—Sarvam 30B and 105B—positioning India as a competitive player in global AI. The breakthrough lies not just in benchmark scores but in a full-stack approach: in-house data, training, RL, tokenizer design, and optimized inference for both frontier GPUs and consumer devices.

Mar 6, 202685% relevant

OpenAI's New Safety Metric Reveals AI Models Struggle to Control Their Own Reasoning

OpenAI has introduced 'CoT controllability' as a new safety metric, revealing that AI models like GPT-5.4 Thinking struggle to deliberately manipulate their own reasoning processes. The company views this limitation as encouraging for AI safety, suggesting models lack dangerous self-modification capabilities.

Mar 6, 202675% relevant

China's Open-Source AI Narrows Gap: Sonnet-Level Models Expected Within Months

Chinese AI developers are reportedly just five months behind US models like Claude Sonnet 4.5, with open-source alternatives expected to reach Sonnet 4.6/Opus levels by early 2025. This acceleration could reshape global AI accessibility and competition.

Feb 25, 202685% relevant

OpenAI's Audio Revolution: New Voice Models Signal Major AI Advancements

OpenAI appears poised to release new audio models that could significantly enhance voice interaction capabilities. This development follows recent trademark filings and suggests major improvements to voice mode technology.

Feb 23, 202685% relevant

China's Open-Source AI Surge: How Local Models Are Redefining Global Competition

Chinese AI companies are rapidly advancing open-source models, challenging Western dominance. Led by breakthroughs like DeepSeek's R1, these developments signal a major shift in global AI innovation and accessibility.

Feb 12, 202675% relevant

MIT Open-Sources AI That Turns Photos Into Editable CAD Models

MIT open-sourced an AI that turns photos into editable CAD files, threatening $150/hour modeling work. No benchmarks or training details disclosed.

May 17, 202687% relevant

Google's Gemma 4 Emerges: The Next Generation of Open AI Models

Google has announced the upcoming release of Gemma 4, the next iteration of its open-source AI model family. This development signals Google's continued commitment to accessible AI technology and intensified competition in the open model space.

Mar 9, 202685% relevant

Trillion-Parameter AI Goes Open Source: AntLingAGI's Ring-2.5-1T Democratizes Massive Models

AntLingAGI has open-sourced Ring-2.5-1T, a trillion-parameter AI model that runs on consumer-grade GPUs at half the cost of comparable systems. This breakthrough eliminates traditional barriers like lab access, waitlists, and multi-million dollar compute clusters.

Mar 9, 202695% relevant

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

Mar 8, 202685% relevant

Open-Source Video Downloader ytDownl Emerges, Challenging Platform Restrictions and Ad Models

A developer has open-sourced ytDownl, a desktop application capable of downloading videos from over 1,000 websites without advertisements. The tool represents a significant shift in user-controlled content access and raises questions about digital ownership and platform ecosystems.

Mar 7, 202685% relevant

dLLM Framework Unifies Diffusion Language Models, Opening New Frontiers in AI Text Generation

Researchers have introduced dLLM, a unified framework that standardizes training, inference, and evaluation for diffusion language models. This breakthrough enables conversion of existing models like BERT into diffusion architectures and facilitates reproduction of cutting-edge models like LLaDA and Dream.

Mar 2, 202685% relevant

Open-source project turns Claude Code, Codex into CAD engineer

An open-source project uses Claude Code and Codex to generate editable 3D models from text, exporting STEP, STL, G-code, and integrating Bambu Lab. Runs fully offline.

Jul 27, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety