open models
30 articles about open models in AI news
AI2's MolmoWeb: Open 8B-Parameter Web Agent Navigates Using Screenshots, Challenges Proprietary Systems
The Allen Institute for AI released MolmoWeb, a fully open web agent that operates websites using only screenshots. The 8B-parameter model outperforms other open models and approaches proprietary performance, with all training data and weights publicly released.
NVIDIA Nemotron 3 Super: 120B Hybrid Mamba-Transformer MoE with 1M Context
NVIDIA has released Nemotron 3 Super, a 120B parameter open hybrid Mamba-Transformer Mixture of Experts model with 12B active parameters and 1M token context length. The company claims it delivers up to 7.5x higher throughput than similar open models.
How to Build a Claude Code Fallback System with Hermes Agent and Qwen3.6
Set up Hermes Agent with open models as a cost-effective Claude Code alternative for routine tasks, reserving Claude for complex refactors.
NVIDIA Introduces Ising: World's First Open AI Models for Quantum System Acceleration
NVIDIA has launched Ising, the world's first open AI models designed to accelerate quantum computing workflows, enabling researchers and enterprises to use AI for scalable quantum processor calibration and high-performance quantum systems.
Google Launches Fully Open-Source Gemma 4 AI Models Under Apache 2.0 License
Google has released Gemma 4, a new family of open-source AI models available under the permissive Apache 2.0 license. The models are designed to run locally on various devices including servers, phones, and Raspberry Pi, marking Google's renewed commitment to the open-source AI ecosystem.
Google's Gemma4 Models Lead in Small-Scale Open LLM Performance, According to Developer Analysis
Independent developer analysis indicates Google's Gemma4 models are currently the top-performing open-source small language models, with a significant lead in model behavior over alternatives.
Perceptron AI Launches Open-Source MCP for Robust Receipt OCR via Isaac Models
Perceptron AI has released an open-source Model Context Protocol (MCP) server that uses its Isaac vision models to extract structured data from messy, real-world receipts. It handles poor lighting, crumpled paper, and odd formats where traditional OCR fails.
Alibaba's Qwen Team Announces More Open-Source Models Coming at ModelScope DevCon
Alibaba's Qwen team announced at the ModelScope DevCon in Nanjing that they will release more open-source Qwen models. This signals continued investment in their competitive open-weight LLM series.
Fine-Tuning OpenAI's GPT-OSS 20B: A Practitioner's Guide to LoRA on MoE Models
A technical guide details the practical challenges and solutions for fine-tuning OpenAI's 20-billion parameter GPT-OSS model using LoRA. This is crucial for efficiently adapting large, complex MoE models to specific business domains.
China's Top Open-Source AI Models Have Overtaken US Counterparts, Analysis Shows
Analysis indicates China's best open-source AI models have surpassed US equivalents. Leadership in open-source could accelerate global adoption through downloads and on-prem deployment.
Open-Source Web UI 'LLM Studio' Enables Local Fine-Tuning of 500+ Models, Including GGUF and Multimodal
LLM Studio, a free and open-source web interface, allows users to fine-tune over 500 large language models locally on their own hardware. It supports GGUF-quantized models, vision, audio, and embedding models across Mac, Windows, and Linux.
Sarvam AI's Open-Source Models Signal India's Arrival in Global AI Race
Sarvam AI has open-sourced two reasoning models—Sarvam 30B and 105B—positioning India as a competitive player in global AI. The breakthrough lies not just in benchmark scores but in a full-stack approach: in-house data, training, RL, tokenizer design, and optimized inference for both frontier GPUs and consumer devices.
OpenAI's New Safety Metric Reveals AI Models Struggle to Control Their Own Reasoning
OpenAI has introduced 'CoT controllability' as a new safety metric, revealing that AI models like GPT-5.4 Thinking struggle to deliberately manipulate their own reasoning processes. The company views this limitation as encouraging for AI safety, suggesting models lack dangerous self-modification capabilities.
China's Open-Source AI Narrows Gap: Sonnet-Level Models Expected Within Months
Chinese AI developers are reportedly just five months behind US models like Claude Sonnet 4.5, with open-source alternatives expected to reach Sonnet 4.6/Opus levels by early 2025. This acceleration could reshape global AI accessibility and competition.
OpenAI's Audio Revolution: New Voice Models Signal Major AI Advancements
OpenAI appears poised to release new audio models that could significantly enhance voice interaction capabilities. This development follows recent trademark filings and suggests major improvements to voice mode technology.
China's Open-Source AI Surge: How Local Models Are Redefining Global Competition
Chinese AI companies are rapidly advancing open-source models, challenging Western dominance. Led by breakthroughs like DeepSeek's R1, these developments signal a major shift in global AI innovation and accessibility.
Google's Gemma 4 Emerges: The Next Generation of Open AI Models
Google has announced the upcoming release of Gemma 4, the next iteration of its open-source AI model family. This development signals Google's continued commitment to accessible AI technology and intensified competition in the open model space.
Trillion-Parameter AI Goes Open Source: AntLingAGI's Ring-2.5-1T Democratizes Massive Models
AntLingAGI has open-sourced Ring-2.5-1T, a trillion-parameter AI model that runs on consumer-grade GPUs at half the cost of comparable systems. This breakthrough eliminates traditional barriers like lab access, waitlists, and multi-million dollar compute clusters.
Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments
OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.
Open-Source Video Downloader ytDownl Emerges, Challenging Platform Restrictions and Ad Models
A developer has open-sourced ytDownl, a desktop application capable of downloading videos from over 1,000 websites without advertisements. The tool represents a significant shift in user-controlled content access and raises questions about digital ownership and platform ecosystems.
dLLM Framework Unifies Diffusion Language Models, Opening New Frontiers in AI Text Generation
Researchers have introduced dLLM, a unified framework that standardizes training, inference, and evaluation for diffusion language models. This breakthrough enables conversion of existing models like BERT into diffusion architectures and facilitates reproduction of cutting-edge models like LLaDA and Dream.
DeepSeek V4-Pro: 1.6T parameters, open weights, undercuts rivals 10x
DeepSeek unveiled V4-Pro and V4-Flash, its largest open-weight models with up to 1.6 trillion parameters and a 1M-token context window. The new hybrid attention architecture cuts compute for long contexts by 73–90%, enabling prices far below OpenAI, Google, and Anthropic.
GPT-ImageGen-2 Likely Uses AI Models as Prompt Generators
Evidence suggests OpenAI's upcoming image model, GPT-ImageGen-2, operates as a tool where AI models generate the prompts, not users. This marks a shift from the transparent prompt display seen in DALL-E 3.
Moonshot AI's Kimi K2.6 Hits 58.6% on SWE-Bench Pro, Leads Open-Source Coding
Moonshot AI released Kimi K2.6, an open-source coding model achieving 58.6% on SWE-Bench Pro and 54.0% on HLE with tools. This positions it as a top-tier open alternative to proprietary models like Claude 3.5 Sonnet.
Qwen 3.6 Released: Free, Open-Weights Model for Local AI Coding
Alibaba's Qwen team released Qwen 3.6, an open-weights AI model for local deployment. This provides a free, private alternative to ID-verified models like Anthropic's Mythos and OpenAI's Codex.
Mystery 'Elephant Alpha' 100B Model Tops OpenRouter Leaderboard
An unidentified 100B-parameter AI model named 'Elephant Alpha' has appeared at the top of OpenRouter's performance leaderboard without any announcement or model card, beating several established paid models.
MLX-VLM Adds Continuous Batching, OpenAI API, and Vision Cache for Apple Silicon
The next release of MLX-VLM will introduce continuous batching, an OpenAI-compatible API, and vision feature caching for multimodal models running locally on Apple Silicon. These optimizations promise up to 228x speedups on cache hits for models like Gemma4.
OpenVoice v2: Complete Voice Cloning Directory Launches on GitHub
A developer has compiled and released a comprehensive directory of open-source voice cloning tools and resources on GitHub. This centralizes access to models, datasets, and training code, lowering the barrier to entry for AI audio development.
Altman: Next-Gen AI Models to Aid 'Career-Defining' Scientific Discovery
OpenAI CEO Sam Altman stated that upcoming AI models will assist researchers in making 'career-defining' discoveries, though he tempered expectations of immediate Nobel-level breakthroughs.
OpenAI Shifts ChatGPT Ads to CPC, Targets $11B Revenue by 2027
OpenAI is restructuring ChatGPT advertising, moving from impression-based pricing to cost-per-click and conversion-driven models. This shift aims to compete directly with Google and Meta in intent-based advertising, targeting $2.4B revenue this year and $11B by 2027.