Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

open models

30 articles about open models in AI news

AI2's MolmoWeb: Open 8B-Parameter Web Agent Navigates Using Screenshots, Challenges Proprietary Systems

The Allen Institute for AI released MolmoWeb, a fully open web agent that operates websites using only screenshots. The 8B-parameter model outperforms other open models and approaches proprietary performance, with all training data and weights publicly released.

100% relevant

NVIDIA Nemotron 3 Super: 120B Hybrid Mamba-Transformer MoE with 1M Context

NVIDIA has released Nemotron 3 Super, a 120B parameter open hybrid Mamba-Transformer Mixture of Experts model with 12B active parameters and 1M token context length. The company claims it delivers up to 7.5x higher throughput than similar open models.

95% relevant

How to Build a Claude Code Fallback System with Hermes Agent and Qwen3.6

Set up Hermes Agent with open models as a cost-effective Claude Code alternative for routine tasks, reserving Claude for complex refactors.

100% relevant

NVIDIA Introduces Ising: World's First Open AI Models for Quantum System Acceleration

NVIDIA has launched Ising, the world's first open AI models designed to accelerate quantum computing workflows, enabling researchers and enterprises to use AI for scalable quantum processor calibration and high-performance quantum systems.

85% relevant

Google Launches Fully Open-Source Gemma 4 AI Models Under Apache 2.0 License

Google has released Gemma 4, a new family of open-source AI models available under the permissive Apache 2.0 license. The models are designed to run locally on various devices including servers, phones, and Raspberry Pi, marking Google's renewed commitment to the open-source AI ecosystem.

89% relevant

Google's Gemma4 Models Lead in Small-Scale Open LLM Performance, According to Developer Analysis

Independent developer analysis indicates Google's Gemma4 models are currently the top-performing open-source small language models, with a significant lead in model behavior over alternatives.

85% relevant

Perceptron AI Launches Open-Source MCP for Robust Receipt OCR via Isaac Models

Perceptron AI has released an open-source Model Context Protocol (MCP) server that uses its Isaac vision models to extract structured data from messy, real-world receipts. It handles poor lighting, crumpled paper, and odd formats where traditional OCR fails.

93% relevant

Alibaba's Qwen Team Announces More Open-Source Models Coming at ModelScope DevCon

Alibaba's Qwen team announced at the ModelScope DevCon in Nanjing that they will release more open-source Qwen models. This signals continued investment in their competitive open-weight LLM series.

85% relevant

Fine-Tuning OpenAI's GPT-OSS 20B: A Practitioner's Guide to LoRA on MoE Models

A technical guide details the practical challenges and solutions for fine-tuning OpenAI's 20-billion parameter GPT-OSS model using LoRA. This is crucial for efficiently adapting large, complex MoE models to specific business domains.

100% relevant

China's Top Open-Source AI Models Have Overtaken US Counterparts, Analysis Shows

Analysis indicates China's best open-source AI models have surpassed US equivalents. Leadership in open-source could accelerate global adoption through downloads and on-prem deployment.

85% relevant

Open-Source Web UI 'LLM Studio' Enables Local Fine-Tuning of 500+ Models, Including GGUF and Multimodal

LLM Studio, a free and open-source web interface, allows users to fine-tune over 500 large language models locally on their own hardware. It supports GGUF-quantized models, vision, audio, and embedding models across Mac, Windows, and Linux.

85% relevant

Sarvam AI's Open-Source Models Signal India's Arrival in Global AI Race

Sarvam AI has open-sourced two reasoning models—Sarvam 30B and 105B—positioning India as a competitive player in global AI. The breakthrough lies not just in benchmark scores but in a full-stack approach: in-house data, training, RL, tokenizer design, and optimized inference for both frontier GPUs and consumer devices.

85% relevant

OpenAI's New Safety Metric Reveals AI Models Struggle to Control Their Own Reasoning

OpenAI has introduced 'CoT controllability' as a new safety metric, revealing that AI models like GPT-5.4 Thinking struggle to deliberately manipulate their own reasoning processes. The company views this limitation as encouraging for AI safety, suggesting models lack dangerous self-modification capabilities.

75% relevant

China's Open-Source AI Narrows Gap: Sonnet-Level Models Expected Within Months

Chinese AI developers are reportedly just five months behind US models like Claude Sonnet 4.5, with open-source alternatives expected to reach Sonnet 4.6/Opus levels by early 2025. This acceleration could reshape global AI accessibility and competition.

85% relevant

OpenAI's Audio Revolution: New Voice Models Signal Major AI Advancements

OpenAI appears poised to release new audio models that could significantly enhance voice interaction capabilities. This development follows recent trademark filings and suggests major improvements to voice mode technology.

85% relevant

China's Open-Source AI Surge: How Local Models Are Redefining Global Competition

Chinese AI companies are rapidly advancing open-source models, challenging Western dominance. Led by breakthroughs like DeepSeek's R1, these developments signal a major shift in global AI innovation and accessibility.

75% relevant

Google's Gemma 4 Emerges: The Next Generation of Open AI Models

Google has announced the upcoming release of Gemma 4, the next iteration of its open-source AI model family. This development signals Google's continued commitment to accessible AI technology and intensified competition in the open model space.

85% relevant

Trillion-Parameter AI Goes Open Source: AntLingAGI's Ring-2.5-1T Democratizes Massive Models

AntLingAGI has open-sourced Ring-2.5-1T, a trillion-parameter AI model that runs on consumer-grade GPUs at half the cost of comparable systems. This breakthrough eliminates traditional barriers like lab access, waitlists, and multi-million dollar compute clusters.

95% relevant

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

85% relevant

Open-Source Video Downloader ytDownl Emerges, Challenging Platform Restrictions and Ad Models

A developer has open-sourced ytDownl, a desktop application capable of downloading videos from over 1,000 websites without advertisements. The tool represents a significant shift in user-controlled content access and raises questions about digital ownership and platform ecosystems.

85% relevant

dLLM Framework Unifies Diffusion Language Models, Opening New Frontiers in AI Text Generation

Researchers have introduced dLLM, a unified framework that standardizes training, inference, and evaluation for diffusion language models. This breakthrough enables conversion of existing models like BERT into diffusion architectures and facilitates reproduction of cutting-edge models like LLaDA and Dream.

85% relevant

DeepSeek V4-Pro: 1.6T parameters, open weights, undercuts rivals 10x

DeepSeek unveiled V4-Pro and V4-Flash, its largest open-weight models with up to 1.6 trillion parameters and a 1M-token context window. The new hybrid attention architecture cuts compute for long contexts by 73–90%, enabling prices far below OpenAI, Google, and Anthropic.

100% relevant

GPT-ImageGen-2 Likely Uses AI Models as Prompt Generators

Evidence suggests OpenAI's upcoming image model, GPT-ImageGen-2, operates as a tool where AI models generate the prompts, not users. This marks a shift from the transparent prompt display seen in DALL-E 3.

85% relevant

Moonshot AI's Kimi K2.6 Hits 58.6% on SWE-Bench Pro, Leads Open-Source Coding

Moonshot AI released Kimi K2.6, an open-source coding model achieving 58.6% on SWE-Bench Pro and 54.0% on HLE with tools. This positions it as a top-tier open alternative to proprietary models like Claude 3.5 Sonnet.

100% relevant

Qwen 3.6 Released: Free, Open-Weights Model for Local AI Coding

Alibaba's Qwen team released Qwen 3.6, an open-weights AI model for local deployment. This provides a free, private alternative to ID-verified models like Anthropic's Mythos and OpenAI's Codex.

100% relevant

Mystery 'Elephant Alpha' 100B Model Tops OpenRouter Leaderboard

An unidentified 100B-parameter AI model named 'Elephant Alpha' has appeared at the top of OpenRouter's performance leaderboard without any announcement or model card, beating several established paid models.

97% relevant

MLX-VLM Adds Continuous Batching, OpenAI API, and Vision Cache for Apple Silicon

The next release of MLX-VLM will introduce continuous batching, an OpenAI-compatible API, and vision feature caching for multimodal models running locally on Apple Silicon. These optimizations promise up to 228x speedups on cache hits for models like Gemma4.

95% relevant

OpenVoice v2: Complete Voice Cloning Directory Launches on GitHub

A developer has compiled and released a comprehensive directory of open-source voice cloning tools and resources on GitHub. This centralizes access to models, datasets, and training code, lowering the barrier to entry for AI audio development.

85% relevant

Altman: Next-Gen AI Models to Aid 'Career-Defining' Scientific Discovery

OpenAI CEO Sam Altman stated that upcoming AI models will assist researchers in making 'career-defining' discoveries, though he tempered expectations of immediate Nobel-level breakthroughs.

87% relevant

OpenAI Shifts ChatGPT Ads to CPC, Targets $11B Revenue by 2027

OpenAI is restructuring ChatGPT advertising, moving from impression-based pricing to cost-per-click and conversion-driven models. This shift aims to compete directly with Google and Meta in intent-based advertising, targeting $2.4B revenue this year and $11B by 2027.

95% relevant