model release
30 articles about model release in AI news
DeepSeek-V4 Rumored as 'Whale' Returns, Signaling Major Model Release
DeepSeek's cryptic 'whale' codename has reappeared, strongly hinting at the impending launch of DeepSeek-V4. This follows the company's pattern of using the whale symbol before major model releases.
Baidu's Qianfan-OCR End-to-End Document Intelligence Model Released on Hugging Face
Baidu has released Qianfan-OCR, an end-to-end document intelligence model, on Hugging Face. The model appears to be a unified framework for optical character recognition and document understanding tasks.
Qwen2.5-7B-Instruct 4-bit DWQ Model Released for Apple MLX
A developer has ported a 4-bit quantized Qwen2.5-7B-Instruct model to Apple's MLX framework. This makes the capable 7B model more efficient to run on Apple Silicon Macs.
DeepSeek Teases 'Much Larger' Base Model Release Amid Industry Silence and Hardware Challenges
DeepSeek staff confirmed a new, larger base model is coming soon, following months of quiet after reports of failed Huawei chip training. This comes as the Chinese AI lab faces heightened expectations after its breakthrough o1-level model in January 2025.
Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments
OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.
Meta Launches Muse Spark, First Model Since Zuck's AI Funding Push
Meta has launched a new AI model called Muse Spark. This is the company's first model release since CEO Mark Zuckerberg announced aggressive AI funding and a shift to open-source development in early 2026.
Minimax Confirms Abab 6.5 Pro Model as 'Minimax 2.7' in Teaser Announcement
Minimax has officially branded its upcoming Abab 6.5 Pro model as 'Minimax 2.7' in a teaser announcement. This confirms the company's next major model release is imminent.
Industry Executives Signal Unprecedented AI Acceleration, With GPT-5.4 and Opus 4.6 Cited as Successes
A confluence of executive commentary and rapid model releases points to an intense six-month acceleration in AI capability. Sam Altman states internal models have exceeded expectations, while open-source efforts like Qwen 3.5 narrow the gap with frontier labs.
Anthropic's Claude 3.7 Sonnet: The Dawn of Recursive Self-Improvement and Its Economic Warnings
Anthropic's latest AI developments reveal accelerated model releases, with Claude now writing 70-90% of its own code. The company warns of imminent white-collar job displacement and approaches the threshold of recursive self-improvement.
Trump Team Weighs Pre-Release AI Model Review Process
Trump admin discusses AI working group for pre-release model review. Briefed Anthropic, Google, OpenAI; no executive order yet.
Qwen 3.6 Released: Free, Open-Weights Model for Local AI Coding
Alibaba's Qwen team released Qwen 3.6, an open-weights AI model for local deployment. This provides a free, private alternative to ID-verified models like Anthropic's Mythos and OpenAI's Codex.
Kyutai Labs Releases OVIE: Single-Image Novel View Synthesis Model
French AI lab Kyutai Labs released OVIE, a novel view generation model trained only on single images, bypassing the need for costly multi-view datasets. This could democratize 3D content creation from 2D photos.
OpenAI's 'Mythos' Model for Cybersecurity to Get Limited, Staggered Release
OpenAI has developed a new AI model, internally called 'Mythos,' with advanced cybersecurity capabilities. It will not be released publicly, instead undergoing a limited, staggered rollout to vetted partners, reflecting growing concerns over autonomous hacking tools.
Google Releases Fully Open-Source Gemma 4 AI Model for Local Device Deployment
Google has launched Gemma 4, a fully open-source AI model family available under the Apache 2.0 license. The release marks Google's re-entry into the competitive open-source AI landscape with models optimized for local deployment, including on mobile devices.
Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal Capabilities
Google has released the Gemma 4 family of open-weight models, derived from Gemini 3 technology. The four models, ranging from 2B to 31B parameters and including a Mixture-of-Experts variant, are available under a permissive Apache 2.0 license and feature multimodal processing.
Anthropic's 'Spud' Model Expected in April, 'Mythos' in Q3 2026 as AI Release Cadence Accelerates
Anthropic's next major frontier model 'Spud' is reportedly scheduled for release in April 2026, with 'Mythos' potentially following in Q3. This aligns with an accelerating ~3-month release cadence across major labs, intensifying competition amid growing compute and energy bottlenecks.
Google Gemma 4 Model Reportedly in Testing, Signaling Next-Gen Open-Weight LLM Release
A developer reports that Google's Gemma 4 model is 'incoming' and currently being tested. This suggests the next iteration of Google's open-weight language model family is nearing release.
Tongyi Lab Releases World's First Open-Source Multi-Speaker AI Dubbing Model
Alibaba's Tongyi Lab has released the first open-source AI model capable of dubbing multi-speaker conversations, addressing one of the hardest problems in AI video generation. The model synchronizes voice with lip movements across multiple speakers in a single pass.
Mistral Releases Mistral Small 4, Claiming Significant Performance Jump Over Previous Models
Mistral AI has released Mistral Small 4, a new model in its 'Small' tier. The company claims it represents a major performance improvement over its predecessors, though no specific benchmarks are provided in the initial announcement.
Microsoft Releases GigaTIME: AI Model Generates Protein Maps from Standard Medical Images
Microsoft has released GigaTIME, an AI model that generates detailed spatial protein maps from standard, low-cost medical images like H&E stains. This could significantly reduce the cost and time of cancer tissue analysis.
Google, Microsoft, xAI Agree to US Gov Pre-Release AI Testing
Google, Microsoft, xAI agreed to US pre-release testing of frontier AI. Voluntary deal lacks enforcement, excludes open-weight models.
Claude Opus 4.7 Appears on Anthropic's Internal API, Hinting at Imminent Release
A new model identifier, 'Claude Opus 4.7', has been spotted on Anthropic's internal API. This suggests a forthcoming update to the flagship Opus line, potentially a minor version bump ahead of a larger release.
Google Releases TIPSv2 Vision Encoder for Multi-Task Dense Prediction
Google has released the TIPSv2-B/14 vision encoder model on Hugging Face. It performs three dense prediction tasks—depth estimation, surface normal prediction, and semantic segmentation—from a single backbone.
MedGemma 1.5 Technical Report Released, Details Multimodal Medical AI
Google DeepMind has published the technical report for MedGemma 1.5, detailing the architecture and capabilities of its open-source, multimodal medical AI model. This follows the initial Med-PaLM 2 release and represents a significant step in making specialized medical AI more accessible.
Massive Video Reasoning Dataset Released, Reportedly 1000x Larger Than Predecessors
An unverified report claims the release of a video reasoning dataset roughly 1000x larger than existing benchmarks. If true, it would be a significant resource for training next-generation video understanding models.
Zhipu AI Releases GLM-5.1, Claims Major Performance Gains Over GLM-5.0
Zhipu AI announced GLM-5.1, reporting a 'significant increase in evals' compared to GLM-5.0. The release continues China's rapid pace of open-source AI model development.
Meta to Release First LLM Built Under Scale AI's Alexandr Wang
Meta is set to release its first large language model developed under the technical leadership of Scale AI founder Alexandr Wang. While not fully open initially, the company plans to eventually open-source versions of the new model family.
Anthropic's Claude Mythos Compute Needs Delay Release, 'Spud' Likely First
Anthropic's leaked internal note reveals its next flagship model, Claude Mythos, is too computationally expensive for general release. The company states it needs to become 'much more efficient,' likely delaying Mythos and prioritizing the 'Spud' model.
OpenAI Image Generation V2 Release Imminent, Per Leak
A post from a known leaker indicates OpenAI's next image generation model, potentially DALL-E 4, is about to be released. This would mark a major competitive move in the rapidly evolving text-to-image space.
Unitree Robotics Releases UnifoLM-WBT-Dataset: A Large-Scale, Real-World Robotics Dataset for Embodied AI
Chinese robotics firm Unitree Robotics has open-sourced the UnifoLM-WBT-Dataset, a high-quality dataset derived from real-world robot operations. The release aims to accelerate training for embodied AI and large language models applied to physical systems.