tts
30 articles about tts in AI news
mlx-audio v0.4.3 Ships 6 New TTS Models, Slimmer Deps
mlx-audio v0.4.3 adds 6 TTS models, server concurrency, and slims dependencies, targeting Apple Silicon developers.
Google Launches Gemini 3.1 Flash TTS with Prompt-Controlled Speech
Google has launched Gemini 3.1 Flash TTS, a text-to-speech model featuring prompt-based voice control and support for over 70 languages. This release expands Google's multimodal AI offerings directly to developers.
OpenBMB Launches VoxCPM 2, an Open-Source TTS Model Rivaling Qwen3-TTS
OpenBMB has launched VoxCPM 2, an open-source text-to-speech AI model from China. The release is positioned as a direct competitor to Alibaba's Qwen3-TTS, expanding the open-source TTS landscape.
Qwen3-TTS Added to mlx-tune, Enabling Full Qwen Model Fine-Tuning on Apple Silicon Macs
The mlx-tune library now supports Qwen3-TTS, making the entire Qwen model stack—including the new text-to-speech model—fine-tunable on Apple Silicon Macs. This expands local AI development options for researchers and developers.
Mistral AI Releases Voxtral TTS: 4B-Parameter Open-Weight Model Clones Voices from 3-Second Audio in 9 Languages
Mistral AI has launched Voxtral TTS, its first open-weight text-to-speech model. The 4B-parameter model clones voices from three seconds of reference audio across nine languages, with a latency of 70ms, and scored higher on naturalness than ElevenLabs Flash v2.5 in human tests.
Mistral AI Launches Voxtral TTS: 3B-Parameter Open-Source Model Claims 63% Win Rate Over ElevenLabs Flash v2.5
Mistral AI released Voxtral TTS, a 3-billion-parameter open-weights text-to-speech model. It reportedly outperforms ElevenLabs Flash v2.5 in human preference tests, runs on 3 GB RAM, and clones voices from 5 seconds of audio.
LuxTTS Democratizes Voice Cloning: High-Quality Synthesis Now Runs on Consumer Hardware
LuxTTS, a new open-source text-to-speech model, enables realistic voice cloning from just 3 seconds of audio using only 1GB of VRAM. The system operates 150x faster than real-time and produces 48kHz audio, challenging proprietary solutions like ElevenLabs.
Massachusetts Launches Statewide AI Literacy Initiative with Google Partnership
Google partners with Massachusetts AI Hub to provide free AI training to all residents, including Google's AI Professional Certificate. This statewide initiative aims to democratize AI skills amid rapid technological transformation.
OpenBMB's VoxCPM 2: 2B-Param Open-Source TTS for Multilingual Voice
OpenBMB launched VoxCPM 2, a 2-billion-parameter open-source text-to-speech model. It generates multilingual, emotionally expressive speech from text descriptions and runs on consumer-grade hardware.
Microsoft Open-Sources VALL-E 2: A Zero-Shot TTS Model Achieving Human Parity in Speech Naturalness
Microsoft Research has open-sourced VALL-E 2, a neural codec language model for text-to-speech that achieves human parity in naturalness. It uses a novel 'Repetition-Aware Sampling' method to eliminate word repetition, a common failure mode in prior models.
Microsoft's VibeVoice Family Processes 60-Minute Audio in Single Pass, Eliminates Chunking for ASR & TTS
Microsoft open-sourced VibeVoice, a family of speech AI models that processes up to 60 minutes of audio without chunking. It delivers structured transcriptions with speaker diarization and generates 90-minute multi-speaker speech in one pass.
AWS Commits 2 Gigawatts of Trainium Capacity to OpenAI, Reveals 1.4 Million Chips Deployed
Amazon's $50B OpenAI deal includes a 2-gigawatt commitment of Trainium computing capacity. AWS disclosed 1.4 million Trainium chips are deployed, with over 1 million Trainium2 chips running Anthropic's Claude.
AI Chip Capacity Crisis: 10GW Left Through 2030, Prices Up Double Digits
The AI accelerator market has only 10 gigawatts of capacity left for contract through 2030, with 100GW already under contract. Prices are rising double digits as one competitor has stopped taking orders entirely.
Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion
Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.
Project N.O.M.A.D. Solar-Powered Mini PC Packs Local AI, Wikipedia, Khan Academy
Project N.O.M.A.D. is a 100% open-source, solar-powered mini PC designed for offline operation. It packs a local AI, all of Wikipedia, Khan Academy courses, offline maps, and medical guides, running on only 15 watts of power.
US Data Center Power Demand Hits 15 GW, Grid Constraints Emerge
US data center power demand reached 15 gigawatts in 2023, up from 11 GW in 2022. This rapid growth highlights a widening bottleneck: compute infrastructure is scaling faster than power delivery systems can support.
Fish Audio S2 Enables Word-Level Speech Control with Positional Tags, Beats GPT-4o in Human Preference Tests
Fish Audio S2 introduces a 100% open-source TTS model that uses inline positional tags for word-level vocal control, achieving 8/10 wins against GPT-4o and Gemini in human preference tests while generating audio nearly 5x faster than real-time.
Neurons Playing Doom: How Living Brain Cells Could Revolutionize Computing
Australian startup Cortical Labs is pioneering biological computing with a system that uses living human brain cells to perform computational tasks. Their CL1 computer consumes just 30 watts while learning to play Doom, potentially offering massive energy savings over traditional AI hardware.
Agentic AI Could Be Retail's Unexpected Savior, According to Industry Veteran
Retail C-suite veteran Karlyn Mattson argues that agentic AI's true promise for retail isn't just automation, but restoring the industry's lost creative and strategic edge by freeing human talent from routine tasks.
Nvidia's $2B Nebius Bet: Chip Giant Doubles Down on AI Infrastructure Empire
Nvidia will invest $2 billion in AI cloud specialist Nebius Group NV, expanding its strategic investments in companies that build data centers using its chips. The partnership aims to deploy over 5 gigawatts of AI-optimized data center capacity by 2030, equivalent to powering 4 million U.S. households.
China's Solar Surge: How AI and Infrastructure Integration Are Powering a Renewable Revolution
China has achieved its 2030 target of 1.2 terawatts of installed wind and solar capacity six years early, largely by transforming everyday infrastructure like parking lots and rooftops into distributed power plants. This unprecedented deployment pace highlights a strategic fusion of industrial policy, digital management, and infrastructure repurposing.
Amazon's $11 Billion AI Power Play: Inside the Indiana Data Center That's Reshaping Tech Infrastructure
Amazon is building an $11 billion AI data center campus in Indiana that will draw 2.2 gigawatts of power—enough for 1.7 million homes. This massive investment highlights the escalating infrastructure demands of artificial intelligence and the growing geographic shift in tech's physical footprint.
AWS Becomes OpenAI's Exclusive Third-Party Cloud Partner in Landmark Deal
OpenAI and Amazon have announced a multi-year strategic partnership making AWS the exclusive third-party cloud provider for OpenAI Frontier. The deal includes 2 gigawatts of Trainium capacity and co-creation of a Stateful Runtime Environment on Amazon Bedrock.
Kunluncore Files STAR Market IPO, Claims 32K GPU Cluster First
Kunluncore filed for a STAR Market IPO, claiming a 32K GPU cluster first, testing investor appetite for domestic AI chips.
Anthropic's 220K GPU Cluster: $5B Compute Bet Revealed
Anthropic reportedly has 220K NVIDIA GPUs and 310MW, implying a >$5B compute cluster, 3x OpenAI's largest.
NJ Voters Back AI Data Center Bans as NIMBY Push Grows
54% of New Jersey voters support banning AI data centers, per a Jan 2026 poll. Nationally, 49% oppose local construction, threatening hyperscaler buildout plans.
Anthropic Ships 10 Finance AI Agents as IPO Race with OpenAI Heats Up
Anthropic released 10 finance AI agents with Moody's data connectors. The launch intensifies the IPO race with OpenAI, backed by a $1.5B private equity JV.
UniVidX Generates Video From 1,000 Samples, SIGGRAPH 2026
UniVidX generates omni-directional video from <1,000 training samples, using diffusion priors with stochastic masking, accepted at SIGGRAPH 2026.
Michigan Utility Board Blocks $1.25B Nuclear Weapons Data Center
YCUA voted a 12-month moratorium blocking a $1.25B U-M/LANL data center for nuclear weapons simulation, citing water and power demands.
Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale
Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.