Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

tts

30 articles about tts in AI news

mlx-audio v0.4.3 Ships 6 New TTS Models, Slimmer Deps

mlx-audio v0.4.3 adds 6 TTS models, server concurrency, and slims dependencies, targeting Apple Silicon developers.

85% relevant

Google Launches Gemini 3.1 Flash TTS with Prompt-Controlled Speech

Google has launched Gemini 3.1 Flash TTS, a text-to-speech model featuring prompt-based voice control and support for over 70 languages. This release expands Google's multimodal AI offerings directly to developers.

93% relevant

OpenBMB Launches VoxCPM 2, an Open-Source TTS Model Rivaling Qwen3-TTS

OpenBMB has launched VoxCPM 2, an open-source text-to-speech AI model from China. The release is positioned as a direct competitor to Alibaba's Qwen3-TTS, expanding the open-source TTS landscape.

91% relevant

Qwen3-TTS Added to mlx-tune, Enabling Full Qwen Model Fine-Tuning on Apple Silicon Macs

The mlx-tune library now supports Qwen3-TTS, making the entire Qwen model stack—including the new text-to-speech model—fine-tunable on Apple Silicon Macs. This expands local AI development options for researchers and developers.

85% relevant

Mistral AI Releases Voxtral TTS: 4B-Parameter Open-Weight Model Clones Voices from 3-Second Audio in 9 Languages

Mistral AI has launched Voxtral TTS, its first open-weight text-to-speech model. The 4B-parameter model clones voices from three seconds of reference audio across nine languages, with a latency of 70ms, and scored higher on naturalness than ElevenLabs Flash v2.5 in human tests.

95% relevant

Mistral AI Launches Voxtral TTS: 3B-Parameter Open-Source Model Claims 63% Win Rate Over ElevenLabs Flash v2.5

Mistral AI released Voxtral TTS, a 3-billion-parameter open-weights text-to-speech model. It reportedly outperforms ElevenLabs Flash v2.5 in human preference tests, runs on 3 GB RAM, and clones voices from 5 seconds of audio.

95% relevant

LuxTTS Democratizes Voice Cloning: High-Quality Synthesis Now Runs on Consumer Hardware

LuxTTS, a new open-source text-to-speech model, enables realistic voice cloning from just 3 seconds of audio using only 1GB of VRAM. The system operates 150x faster than real-time and produces 48kHz audio, challenging proprietary solutions like ElevenLabs.

95% relevant

Massachusetts Launches Statewide AI Literacy Initiative with Google Partnership

Google partners with Massachusetts AI Hub to provide free AI training to all residents, including Google's AI Professional Certificate. This statewide initiative aims to democratize AI skills amid rapid technological transformation.

75% relevant

OpenBMB's VoxCPM 2: 2B-Param Open-Source TTS for Multilingual Voice

OpenBMB launched VoxCPM 2, a 2-billion-parameter open-source text-to-speech model. It generates multilingual, emotionally expressive speech from text descriptions and runs on consumer-grade hardware.

97% relevant

Microsoft Open-Sources VALL-E 2: A Zero-Shot TTS Model Achieving Human Parity in Speech Naturalness

Microsoft Research has open-sourced VALL-E 2, a neural codec language model for text-to-speech that achieves human parity in naturalness. It uses a novel 'Repetition-Aware Sampling' method to eliminate word repetition, a common failure mode in prior models.

95% relevant

Microsoft's VibeVoice Family Processes 60-Minute Audio in Single Pass, Eliminates Chunking for ASR & TTS

Microsoft open-sourced VibeVoice, a family of speech AI models that processes up to 60 minutes of audio without chunking. It delivers structured transcriptions with speaker diarization and generates 90-minute multi-speaker speech in one pass.

99% relevant

AWS Commits 2 Gigawatts of Trainium Capacity to OpenAI, Reveals 1.4 Million Chips Deployed

Amazon's $50B OpenAI deal includes a 2-gigawatt commitment of Trainium computing capacity. AWS disclosed 1.4 million Trainium chips are deployed, with over 1 million Trainium2 chips running Anthropic's Claude.

95% relevant

AI Chip Capacity Crisis: 10GW Left Through 2030, Prices Up Double Digits

The AI accelerator market has only 10 gigawatts of capacity left for contract through 2030, with 100GW already under contract. Prices are rising double digits as one competitor has stopped taking orders entirely.

97% relevant

Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion

Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.

97% relevant

Project N.O.M.A.D. Solar-Powered Mini PC Packs Local AI, Wikipedia, Khan Academy

Project N.O.M.A.D. is a 100% open-source, solar-powered mini PC designed for offline operation. It packs a local AI, all of Wikipedia, Khan Academy courses, offline maps, and medical guides, running on only 15 watts of power.

85% relevant

US Data Center Power Demand Hits 15 GW, Grid Constraints Emerge

US data center power demand reached 15 gigawatts in 2023, up from 11 GW in 2022. This rapid growth highlights a widening bottleneck: compute infrastructure is scaling faster than power delivery systems can support.

75% relevant

Fish Audio S2 Enables Word-Level Speech Control with Positional Tags, Beats GPT-4o in Human Preference Tests

Fish Audio S2 introduces a 100% open-source TTS model that uses inline positional tags for word-level vocal control, achieving 8/10 wins against GPT-4o and Gemini in human preference tests while generating audio nearly 5x faster than real-time.

95% relevant

Neurons Playing Doom: How Living Brain Cells Could Revolutionize Computing

Australian startup Cortical Labs is pioneering biological computing with a system that uses living human brain cells to perform computational tasks. Their CL1 computer consumes just 30 watts while learning to play Doom, potentially offering massive energy savings over traditional AI hardware.

85% relevant

Agentic AI Could Be Retail's Unexpected Savior, According to Industry Veteran

Retail C-suite veteran Karlyn Mattson argues that agentic AI's true promise for retail isn't just automation, but restoring the industry's lost creative and strategic edge by freeing human talent from routine tasks.

90% relevant

Nvidia's $2B Nebius Bet: Chip Giant Doubles Down on AI Infrastructure Empire

Nvidia will invest $2 billion in AI cloud specialist Nebius Group NV, expanding its strategic investments in companies that build data centers using its chips. The partnership aims to deploy over 5 gigawatts of AI-optimized data center capacity by 2030, equivalent to powering 4 million U.S. households.

81% relevant

China's Solar Surge: How AI and Infrastructure Integration Are Powering a Renewable Revolution

China has achieved its 2030 target of 1.2 terawatts of installed wind and solar capacity six years early, largely by transforming everyday infrastructure like parking lots and rooftops into distributed power plants. This unprecedented deployment pace highlights a strategic fusion of industrial policy, digital management, and infrastructure repurposing.

85% relevant

Amazon's $11 Billion AI Power Play: Inside the Indiana Data Center That's Reshaping Tech Infrastructure

Amazon is building an $11 billion AI data center campus in Indiana that will draw 2.2 gigawatts of power—enough for 1.7 million homes. This massive investment highlights the escalating infrastructure demands of artificial intelligence and the growing geographic shift in tech's physical footprint.

85% relevant

AWS Becomes OpenAI's Exclusive Third-Party Cloud Partner in Landmark Deal

OpenAI and Amazon have announced a multi-year strategic partnership making AWS the exclusive third-party cloud provider for OpenAI Frontier. The deal includes 2 gigawatts of Trainium capacity and co-creation of a Stateful Runtime Environment on Amazon Bedrock.

90% relevant

Kunluncore Files STAR Market IPO, Claims 32K GPU Cluster First

Kunluncore filed for a STAR Market IPO, claiming a 32K GPU cluster first, testing investor appetite for domestic AI chips.

85% relevant

Anthropic's 220K GPU Cluster: $5B Compute Bet Revealed

Anthropic reportedly has 220K NVIDIA GPUs and 310MW, implying a >$5B compute cluster, 3x OpenAI's largest.

100% relevant

NJ Voters Back AI Data Center Bans as NIMBY Push Grows

54% of New Jersey voters support banning AI data centers, per a Jan 2026 poll. Nationally, 49% oppose local construction, threatening hyperscaler buildout plans.

85% relevant

Anthropic Ships 10 Finance AI Agents as IPO Race with OpenAI Heats Up

Anthropic released 10 finance AI agents with Moody's data connectors. The launch intensifies the IPO race with OpenAI, backed by a $1.5B private equity JV.

98% relevant

UniVidX Generates Video From 1,000 Samples, SIGGRAPH 2026

UniVidX generates omni-directional video from <1,000 training samples, using diffusion priors with stochastic masking, accepted at SIGGRAPH 2026.

85% relevant

Michigan Utility Board Blocks $1.25B Nuclear Weapons Data Center

YCUA voted a 12-month moratorium blocking a $1.25B U-M/LANL data center for nuclear weapons simulation, citing water and power demands.

80% relevant

Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale

Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.

100% relevant