Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

tts

30 articles about tts in AI news

Miso One: 8B Open-Source TTS Hits 110ms Latency, Real Emotion

Miso One, an 8B open-source TTS model, achieves 110ms latency with emotional range. Weights are fully open-source for self-hosting, but no benchmark data is provided.

85% relevant

mlx-audio v0.4.3 Ships 6 New TTS Models, Slimmer Deps

mlx-audio v0.4.3 adds 6 TTS models, server concurrency, and slims dependencies, targeting Apple Silicon developers.

85% relevant

Google Launches Gemini 3.1 Flash TTS with Prompt-Controlled Speech

Google has launched Gemini 3.1 Flash TTS, a text-to-speech model featuring prompt-based voice control and support for over 70 languages. This release expands Google's multimodal AI offerings directly to developers.

93% relevant

OpenBMB Launches VoxCPM 2, an Open-Source TTS Model Rivaling Qwen3-TTS

OpenBMB has launched VoxCPM 2, an open-source text-to-speech AI model from China. The release is positioned as a direct competitor to Alibaba's Qwen3-TTS, expanding the open-source TTS landscape.

91% relevant

Qwen3-TTS Added to mlx-tune, Enabling Full Qwen Model Fine-Tuning on Apple Silicon Macs

The mlx-tune library now supports Qwen3-TTS, making the entire Qwen model stack—including the new text-to-speech model—fine-tunable on Apple Silicon Macs. This expands local AI development options for researchers and developers.

85% relevant

Mistral AI Releases Voxtral TTS: 4B-Parameter Open-Weight Model Clones Voices from 3-Second Audio in 9 Languages

Mistral AI has launched Voxtral TTS, its first open-weight text-to-speech model. The 4B-parameter model clones voices from three seconds of reference audio across nine languages, with a latency of 70ms, and scored higher on naturalness than ElevenLabs Flash v2.5 in human tests.

95% relevant

Mistral AI Launches Voxtral TTS: 3B-Parameter Open-Source Model Claims 63% Win Rate Over ElevenLabs Flash v2.5

Mistral AI released Voxtral TTS, a 3-billion-parameter open-weights text-to-speech model. It reportedly outperforms ElevenLabs Flash v2.5 in human preference tests, runs on 3 GB RAM, and clones voices from 5 seconds of audio.

95% relevant

LuxTTS Democratizes Voice Cloning: High-Quality Synthesis Now Runs on Consumer Hardware

LuxTTS, a new open-source text-to-speech model, enables realistic voice cloning from just 3 seconds of audio using only 1GB of VRAM. The system operates 150x faster than real-time and produces 48kHz audio, challenging proprietary solutions like ElevenLabs.

95% relevant

Massachusetts Launches Statewide AI Literacy Initiative with Google Partnership

Google partners with Massachusetts AI Hub to provide free AI training to all residents, including Google's AI Professional Certificate. This statewide initiative aims to democratize AI skills amid rapid technological transformation.

75% relevant

OpenBMB's VoxCPM 2: 2B-Param Open-Source TTS for Multilingual Voice

OpenBMB launched VoxCPM 2, a 2-billion-parameter open-source text-to-speech model. It generates multilingual, emotionally expressive speech from text descriptions and runs on consumer-grade hardware.

97% relevant

Microsoft Open-Sources VALL-E 2: A Zero-Shot TTS Model Achieving Human Parity in Speech Naturalness

Microsoft Research has open-sourced VALL-E 2, a neural codec language model for text-to-speech that achieves human parity in naturalness. It uses a novel 'Repetition-Aware Sampling' method to eliminate word repetition, a common failure mode in prior models.

95% relevant

Microsoft's VibeVoice Family Processes 60-Minute Audio in Single Pass, Eliminates Chunking for ASR & TTS

Microsoft open-sourced VibeVoice, a family of speech AI models that processes up to 60 minutes of audio without chunking. It delivers structured transcriptions with speaker diarization and generates 90-minute multi-speaker speech in one pass.

99% relevant

AWS Commits 2 Gigawatts of Trainium Capacity to OpenAI, Reveals 1.4 Million Chips Deployed

Amazon's $50B OpenAI deal includes a 2-gigawatt commitment of Trainium computing capacity. AWS disclosed 1.4 million Trainium chips are deployed, with over 1 million Trainium2 chips running Anthropic's Claude.

95% relevant

Amazon Opens Trainium Chips to Outside Data Centers, Targeting Nvidia's Core Business

AWS AI chief Peter DeSantis confirmed Amazon is negotiating to sell Trainium chips externally for the first time, backed by Andy Jassy's estimate of a $50B annual revenue potential. With Trainium3 sold out, Trainium4 pre-booked, and Anthropic and OpenAI already running gigawatts of Trainium capacity

100% relevant

xAI Bundles SuperGrok into Hermes Agent — No API Key Needed

xAI integrated SuperGrok subscriptions into Hermes Agent, enabling single OAuth login for Grok 4.3, TTS, images, and X search, eliminating separate API keys.

82% relevant

AI Chip Capacity Crisis: 10GW Left Through 2030, Prices Up Double Digits

The AI accelerator market has only 10 gigawatts of capacity left for contract through 2030, with 100GW already under contract. Prices are rising double digits as one competitor has stopped taking orders entirely.

97% relevant

Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion

Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.

97% relevant

Project N.O.M.A.D. Solar-Powered Mini PC Packs Local AI, Wikipedia, Khan Academy

Project N.O.M.A.D. is a 100% open-source, solar-powered mini PC designed for offline operation. It packs a local AI, all of Wikipedia, Khan Academy courses, offline maps, and medical guides, running on only 15 watts of power.

85% relevant

US Data Center Power Demand Hits 15 GW, Grid Constraints Emerge

US data center power demand reached 15 gigawatts in 2023, up from 11 GW in 2022. This rapid growth highlights a widening bottleneck: compute infrastructure is scaling faster than power delivery systems can support.

75% relevant

Fish Audio S2 Enables Word-Level Speech Control with Positional Tags, Beats GPT-4o in Human Preference Tests

Fish Audio S2 introduces a 100% open-source TTS model that uses inline positional tags for word-level vocal control, achieving 8/10 wins against GPT-4o and Gemini in human preference tests while generating audio nearly 5x faster than real-time.

95% relevant

Neurons Playing Doom: How Living Brain Cells Could Revolutionize Computing

Australian startup Cortical Labs is pioneering biological computing with a system that uses living human brain cells to perform computational tasks. Their CL1 computer consumes just 30 watts while learning to play Doom, potentially offering massive energy savings over traditional AI hardware.

85% relevant

Agentic AI Could Be Retail's Unexpected Savior, According to Industry Veteran

Retail C-suite veteran Karlyn Mattson argues that agentic AI's true promise for retail isn't just automation, but restoring the industry's lost creative and strategic edge by freeing human talent from routine tasks.

90% relevant

Nvidia's $2B Nebius Bet: Chip Giant Doubles Down on AI Infrastructure Empire

Nvidia will invest $2 billion in AI cloud specialist Nebius Group NV, expanding its strategic investments in companies that build data centers using its chips. The partnership aims to deploy over 5 gigawatts of AI-optimized data center capacity by 2030, equivalent to powering 4 million U.S. households.

81% relevant

China's Solar Surge: How AI and Infrastructure Integration Are Powering a Renewable Revolution

China has achieved its 2030 target of 1.2 terawatts of installed wind and solar capacity six years early, largely by transforming everyday infrastructure like parking lots and rooftops into distributed power plants. This unprecedented deployment pace highlights a strategic fusion of industrial policy, digital management, and infrastructure repurposing.

85% relevant

Amazon's $11 Billion AI Power Play: Inside the Indiana Data Center That's Reshaping Tech Infrastructure

Amazon is building an $11 billion AI data center campus in Indiana that will draw 2.2 gigawatts of power—enough for 1.7 million homes. This massive investment highlights the escalating infrastructure demands of artificial intelligence and the growing geographic shift in tech's physical footprint.

85% relevant

AWS Becomes OpenAI's Exclusive Third-Party Cloud Partner in Landmark Deal

OpenAI and Amazon have announced a multi-year strategic partnership making AWS the exclusive third-party cloud provider for OpenAI Frontier. The deal includes 2 gigawatts of Trainium capacity and co-creation of a Stateful Runtime Environment on Amazon Bedrock.

90% relevant

AWS Beats Cloud Rivals to NVIDIA Blackwell with EC2 G7 — 4.6x AI Inference Gain Over G6

AWS launched EC2 G7 instances on June 19, 2026, becoming the first major cloud to offer NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs. The instances claim 4.6x AI inference performance over G6, backed by 700 Gbps EFA networking and 32 GB GDDR7 per GPU. The move arrives the same week AWS confirme

85% relevant

KKR Launches $10B Helix to Solve AI Infrastructure's Power-Land-Compute Bottleneck

KKR and three heavyweight partners — Nvidia, the Kuwait Investment Authority, and Vistra — launched Helix Digital Infrastructure on June 10, 2026, with more than $10 billion in committed capital. Former AWS CEO Adam Selipsky, who stepped down from Amazon in June 2024 and joined KKR as a senior advis

95% relevant

Movable Ink Launches Programmatic CRM With AI Agents for Personalized

Movable Ink launched Programmatic CRM with AI agents on June 18, 2026, automating personalized content creation and customer engagement for brands. The platform leverages real-time data to generate tailored content across email, web, and mobile, reducing manual effort while scaling personalization.

98% relevant

Nadella: AI's New Unit Is 'Tokens per Dollar per Watt'

Satya Nadella defined AI's supply-side economics as 'Tokens per Dollar per Watt', urging infrastructure focus for companies, industries, and countries.

80% relevant