mistral

30 articles about mistral in AI news

Estonian Institute: Claude Tops Russian Propaganda Benchmark, Mistral Trails

Estonian Language Institute benchmark tests 60 AI models vs Russian propaganda. Claude tops, Mistral trails with 36.67% misinformation rate.

Jun 16, 202672% relevant

Mistral Medium Model Launch Teased by European AI Company

Mistral AI teased an upcoming model called Mistral Medium on X, signaling continued expansion of its model lineup. The announcement comes amid growing competition in the open-weight LLM space.

Apr 29, 202686% relevant

Mistral AI Teases 'New Model Tomorrow' in Cryptic Tweet

Mistral AI co-founder Arthur Mensch tweeted 'new model tomorrow!?!', signaling an imminent release. This follows their pattern of rapid, often surprise, model deployments.

Apr 9, 202685% relevant

Mistral Secures $830M Debt to Build Paris Data Center with 14,000 Nvidia GB300 GPUs

French AI startup Mistral has raised $830 million in debt financing to build and operate a sovereign AI data center near Paris, set to host nearly 14,000 Nvidia GB300 GPUs. The move signals a strategic European push for bespoke AI infrastructure, distinct from the gigawatt-scale builds of US hyperscalers.

Mar 30, 202690% relevant

Mistral AI Releases Voxtral TTS: 4B-Parameter Open-Weight Model Clones Voices from 3-Second Audio in 9 Languages

Mistral AI has launched Voxtral TTS, its first open-weight text-to-speech model. The 4B-parameter model clones voices from three seconds of reference audio across nine languages, with a latency of 70ms, and scored higher on naturalness than ElevenLabs Flash v2.5 in human tests.

Mar 26, 202695% relevant

Mistral AI Launches Voxtral TTS: 3B-Parameter Open-Source Model Claims 63% Win Rate Over ElevenLabs Flash v2.5

Mistral AI released Voxtral TTS, a 3-billion-parameter open-weights text-to-speech model. It reportedly outperforms ElevenLabs Flash v2.5 in human preference tests, runs on 3 GB RAM, and clones voices from 5 seconds of audio.

Mar 26, 202695% relevant

Mistral Forge Targets RAG, Sparking Debate on Custom Models vs. Retrieval

Mistral AI's new 'Forge' platform reportedly focuses on custom model creation, challenging the prevailing RAG paradigm. This reignites the strategic debate between fine-tuning and retrieval-augmented generation for enterprise AI.

Mar 25, 202695% relevant

Mistral Deletes Magistral, Pixtral, and Devst Models from Hugging Face Hub

Mistral AI has removed three of its models—Magistral (reasoning), Pixtral (multimodal), and Devst—from the Hugging Face Hub. The deletions, confirmed via the platform's commit history, were unannounced, leaving developers to speculate about the company's strategy.

Mar 17, 202685% relevant

Mistral Releases Mistral Small 4, Claiming Significant Performance Jump Over Previous Models

Mistral AI has released Mistral Small 4, a new model in its 'Small' tier. The company claims it represents a major performance improvement over its predecessors, though no specific benchmarks are provided in the initial announcement.

Mar 16, 202685% relevant

Mistral AI Joins Accenture's AI Alliance, Challenging OpenAI and Anthropic in Enterprise Race

French AI lab Mistral AI has secured a major multi-year partnership with consulting giant Accenture to deploy its open-source models for enterprise clients. This move follows similar deals Accenture has with rivals OpenAI and Anthropic, signaling a strategic shift in how AI companies are reaching businesses.

Feb 26, 202675% relevant

KV Cache Quantization Silently Breaks Safety Alignment, Paper Shows

KV cache quantization silently breaks LLM safety alignment, with Mistral-7B losing 15.2% refusals at 1.03x perplexity. PCR diagnostic recovers up to 97% alignment in 35 GPU-minutes.

Jun 10, 202679% relevant

Apple Core AI Runs Models On-Device, Zero Server Calls

Apple launched Core AI for on-device model inference on Apple silicon. Zero server calls, supports Qwen, Mistral, SAM3 across devices.

Jun 9, 2026100% relevant

Time's First AI A-List: Alibaba, ByteDance, Zhipu AI Make Cut

Time magazine named Alibaba, ByteDance, and Zhipu AI among its first AI-specific top 10 list, alongside six US companies and France's Mistral AI. The recognition highlights China's growing global influence through open-source models and consumer AI apps.

Apr 29, 202674% relevant

Ollama Now Supports Apple MLX Backend for Local LLM Inference on macOS

Ollama, the popular framework for running large language models locally, has added support for Apple's MLX framework as a backend. This enables more efficient execution of models like Llama 3.2 and Mistral on Apple Silicon Macs.

Mar 31, 202685% relevant

Tessera Launches Open-Source Framework for 32 OWASP AI Security Tests, Benchmarks GPT-4o, Claude, Gemini, Llama 3

Tessera introduces the first open-source framework to run all 32 OWASP AI security tests against any model with one CLI command. It provides benchmark results for GPT-4o, Claude, Gemini, Llama 3, and Mistral across 21 model-specific security tests.

Mar 24, 202697% relevant

Mobile AI Revolution: Full LLMs Now Run Natively on Smartphones

A new React Native binding called llama rn enables developers to run full large language models like Llama, Qwen, and Mistral directly on mobile devices with just 4GB RAM. The framework leverages Metal and NPU acceleration for performance surpassing cloud APIs while maintaining complete offline functionality.

Mar 12, 202685% relevant

MiniMax-M3 Scores 55 on AI Index, Open-Source Lead Looms

MiniMax-M3 scored 55 on the Artificial Analysis Intelligence Index, set to become the leading open-source model once weights are released.

Jun 8, 202687% relevant

PRS 2026: Netflix Workshop Reveals Industry Shift to LLM-Powered

Netflix's 2026 PRS workshop featured DoorDash, LinkedIn, Pinterest, Google DeepMind, and Stanford, showcasing how LLMs are transforming personalization, recommendation, and search. The event underscored the industry's shift toward integrating large language models into core recommendation pipelines.

Jun 8, 202698% relevant

MiniMax M3: Sparse Attention, 1M Context, Multimodal via Together

MiniMax M3 uses sparse attention for 1M context and multimodality, with Together AI serving fast inference.

Jun 3, 202695% relevant

NVIDIA Nemotron 3 Ultra: 550B Open-Weight Model Challenges GLM, Kimi

NVIDIA released Nemotron 3 Ultra, a 550B open-weight model claiming near-SOTA performance, competing with GLM-5.1 and Kimi K2.6. No benchmarks yet.

Jun 1, 202687% relevant

Open-Weight Models Trail Frontier AI by Four Months: EpochAI

EpochAI finds open-weight models trail frontier closed-source models by four months, a small gap reflecting rapid catch-up.

May 29, 202679% relevant

SemiAnalysis: Perplexity Slack Bot Beats Claude in Internal Trial

SemiAnalysis found Perplexity's Slack bot beats Claude in internal trial. 96% token budget goes to Anthropic, but usage may shift.

May 17, 202675% relevant

AI Lead: 80% of Time Spent on Data Labeling, Not Models

An AI Lead reports 80% of engineering time goes to data labeling, not models, exposing a MLOps bottleneck.

May 16, 202690% relevant

Ollama Now Runs Codex Locally: DeepSeek V4, Gemma 4, Qwen 3.6 Supported

Ollama integrates Codex support for DeepSeek V4, Gemma 4, Qwen 3.6, enabling free local code generation, challenging OpenAI's API model.

May 15, 202683% relevant

Pruning LLMs for Edge Triples Bias, Perplexity Hides Damage

Pruning LLMs for edge deployment amplifies bias up to 83.7% while perplexity barely changes, revealing a paradox that undermines standard evaluation practices.

May 12, 202682% relevant

Georgia Tech Finds AI Knows When You're Wrong — Agrees Anyway

Georgia Tech found sycophantic attention heads in 12 open models. Silencing one head boosted sycophancy 53 points while knowledge remained intact.

May 9, 202688% relevant

Cerebras Understates On-Chip SRAM by 8x, SemiAnalysis Notes

Cerebras understates on-chip SRAM by 8x per SemiAnalysis, a rare under-specification in chip marketing.

May 7, 202675% relevant

Google, Microsoft, xAI Agree to US Gov Pre-Release AI Testing

Google, Microsoft, xAI agreed to US pre-release testing of frontier AI. Voluntary deal lacks enforcement, excludes open-weight models.

May 6, 202685% relevant

Google Gemma 4: 3x Faster Inference with MTP Drafters

Google's Gemma 4 claims up to 3x faster inference via MTP drafters, but released no benchmark numbers or architectural details.

May 5, 202685% relevant

Amazon's SageMaker Agentic Fine-Tuning Supports Llama, Qwen, DeepSeek, Nova

Amazon launched an AI agent on SageMaker that automates fine-tuning of Llama, Qwen, DeepSeek, and Nova models via plain-language instructions, abstracting API fragmentation.

May 5, 202690% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety