deepseek

30 articles about deepseek in AI news

NVIDIA Blackwell Cuts DeepSeek V4 Token Costs 5x in One Month

NVIDIA claims Blackwell inference stack cut DeepSeek V4 token costs 5x in one month, per a newly published report shared by @rohanpaul_ai.

Jun 30, 2026100% relevant

DeepSeek Raises $7B, Ends No-Funding Pledge, Doubles Staff

DeepSeek raised $7B, abandoning its no-funding pledge, to double headcount and launch a coding agent team competing with Claude Code.

Jun 27, 2026100% relevant

Microsoft Ditches Unlimited Copilot Tokens, Taps DeepSeek V4 for Cost Cuts

Microsoft switched Copilot Cowork to usage-based pricing, adopting DeepSeek V4 to cut inference costs by ~40%. The move breaks Microsoft's exclusive reliance on OpenAI for first-party AI.

Jun 22, 202695% relevant

CoreWeave Trains DeepSeek-V3 in 2 Minutes, Claims MLPerf v6.0 Record

CoreWeave trained DeepSeek-V3 in ~2 minutes on MLPerf v6.0, beating AWS's record by 43% using 11K+ H100 GPUs across 4 data centers.

Jun 16, 2026100% relevant

DeepSeek Raises $7.4B at $50B Valuation in First External Round

DeepSeek raised ~$7.4B at a $50B valuation in its first external round, with an unusual limited partnership structure and a $2.9B personal investment from founder Liang Wenfeng.

Jun 16, 2026100% relevant

CATL Invests in DeepSeek: Battery Giant Pivots to AI Energy

CATL invested in DeepSeek's first funding round, signaling a $1B+ pivot to AI data center energy infrastructure.

Jun 10, 2026100% relevant

DeepSeek-V4 Hits 500K Context with 90% Less KV Cache via FlashMemory

DeepSeek-V4 achieves 500K context with 90% less KV cache via FlashMemory's lookahead sparse attention, keeping only 13.5% of cache in GPU memory without retraining.

Jun 9, 202698% relevant

DeepSeek Raises $6.9B at $48-55B Valuation, Opens to Outside Capital

DeepSeek raising ~$6.9B at $48-55B valuation in first external funding round, as it tops Ramp's US business spending index with enterprises switching from OpenAI/Anthropic.

Jun 4, 202698% relevant

DeepSeek v4 Pricing Cuts 75%: $0.43/M Tokens In

DeepSeek v4 API pricing permanently cut 75% to $0.43/M input, $0.87/M output, enabled by 27% compute and 10% cache vs v3.2.

May 22, 2026100% relevant

Ollama Now Runs Codex Locally: DeepSeek V4, Gemma 4, Qwen 3.6 Supported

Ollama integrates Codex support for DeepSeek V4, Gemma 4, Qwen 3.6, enabling free local code generation, challenging OpenAI's API model.

May 15, 202683% relevant

AMD ROCm Performance Jumps 75x in 14 Days Post-DeepSeek v4

AMD ROCm stack improved 75x in 14 days post-DeepSeek v4 via fused operations. Still needs 5x more to match B200 performance.

May 10, 2026100% relevant

DeepSeek Hits $45B Valuation in First VC Round, Led by China State Fund

DeepSeek valuation jumps from $20B to $45B in first VC round led by China state fund. The raise targets employee retention and chip independence via Huawei optimization.

May 6, 202685% relevant

Amazon's SageMaker Agentic Fine-Tuning Supports Llama, Qwen, DeepSeek, Nova

Amazon launched an AI agent on SageMaker that automates fine-tuning of Llama, Qwen, DeepSeek, and Nova models via plain-language instructions, abstracting API fragmentation.

May 5, 202690% relevant

DeepSeek-V4 Ported to MLX for Apple Silicon Inference

A developer has ported DeepSeek-V4 to Apple's MLX framework, allowing the large language model to run on Apple Silicon Macs. Early results show functional inference with room for optimization.

Apr 24, 2026100% relevant

DeepSeek V4-Pro: 1.6T parameters, open weights, undercuts rivals 10x

DeepSeek unveiled V4-Pro and V4-Flash, its largest open-weight models with up to 1.6 trillion parameters and a 1M-token context window. The new hybrid attention architecture cuts compute for long contexts by 73–90%, enabling prices far below OpenAI, Google, and Anthropic.

Apr 24, 2026100% relevant

DeepSeek Seeks $300M+ at $10B+ Valuation to Retain AI Talent

DeepSeek is raising its first external capital, targeting $300M+ at a $10B+ valuation. The round is small (≤3% equity) to set a valuation benchmark for employee stock options and combat poaching by rivals.

Apr 22, 202694% relevant

DeepSeek Seeks First Outside Funding at $10B Valuation

DeepSeek is in talks to raise at least $300 million in its first external funding round at a $10 billion valuation. This ends its reliance on parent hedge fund High-Flyer Capital and signals a new phase in the costly global AI race.

Apr 18, 2026100% relevant

Stealth 100B Model Appears on OpenRouter, Possibly DeepSeek or Kimi

A new, unannounced 100-billion-parameter AI model has appeared on the OpenRouter API platform. Its origin is unknown, but observers speculate it could be a variant from DeepSeek or an update to Kimi's code model.

Apr 13, 202685% relevant

DeepSeek-V4 Rumored as 'Whale' Returns, Signaling Major Model Release

DeepSeek's cryptic 'whale' codename has reappeared, strongly hinting at the impending launch of DeepSeek-V4. This follows the company's pattern of using the whale symbol before major model releases.

Apr 7, 202689% relevant

DeepSeek V4 Begins Limited Rollout with Fast, Expert, Vision Modes

DeepSeek V4 is reportedly in limited gray-scale testing with a new interface offering Fast, Expert, and Vision modes. This mirrors competitor Kimi's tiered system and suggests a move towards performance-based rate limiting.

Apr 7, 202685% relevant

GPT4All Hits 77K GitHub Stars, Adds DeepSeek R1 for Free Local AI

The GPT4All project has surpassed 77,000 GitHub stars as it adds support for distilled DeepSeek R1 models, enabling reasoning-capable AI to run locally on consumer CPUs with zero API costs.

Apr 6, 202687% relevant

AI Weekly: GPT-6 Rumors, DeepSeek V4 on Huawei, Anthropic Models, Qwen 3.6-Plus

A weekly roundup video aggregates major AI rumors and announcements, including unverified GPT-6 details, DeepSeek V4 reportedly running on Huawei hardware, and launches of Anthropic's Conway and Ultraplan and Alibaba's Qwen 3.6-Plus.

Apr 5, 202685% relevant

DeepSeek's HISA: Hierarchical Sparse Attention Cuts 64K Context Indexing Cost

DeepSeek researchers introduced HISA, a hierarchical sparse attention method that replaces flat token scanning. It removes a computational bottleneck at 64K context lengths without requiring any model retraining.

Apr 5, 202685% relevant

DeepSeek V4 to Run on Huawei Ascend 950PR Chips, Sparking 20% Price Surge

DeepSeek's anticipated V4 model will be powered by Huawei's Ascend 950PR chips, with Alibaba, ByteDance, and Tencent stockpiling hundreds of thousands of units ahead of launch. This has driven chip prices up approximately 20% in recent weeks.

Apr 3, 202691% relevant

DeepSeek-R1 Reportedly Hits 78.9% on OS-World, Outperforming GPT-5.4 at 1/10th Cost

A new benchmark claim suggests DeepSeek-R1 has achieved 78.9% on the OS-World agentic coding benchmark, reportedly outperforming GPT-5.4 while operating at one-tenth the cost. If verified, this would represent a significant leap in cost-performance for AI coding agents.

Apr 1, 202695% relevant

DeepSeek Teases 'Much Larger' Base Model Release Amid Industry Silence and Hardware Challenges

DeepSeek staff confirmed a new, larger base model is coming soon, following months of quiet after reports of failed Huawei chip training. This comes as the Chinese AI lab faces heightened expectations after its breakthrough o1-level model in January 2025.

Mar 25, 202685% relevant

China's DeepSeek-R1: Open-Source AI Agent Runs Locally with Web Search, Code Generation, and Built-In Computer

Chinese AI company DeepSeek has released DeepSeek-R1, a fully open-source AI agent that runs locally on personal computers with web search capabilities, code generation, and built-in computer functionality. The model represents a significant move toward accessible, self-contained AI systems outside the dominant U.S. ecosystem.

Mar 23, 202699% relevant

DeepSeek-R1 Scores 79.8% on SWE-Bench Verified, Matching Claude 3.5 Sonnet in Code Generation

DeepSeek's new R1 reasoning model achieved 79.8% on SWE-Bench Verified, matching Claude 3.5 Sonnet's performance. This marks significant progress in AI's ability to solve real-world coding problems.

Mar 17, 202685% relevant

DeepSeek V4 Emerges: China's Next AI Contender Takes Shape

DeepSeek appears poised to release its fourth-generation AI model, signaling continued advancement in China's competitive large language model landscape. The upcoming release follows the company's established pattern of rapid iteration.

Mar 11, 202685% relevant

DeepSeek-V2.5 R1: The Next Frontier in Open-Source AI Arrives

DeepSeek's highly anticipated next-generation model, DeepSeek-V2.5 R1, is reportedly launching this week according to credible sources. This release promises significant advancements in the competitive open-source AI landscape.

Mar 9, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety