gpu shortage

30 articles about gpu shortage in AI news

AWS Never Retired an A100 Server, CEO Says Amid Chip Shortage

AWS CEO Matt Garman stated that A100 servers are completely sold out and never retired, as demand for older chips outpaces supply. This underscores the prolonged GPU shortage and the value of legacy hardware in cloud AI.

Apr 26, 202687% relevant

CEO of Top AI Company Admits Begging for GPUs: Shortage Is Structural

The CEO of the most valuable AI company admitted begging for GPUs, signaling a structural shortage. The confession, reported by @TheGeorgePu, contradicts vendor narratives of ample supply.

Jul 18, 202675% relevant

The Great GPU Scramble: How Hardware Shortages Are Defining the AI Arms Race

Oracle founder Larry Ellison identifies GPU acquisition as the primary bottleneck in AI development, with companies racing to secure limited hardware for breakthroughs in medicine, video generation, and autonomous systems.

Mar 7, 202685% relevant

Mac Studio AI Hardware Shortage Signals Shift to Cloud Rentals

Developers report a global shortage of high-memory Apple Silicon Macs, with 128GB Mac Studios unavailable worldwide. This pushes practitioners toward renting cloud H100 GPUs at ~$3/hr, marking a shift from the recent local AI trend.

Apr 14, 202685% relevant

Moonshot AI Pauses K3 Subscriptions as Demand Exceeds GPU Capacity

Moonshot AI paused Kimi K3 subscriptions due to GPU capacity limits. The open-weight release by July 27 aims to offload compute demand.

Jul 20, 2026100% relevant

DARPA Leases 50 Nvidia H100 GPUs for Biological AI Program

DARPA's Biological Technologies Office is procuring 50 Nvidia HGX H100 GPU systems for its NODES program, with hardware delivery required within one month. This represents a significant government investment in AI infrastructure for biological research applications.

Apr 22, 202686% relevant

AI Compute Crisis: GPU Prices Up 48%, Anthropic API at 98.95% Uptime

The AI industry faces a severe compute capacity crisis, with GPU prices up 48%, Anthropic API uptime falling to 98.95%, and OpenAI shutting down Sora to reallocate resources. Demand for agentic AI is outstripping supply, forcing rationing and product cancellations.

Apr 13, 2026100% relevant

Google's 5M H100-Equivalent GPU Fleet Powers Anthropic's AI Expansion

An analyst estimates Google's compute capacity at ~5 million Nvidia H100-equivalent GPUs, providing the infrastructure backbone for Anthropic's model deployment and growth. This highlights the strategic shift where foundational AI labs rely on hyperscaler scale for distribution.

Apr 7, 202685% relevant

Jensen Huang Counters Musk's 'One Robot Per Person' Vision, Argues for Multiples to Address Labor Shortages

NVIDIA CEO Jensen Huang responded to Elon Musk's expectation of one robot per person, stating the need for 'more than 1' per person to address severe labor shortages and accelerate corporate growth.

Mar 29, 202687% relevant

Compute Shortage to Split AI Market: Rich Get Agents, Poor Get Chatbots

Mollick warns compute shortage makes agents expensive while chatbots cheapen, splitting AI market by company resources.

May 21, 202675% relevant

MLCC Shortage Threatens AI Server Ramp: Prices Hiking, Lead Times Stretching

MLCCs, cheap components stabilizing voltage in AI servers, face supply crunch as demand grows ~5x by CY27. Lead times stretch, prices hike, new lines take 2 years.

Jun 5, 202687% relevant

AI Data Center HBM Shortage Intensifies as Samsung, SK Hynix, and Micron Struggle with Supply

AI data centers are aggressively stockpiling high-bandwidth memory (HBM), creating a supply crunch. Only three manufacturers—Samsung, SK Hynix, and Micron—can produce this critical component for AI servers.

Mar 27, 202685% relevant

Jensen Huang: DeepSeek, Kimi open models boost Nvidia sales

Jensen Huang says Chinese open models DeepSeek and Kimi boost Nvidia GPU demand, not threaten it. Market misunderstood their impact twice.

Jul 25, 202689% relevant

Apple M7 Ultra Chip Reportedly Supports 1.5TB Unified Memory

Apple's M7 Ultra chip reportedly supports 1.5TB unified memory, doubling the M3 Ultra and matching eight Nvidia B200 GPUs, but DRAM supply constraints threaten pricing.

Jul 13, 202687% relevant

Nvidia RTX Pro 6000 Hits $13,250, Up 55% in a Year

Nvidia raised RTX Pro 6000 Blackwell to $13,250, up 55% in a year. Memory shortage and AI demand drive prices.

Jun 13, 202690% relevant

SemiAnalysis: N3 chip demand far outstrips current consensus estimates

SemiAnalysis argues N3 chip demand far exceeds consensus accelerator models, implying a structural silicon shortage not priced by markets.

May 30, 202689% relevant

Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market

Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.

May 1, 2026100% relevant

CPU Demand Flipping the AI Narrative as Datacenter Growth Shifts

A new analysis from SemiAnalysis indicates CPU demand is rising in AI datacenters, reversing a narrative of GPU-only dominance. This shift signals changing workload patterns and infrastructure priorities.

Apr 28, 2026100% relevant

AirTrain Enables Distributed ML Training on MacBooks Over Wi-Fi

Developer @AlexanderCodes_ open-sourced AirTrain, a tool that enables distributed ML training across Apple Silicon MacBooks using Wi-Fi by syncing gradients every 500 steps instead of every step. This makes personal device training feasible for models up to 70B parameters without cloud GPU costs.

Apr 18, 202695% relevant

DOE Seeks Input on AI Infrastructure for Federal Lands

The U.S. Department of Energy has published a Request for Information (RFI) to solicit input on developing AI and high-performance computing infrastructure on DOE-owned lands. This marks a significant step in the federal government's strategy to directly address the national AI compute shortage.

Apr 17, 202672% relevant

InCoder-32B-Thinking Hits 81.3% on LiveCodeBench, Trained on Chip & Kernel Traces

InCoder-32B-Thinking, a 32B parameter model trained on execution traces from chip design, GPU kernels, and embedded systems, scores 81.3% on LiveCodeBench V5 and an 84% compile pass rate on CAD-Coder.

Apr 11, 202692% relevant

Memory Market Squeeze Threatens iPhone Price Hikes as AI Demands Strain Supply

A global RAM shortage and price increases could force Apple to raise iPhone prices by up to $250, according to industry analysis. The tech giant is reportedly unwilling to absorb the cost, passing it directly to consumers amid surging memory demands from AI applications.

Mar 14, 202685% relevant

AI Gold Rush Strains Apple Hardware: High-Memory Macs Sell Out as Local AI Agents Go Mainstream

A surge in demand for local AI development has created severe inventory shortages for high-memory Apple hardware. Mac Studio orders with 128GB or 512GB RAM face 6+ week delays as consumers buy up every available unit to run powerful AI agents like OpenClaw.

Mar 6, 202685% relevant

Nvidia's Next-Gen AI Rack Delayed to 2028, SemiAnalysis Says

Nvidia's next-gen AI rack delayed to 2028 on manufacturing snags per SemiAnalysis. Delay benefits AMD and custom silicon rivals.

Jul 6, 202695% relevant

Anthropic Explores Custom AI Chip with Samsung

Anthropic is discussing a custom AI chip with Samsung, per The Information. The move follows OpenAI's Jalapeño chip and signals growing vertical integration in AI hardware.

Jul 2, 202688% relevant

Micron Profit Surges 15-Fold; HBM4 Revenue Tops $1B

Micron profit surged 15-fold to $4.2B as HBM4 revenue topped $1B, with gross margin near 85%.

Jun 25, 2026100% relevant

Lansing AI data center petition hits 20,000 signatures

Over 20,000 signatures oppose a Lansing AI data center tied to Nvidia's Vera Rubin build-out, signaling growing local resistance.

Jun 18, 202685% relevant

Memory Supply Squeeze Hits Non-AI Sectors as DRAM Prices Double

DRAM prices surged 93-98% QoQ in Q1 2026 as AI data centers consume fab capacity, nine industry groups warned the Trump administration on June 3, threatening supply for automotive, telecom, and medical devices.

Jun 5, 202662% relevant

ERCOT datacenter requests exceed grid capacity by 5x

ERCOT datacenter requests far exceed grid underwriting capacity, per @SemiAnalysis_, revealing grid approval as a binding constraint on AI infrastructure buildout.

May 29, 202687% relevant

CNAS Report: AI Hits Silicon Wall as Chip Supply Trails $700B CapEx

CNAS report warns semiconductor manufacturing cannot keep pace with AI demand as hyperscalers plan $700B+ CapEx in 2026. Silicon replaces power as the near-term constraint.

May 11, 202690% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety