Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

gpu shortage

30 articles about gpu shortage in AI news

AWS Never Retired an A100 Server, CEO Says Amid Chip Shortage

AWS CEO Matt Garman stated that A100 servers are completely sold out and never retired, as demand for older chips outpaces supply. This underscores the prolonged GPU shortage and the value of legacy hardware in cloud AI.

87% relevant

The Great GPU Scramble: How Hardware Shortages Are Defining the AI Arms Race

Oracle founder Larry Ellison identifies GPU acquisition as the primary bottleneck in AI development, with companies racing to secure limited hardware for breakthroughs in medicine, video generation, and autonomous systems.

85% relevant

Mac Studio AI Hardware Shortage Signals Shift to Cloud Rentals

Developers report a global shortage of high-memory Apple Silicon Macs, with 128GB Mac Studios unavailable worldwide. This pushes practitioners toward renting cloud H100 GPUs at ~$3/hr, marking a shift from the recent local AI trend.

85% relevant

DARPA Leases 50 Nvidia H100 GPUs for Biological AI Program

DARPA's Biological Technologies Office is procuring 50 Nvidia HGX H100 GPU systems for its NODES program, with hardware delivery required within one month. This represents a significant government investment in AI infrastructure for biological research applications.

86% relevant

AI Compute Crisis: GPU Prices Up 48%, Anthropic API at 98.95% Uptime

The AI industry faces a severe compute capacity crisis, with GPU prices up 48%, Anthropic API uptime falling to 98.95%, and OpenAI shutting down Sora to reallocate resources. Demand for agentic AI is outstripping supply, forcing rationing and product cancellations.

100% relevant

Google's 5M H100-Equivalent GPU Fleet Powers Anthropic's AI Expansion

An analyst estimates Google's compute capacity at ~5 million Nvidia H100-equivalent GPUs, providing the infrastructure backbone for Anthropic's model deployment and growth. This highlights the strategic shift where foundational AI labs rely on hyperscaler scale for distribution.

85% relevant

Jensen Huang Counters Musk's 'One Robot Per Person' Vision, Argues for Multiples to Address Labor Shortages

NVIDIA CEO Jensen Huang responded to Elon Musk's expectation of one robot per person, stating the need for 'more than 1' per person to address severe labor shortages and accelerate corporate growth.

87% relevant

Compute Shortage to Split AI Market: Rich Get Agents, Poor Get Chatbots

Mollick warns compute shortage makes agents expensive while chatbots cheapen, splitting AI market by company resources.

75% relevant

MLCC Shortage Threatens AI Server Ramp: Prices Hiking, Lead Times Stretching

MLCCs, cheap components stabilizing voltage in AI servers, face supply crunch as demand grows ~5x by CY27. Lead times stretch, prices hike, new lines take 2 years.

87% relevant

AI Data Center HBM Shortage Intensifies as Samsung, SK Hynix, and Micron Struggle with Supply

AI data centers are aggressively stockpiling high-bandwidth memory (HBM), creating a supply crunch. Only three manufacturers—Samsung, SK Hynix, and Micron—can produce this critical component for AI servers.

85% relevant

Nvidia RTX Pro 6000 Hits $13,250, Up 55% in a Year

Nvidia raised RTX Pro 6000 Blackwell to $13,250, up 55% in a year. Memory shortage and AI demand drive prices.

78% relevant

SemiAnalysis: N3 chip demand far outstrips current consensus estimates

SemiAnalysis argues N3 chip demand far exceeds consensus accelerator models, implying a structural silicon shortage not priced by markets.

89% relevant

Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market

Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.

100% relevant

CPU Demand Flipping the AI Narrative as Datacenter Growth Shifts

A new analysis from SemiAnalysis indicates CPU demand is rising in AI datacenters, reversing a narrative of GPU-only dominance. This shift signals changing workload patterns and infrastructure priorities.

100% relevant

AirTrain Enables Distributed ML Training on MacBooks Over Wi-Fi

Developer @AlexanderCodes_ open-sourced AirTrain, a tool that enables distributed ML training across Apple Silicon MacBooks using Wi-Fi by syncing gradients every 500 steps instead of every step. This makes personal device training feasible for models up to 70B parameters without cloud GPU costs.

95% relevant

DOE Seeks Input on AI Infrastructure for Federal Lands

The U.S. Department of Energy has published a Request for Information (RFI) to solicit input on developing AI and high-performance computing infrastructure on DOE-owned lands. This marks a significant step in the federal government's strategy to directly address the national AI compute shortage.

72% relevant

InCoder-32B-Thinking Hits 81.3% on LiveCodeBench, Trained on Chip & Kernel Traces

InCoder-32B-Thinking, a 32B parameter model trained on execution traces from chip design, GPU kernels, and embedded systems, scores 81.3% on LiveCodeBench V5 and an 84% compile pass rate on CAD-Coder.

92% relevant

Memory Market Squeeze Threatens iPhone Price Hikes as AI Demands Strain Supply

A global RAM shortage and price increases could force Apple to raise iPhone prices by up to $250, according to industry analysis. The tech giant is reportedly unwilling to absorb the cost, passing it directly to consumers amid surging memory demands from AI applications.

85% relevant

AI Gold Rush Strains Apple Hardware: High-Memory Macs Sell Out as Local AI Agents Go Mainstream

A surge in demand for local AI development has created severe inventory shortages for high-memory Apple hardware. Mac Studio orders with 128GB or 512GB RAM face 6+ week delays as consumers buy up every available unit to run powerful AI agents like OpenClaw.

85% relevant

Memory Supply Squeeze Hits Non-AI Sectors as DRAM Prices Double

DRAM prices surged 93-98% QoQ in Q1 2026 as AI data centers consume fab capacity, nine industry groups warned the Trump administration on June 3, threatening supply for automotive, telecom, and medical devices.

62% relevant

ERCOT datacenter requests exceed grid capacity by 5x

ERCOT datacenter requests far exceed grid underwriting capacity, per @SemiAnalysis_, revealing grid approval as a binding constraint on AI infrastructure buildout.

87% relevant

CNAS Report: AI Hits Silicon Wall as Chip Supply Trails $700B CapEx

CNAS report warns semiconductor manufacturing cannot keep pace with AI demand as hyperscalers plan $700B+ CapEx in 2026. Silicon replaces power as the near-term constraint.

90% relevant

AI Data Centers Face 220GW Grid Jam, Power Infrastructure Becomes Bottleneck

PJM's 220GW interconnection queue shows AI data center growth is now constrained by power grid capacity, not compute. Hyperscalers face 3-7 year delays.

85% relevant

The $500B AI Chip Bottleneck: One Material, One Supplier

A single Japanese chemical company supplies 98% of the thin-film material used in every AI chip on earth. NVIDIA is paying half the capex to expand supplier fabs as lead times stretch past 6 months.

90% relevant

Google Splits TPU Line: 8t for Training, 8i for Inference

At Cloud Next 2026, Google introduced two new AI chips — TPU 8t for training and TPU 8i for inference — splitting its custom silicon for the first time. OpenAI, Anthropic, and Meta are buying multi-gigawatt TPU capacity, signaling a crack in NVIDIA's 81% market share.

100% relevant

AI Chip Capacity Crisis: 10GW Left Through 2030, Prices Up Double Digits

The AI accelerator market has only 10 gigawatts of capacity left for contract through 2030, with 100GW already under contract. Prices are rising double digits as one competitor has stopped taking orders entirely.

97% relevant

Microsoft's Fairwater AI Data Center Launches Early, Boosts Azure Capacity

Microsoft has launched its Fairwater AI data center ahead of schedule. The facility adds significant high-performance computing capacity to Azure's AI infrastructure, crucial for training and running large models.

92% relevant

Anthropic Hiring Data Center Leasing Principals in Europe & Australia

Anthropic is actively hiring for data center leasing roles in Europe and Australia, revealing a strategic push to build out its own compute infrastructure as it scales its AI models.

100% relevant

Anthropic's Adaptive Thinking: A Compute-Constrained Efficiency Play

Analysis suggests Anthropic's new 'adaptive thinking' feature is a direct response to compute constraints and competitive pressure from OpenAI, aiming to optimize token usage for enterprise clients at the potential cost of consumer experience.

87% relevant

Satellite Data Shows 40% of 2026 AI Data Centers at Risk of Delay

Geospatial analytics firm SynMax reports that at least 40% of AI data centers scheduled for 2026 completion are at risk of delays exceeding three months, based on satellite imagery analysis of construction progress at sites for OpenAI, Microsoft, and Oracle.

80% relevant