compute infrastructure
30 articles about compute infrastructure in AI news
Altimeter's Gerstner: AI Economics Shift to Owned Compute for Fixed Costs
Altimeter Capital's Brad Gerstner states the fundamental economics of AI have flipped, where companies owning their compute infrastructure lock in fixed costs while AI-driven revenue scales, creating a powerful advantage.
Beyond Better Models: The Compute Scaling Revolution Driving AI's Next Leap
New analysis reveals that scaling compute infrastructure may deliver 10× annual efficiency gains in AI development, surpassing algorithmic improvements alone. The real leverage comes from combining innovative ideas with massive computational resources.
Adam Selipsky Leaves AWS to Lead $10B AI Data Center Venture
Former AWS CEO Adam Selipsky to lead a $10B AI data center venture, highlighting the capital race for AI compute infrastructure.
Anthropic Hiring Data Center Leasing Principals in Europe & Australia
Anthropic is actively hiring for data center leasing roles in Europe and Australia, revealing a strategic push to build out its own compute infrastructure as it scales its AI models.
Anthropic Considers Custom AI Chips, Following Google & OpenAI
Anthropic is reportedly considering developing custom AI chips, a strategic move to gain control over its compute infrastructure and reduce costs. This follows similar initiatives by Google, Amazon, and OpenAI.
US Data Center Power Demand Hits 15 GW, Grid Constraints Emerge
US data center power demand reached 15 gigawatts in 2023, up from 11 GW in 2022. This rapid growth highlights a widening bottleneck: compute infrastructure is scaling faster than power delivery systems can support.
Oracle Cuts 20% of Workforce to Fund AI Infrastructure Push, Shifting from Labor to Compute
Oracle is laying off 20% of its workforce to redirect capital toward massive AI infrastructure investments. The move signals a strategic pivot from traditional workforce costs to data center and compute spending.
KKR Launches $10B Helix to Solve AI Infrastructure's Power-Land-Compute Bottleneck
KKR and three heavyweight partners — Nvidia, the Kuwait Investment Authority, and Vistra — launched Helix Digital Infrastructure on June 10, 2026, with more than $10 billion in committed capital. Former AWS CEO Adam Selipsky, who stepped down from Amazon in June 2024 and joined KKR as a senior advis
Cursor SDK Turns AI Agent Runtime into Programmable Infrastructure
Cursor is releasing an SDK that turns its agent runtime into programmable infrastructure for headless use in CI/CD pipelines, internal tools, and third-party products. Revenue scales with compute tokens, not seats, enabling higher volume without human-in-the-loop.
Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion
Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.
DOE Seeks Input on AI Infrastructure for Federal Lands
The U.S. Department of Energy has published a Request for Information (RFI) to solicit input on developing AI and high-performance computing infrastructure on DOE-owned lands. This marks a significant step in the federal government's strategy to directly address the national AI compute shortage.
Nvidia: Cost Per Token Is the Only AI Infrastructure Metric That Matters
Nvidia asserts that total cost of ownership for AI infrastructure must be measured in cost per delivered token, not raw compute metrics. This shift is critical for scaling profitable agentic AI applications.
Canada's AI Compute Gap: Google Cloud Montreal Offers 2017-Era Chips
A technical developer's attempt to rent modern AI compute in Canada revealed a stark infrastructure gap, with major providers offering chips as old as 2017, undermining national AI ambitions.
AI Economics Shift: OpenAI Compute Margins Hit 70%, Anthropic Turns Profitable
Analysis shows AI economics have fundamentally flipped. Firms with owned compute see infrastructure costs remain fixed while revenue scales, leading OpenAI's compute margins to rise from 35% to 70% and Anthropic to turn from -94% to +40% margins.
Terafab's 1GW AI Compute Goal Requires Massive Fab Capacity
Analysis of Terafab's stated goals shows that achieving 1GW of AI compute would require approximately 190,000 wafer starts per month across logic and memory. This underscores the unprecedented scale of semiconductor manufacturing needed for future AI infrastructure.
Google's AI Infrastructure Strategy: What Retail Leaders Should Watch in 2026
Google's evolving AI infrastructure and compute strategy, including data center investments and model compression techniques, will directly impact how retail brands deploy and scale AI applications by 2026. The company's focus on efficiency and real-time capabilities signals a shift toward more accessible, powerful retail AI tools.
Apple's Private Cloud Compute: Leak Suggests 4x M2 Ultra Cluster for On-Device AI Offload
A leak suggests Apple's Private Cloud Compute for AI may be built on clusters of four M2 Ultra chips, potentially offering high-performance, private server-side processing for iPhone AI tasks. This would mark Apple's strategic move into dedicated, privacy-focused AI infrastructure.
OpenAI Winds Down Sora App, Reallocates Compute to Next-Gen 'Spud' LLM Development
OpenAI has completed initial development of its next major AI model, codenamed 'Spud,' and is winding down the Sora video app, which was reportedly a compute resource drain. The move reallocates critical infrastructure toward core LLM competition with Anthropic and Google.
Frontier AI Labs Used Only 21% of Global Compute in 2025
Frontier labs used only 21% of global AI compute in 2025, per EpochAI, challenging the narrative of compute concentration.
HPE Slingshot Leads Supercomputer Interconnects; China's 1.2 Exaflops
HPE's Slingshot tops supercomputer interconnects, but China's 1.2 exaflops machine steals the show, signaling a tightening race in HPC.
Upscale AI Raises $190M for AI Networking Infrastructure
Upscale AI raised $190M to expand AI networking infrastructure, addressing the bottleneck of 100K+ GPU clusters.
Cursor Trains GPT-Size Model with 10-20x Compute
Cursor trained a GPT-size model from scratch with 10-20x more compute, announced at Compile. The move shifts from fine-tuning to pretraining for code generation.
NVIDIA, GENCI Launch AI Factory France Compute Access for Startups
NVIDIA and GENCI launched AI Factory France at VivaTech, giving European startups free access to AI supercomputers. The program includes compute, tools, and expert support for NVIDIA Inception members.
Computer Vision Deployments Drive Retail Productivity Gains
Computer vision deployments in retail are driving productivity gains by automating inventory, checkout, and loss prevention. AI News reports that retailers using these systems see measurable operational improvements. The technology leverages vision transformers and cloud platforms like Google Cloud.
Schneider Electric & Foxconn Partner on AI Data Center Infrastructure
Schneider Electric and Foxconn announced a strategic collaboration to co-develop next-gen AI data center infrastructure, including reference architectures and modular power/cooling skids. Production begins later this year.
Musk Pitches Moon as AI Compute Site via Electromagnetic Launchers
Musk proposes Moon-based electromagnetic accelerators to build solar panels for AI compute, leveraging lunar materials and low gravity.
xAI pivots Colossus to rental compute, chasing 30%+ margins
xAI pivots Colossus to rental compute, targeting 30%+ margins as a neo-hyperscaler, per analyst.
Google to Pay SpaceX $920M/Month for xAI Compute Capacity
Google commits $11B/year to SpaceX for compute at xAI data centers, potentially adding $1T to SpaceX's valuation.
SemiAnalysis Calls Jensen ComputeX Keynote 'F Tier' Over No AI DC News
SemiAnalysis rated Jensen Huang's ComputeX keynote 'F Tier' for no AI datacenter news and revealed a delayed NVIDIA ARM chip with broken video output.
Albertsons Launches AI Supply Chain Tool With Computer Vision
Albertsons launched a patent-pending AI supply chain tool using computer vision to reduce food waste and improve inventory across 2,200+ stores.