compute scale
30 articles about compute scale in AI news
xAI pivots Colossus to rental compute, chasing 30%+ margins
xAI pivots Colossus to rental compute, targeting 30%+ margins as a neo-hyperscaler, per analyst.
Cerebras Challenges Nvidia Inference Monopoly with Wafer-Scale Edge
Cerebras is challenging Nvidia's inference dominance with wafer-scale chips, as inference workloads surpass training in AI compute spend.
Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale
Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.
Altimeter's Gerstner: AI Economics Shift to Owned Compute for Fixed Costs
Altimeter Capital's Brad Gerstner states the fundamental economics of AI have flipped, where companies owning their compute infrastructure lock in fixed costs while AI-driven revenue scales, creating a powerful advantage.
AI Economics Shift: OpenAI Compute Margins Hit 70%, Anthropic Turns Profitable
Analysis shows AI economics have fundamentally flipped. Firms with owned compute see infrastructure costs remain fixed while revenue scales, leading OpenAI's compute margins to rise from 35% to 70% and Anthropic to turn from -94% to +40% margins.
Terafab's 1GW AI Compute Goal Requires Massive Fab Capacity
Analysis of Terafab's stated goals shows that achieving 1GW of AI compute would require approximately 190,000 wafer starts per month across logic and memory. This underscores the unprecedented scale of semiconductor manufacturing needed for future AI infrastructure.
From Surveillance to Service: How Computer Vision is Redefining Luxury Retail Experiences
Computer vision technology is evolving beyond basic analytics to enable personalized clienteling, virtual try-ons, and intelligent inventory management. For luxury brands, this means transforming physical stores into data-rich environments that deliver bespoke experiences at scale.
Musk Pitches Moon as AI Compute Site via Electromagnetic Launchers
Musk proposes Moon-based electromagnetic accelerators to build solar panels for AI compute, leveraging lunar materials and low gravity.
Google to Pay SpaceX $920M/Month for xAI Compute Capacity
Google commits $11B/year to SpaceX for compute at xAI data centers, potentially adding $1T to SpaceX's valuation.
Cerebras Reengineers Mechanical Playbook for Wafer-Scale Chip Cooling
Cerebras disclosed three mechanical innovations—vertical power delivery, flexible interposers, and direct-impingement cooling—to prevent wafer-scale chips from cracking, rewriting engineering fundamentals.
SSSTC Unveils Immersion-Cooled SSDs at Computex 2026 for AI Data Centers
SSSTC expanded immersion-cooled SSDs at Computex 2026 for AI data center heat management, competing with Samsung and Micron but withholding pricing and availability.
SemiAnalysis Calls Jensen ComputeX Keynote 'F Tier' Over No AI DC News
SemiAnalysis rated Jensen Huang's ComputeX keynote 'F Tier' for no AI datacenter news and revealed a delayed NVIDIA ARM chip with broken video output.
ByteDance Builds In-House AI CPUs for TikTok-Scale Agent Inference
ByteDance builds custom AI CPUs for inference at TikTok scale, targeting scarce server supply. The move signals agent workload shift from training to inference hardware.
OpenAI Readies General-Purpose LLM With Test-Time Compute Scaling
OpenAI is releasing a general-purpose LLM that improves with test-time compute, per an internal message. The model shows math gains without specialized training.
Cerebras WSE-3 Claims 10x Training Speed Over Nvidia H100 on GPT-Scale Model
Cerebras claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3. Benchmark lacks power and cost data, limiting independent verification.
Cerebra's Tokenomics Bet: AWS, OpenAI Deals and Wafer-Scale Edge
Cerebra's tokenomics pricing and AWS/OpenAI partnerships challenge NVIDIA's inference dominance, offering a 5x cost reduction per token via its wafer-scale architecture.
Albertsons Launches AI Supply Chain Tool With Computer Vision
Albertsons launched a patent-pending AI supply chain tool using computer vision to reduce food waste and improve inventory across 2,200+ stores.
NVIDIA, DOE Build 100K-GPU Supercomputer for Science
DOE and NVIDIA announced Solstice, a 100K-GPU Vera Rubin supercomputer delivering 5,000 exaflops, and Equinox with 10K Blackwell GPUs.
Anthropic's 220K GPU Cluster: $5B Compute Bet Revealed
Anthropic reportedly has 220K NVIDIA GPUs and 310MW, implying a >$5B compute cluster, 3x OpenAI's largest.
Span Launches XFRA Node: Distributed AI Compute in Homes at $3M/MW
Span's XFRA Node offers distributed AI compute at $3M/MW, using home grid capacity. A 100-home pilot this year targets 1.25 MW.
Nscale to Deploy 66K+ Rubin GPUs for Microsoft in Portugal
Nscale will deploy 66,000+ NVIDIA Rubin GPUs for Microsoft at Portugal's Start Campus. The deal is a first for Rubin and signals Microsoft's geographic diversification.
Box Elder County to Vote on Hyperscale AI Data Center After Delay
Box Elder County votes on hyperscale AI data center after delay. Decision tests local government balance between infrastructure demand and resource constraints.
Qualcomm Ships Hyperscaler Custom Silicon by December 2026
Qualcomm is developing custom silicon for an unnamed hyperscaler, with shipments expected December 2026, marking its most concrete data-center comeback move.
Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market
Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.
Utah Hyperscale Data Center to Exceed State Power Use
A hyperscale data center in Box Elder County, Utah, developed by Kevin O'Leary's O'Leary Digital, is set to generate and consume more power than the state itself, moving toward final approval.
Cursor Walked from $50B Round for SpaceX's Compute Offer
Cursor was days from closing a $2B round at a $50B valuation with top investors, but walked away when SpaceX offered $60B and a million H100s, signaling compute access now rivals capital in AI dealmaking.
Applied Digital Lands 300MW Lease with Hyperscaler at Louisiana Site
Applied Digital secured a 300MW lease with an investment-grade hyperscaler at its Delta Forge 1 site in Louisiana, with a total reported value of $7.5 billion, signaling continued demand for AI data center capacity.
Airbnb's Engineering Blueprint for a Petabyte-Scale
Airbnb engineers detail the construction of a massive, internally operated metrics storage system. The system ingests 50 million samples per second, manages 1.3 billion active time series, and stores 2.5 petabytes of data, overcoming challenges in tenancy, shuffle sharding, and observability at scale.
Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion
Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.
Fanuc robot arms combine AI and computer vision to adopt flexible workflows
Fanuc has updated its robot arms with AI and computer vision, enabling them to handle flexible workflows rather than fixed, repetitive tasks. This shift allows for greater adaptability in manufacturing environments.