compute scale

30 articles about compute scale in AI news

Safe Superintelligence Partners Nvidia for 10x Compute Scale-Up

SSI partners with Nvidia for 10x compute scale; Nvidia also invests. Details on investment size and timeline undisclosed, raising questions about the startup's capital needs.

Jul 28, 2026100% relevant

DeepSeek Builds Gigawatt-Scale AI Data Center in Inner Mongolia

DeepSeek is building a gigawatt-scale AI data center in Inner Mongolia, per Bloomberg. The project marks a strategic pivot from efficiency to raw compute scale.

Jul 30, 202687% relevant

JUPITER Exascale Maps Brain at Cellular Scale on 4,096 Grace Hopper Nodes

JUPITER, Europe's first exascale supercomputer, trained CytoNet brain model on 6.5 PB in 5 days and runs climate, 6G, and quantum simulations.

Jun 22, 202685% relevant

xAI pivots Colossus to rental compute, chasing 30%+ margins

xAI pivots Colossus to rental compute, targeting 30%+ margins as a neo-hyperscaler, per analyst.

Jun 6, 202683% relevant

Cerebras Challenges Nvidia Inference Monopoly with Wafer-Scale Edge

Cerebras is challenging Nvidia's inference dominance with wafer-scale chips, as inference workloads surpass training in AI compute spend.

May 20, 202670% relevant

Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale

Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.

May 1, 2026100% relevant

Altimeter's Gerstner: AI Economics Shift to Owned Compute for Fixed Costs

Altimeter Capital's Brad Gerstner states the fundamental economics of AI have flipped, where companies owning their compute infrastructure lock in fixed costs while AI-driven revenue scales, creating a powerful advantage.

Apr 12, 202685% relevant

AI Economics Shift: OpenAI Compute Margins Hit 70%, Anthropic Turns Profitable

Analysis shows AI economics have fundamentally flipped. Firms with owned compute see infrastructure costs remain fixed while revenue scales, leading OpenAI's compute margins to rise from 35% to 70% and Anthropic to turn from -94% to +40% margins.

Apr 11, 202687% relevant

Terafab's 1GW AI Compute Goal Requires Massive Fab Capacity

Analysis of Terafab's stated goals shows that achieving 1GW of AI compute would require approximately 190,000 wafer starts per month across logic and memory. This underscores the unprecedented scale of semiconductor manufacturing needed for future AI infrastructure.

Apr 7, 202685% relevant

From Surveillance to Service: How Computer Vision is Redefining Luxury Retail Experiences

Computer vision technology is evolving beyond basic analytics to enable personalized clienteling, virtual try-ons, and intelligent inventory management. For luxury brands, this means transforming physical stores into data-rich environments that deliver bespoke experiences at scale.

Mar 5, 202670% relevant

Nscale Acquires Anyscale, Adding Ray Creator to AI Cloud Stack

Nscale acquires Anyscale, adding Ray's software layer to its full-stack AI cloud. The ~200-person team joins, with terms undisclosed.

Jul 30, 202682% relevant

Epoch AI: Google's Colossus 1 Training Compute Hits 1e26 FLOP

Google's Colossus 1 used 1e26 FLOP at $4.6B, per Epoch AI. It is the largest known training run, signaling a new capital scale.

Jul 25, 2026100% relevant

NUS CIMERA Chip Cuts LLM Memory Wall with Compute-in-Interconnect

NUS researchers propose CIMERA, an LLM inference accelerator integrating compute-in-interconnect and memory to mitigate the memory wall, detailed in arXiv:2607.13649 (July 2026).

Jul 20, 202690% relevant

Microsoft to Deploy AMD Helios Rack-Scale AI at Scale on Azure

Microsoft will deploy AMD's Helios rack-scale AI accelerator at scale on Azure, powered by MI455X GPUs and Epyc Venice CPUs. The move diversifies Azure's AI silicon beyond Nvidia.

Jul 20, 2026100% relevant

Morgan Stanley: Top 5 Cloud Capex to Hit $1.2T, Quadruple Compute by 2028

Morgan Stanley forecasts top 5 cloud capex to hit $1.2T in 2027, $1.4T by 2028, with compute power quadrupling.

Jul 15, 202685% relevant

Domestic AI Compute Nears Tipping Point, Analyst Says

Analyst says domestic compute viable within a week; AI-2040 omits Ascend 910, highlighting hardware gaps.

Jul 12, 202677% relevant

Meta's Superintelligence Compute Ramp Spans 2000km Across Data Centers

Meta's superintelligence compute ramp spans 2000km+ with an RL startup, per SemiAnalysis, marking the most aggressive AI infrastructure build.

Jul 9, 2026100% relevant

Sarasota County Blocks Hyperscale Data Centers for One Year

Sarasota County enacted a one-year moratorium on hyperscale data centers over energy and water concerns, joining Palm Beach County in a growing local backlash against AI infrastructure.

Jul 9, 202691% relevant

Scale-Across: Cloud Giants Link Datacenters for Million-Accelerator AI Clusters

Cloud providers are linking multiple datacenters for million-accelerator AI clusters, a new 'scale-across' paradigm.

Jul 6, 202689% relevant

AISI: Fixed compute budgets underestimate AI agents by 60%

AISI found standard benchmarks cap compute budgets, underestimating agent capabilities by ~60%. Success rates jumped ~25% with 10x tokens.

Jul 3, 202696% relevant

AI Security Inst Shows Test-Time Compute Skews Frontier Evaluations

AISecInst research shows test-time compute budgets skew frontier model evaluations, challenging standard practices.

Jul 3, 202692% relevant

Frontier AI Labs Used Only 21% of Global Compute in 2025

Frontier labs used only 21% of global AI compute in 2025, per EpochAI, challenging the narrative of compute concentration.

Jun 29, 202691% relevant

Jim Keller: Tenstorrent IPO Looms as BlackHole Chip Scales

Jim Keller confirmed Tenstorrent's IPO plans as BlackHole chip scales for AI inference, competing with Nvidia. No revenue disclosed.

Jun 25, 202698% relevant

AI Data Center Scale Doubles Every 7 Months, Epoch Finds

Epoch AI finds AI data center scale doubles every 7 months, driven by Google, Microsoft, and Amazon investments. This accelerates beyond the earlier 12-month cycle, raising training cost projections to $10 billion by 2028.

Jun 25, 202695% relevant

HPE Slingshot Leads Supercomputer Interconnects; China's 1.2 Exaflops

HPE's Slingshot tops supercomputer interconnects, but China's 1.2 exaflops machine steals the show, signaling a tightening race in HPC.

Jun 24, 202688% relevant

Upscale AI Raises $190M for AI Networking Infrastructure

Upscale AI raised $190M to expand AI networking infrastructure, addressing the bottleneck of 100K+ GPU clusters.

Jun 22, 202695% relevant

Prometheus Hyperscale Wins Gigawatt Wyoming Campus Approval

Prometheus Hyperscale secured gigawatt campus approval in Wyoming for AI workloads, tapping low-cost power and land.

Jun 22, 202678% relevant

NVIDIA, GENCI Launch AI Factory France Compute Access for Startups

NVIDIA and GENCI launched AI Factory France at VivaTech, giving European startups free access to AI supercomputers. The program includes compute, tools, and expert support for NVIDIA Inception members.

Jun 18, 202690% relevant

Computer Vision Deployments Drive Retail Productivity Gains

Computer vision deployments in retail are driving productivity gains by automating inventory, checkout, and loss prevention. AI News reports that retailers using these systems see measurable operational improvements. The technology leverages vision transformers and cloud platforms like Google Cloud.

Jun 18, 202687% relevant

Intel Omni-Path Resurfaces as InfiniBand Rival for DoE Supercomputers

Intel's Omni-Path interconnect, revived by Cornelis Networks, will connect DoE supercomputers at 400Gbps as an InfiniBand alternative.

Jun 16, 202690% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety