scale

30 articles about scale in AI news

NanoEuler: GPT-2-Scale 116M Model Built in Pure C/CUDA From Scratch

NanoEuler is a 116M-parameter GPT-2-scale model built in pure C/CUDA from scratch. It provides a complete educational training pipeline for understanding LLMs at the lowest level.

Jun 28, 202675% relevant

Vibe Coding Fails: Why AI-Generated Code Breaks at Scale

Vibe coding fails because AI-generated code lacks architectural coherence, test coverage, and security validation, breaking at scale beyond 1,000 lines.

Jun 27, 202670% relevant

Jim Keller: Tenstorrent IPO Looms as BlackHole Chip Scales

Jim Keller confirmed Tenstorrent's IPO plans as BlackHole chip scales for AI inference, competing with Nvidia. No revenue disclosed.

Jun 25, 202698% relevant

AI Data Center Scale Doubles Every 7 Months, Epoch Finds

Epoch AI finds AI data center scale doubles every 7 months, driven by Google, Microsoft, and Amazon investments. This accelerates beyond the earlier 12-month cycle, raising training cost projections to $10 billion by 2028.

Jun 25, 202695% relevant

Upscale AI Raises $190M for AI Networking Infrastructure

Upscale AI raised $190M to expand AI networking infrastructure, addressing the bottleneck of 100K+ GPU clusters.

Jun 22, 202695% relevant

Prometheus Hyperscale Wins Gigawatt Wyoming Campus Approval

Prometheus Hyperscale secured gigawatt campus approval in Wyoming for AI workloads, tapping low-cost power and land.

Jun 22, 202678% relevant

JUPITER Exascale Maps Brain at Cellular Scale on 4,096 Grace Hopper Nodes

JUPITER, Europe's first exascale supercomputer, trained CytoNet brain model on 6.5 PB in 5 days and runs climate, 6G, and quantum simulations.

Jun 22, 202685% relevant

Qualcomm Launches AI Data Center Program With Hyperscaler Customer

Qualcomm launched an AI data center program with a major hyperscaler customer, targeting inference workloads. Financial terms and partner identity undisclosed.

Jun 17, 202685% relevant

CoreWeave Beats AWS, Google to First Vera Rubin Rack-Scale Validation

CoreWeave validated Nvidia's Vera Rubin NVL72 at rack scale before hyperscalers, reinforcing its GPU-first strategy.

Jun 13, 202694% relevant

Scale Your AI Code Review Fleet

Gito v4.1.0 now runs on Claude Code and Gemini CLI. Use async LLM requests and selective model routing to scale code review fleets efficiently.

Jun 5, 202687% relevant

Cerebras Reengineers Mechanical Playbook for Wafer-Scale Chip Cooling

Cerebras disclosed three mechanical innovations—vertical power delivery, flexible interposers, and direct-impingement cooling—to prevent wafer-scale chips from cracking, rewriting engineering fundamentals.

Jun 4, 202688% relevant

Virginia Beach moves to ban hyperscale data centers

Virginia Beach council members propose banning new hyperscale data centers over power, water, and noise concerns, targeting facilities >100K sq ft or >50 MW.

Jun 2, 202679% relevant

Instacart's Semantic IDs: Product Understanding at Scale

Instacart's engineering team details a semantic ID system for product understanding at scale, using embeddings to create meaningful identifiers that enhance search and recommendations. This approach captures nuanced product relationships, improving relevance for grocery e-commerce.

Jun 2, 2026100% relevant

ByteDance Builds In-House AI CPUs for TikTok-Scale Agent Inference

ByteDance builds custom AI CPUs for inference at TikTok scale, targeting scarce server supply. The move signals agent workload shift from training to inference hardware.

May 31, 202685% relevant

Cerebras Challenges Nvidia Inference Monopoly with Wafer-Scale Edge

Cerebras is challenging Nvidia's inference dominance with wafer-scale chips, as inference workloads surpass training in AI compute spend.

May 20, 202670% relevant

Cerebras WSE-3 Claims 10x Training Speed Over Nvidia H100 on GPT-Scale Model

Cerebras claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3. Benchmark lacks power and cost data, limiting independent verification.

May 15, 202664% relevant

Google TPU 'Broadfly' Topology Scales Pod to 1,152 Chips

Google unveiled a Broadfly TPU topology at Cloud Next, scaling pods to 1,152 chips — 4.5x larger than Ironwood — with max 7 hops. This inference-first design challenges NVIDIA's NVLink on scale and latency.

May 14, 202694% relevant

Cerebra's Tokenomics Bet: AWS, OpenAI Deals and Wafer-Scale Edge

Cerebra's tokenomics pricing and AWS/OpenAI partnerships challenge NVIDIA's inference dominance, offering a 5x cost reduction per token via its wafer-scale architecture.

May 13, 202689% relevant

Nscale to Deploy 66K+ Rubin GPUs for Microsoft in Portugal

Nscale will deploy 66,000+ NVIDIA Rubin GPUs for Microsoft at Portugal's Start Campus. The deal is a first for Rubin and signals Microsoft's geographic diversification.

May 5, 202680% relevant

GUC, Wiwynn Partner on Silicon-to-System AI Infrastructure for Hyperscalers

GUC and Wiwynn partner on silicon-to-system AI infrastructure, integrating SoC design, optical I/O, and liquid cooling for hyperscalers.

May 4, 202683% relevant

Box Elder County to Vote on Hyperscale AI Data Center After Delay

Box Elder County votes on hyperscale AI data center after delay. Decision tests local government balance between infrastructure demand and resource constraints.

May 4, 202685% relevant

Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale

Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.

May 1, 2026100% relevant

Qualcomm Ships Hyperscaler Custom Silicon by December 2026

Qualcomm is developing custom silicon for an unnamed hyperscaler, with shipments expected December 2026, marking its most concrete data-center comeback move.

May 1, 202676% relevant

Meta Deploys AI Agents to Automate Hyperscale Performance Tuning

Meta deployed unified AI agents to automate hyperscale performance optimization, aiming to reduce manual tuning and costs amid a $145B AI capex push.

May 1, 202678% relevant

Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market

Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.

May 1, 2026100% relevant

Utah Hyperscale Data Center to Exceed State Power Use

A hyperscale data center in Box Elder County, Utah, developed by Kevin O'Leary's O'Leary Digital, is set to generate and consume more power than the state itself, moving toward final approval.

Apr 26, 2026100% relevant

Castore and GXO Detail 'Sustainable Scale' Strategy at Drapers Supply

At the Drapers Supply Chain Summit, Castore CSCO Adrian Harris detailed how the rapid-growth sportswear brand is shifting focus from breakneck expansion to 'sustainable scale' with logistics partner GXO. The partnership is central to operationalizing sustainability in Castore's supply chain.

Apr 24, 202674% relevant

Applied Digital Lands 300MW Lease with Hyperscaler at Louisiana Site

Applied Digital secured a 300MW lease with an investment-grade hyperscaler at its Delta Forge 1 site in Louisiana, with a total reported value of $7.5 billion, signaling continued demand for AI data center capacity.

Apr 23, 2026100% relevant

Airbnb's Engineering Blueprint for a Petabyte-Scale

Airbnb engineers detail the construction of a massive, internally operated metrics storage system. The system ingests 50 million samples per second, manages 1.3 billion active time series, and stores 2.5 petabytes of data, overcoming challenges in tenancy, shuffle sharding, and observability at scale.

Apr 21, 202680% relevant

Cisco Reveals Scale-Across GPU Networking Needs 14x DCI Bandwidth

Cisco's chief architect detailed the massive bandwidth requirements for connecting AI clusters via 'scale-across' GPU networking, which needs 14x the capacity of traditional data center interconnects. This shift is creating a multi-billion dollar market for 800G coherent pluggables and deep-buffered switches.

Apr 21, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety