scale ai

30 articles about scale ai in AI news

Upscale AI Raises $190M for AI Networking Infrastructure

Upscale AI raised $190M to expand AI networking infrastructure, addressing the bottleneck of 100K+ GPU clusters.

Jun 22, 202695% relevant

Box Elder County to Vote on Hyperscale AI Data Center After Delay

Box Elder County votes on hyperscale AI data center after delay. Decision tests local government balance between infrastructure demand and resource constraints.

May 4, 202685% relevant

Meta to Release First LLM Built Under Scale AI's Alexandr Wang

Meta is set to release its first large language model developed under the technical leadership of Scale AI founder Alexandr Wang. While not fully open initially, the company plans to eventually open-source versions of the new model family.

Apr 6, 202687% relevant

Analysis: Meta's AI Investment Strategy Questioned as Scale AI Acquihire and Data Center Spend Top $700B

An analysis estimates Meta's total AI investment at ~$700B, including a ~$14.3M Scale AI acquihire and over $600B in data centers. The post questions why this has not yielded a competitive upcoming model against Chinese open-source labs.

Mar 25, 202685% relevant

Tulip and Salesfloor Merge to Scale AI-Powered Retail Engagement

Tulip, a mobile retail platform, and Salesfloor, a clienteling and virtual selling solution, have announced a merger. The combined entity aims to scale AI-powered customer engagement for retailers, focusing on unifying in-store and online experiences.

Mar 24, 202695% relevant

Perplexity's pplx-embed: The Bidirectional Breakthrough Transforming Web-Scale AI Retrieval

Perplexity has launched pplx-embed, a new family of multilingual embedding models that set state-of-the-art benchmarks for web-scale retrieval. Built on Qwen3 architecture with bidirectional attention, these models specifically address the noise and complexity of real-world web data.

Feb 27, 202675% relevant

Throughput Optimization as a Strategic Lever in Large-Scale AI Systems

A new arXiv paper argues that optimizing data pipeline and memory throughput is now a strategic necessity for training large AI models, citing specific innovations like OVERLORD and ZeRO-Offload that deliver measurable efficiency gains.

Mar 31, 202688% relevant

Nvidia Bets Big on Thinking Machines Lab with Gigawatt-Scale AI Partnership

Nvidia has formed a strategic partnership with Thinking Machines Lab, led by former OpenAI CTO Mira Murati, committing to deploy at least one gigawatt of next-generation Vera Rubin systems. The multiyear deal includes significant investment and aims to accelerate frontier AI development while expanding access to customizable models.

Mar 10, 202681% relevant

Nebius Claims First NVIDIA GB300 Exemplar Cloud for Training

Nebius becomes first cloud provider validated as NVIDIA Exemplar Cloud on GB300 for training, targeting hyperscale AI workloads.

Apr 29, 202694% relevant

Aehr Test Systems Lands $41M AI Chip Order; H2 Bookings Top $92M

Aehr Test Systems received a record $41 million production order from a key hyperscale AI customer. Total bookings for the second half of its fiscal year exceeded $92 million, highlighting surging demand for semiconductor test and burn-in equipment.

Apr 16, 202674% relevant

Cloudflare Agent Cloud Integrates OpenAI GPT-5.4 & Codex for Enterprise AI

Cloudflare has integrated OpenAI's GPT-5.4 and Codex models into its Agent Cloud platform. This allows enterprises to build, deploy, and scale AI agents for production workflows with built-in security and performance.

Apr 13, 202683% relevant

AI System Claims 100x Energy Efficiency Gain with Higher Accuracy

A new AI system reportedly uses 100 times less energy than current models while achieving higher accuracy. If validated, this could significantly reduce the operational costs and environmental impact of large-scale AI deployment.

Apr 6, 202695% relevant

Google's AI Infrastructure Strategy: What Retail Leaders Should Watch in 2026

Google's evolving AI infrastructure and compute strategy, including data center investments and model compression techniques, will directly impact how retail brands deploy and scale AI applications by 2026. The company's focus on efficiency and real-time capabilities signals a shift toward more accessible, powerful retail AI tools.

Apr 1, 202680% relevant

AI Agents Get a Memory Upgrade: New Research Tackles Long-Horizon Task Challenges

Researchers have developed new methods to scale AI agent memory for complex, long-horizon tasks. The breakthrough addresses one of the biggest limitations in current agent systems—their inability to retain and utilize information over extended sequences of actions.

Mar 10, 202687% relevant

Google's Virgo Network Links 134,000 TPU v8 Chips with 47 Pbps Fabric

Google unveiled its Virgo networking stack for TPU v8, capable of linking 134,000 chips in a single fabric with 47 petabits/sec of bi-sectional bandwidth. This represents a massive scale-up in interconnect technology for large-scale AI model training.

Apr 22, 2026100% relevant

JUPITER Exascale Maps Brain at Cellular Scale on 4,096 Grace Hopper Nodes

JUPITER, Europe's first exascale supercomputer, trained CytoNet brain model on 6.5 PB in 5 days and runs climate, 6G, and quantum simulations.

Jun 22, 202685% relevant

Qualcomm Launches AI Data Center Program With Hyperscaler Customer

Qualcomm launched an AI data center program with a major hyperscaler customer, targeting inference workloads. Financial terms and partner identity undisclosed.

Jun 17, 202685% relevant

ByteDance Builds In-House AI CPUs for TikTok-Scale Agent Inference

ByteDance builds custom AI CPUs for inference at TikTok scale, targeting scarce server supply. The move signals agent workload shift from training to inference hardware.

May 31, 202685% relevant

Cerebras WSE-3 Claims 10x Training Speed Over Nvidia H100 on GPT-Scale Model

Cerebras claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3. Benchmark lacks power and cost data, limiting independent verification.

May 15, 202664% relevant

Cerebra's Tokenomics Bet: AWS, OpenAI Deals and Wafer-Scale Edge

Cerebra's tokenomics pricing and AWS/OpenAI partnerships challenge NVIDIA's inference dominance, offering a 5x cost reduction per token via its wafer-scale architecture.

May 13, 202689% relevant

GUC, Wiwynn Partner on Silicon-to-System AI Infrastructure for Hyperscalers

GUC and Wiwynn partner on silicon-to-system AI infrastructure, integrating SoC design, optical I/O, and liquid cooling for hyperscalers.

May 4, 202683% relevant

Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale

Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.

May 1, 2026100% relevant

Meta Deploys AI Agents to Automate Hyperscale Performance Tuning

Meta deployed unified AI agents to automate hyperscale performance optimization, aiming to reduce manual tuning and costs amid a $145B AI capex push.

May 1, 202678% relevant

Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market

Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.

May 1, 2026100% relevant

Castore and GXO Detail 'Sustainable Scale' Strategy at Drapers Supply

At the Drapers Supply Chain Summit, Castore CSCO Adrian Harris detailed how the rapid-growth sportswear brand is shifting focus from breakneck expansion to 'sustainable scale' with logistics partner GXO. The partnership is central to operationalizing sustainability in Castore's supply chain.

Apr 24, 202674% relevant

Airbnb's Engineering Blueprint for a Petabyte-Scale

Airbnb engineers detail the construction of a massive, internally operated metrics storage system. The system ingests 50 million samples per second, manages 1.3 billion active time series, and stores 2.5 petabytes of data, overcoming challenges in tenancy, shuffle sharding, and observability at scale.

Apr 21, 202680% relevant

Meta Deploys Unified AI Agents to Manage Hyperscale Infrastructure

Meta's engineering team has built and deployed a system of unified AI agents to autonomously manage capacity and performance across its hyperscale infrastructure. This represents a significant shift from rule-based automation to AI-driven orchestration for one of the world's largest computing fleets.

Apr 16, 202670% relevant

AI Labs Shift from Pure Engineering to Scaled Human Operations

As frontier AI models advance, the demand for expert human feedback—from annotators to red-teamers—is increasing, creating a labor market that resembles scaled human operations more than traditional software development.

Apr 14, 202685% relevant

Genspark Raises $385M at $1.6B Valuation, Scales AI Agent Platform After Strong Japan Traction

Genspark has raised $385 million at a $1.6 billion valuation to scale its AI Agent platform. The funding follows strong user engagement in Japan and will accelerate the commercialization of its 'AI Workspace' for enterprises.

Apr 3, 202695% relevant

Sam Altman Predicts 'One-Person Billion-Dollar Companies' as AI Reshapes Business Scale

OpenAI CEO Sam Altman predicts the emergence of 'one-person billion-dollar companies' powered by AI, citing a specific example from a private CEO discussion group. This follows his earlier forecast of 10-person billion-dollar firms, suggesting AI is accelerating the compression of business scale.

Apr 2, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety