Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

scale ai

30 articles about scale ai in AI news

Box Elder County to Vote on Hyperscale AI Data Center After Delay

Box Elder County votes on hyperscale AI data center after delay. Decision tests local government balance between infrastructure demand and resource constraints.

85% relevant

Meta to Release First LLM Built Under Scale AI's Alexandr Wang

Meta is set to release its first large language model developed under the technical leadership of Scale AI founder Alexandr Wang. While not fully open initially, the company plans to eventually open-source versions of the new model family.

87% relevant

Analysis: Meta's AI Investment Strategy Questioned as Scale AI Acquihire and Data Center Spend Top $700B

An analysis estimates Meta's total AI investment at ~$700B, including a ~$14.3M Scale AI acquihire and over $600B in data centers. The post questions why this has not yielded a competitive upcoming model against Chinese open-source labs.

85% relevant

Tulip and Salesfloor Merge to Scale AI-Powered Retail Engagement

Tulip, a mobile retail platform, and Salesfloor, a clienteling and virtual selling solution, have announced a merger. The combined entity aims to scale AI-powered customer engagement for retailers, focusing on unifying in-store and online experiences.

95% relevant

Perplexity's pplx-embed: The Bidirectional Breakthrough Transforming Web-Scale AI Retrieval

Perplexity has launched pplx-embed, a new family of multilingual embedding models that set state-of-the-art benchmarks for web-scale retrieval. Built on Qwen3 architecture with bidirectional attention, these models specifically address the noise and complexity of real-world web data.

75% relevant

Throughput Optimization as a Strategic Lever in Large-Scale AI Systems

A new arXiv paper argues that optimizing data pipeline and memory throughput is now a strategic necessity for training large AI models, citing specific innovations like OVERLORD and ZeRO-Offload that deliver measurable efficiency gains.

88% relevant

Nvidia Bets Big on Thinking Machines Lab with Gigawatt-Scale AI Partnership

Nvidia has formed a strategic partnership with Thinking Machines Lab, led by former OpenAI CTO Mira Murati, committing to deploy at least one gigawatt of next-generation Vera Rubin systems. The multiyear deal includes significant investment and aims to accelerate frontier AI development while expanding access to customizable models.

81% relevant

Nebius Claims First NVIDIA GB300 Exemplar Cloud for Training

Nebius becomes first cloud provider validated as NVIDIA Exemplar Cloud on GB300 for training, targeting hyperscale AI workloads.

94% relevant

Aehr Test Systems Lands $41M AI Chip Order; H2 Bookings Top $92M

Aehr Test Systems received a record $41 million production order from a key hyperscale AI customer. Total bookings for the second half of its fiscal year exceeded $92 million, highlighting surging demand for semiconductor test and burn-in equipment.

74% relevant

Cloudflare Agent Cloud Integrates OpenAI GPT-5.4 & Codex for Enterprise AI

Cloudflare has integrated OpenAI's GPT-5.4 and Codex models into its Agent Cloud platform. This allows enterprises to build, deploy, and scale AI agents for production workflows with built-in security and performance.

83% relevant

AI System Claims 100x Energy Efficiency Gain with Higher Accuracy

A new AI system reportedly uses 100 times less energy than current models while achieving higher accuracy. If validated, this could significantly reduce the operational costs and environmental impact of large-scale AI deployment.

95% relevant

Google's AI Infrastructure Strategy: What Retail Leaders Should Watch in 2026

Google's evolving AI infrastructure and compute strategy, including data center investments and model compression techniques, will directly impact how retail brands deploy and scale AI applications by 2026. The company's focus on efficiency and real-time capabilities signals a shift toward more accessible, powerful retail AI tools.

80% relevant

AI Agents Get a Memory Upgrade: New Research Tackles Long-Horizon Task Challenges

Researchers have developed new methods to scale AI agent memory for complex, long-horizon tasks. The breakthrough addresses one of the biggest limitations in current agent systems—their inability to retain and utilize information over extended sequences of actions.

87% relevant

Google's Virgo Network Links 134,000 TPU v8 Chips with 47 Pbps Fabric

Google unveiled its Virgo networking stack for TPU v8, capable of linking 134,000 chips in a single fabric with 47 petabits/sec of bi-sectional bandwidth. This represents a massive scale-up in interconnect technology for large-scale AI model training.

100% relevant

GUC, Wiwynn Partner on Silicon-to-System AI Infrastructure for Hyperscalers

GUC and Wiwynn partner on silicon-to-system AI infrastructure, integrating SoC design, optical I/O, and liquid cooling for hyperscalers.

79% relevant

Google-Anthropic 5 GW Deal: AI Capacity Pre-Sold at Gigawatt Scale

Google and Anthropic signed a 5 GW compute deal, pre-selling AI capacity at gigawatt scale and reshaping infrastructure financing.

100% relevant

Meta Deploys AI Agents to Automate Hyperscale Performance Tuning

Meta deployed unified AI agents to automate hyperscale performance optimization, aiming to reduce manual tuning and costs amid a $145B AI capex push.

78% relevant

Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market

Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.

100% relevant

Castore and GXO Detail 'Sustainable Scale' Strategy at Drapers Supply

At the Drapers Supply Chain Summit, Castore CSCO Adrian Harris detailed how the rapid-growth sportswear brand is shifting focus from breakneck expansion to 'sustainable scale' with logistics partner GXO. The partnership is central to operationalizing sustainability in Castore's supply chain.

74% relevant

Airbnb's Engineering Blueprint for a Petabyte-Scale

Airbnb engineers detail the construction of a massive, internally operated metrics storage system. The system ingests 50 million samples per second, manages 1.3 billion active time series, and stores 2.5 petabytes of data, overcoming challenges in tenancy, shuffle sharding, and observability at scale.

80% relevant

Meta Deploys Unified AI Agents to Manage Hyperscale Infrastructure

Meta's engineering team has built and deployed a system of unified AI agents to autonomously manage capacity and performance across its hyperscale infrastructure. This represents a significant shift from rule-based automation to AI-driven orchestration for one of the world's largest computing fleets.

70% relevant

AI Labs Shift from Pure Engineering to Scaled Human Operations

As frontier AI models advance, the demand for expert human feedback—from annotators to red-teamers—is increasing, creating a labor market that resembles scaled human operations more than traditional software development.

85% relevant

Genspark Raises $385M at $1.6B Valuation, Scales AI Agent Platform After Strong Japan Traction

Genspark has raised $385 million at a $1.6 billion valuation to scale its AI Agent platform. The funding follows strong user engagement in Japan and will accelerate the commercialization of its 'AI Workspace' for enterprises.

95% relevant

Sam Altman Predicts 'One-Person Billion-Dollar Companies' as AI Reshapes Business Scale

OpenAI CEO Sam Altman predicts the emergence of 'one-person billion-dollar companies' powered by AI, citing a specific example from a private CEO discussion group. This follows his earlier forecast of 10-person billion-dollar firms, suggesting AI is accelerating the compression of business scale.

87% relevant

OXRL Study: Post-Training Algorithm Rankings Invert with Model Scale, Loss Modifications Offer Negligible Gains

A controlled study of 51 post-training algorithms across 240 runs finds algorithm performance rankings completely invert between 1.5B and 7B parameter models. The choice of loss function provides less than 1 percentage point of leverage compared to model scale.

95% relevant

Nscale's $2 Billion Bet: How a UK AI Infrastructure Startup Became Europe's New Tech Titan

UK-based AI infrastructure company Nscale has secured a massive $2 billion Series C round, valuing it at $14.6 billion. The funding will accelerate global deployment of vertically integrated AI data centers, with former Meta executives Sheryl Sandberg and Nick Clegg joining the board.

75% relevant

The AI Espionage Era: How Chinese Firms Launched Industrial-Scale Attacks on Claude

Anthropic reveals three massive AI model distillation campaigns by Chinese competitors who used 24,000 fake accounts to extract Claude's capabilities through 16 million exchanges. This industrial-scale intellectual property theft highlights growing tensions in the global AI race.

85% relevant

China's Mountain-Scale Solar Farms Redefine Renewable Energy Ambition

Massive solar installations covering entire hillsides in rural Guizhou demonstrate China's unprecedented scale in renewable energy infrastructure, transforming barren landscapes into terawatt-hour electricity generators.

85% relevant

Applied Digital Lands 300MW Lease with Hyperscaler at Louisiana Site

Applied Digital secured a 300MW lease with an investment-grade hyperscaler at its Delta Forge 1 site in Louisiana, with a total reported value of $7.5 billion, signaling continued demand for AI data center capacity.

100% relevant

Cisco Reveals Scale-Across GPU Networking Needs 14x DCI Bandwidth

Cisco's chief architect detailed the massive bandwidth requirements for connecting AI clusters via 'scale-across' GPU networking, which needs 14x the capacity of traditional data center interconnects. This shift is creating a multi-billion dollar market for 800G coherent pluggables and deep-buffered switches.

85% relevant