Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

compute

30 articles about compute in AI news

SSSTC Unveils Immersion-Cooled SSDs at Computex 2026 for AI Data Centers

SSSTC expanded immersion-cooled SSDs at Computex 2026 for AI data center heat management, competing with Samsung and Micron but withholding pricing and availability.

82% relevant

SemiAnalysis Calls Jensen ComputeX Keynote 'F Tier' Over No AI DC News

SemiAnalysis rated Jensen Huang's ComputeX keynote 'F Tier' for no AI datacenter news and revealed a delayed NVIDIA ARM chip with broken video output.

82% relevant

Nvidia N1X Arm Laptop Chip Nears Reveal at Computex

Nvidia, Microsoft, Arm tease N1X Arm laptop chip debut at Computex. Nvidia enters Windows-on-Arm without owning the architecture it tried to buy.

85% relevant

Cua Driver Brings Background Computer-Use to Windows

Cua Driver launched Windows support for background computer-use, enabling agents like Claude Code to control GUI apps without blocking execution.

81% relevant

Compute Shortage to Split AI Market: Rich Get Agents, Poor Get Chatbots

Mollick warns compute shortage makes agents expensive while chatbots cheapen, splitting AI market by company resources.

75% relevant

OpenAI Readies General-Purpose LLM With Test-Time Compute Scaling

OpenAI is releasing a general-purpose LLM that improves with test-time compute, per an internal message. The model shows math gains without specialized training.

85% relevant

Albertsons Launches AI Supply Chain Tool With Computer Vision

Albertsons launched a patent-pending AI supply chain tool using computer vision to reduce food waste and improve inventory across 2,200+ stores.

100% relevant

NVIDIA, DOE Build 100K-GPU Supercomputer for Science

DOE and NVIDIA announced Solstice, a 100K-GPU Vera Rubin supercomputer delivering 5,000 exaflops, and Equinox with 10K Blackwell GPUs.

80% relevant

Anthropic's 220K GPU Cluster: $5B Compute Bet Revealed

Anthropic reportedly has 220K NVIDIA GPUs and 310MW, implying a >$5B compute cluster, 3x OpenAI's largest.

100% relevant

Span Launches XFRA Node: Distributed AI Compute in Homes at $3M/MW

Span's XFRA Node offers distributed AI compute at $3M/MW, using home grid capacity. A 100-home pilot this year targets 1.25 MW.

90% relevant

Cursor Walked from $50B Round for SpaceX's Compute Offer

Cursor was days from closing a $2B round at a $50B valuation with top investors, but walked away when SpaceX offered $60B and a million H100s, signaling compute access now rivals capital in AI dealmaking.

85% relevant

Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion

Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.

97% relevant

Fanuc robot arms combine AI and computer vision to adopt flexible workflows

Fanuc has updated its robot arms with AI and computer vision, enabling them to handle flexible workflows rather than fixed, repetitive tasks. This shift allows for greater adaptability in manufacturing environments.

74% relevant

GPT-5.4 Launches with Computer Control API

OpenAI launched GPT-5.4, featuring a 'Computer Use' API that lets the model control a user's desktop. Despite improvements, it scores 78.5% on SWE-Bench, behind Claude 3.5 Sonnet's 81.2%.

77% relevant

Anthropic's Adaptive Thinking: A Compute-Constrained Efficiency Play

Analysis suggests Anthropic's new 'adaptive thinking' feature is a direct response to compute constraints and competitive pressure from OpenAI, aiming to optimize token usage for enterprise clients at the potential cost of consumer experience.

87% relevant

Perplexity AI Launches 'Personal Computer' for Mac App Orchestration

Perplexity AI has released 'Personal Computer', a feature that integrates with its Mac app to securely orchestrate local files and applications. This move expands its AI assistant from web search to direct desktop interaction.

87% relevant

Canada's AI Compute Gap: Google Cloud Montreal Offers 2017-Era Chips

A technical developer's attempt to rent modern AI compute in Canada revealed a stark infrastructure gap, with major providers offering chips as old as 2017, undermining national AI ambitions.

85% relevant

Compute Constraints Create Double Bind for AI Growth: Ethan Mollick

Ethan Mollick highlights a critical industry bottleneck: compute scarcity forces a trade-off between raising prices/rationing current models and limiting future model training, creating a growth double bind.

85% relevant

AI Compute Crisis: GPU Prices Up 48%, Anthropic API at 98.95% Uptime

The AI industry faces a severe compute capacity crisis, with GPU prices up 48%, Anthropic API uptime falling to 98.95%, and OpenAI shutting down Sora to reallocate resources. Demand for agentic AI is outstripping supply, forcing rationing and product cancellations.

100% relevant

Computer Vision's Retail Applications: A Look at Current Use Cases

An article from vocal.media details five real-world applications where computer vision is transforming retail operations, including inventory tracking, loss prevention, and customer analytics.

72% relevant

Altimeter's Gerstner: AI Economics Shift to Owned Compute for Fixed Costs

Altimeter Capital's Brad Gerstner states the fundamental economics of AI have flipped, where companies owning their compute infrastructure lock in fixed costs while AI-driven revenue scales, creating a powerful advantage.

85% relevant

AI Economics Shift: OpenAI Compute Margins Hit 70%, Anthropic Turns Profitable

Analysis shows AI economics have fundamentally flipped. Firms with owned compute see infrastructure costs remain fixed while revenue scales, leading OpenAI's compute margins to rise from 35% to 70% and Anthropic to turn from -94% to +40% margins.

87% relevant

Meta's 'Model as Computer' Paper Explores LLM OS-Level Integration

A new research paper from Meta explores a paradigm where the language model acts as the computer's kernel, directly managing processes and memory. This could fundamentally change how AI agents are architected and interact with systems.

89% relevant

Meta's Neural Computers: Learned Runtimes Replace External OS for AI Agents

Meta AI and KAUST research introduces Neural Computers, a paradigm where AI models internalize computation, memory, and I/O. Early prototypes show 98.7% GUI cursor control and an 83% arithmetic accuracy boost via reprompting.

97% relevant

Apple Reportedly Developing 'Balta' AI ASIC for Cloud Compute

A Morgan Stanley report indicates Apple is accelerating development of a custom ASIC, codenamed 'Balta,' for AI cloud and hybrid compute. This marks Apple's first known move to design silicon for its data centers, not just consumer devices.

85% relevant

OpenAI Stargate Leaders Depart as Firm Pivots to $600B Compute Rental Plan

Key leaders behind OpenAI's Stargate AI supercomputer initiative are departing as the company shifts strategy from building its own data centers to planning a $600 billion compute rental spend over five years.

95% relevant

Perplexity Revenue Doubled in Q1 2026 After Launching Computer

Perplexity's revenue doubled in the quarter following the launch of its 'Computer' feature. The company now has over 100 million users and growing enterprise adoption.

85% relevant

Terafab's 1GW AI Compute Goal Requires Massive Fab Capacity

Analysis of Terafab's stated goals shows that achieving 1GW of AI compute would require approximately 190,000 wafer starts per month across logic and memory. This underscores the unprecedented scale of semiconductor manufacturing needed for future AI infrastructure.

85% relevant

OpenAI, Anthropic Forecast $121B Compute Burn, Revealing AI's True Cost

Internal forecasts from OpenAI and Anthropic reveal the core challenge of modern AI has shifted from selling the technology to financing the immense compute required for training and inference, with OpenAI projecting $121B in compute spending for 2028.

99% relevant

OpenAI Finishes GPT-5.5 'Spud' Pretraining, Halts Sora for Compute

OpenAI has finished pretraining its next major model, codenamed 'Spud' (likely GPT-5.5), built on a new architecture and data mix. The company reportedly halted its Sora video generation project entirely, sacrificing a $1B Disney investment, to prioritize compute for Spud's launch.

95% relevant