compute
30 articles about compute in AI news
Anthropic Secures 5GW AWS Compute, $100B+ Deal for Claude Expansion
Anthropic has expanded its deal with Amazon to secure up to 5 gigawatts of compute capacity—equivalent to Microsoft's 2024 global data center footprint—and committed over $100 billion to AWS over the next decade. This infrastructure surge supports Claude's tripled run-rate revenue to over $30B and addresses consumer demand straining its systems.
GPT-5.4 Launches with Computer Control API
OpenAI launched GPT-5.4, featuring a 'Computer Use' API that lets the model control a user's desktop. Despite improvements, it scores 78.5% on SWE-Bench, behind Claude 3.5 Sonnet's 81.2%.
Anthropic's Adaptive Thinking: A Compute-Constrained Efficiency Play
Analysis suggests Anthropic's new 'adaptive thinking' feature is a direct response to compute constraints and competitive pressure from OpenAI, aiming to optimize token usage for enterprise clients at the potential cost of consumer experience.
Perplexity AI Launches 'Personal Computer' for Mac App Orchestration
Perplexity AI has released 'Personal Computer', a feature that integrates with its Mac app to securely orchestrate local files and applications. This move expands its AI assistant from web search to direct desktop interaction.
Canada's AI Compute Gap: Google Cloud Montreal Offers 2017-Era Chips
A technical developer's attempt to rent modern AI compute in Canada revealed a stark infrastructure gap, with major providers offering chips as old as 2017, undermining national AI ambitions.
Compute Constraints Create Double Bind for AI Growth: Ethan Mollick
Ethan Mollick highlights a critical industry bottleneck: compute scarcity forces a trade-off between raising prices/rationing current models and limiting future model training, creating a growth double bind.
AI Compute Crisis: GPU Prices Up 48%, Anthropic API at 98.95% Uptime
The AI industry faces a severe compute capacity crisis, with GPU prices up 48%, Anthropic API uptime falling to 98.95%, and OpenAI shutting down Sora to reallocate resources. Demand for agentic AI is outstripping supply, forcing rationing and product cancellations.
Computer Vision's Retail Applications: A Look at Current Use Cases
An article from vocal.media details five real-world applications where computer vision is transforming retail operations, including inventory tracking, loss prevention, and customer analytics.
Altimeter's Gerstner: AI Economics Shift to Owned Compute for Fixed Costs
Altimeter Capital's Brad Gerstner states the fundamental economics of AI have flipped, where companies owning their compute infrastructure lock in fixed costs while AI-driven revenue scales, creating a powerful advantage.
AI Economics Shift: OpenAI Compute Margins Hit 70%, Anthropic Turns Profitable
Analysis shows AI economics have fundamentally flipped. Firms with owned compute see infrastructure costs remain fixed while revenue scales, leading OpenAI's compute margins to rise from 35% to 70% and Anthropic to turn from -94% to +40% margins.
Meta's 'Model as Computer' Paper Explores LLM OS-Level Integration
A new research paper from Meta explores a paradigm where the language model acts as the computer's kernel, directly managing processes and memory. This could fundamentally change how AI agents are architected and interact with systems.
Meta's Neural Computers: Learned Runtimes Replace External OS for AI Agents
Meta AI and KAUST research introduces Neural Computers, a paradigm where AI models internalize computation, memory, and I/O. Early prototypes show 98.7% GUI cursor control and an 83% arithmetic accuracy boost via reprompting.
Apple Reportedly Developing 'Balta' AI ASIC for Cloud Compute
A Morgan Stanley report indicates Apple is accelerating development of a custom ASIC, codenamed 'Balta,' for AI cloud and hybrid compute. This marks Apple's first known move to design silicon for its data centers, not just consumer devices.
OpenAI Stargate Leaders Depart as Firm Pivots to $600B Compute Rental Plan
Key leaders behind OpenAI's Stargate AI supercomputer initiative are departing as the company shifts strategy from building its own data centers to planning a $600 billion compute rental spend over five years.
Perplexity Revenue Doubled in Q1 2026 After Launching Computer
Perplexity's revenue doubled in the quarter following the launch of its 'Computer' feature. The company now has over 100 million users and growing enterprise adoption.
Terafab's 1GW AI Compute Goal Requires Massive Fab Capacity
Analysis of Terafab's stated goals shows that achieving 1GW of AI compute would require approximately 190,000 wafer starts per month across logic and memory. This underscores the unprecedented scale of semiconductor manufacturing needed for future AI infrastructure.
OpenAI, Anthropic Forecast $121B Compute Burn, Revealing AI's True Cost
Internal forecasts from OpenAI and Anthropic reveal the core challenge of modern AI has shifted from selling the technology to financing the immense compute required for training and inference, with OpenAI projecting $121B in compute spending for 2028.
OpenAI Finishes GPT-5.5 'Spud' Pretraining, Halts Sora for Compute
OpenAI has finished pretraining its next major model, codenamed 'Spud' (likely GPT-5.5), built on a new architecture and data mix. The company reportedly halted its Sora video generation project entirely, sacrificing a $1B Disney investment, to prioritize compute for Spud's launch.
Anthropic's Claude Desktop Apps Gain Windows Support for Computer Use Feature
Anthropic has released Windows versions of Claude Code Desktop and Claude Cowork, bringing the 'computer use' feature—which allows the AI to interact with files and applications on a user's computer—to the platform. This follows the macOS release and marks a key step in Anthropic's desktop strategy.
OpenAI Reallocates Compute and Talent Toward 'Automated Researchers' and Agent Systems
OpenAI is reallocating significant compute resources and engineering talent toward developing 'automated researchers' and agent-based systems capable of executing complex tasks end-to-end, signaling a strategic pivot away from some existing projects.
Sam Altman Envisions Codex Desktop Evolving into Unified AI Agent Controlling Computers
Sam Altman discussed the Codex Desktop ecosystem evolving toward a unified AI agent that can control computers, access user data, and work across multiple surfaces. This vision points toward AI systems moving beyond code generation to become proactive, cross-platform assistants.
Computer Vision Is Transforming Retail Loss Prevention
The article discusses the growing adoption of computer vision systems in retail to prevent theft, manage inventory, and enhance store security. This represents a direct application of AI to a long-standing, costly industry problem.
Oracle Cuts 20% of Workforce to Fund AI Infrastructure Push, Shifting from Labor to Compute
Oracle is laying off 20% of its workforce to redirect capital toward massive AI infrastructure investments. The move signals a strategic pivot from traditional workforce costs to data center and compute spending.
Anthropic Launches Computer Use Feature in Claude Code, Enabling AI to Execute Terminal Commands
Anthropic has activated a 'computer use' capability within its Claude Code environment, allowing the AI assistant to directly execute terminal commands. This marks a significant step toward autonomous coding agents that can interact with development environments.
Claude Code's /mcp Computer Use: Test Your Local Apps Directly from the CLI
Claude Code can now open your apps, click through UIs, and test builds via a new /mcp computer-use command, turning it into a hands-on testing agent.
Elon Musk Predicts 'Vast Majority' of AI Compute Will Be for Real-Time Video
Elon Musk states that real-time video consumption and generation will consume most AI compute, highlighting a shift from text to video as the primary medium for AI processing.
Apple's Private Cloud Compute: Leak Suggests 4x M2 Ultra Cluster for On-Device AI Offload
A leak suggests Apple's Private Cloud Compute for AI may be built on clusters of four M2 Ultra chips, potentially offering high-performance, private server-side processing for iPhone AI tasks. This would mark Apple's strategic move into dedicated, privacy-focused AI infrastructure.
Morgan Stanley Predicts 10x Compute Spike to Double AI Intelligence, Highlights 18 GW Energy Crisis
Morgan Stanley forecasts a massive AI leap from a 10x increase in training compute, but warns of an 18-gigawatt U.S. power shortfall by 2028. The report claims GPT-5.4 matches human experts with 83% on GDPVal.
OpenAI Shifts Sora Team to World-Model Research, Reportedly Cancels Video Model for Compute
A report claims OpenAI has redirected its Sora team to focus on world-model research for robotics and canceled the video model to free compute for a new, powerful LLM codenamed 'Spud.'
Anthropic Launches 'Computer Use' Beta for Claude Desktop, Enabling Direct App Control
Anthropic has released a beta feature for Claude Desktop that allows the AI to directly view and interact with applications on a user's computer screen to complete tasks, marking a significant step toward agentic AI.