cloud
30 articles about cloud in AI news
AI Model Runs Entirely on USB Stick, No Cloud Needed
An unnamed developer built an AI on a USB stick, no internet needed. Challenges ChatGPT's cloud model.
Claude Mythos Goes GA in Google Cloud Console, Drops Preview Label
Claude Mythos silently went GA in Google Cloud console, preview label removed. Signals deeper Anthropic-GCP integration.
Agentic Commerce: 50% of Online Transactions by 2027, Google Cloud Leads
Agents projected to handle 50% of online transactions by 2027. Payment reliability determines winners in agentic commerce, with Google Cloud leading enterprise rollouts.
Anthropic Launches Claude Platform on AWS — AWS Billing, IAM, CloudTrail
Anthropic launched Claude Platform on AWS, a native API with AWS billing, IAM, and CloudTrail. Same models and pricing as direct API; data stays at Anthropic, not AWS.
Nebius Claims First NVIDIA GB300 Exemplar Cloud for Training
Nebius becomes first cloud provider validated as NVIDIA Exemplar Cloud on GB300 for training, targeting hyperscale AI workloads.
MiniMax Music-2.6 Goes Free on Cloudflare This Week
MiniMax's Music-2.6 AI model is available for free on Cloudflare's platform this week, allowing users to generate full-length songs or instrumentals from text prompts.
Oracle Nabs $16B for Michigan AI Data Center, Rivaling Google Cloud
Oracle has secured $16 billion in funding for a massive AI data center in rural Michigan, a move that pits it directly against Google Cloud and other hyperscalers in the race to build AI infrastructure.
Cloudflare Ships Enterprise MCP Governance
Cloudflare's MCP portal aggregates servers behind Cloudflare Access auth, while Code Mode collapses APIs into two tools. But most SaaS MCP endpoints lack controls — here's how to protect your Claude Code workflows.
Google Cloud Next '26: 8th-gen TPUs, agent platform, $750M fund
At Cloud Next 2026, Google unveiled two 8th-gen TPU chips, a Gemini-based enterprise AI agent platform, and a $750 million partner fund to drive secure, large-scale automation and heavy AI workloads.
NVIDIA, Google Cloud Expand AI Partnership for Agentic & Physical AI
NVIDIA and Google Cloud announced an expanded partnership to advance agentic and physical AI, focusing on new infrastructure and software integrations. This builds on their existing collaboration to provide optimized AI training and inference platforms.
Gur Singh Claims 7 M4 MacBooks Match A100, Calls Cloud GPU Training a 'Scam'
Developer Gur Singh posted that seven M4 MacBooks (2.9 TFLOPS each) match an NVIDIA A100's performance, calling cloud GPU training a 'scam' and advocating for distributed, consumer-hardware approaches.
IOWN Forum Pushes All-Photonic WAN for AI Neocloud Interconnects
The IOWN Global Forum is focusing its optical networking tech on datacenter interconnects, aiming to let GPU 'neoclouds' and financial firms use cheaper, remote facilities without latency penalties for AI workloads.
Cloudflare's New MCP Server Cuts AI Code Review Costs by 70%
A new MCP server from Cloudflare that pre-processes code to remove non-essential elements, slashing token consumption for AI-powered development workflows.
MiniMax Launches MaxHermes, Cloud-Hosted Agent with NousResearch
MiniMax has launched MaxHermes, a cloud-hosted version of the Hermes agent framework, in partnership with NousResearch. This provides a managed service for users of MiniMax's M2.7 model, aiming to simplify agent deployment.
Cloud GPU vs. Colocation: H100 Costs $8k/Month on Google Cloud vs. $1k Colo
A technical founder highlights the stark economics: renting one H100 on Google Cloud costs ~$8,000/month, while the retail hardware is ~$30,000. At that rate, 4 months of cloud rental equals the cost of outright ownership, making colocation at ~$1k/month a compelling alternative for sustained AI workloads.
Mac Studio AI Hardware Shortage Signals Shift to Cloud Rentals
Developers report a global shortage of high-memory Apple Silicon Macs, with 128GB Mac Studios unavailable worldwide. This pushes practitioners toward renting cloud H100 GPUs at ~$3/hr, marking a shift from the recent local AI trend.
Cloudflare Agent Cloud Integrates OpenAI GPT-5.4 & Codex for Enterprise AI
Cloudflare has integrated OpenAI's GPT-5.4 and Codex models into its Agent Cloud platform. This allows enterprises to build, deploy, and scale AI agents for production workflows with built-in security and performance.
Apple Reportedly Developing 'Balta' AI ASIC for Cloud Compute
A Morgan Stanley report indicates Apple is accelerating development of a custom ASIC, codenamed 'Balta,' for AI cloud and hybrid compute. This marks Apple's first known move to design silicon for its data centers, not just consumer devices.
Intel & Google Announce Multiyear AI & Cloud Infrastructure Partnership
Intel and Google have announced a multiyear strategic collaboration to advance AI and cloud infrastructure, focusing on optimizing Google Cloud for Intel's Xeon processors, Gaudi AI accelerators, and future chips.
OpenCAD Browser Tool Enables Local, Private Text-to-CAD Conversion Without Cloud API
A developer has released an open-source text-to-CAD tool that runs entirely in a user's browser, enabling private, local 3D model generation from natural language descriptions. This approach bypasses cloud API costs and data privacy issues inherent in most current AI CAD solutions.
Google Cloud's Vertex AI Experiments Solves the 'Lost Model' Problem in ML Development
A Google Cloud team recounts losing their best-performing model after training 47 versions, highlighting a common MLops failure. They detail how Vertex AI Experiments provides systematic tracking to prevent this.
Sam3 + MLX Enables Local, Multi-Object Video Tracking Without Cloud APIs
A developer has combined Meta's Segment Anything 3 (Sam3) with Apple's MLX framework to enable local, on-device object tracking in videos. This bypasses cloud API costs and latency for computer vision tasks.
Apple's Private Cloud Compute: Leak Suggests 4x M2 Ultra Cluster for On-Device AI Offload
A leak suggests Apple's Private Cloud Compute for AI may be built on clusters of four M2 Ultra chips, potentially offering high-performance, private server-side processing for iPhone AI tasks. This would mark Apple's strategic move into dedicated, privacy-focused AI infrastructure.
KitchenTwin: VLM-Guided Scale Recovery Fuses Global Point Clouds with Object Meshes for Metric Digital Twins
Researchers propose KitchenTwin, a scale-aware 3D fusion framework that registers object meshes with transformer-predicted global point clouds using VLM-guided geometric anchors. The method resolves fundamental coordinate mismatches to build metrically consistent digital twins for embodied AI, and releases an open-source dataset.
Anthropic's Claude Code Adds Scheduled, Cloud-Based Task Execution
Anthropic's Claude Code now supports scheduling recurring, cloud-based tasks. Users can set a repository, schedule, and prompt, with Claude executing the task automatically.
Cloudflare CEO Predicts AI Bot Traffic Will Surpass Human Web Traffic by 2027
Cloudflare CEO Matthew Prince forecasts that automated bot traffic will exceed human web traffic within three years, driven by the proliferation of AI agents. This projection highlights a fundamental shift in internet infrastructure demands.
Alibaba Targets $100B in AI and Cloud Revenue, Betting on 'Agentic AI' for Commerce
Alibaba announced a five-year goal to generate over $100B from its AI and cloud divisions, pivoting its strategy toward the 'agentic AI era' where autonomous agents can complete transactions. This comes amid a major reorganization and heavy investment in AI infrastructure.
MetaClaw: Personal AI Agent That Meta-Learns from Conversations Using Cloud LoRA and Skill Synthesis
MetaClaw is a personal AI agent that automatically evolves from every conversation. It meta-learns in the wild using cloud LoRA and skill synthesis, scheduling weight updates during idle time with zero downtime.
SonarQube Cloud's New MCP Server: Add Security Scanning to Claude Code in 5 Minutes
SonarQube Cloud now has a native MCP server, letting Claude Code analyze code for security vulnerabilities, bugs, and code smells directly in your editor.
NVIDIA CEO Jensen Huang: 'We're Going to Bring OpenAI to AWS' to Drive 'Enormous' Cloud Consumption
NVIDIA CEO Jensen Huang stated at GTC 2026 that NVIDIA will bring OpenAI to AWS, driving massive cloud compute consumption and expanding OpenAI's compute-constrained reach.