Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

cloud

30 articles about cloud in AI news

AI Model Runs Entirely on USB Stick, No Cloud Needed

An unnamed developer built an AI on a USB stick, no internet needed. Challenges ChatGPT's cloud model.

77% relevant

Claude Mythos Goes GA in Google Cloud Console, Drops Preview Label

Claude Mythos silently went GA in Google Cloud console, preview label removed. Signals deeper Anthropic-GCP integration.

91% relevant

Agentic Commerce: 50% of Online Transactions by 2027, Google Cloud Leads

Agents projected to handle 50% of online transactions by 2027. Payment reliability determines winners in agentic commerce, with Google Cloud leading enterprise rollouts.

94% relevant

Anthropic Launches Claude Platform on AWS — AWS Billing, IAM, CloudTrail

Anthropic launched Claude Platform on AWS, a native API with AWS billing, IAM, and CloudTrail. Same models and pricing as direct API; data stays at Anthropic, not AWS.

98% relevant

Nebius Claims First NVIDIA GB300 Exemplar Cloud for Training

Nebius becomes first cloud provider validated as NVIDIA Exemplar Cloud on GB300 for training, targeting hyperscale AI workloads.

94% relevant

MiniMax Music-2.6 Goes Free on Cloudflare This Week

MiniMax's Music-2.6 AI model is available for free on Cloudflare's platform this week, allowing users to generate full-length songs or instrumentals from text prompts.

75% relevant

Oracle Nabs $16B for Michigan AI Data Center, Rivaling Google Cloud

Oracle has secured $16 billion in funding for a massive AI data center in rural Michigan, a move that pits it directly against Google Cloud and other hyperscalers in the race to build AI infrastructure.

76% relevant

Cloudflare Ships Enterprise MCP Governance

Cloudflare's MCP portal aggregates servers behind Cloudflare Access auth, while Code Mode collapses APIs into two tools. But most SaaS MCP endpoints lack controls — here's how to protect your Claude Code workflows.

96% relevant

Google Cloud Next '26: 8th-gen TPUs, agent platform, $750M fund

At Cloud Next 2026, Google unveiled two 8th-gen TPU chips, a Gemini-based enterprise AI agent platform, and a $750 million partner fund to drive secure, large-scale automation and heavy AI workloads.

88% relevant

NVIDIA, Google Cloud Expand AI Partnership for Agentic & Physical AI

NVIDIA and Google Cloud announced an expanded partnership to advance agentic and physical AI, focusing on new infrastructure and software integrations. This builds on their existing collaboration to provide optimized AI training and inference platforms.

100% relevant

Gur Singh Claims 7 M4 MacBooks Match A100, Calls Cloud GPU Training a 'Scam'

Developer Gur Singh posted that seven M4 MacBooks (2.9 TFLOPS each) match an NVIDIA A100's performance, calling cloud GPU training a 'scam' and advocating for distributed, consumer-hardware approaches.

77% relevant

IOWN Forum Pushes All-Photonic WAN for AI Neocloud Interconnects

The IOWN Global Forum is focusing its optical networking tech on datacenter interconnects, aiming to let GPU 'neoclouds' and financial firms use cheaper, remote facilities without latency penalties for AI workloads.

78% relevant

Cloudflare's New MCP Server Cuts AI Code Review Costs by 70%

A new MCP server from Cloudflare that pre-processes code to remove non-essential elements, slashing token consumption for AI-powered development workflows.

82% relevant

MiniMax Launches MaxHermes, Cloud-Hosted Agent with NousResearch

MiniMax has launched MaxHermes, a cloud-hosted version of the Hermes agent framework, in partnership with NousResearch. This provides a managed service for users of MiniMax's M2.7 model, aiming to simplify agent deployment.

85% relevant

Cloud GPU vs. Colocation: H100 Costs $8k/Month on Google Cloud vs. $1k Colo

A technical founder highlights the stark economics: renting one H100 on Google Cloud costs ~$8,000/month, while the retail hardware is ~$30,000. At that rate, 4 months of cloud rental equals the cost of outright ownership, making colocation at ~$1k/month a compelling alternative for sustained AI workloads.

85% relevant

Mac Studio AI Hardware Shortage Signals Shift to Cloud Rentals

Developers report a global shortage of high-memory Apple Silicon Macs, with 128GB Mac Studios unavailable worldwide. This pushes practitioners toward renting cloud H100 GPUs at ~$3/hr, marking a shift from the recent local AI trend.

85% relevant

Cloudflare Agent Cloud Integrates OpenAI GPT-5.4 & Codex for Enterprise AI

Cloudflare has integrated OpenAI's GPT-5.4 and Codex models into its Agent Cloud platform. This allows enterprises to build, deploy, and scale AI agents for production workflows with built-in security and performance.

83% relevant

Apple Reportedly Developing 'Balta' AI ASIC for Cloud Compute

A Morgan Stanley report indicates Apple is accelerating development of a custom ASIC, codenamed 'Balta,' for AI cloud and hybrid compute. This marks Apple's first known move to design silicon for its data centers, not just consumer devices.

85% relevant

Intel & Google Announce Multiyear AI & Cloud Infrastructure Partnership

Intel and Google have announced a multiyear strategic collaboration to advance AI and cloud infrastructure, focusing on optimizing Google Cloud for Intel's Xeon processors, Gaudi AI accelerators, and future chips.

85% relevant

OpenCAD Browser Tool Enables Local, Private Text-to-CAD Conversion Without Cloud API

A developer has released an open-source text-to-CAD tool that runs entirely in a user's browser, enabling private, local 3D model generation from natural language descriptions. This approach bypasses cloud API costs and data privacy issues inherent in most current AI CAD solutions.

89% relevant

Google Cloud's Vertex AI Experiments Solves the 'Lost Model' Problem in ML Development

A Google Cloud team recounts losing their best-performing model after training 47 versions, highlighting a common MLops failure. They detail how Vertex AI Experiments provides systematic tracking to prevent this.

94% relevant

Sam3 + MLX Enables Local, Multi-Object Video Tracking Without Cloud APIs

A developer has combined Meta's Segment Anything 3 (Sam3) with Apple's MLX framework to enable local, on-device object tracking in videos. This bypasses cloud API costs and latency for computer vision tasks.

85% relevant

Apple's Private Cloud Compute: Leak Suggests 4x M2 Ultra Cluster for On-Device AI Offload

A leak suggests Apple's Private Cloud Compute for AI may be built on clusters of four M2 Ultra chips, potentially offering high-performance, private server-side processing for iPhone AI tasks. This would mark Apple's strategic move into dedicated, privacy-focused AI infrastructure.

85% relevant

KitchenTwin: VLM-Guided Scale Recovery Fuses Global Point Clouds with Object Meshes for Metric Digital Twins

Researchers propose KitchenTwin, a scale-aware 3D fusion framework that registers object meshes with transformer-predicted global point clouds using VLM-guided geometric anchors. The method resolves fundamental coordinate mismatches to build metrically consistent digital twins for embodied AI, and releases an open-source dataset.

83% relevant

Anthropic's Claude Code Adds Scheduled, Cloud-Based Task Execution

Anthropic's Claude Code now supports scheduling recurring, cloud-based tasks. Users can set a repository, schedule, and prompt, with Claude executing the task automatically.

87% relevant

Cloudflare CEO Predicts AI Bot Traffic Will Surpass Human Web Traffic by 2027

Cloudflare CEO Matthew Prince forecasts that automated bot traffic will exceed human web traffic within three years, driven by the proliferation of AI agents. This projection highlights a fundamental shift in internet infrastructure demands.

87% relevant

Alibaba Targets $100B in AI and Cloud Revenue, Betting on 'Agentic AI' for Commerce

Alibaba announced a five-year goal to generate over $100B from its AI and cloud divisions, pivoting its strategy toward the 'agentic AI era' where autonomous agents can complete transactions. This comes amid a major reorganization and heavy investment in AI infrastructure.

74% relevant

MetaClaw: Personal AI Agent That Meta-Learns from Conversations Using Cloud LoRA and Skill Synthesis

MetaClaw is a personal AI agent that automatically evolves from every conversation. It meta-learns in the wild using cloud LoRA and skill synthesis, scheduling weight updates during idle time with zero downtime.

85% relevant

SonarQube Cloud's New MCP Server: Add Security Scanning to Claude Code in 5 Minutes

SonarQube Cloud now has a native MCP server, letting Claude Code analyze code for security vulnerabilities, bugs, and code smells directly in your editor.

95% relevant

NVIDIA CEO Jensen Huang: 'We're Going to Bring OpenAI to AWS' to Drive 'Enormous' Cloud Consumption

NVIDIA CEO Jensen Huang stated at GTC 2026 that NVIDIA will bring OpenAI to AWS, driving massive cloud compute consumption and expanding OpenAI's compute-constrained reach.

87% relevant