![Apple & Google: Different Story, Similar AI Strategy | ArcTouch](https://images.ctfassets.net/efnbmai9c55m/i88vRY1jdse9

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Products & LaunchesScore: 94

Apple Ditches Apple Silicon Pledge, Routes AI Queries to Google Cloud

Apple routes AI queries to Google Cloud, breaking 2024 Apple silicon pledge. Distilled Gemini runs locally; heavier queries use Nvidia tech in Google Cloud.

AAAla SMITH & AI Research Desk·May 31, 2026·3 min read··269 views·AI-Generated·Report error

Source: x.comvia @kimmonismusCorroborated

How is Apple handling AI queries after its 2024 Private Cloud Compute pledge?

Apple will run a distilled Gemini model locally on iPhone silicon, with heavier queries routed to Google Cloud using Nvidia confidential-compute tech, abandoning its 2024 promise to keep data on Apple silicon in Private Cloud Compute.

TL;DR

Apple using Google Cloud for AI queries · Private Cloud Compute name stays despite change · Small distilled Gemini model runs on iPhone

Apple is abandoning its 2024 promise to keep AI queries on Apple silicon, routing heavier requests to Google Cloud instead. According to @kimmonismus, a distilled Gemini model runs locally on iPhone while offloaded queries use Nvidia confidential-compute tech in Google Cloud.

Key facts

Local model is a distilled version of Google's Gemini
Heavy queries route to Google Cloud, not Apple silicon
Apple signed off on Nvidia confidential-compute tech for Google Cloud
Apple looked at Liquid AI for on-device model shrinking
Private Cloud Compute name unchanged despite cloud switch

Apple's WWDC next month is expected to center on long-delayed Siri and on-device AI upgrades, but the infrastructure story is more complicated than the marketing suggests.

What Apple is actually doing

Apple will run a smaller, distilled version of Google's Gemini locally on iPhone silicon, pitched on privacy and lower token costs. [According to @kimmonismus] Most of that stack is sourced from elsewhere. The local model is distilled from Gemini.

Queries too heavy for the device route to Google Cloud, where Apple has now signed off on Nvidia's confidential-compute tech to process them. Apple is also reportedly hunting for small on-device-AI startups to speed up the model-shrinking work, having looked at Liquid AI among others.

The broken promise

One quiet shift from the 2024 rollout: Apple promised then that anything leaving your iPhone would run on Apple silicon inside Private Cloud Compute. It couldn't get the full Gemini running there, so those queries now sit in Google Cloud. The Private Cloud Compute name is staying anyway. [The Information]

The unique take

This is the first major crack in Apple's vertical-integration AI narrative. The company that spent years building custom silicon and marketing privacy as a differentiator is now routing its most sensitive user queries — voice commands, personal context — through a competitor's cloud running Nvidia GPUs. The "Private Cloud Compute" label becomes branding, not architecture.

Apple's reliance on Google for both the model (Gemini) and the compute (Google Cloud) creates an awkward dependency. If Google changes pricing, terms, or model access, Apple has limited leverage. The startup acquisition hunt suggests Apple knows this and is trying to build internal model-shrinking capability, but that takes time.

Key Takeaways

Apple routes AI queries to Google Cloud, breaking 2024 Apple silicon pledge.
Distilled Gemini runs locally; heavier queries use Nvidia tech in Google Cloud.

What to watch

Apple & Google: Different Story, Similar AI Strategy | ArcTouch

Watch for WWDC 2026 (next month) where Apple will detail the Siri upgrade. Key metrics: whether Apple discloses the cloud provider arrangement, and whether it announces an acquisition of an on-device-AI startup like Liquid AI to reduce Google dependency.

Source: gentic.news · May 31, 2026 · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

This is a structural retreat from Apple's vertical-integration strategy. The company spent years positioning its custom silicon and on-device processing as privacy differentiators. Now it's outsourcing both the model (Gemini) and the compute (Google Cloud) to competitors. The Private Cloud Compute branding remaining despite the switch suggests marketing inertia over technical reality. Apple's startup acquisition hunt — including reportedly looking at Liquid AI — indicates it wants to build internal capability to shrink models without Google's help. But model distillation is a fast-moving field; Apple is playing catch-up. The dependency on Google is particularly awkward given Apple's existing antitrust tensions with Google over search revenue. Nvidia's confidential-compute tech being used in Google Cloud is notable: it suggests Apple needed hardware-level isolation guarantees that Google's standard infrastructure couldn't provide. This is a win for Nvidia's enterprise security pitch and a signal that confidential computing is becoming table stakes for cloud AI inference.

#model distillation #ai infrastructure #apple #google cloud

This story is part of

The Agentic Pivot: How Claude Code Is Forcing a Reconfiguration of the AI Stack

Anthropic's developer tool is becoming the connective tissue between models, infrastructure, and autonomous workflows, challenging OpenAI's application-first strategy.

Compare side-by-side

Google vs Nvidia

→

Mentioned in this article

Google Apple Google Cloud Gemini Nvidia Liquid AI

Enjoyed this article?