Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

iPhone screen displaying a chat interface, with a cloud icon and Google logo in the background, symbolizing Apple…

Apple Ditches Apple Silicon Pledge, Routes AI Queries to Google Cloud

Apple routes AI queries to Google Cloud, breaking 2024 Apple silicon pledge. Distilled Gemini runs locally; heavier queries use Nvidia tech in Google Cloud.

·6h ago·3 min read··21 views·AI-Generated·Report error
Share:
How is Apple handling AI queries after its 2024 Private Cloud Compute pledge?

Apple will run a distilled Gemini model locally on iPhone silicon, with heavier queries routed to Google Cloud using Nvidia confidential-compute tech, abandoning its 2024 promise to keep data on Apple silicon in Private Cloud Compute.

TL;DR

Apple using Google Cloud for AI queries · Private Cloud Compute name stays despite change · Small distilled Gemini model runs on iPhone

Apple is abandoning its 2024 promise to keep AI queries on Apple silicon, routing heavier requests to Google Cloud instead. According to @kimmonismus, a distilled Gemini model runs locally on iPhone while offloaded queries use Nvidia confidential-compute tech in Google Cloud.

Key facts

  • Local model is a distilled version of Google's Gemini
  • Heavy queries route to Google Cloud, not Apple silicon
  • Apple signed off on Nvidia confidential-compute tech for Google Cloud
  • Apple looked at Liquid AI for on-device model shrinking
  • Private Cloud Compute name unchanged despite cloud switch

Apple's WWDC next month is expected to center on long-delayed Siri and on-device AI upgrades, but the infrastructure story is more complicated than the marketing suggests.

What Apple is actually doing

Apple will run a smaller, distilled version of Google's Gemini locally on iPhone silicon, pitched on privacy and lower token costs. [According to @kimmonismus] Most of that stack is sourced from elsewhere. The local model is distilled from Gemini.

Queries too heavy for the device route to Google Cloud, where Apple has now signed off on Nvidia's confidential-compute tech to process them. Apple is also reportedly hunting for small on-device-AI startups to speed up the model-shrinking work, having looked at Liquid AI among others.

The broken promise

One quiet shift from the 2024 rollout: Apple promised then that anything leaving your iPhone would run on Apple silicon inside Private Cloud Compute. It couldn't get the full Gemini running there, so those queries now sit in Google Cloud. The Private Cloud Compute name is staying anyway. [The Information]

The unique take

This is the first major crack in Apple's vertical-integration AI narrative. The company that spent years building custom silicon and marketing privacy as a differentiator is now routing its most sensitive user queries — voice commands, personal context — through a competitor's cloud running Nvidia GPUs. The "Private Cloud Compute" label becomes branding, not architecture.

Apple's reliance on Google for both the model (Gemini) and the compute (Google Cloud) creates an awkward dependency. If Google changes pricing, terms, or model access, Apple has limited leverage. The startup acquisition hunt suggests Apple knows this and is trying to build internal model-shrinking capability, but that takes time.

Key Takeaways

  • Apple routes AI queries to Google Cloud, breaking 2024 Apple silicon pledge.
  • Distilled Gemini runs locally; heavier queries use Nvidia tech in Google Cloud.

What to watch

Apple & Google: Different Story, Similar AI Strategy | ArcTouch

Watch for WWDC 2026 (next month) where Apple will detail the Siri upgrade. Key metrics: whether Apple discloses the cloud provider arrangement, and whether it announces an acquisition of an on-device-AI startup like Liquid AI to reduce Google dependency.

Source: gentic.news · · author= · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

This is a structural retreat from Apple's vertical-integration strategy. The company spent years positioning its custom silicon and on-device processing as privacy differentiators. Now it's outsourcing both the model (Gemini) and the compute (Google Cloud) to competitors. The Private Cloud Compute branding remaining despite the switch suggests marketing inertia over technical reality. Apple's startup acquisition hunt — including reportedly looking at Liquid AI — indicates it wants to build internal capability to shrink models without Google's help. But model distillation is a fast-moving field; Apple is playing catch-up. The dependency on Google is particularly awkward given Apple's existing antitrust tensions with Google over search revenue. Nvidia's confidential-compute tech being used in Google Cloud is notable: it suggests Apple needed hardware-level isolation guarantees that Google's standard infrastructure couldn't provide. This is a win for Nvidia's enterprise security pitch and a signal that confidential computing is becoming table stakes for cloud AI inference.
Compare side-by-side
Google vs Apple
Enjoyed this article?
Share:

AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Related Articles

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

More in Products & Launches

View all