Apple previewed a new Siri in camera with visual intelligence, per a tweet from @mweinbach. The feature lets users point the iPhone camera at objects for AI-driven identification and actions, marking Apple’s most direct entry into visual AI search.
Key facts
- Feature teased via @mweinbach screenshot on X
- Apple has not disclosed supported iPhone models
- Google Lens processes over 10 billion visual searches monthly
- No release date or beta announced
- Likely leverages Apple Neural Engine for on-device inference
A screenshot posted by @mweinbach on X shows a redesigned Siri interface overlaid on the iPhone camera viewfinder. The text reads “New Siri in camera with visual intelligence,” suggesting Apple is integrating its on-device AI model directly into the camera experience. According to @mweinbach
The feature appears to let users point the camera at objects—such as plants, landmarks, or products—to trigger AI-powered identification and contextual actions, similar to Google Lens or ChatGPT’s vision mode. Apple has not disclosed which iPhone models will support the feature, a release window, or whether it runs entirely on-device or requires a network connection.
Why This Matters
Apple’s move positions it to compete directly with Google Lens, which has over 10 billion monthly visual searches, and ChatGPT’s vision capabilities, which OpenAI added in late 2024. Unlike those services, Apple can leverage its Neural Engine and on-device AI stack for privacy-preserving inference, a key differentiator the company has emphasized in prior product launches.
The timing aligns with Apple’s broader AI push. At WWDC 2025, the company announced on-device large language models for Siri, including summarization and app actions. Visual intelligence extends that to real-time camera input, a modality Google and OpenAI have already productized. Apple’s challenge will be matching the breadth of Google’s knowledge graph while maintaining its privacy stance.
What’s Missing
Apple has not confirmed whether the feature will be available on current iPhone 17 and 18 models or require the rumored iPhone 19’s upgraded Neural Engine. Pricing, regional availability, and supported languages also remain undisclosed. The screenshot does not indicate whether third-party app integrations (e.g., identifying a product and opening Amazon) will be supported.
Competitive Landscape
Google Lens, launched in 2017, is embedded in Google Photos, Chrome, and the Google app, processing over 10 billion queries monthly as of 2024. ChatGPT’s vision mode, added in GPT-4o, handles real-time camera input with conversational follow-ups. Apple’s visual intelligence could differentiate by processing entirely on-device, avoiding the latency and privacy trade-offs of cloud-based alternatives. However, Apple has not confirmed whether the feature will support offline inference.
What to watch
Watch for Apple’s WWDC 2026 keynote in June, where the company typically unveils major software features. If visual intelligence is tied to a new iPhone 19 model, expect a September launch alongside iOS 20. Also track whether Apple opens the feature to third-party developers via an API.









