mobile computing
30 articles about mobile computing in AI news
Qualcomm NPU Shows 6-8x OCR Speed-Up Over CPU in Mobile Workload
A benchmark shows Qualcomm's dedicated NPU processing OCR workloads 6-8 times faster than the device's CPU. This highlights the growing efficiency gap for AI tasks on mobile silicon.
Apple's M5 Pro and Max: Fusion Architecture Redefines AI Computing on Silicon
Apple unveils M5 Pro and M5 Max chips with groundbreaking Fusion Architecture, merging two 3nm dies into a single SoC. The chips deliver up to 30% faster CPU performance and over 4x peak GPU compute for AI workloads compared to previous generations.
Dell XPS 14 with Core Ultra X7 Outlasts Snapdragon X Elite in Battery Test
A new battery test shows the Dell XPS 14 with Intel Core Ultra X7 lasts 11.7 hours, beating the 11.3 hours of Microsoft's ARM-based Surface Laptop 15 with Snapdragon X Elite. This is a significant win for Intel's latest chip in the critical mobile performance metric.
Edge AI Breakthrough: Qwen3.5 2B Runs Locally on iPhone 17 Pro, Redefining On-Device Intelligence
Alibaba's Qwen3.5 2B model now runs locally on iPhone 17 Pro devices, marking a significant breakthrough in edge AI. This development enables sophisticated language processing without cloud dependency, potentially transforming mobile AI applications and user privacy paradigms.
Google's Nano-Banana 2: The Edge AI Revolution That Puts 4K Image Generation in Your Pocket
Google has officially unveiled Nano-Banana 2, a specialized AI model delivering sub-second 4K image synthesis with advanced subject consistency entirely on-device. This breakthrough represents a strategic pivot toward edge computing, challenging the cloud-centric paradigm of current generative AI.
Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market
Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.
AI Frontier Pricing Widens Global Access Gap, Analysis Shows
A viral analysis highlights that Anthropic and OpenAI's $200/mo plans cost 15% of median monthly income in Nigeria vs 0.3% in the US, raising concerns about global AI access inequality.
Alibaba Opens Qwen AI App to External Partners via China Eastern Deal
Alibaba has opened its Qwen consumer AI app to its first external partner, China Eastern Airlines. Users can now manage the entire flight booking process through a single chat interface, expanding the app's real-world agentic capabilities beyond Alibaba's ecosystem.
John Ternus Takes Over Apple AI Leadership as Era Ends
Apple's AI leadership transitions to John Ternus, marking a new era following Steve Jobs' vision and Tim Cook's operational success. This comes as Apple accelerates its generative AI push with Apple Intelligence.
Bentley's 'Phygital' Future
Bentley Motors is pioneering a 'phygital' design approach, merging physical and digital processes. The automaker is deploying real-time 3D visualization and AI-assisted tools to enable faster, more collaborative, and data-informed design decisions for its luxury vehicles.
KIMM's AI-Powered Wheels Adjust Stiffness in Real-Time for Terrain
Researchers at KIMM created wheels that autonomously adjust their stiffness based on terrain. On smooth ground, they stay rigid for efficiency; on rough terrain, they soften and deform to conform to obstacles.
Meta's 'Model as Computer' Paper Explores LLM OS-Level Integration
A new research paper from Meta explores a paradigm where the language model acts as the computer's kernel, directly managing processes and memory. This could fundamentally change how AI agents are architected and interact with systems.
Demis Hassabis: AI Tools Enable Billion-Dollar Startups by 'Kids'
Demis Hassabis stated that current AI tools are so powerful that young entrepreneurs could build multi-billion dollar businesses by discovering novel applications, as labs focus on model development, not exhausting use cases.
X Post Reveals Audible Quality Differences in GPU vs. NPU AI Inference
A developer demonstrated audible quality differences in AI text-to-speech output when run on GPU, CPU, and NPU hardware, highlighting a key efficiency vs. fidelity trade-off for on-device AI.
Google's AI Infrastructure Strategy: What Retail Leaders Should Watch in 2026
Google's evolving AI infrastructure and compute strategy, including data center investments and model compression techniques, will directly impact how retail brands deploy and scale AI applications by 2026. The company's focus on efficiency and real-time capabilities signals a shift toward more accessible, powerful retail AI tools.
REWE Expands Pick&Go Cashierless Store Test to Seventh Location in Hanover
German retailer REWE has launched its seventh Pick&Go cashierless convenience store test location in Hanover. This expansion signals continued investment in frictionless retail technology, a space where AI-powered computer vision and sensor fusion are critical.
CanViT: First Active-Vision Foundation Model Hits 45.9% mIoU on ADE20K with Sequential Glimpses
Researchers introduce CanViT, the first task- and policy-agnostic Active-Vision Foundation Model (AVFM). It achieves 38.5% mIoU on ADE20K segmentation with a single low-resolution glimpse, outperforming prior active models while using 19.5x fewer FLOPs.
OpenAI Winds Down Sora App, Reallocates Compute to Next-Gen 'Spud' LLM Development
OpenAI has completed initial development of its next major AI model, codenamed 'Spud,' and is winding down the Sora video app, which was reportedly a compute resource drain. The move reallocates critical infrastructure toward core LLM competition with Anthropic and Google.
From Warehouses to Luxury Rentals: AI's Impact on Commercial Real Estate Is Accelerating
AI is transforming commercial real estate (CRE) across the value chain, from logistics optimization in warehouses to dynamic pricing and tenant experience in luxury retail spaces. This signals a shift from pilot projects to production-scale implementation.
arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference
A comprehensive arXiv review categorizes five principal KV cache optimization techniques—eviction, compression, hybrid memory, novel attention, and combinations—to address the linear memory scaling bottleneck in long-context LLM inference. The analysis finds no single dominant solution, with optimal strategy depending on context length, hardware, and workload.
Claude Desktop Gains 'Use My Computer' Feature for Direct App and Browser Control
Anthropic's Claude Desktop app now includes an experimental 'Use My Computer' feature that allows Claude AI to directly interact with local applications, browsers, and files when explicitly enabled by users.
OpenClaw AI Agent Adds Real-Time Vision to Meta Ray-Ban Smart Glasses via Gemini Live API
An open-source project enables Meta Ray-Ban smart glasses to function as a real-time AI assistant. It streams the glasses' camera feed (~1fps) to Gemini Live for visual context, then delegates actions via the OpenClaw agent framework.
Stanford's OpenJarvis: The Open-Source Framework Bringing Personal AI Agents to Your Device
Stanford researchers have released OpenJarvis, an open-source framework for building personal AI agents that operate entirely on-device. This local-first approach prioritizes privacy and autonomy while providing tools, memory, and learning capabilities.
Goldman Sachs Predicts AI Agents Will Dominate Software Profits and Expand Market Value
Goldman Sachs Research forecasts AI agents will capture a significant portion of software industry profits while simultaneously expanding the total market value. This dual impact suggests AI will reshape economic dynamics rather than simply replace existing revenue streams.
Gartner's Framework for Evaluating and Implementing AI Agents in Business
Gartner outlines a three-step process for organizations to maximize AI agent value: identify candidate agents, evaluate against business needs, and implement governance. This structured approach helps prioritize use cases with measurable business impact.
Tencent's Penguin-VL: A New Approach to Compact Multimodal AI
Tencent has launched Penguin-VL, a compact vision-language model that replaces traditional CLIP/SigLIP pretraining with an LLM-initialized vision encoder. The model achieves strong multimodal reasoning capabilities with just 2B and 8B parameter versions, potentially changing how smaller AI systems process images and text.
Qualcomm's Arduino Ventuno Q: A Powerhouse Single-Board Computer for the Next Wave of Physical AI
Qualcomm and Arduino have launched the Ventuno Q, a high-performance single-board computer designed specifically for robotics and physical AI applications. Powered by the Dragonwing IQ8 processor with a dedicated NPU and paired with a low-latency microcontroller, it enables complex, offline AI tasks like object tracking and gesture recognition for systems that interact with the real world.
The Desktop AI Revolution: Seven Powerful Models That Run Offline on Your Laptop
A new wave of specialized AI models now runs locally on consumer laptops, offering coding, vision, and automation without subscriptions or data sharing. These tools promise greater privacy, customization, and independence from cloud services.
LeCun's NYU Team Unveils Breakthrough in Efficient Transformer Architecture
Yann LeCun and NYU collaborators have published new research offering significant improvements to Transformer efficiency. The work addresses critical computational bottlenecks in current architectures while maintaining performance.
From Agency Exit to AI Innovation: Tech Founder Bets on SMS-Based AI Assistant for ICP Ecosystem
After selling his digital agency for nine figures, a tech entrepreneur is launching an AI executive assistant that operates entirely via SMS, targeting the Internet Computer Protocol ecosystem with a frictionless, accessible approach to AI productivity.