smartphones
30 articles about smartphones in AI news
The Next Frontier: AI Agents Take Direct Control of Smartphones and Apps
AI systems are gaining the ability to directly control smartphones and applications, moving beyond simple assistants to become autonomous digital agents. This breakthrough promises to revolutionize how we interact with technology but raises significant questions about privacy, security, and the future of human-computer interaction.
Mobile AI Revolution: Full LLMs Now Run Natively on Smartphones
A new React Native binding called llama rn enables developers to run full large language models like Llama, Qwen, and Mistral directly on mobile devices with just 4GB RAM. The framework leverages Metal and NPU acceleration for performance surpassing cloud APIs while maintaining complete offline functionality.
Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market
Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.
Swiss AI Lab Ships Pixel-Based Agents That Control Real Phones
A Swiss AI lab has developed agents that interact with smartphones by processing screen pixels and simulating touch, eliminating the need for app-specific APIs or integrations. This approach mirrors human interaction and could generalize across any app interface.
ModelBest Hits $1B+ Valuation for On-Device Foundation Models
ModelBest, a Chinese developer of on-device AI foundation models, raised several hundred million RMB, reaching a valuation exceeding $1 billion. The funding will accelerate its push to deploy efficient models directly on smartphones and IoT devices.
Emergent's Mobile App Launch: Building Native Apps Directly from Your Smartphone
Emergent has launched a mobile app that enables users to build and publish full iOS and Android applications directly from their smartphones, potentially democratizing mobile app development.
Huawei Hits 1.5µm Bond Pitch in Kirin 2026 Chips, Beats TSMC
Huawei's 2026 Kirin chips achieve 1.5µm hybrid bonding pitch, 16-36x denser than TSMC. Next year targets 1µm.
Apple's 'Attention to Mamba' Paper Proposes Cross-Architecture Transfer
Apple researchers introduced a two-stage recipe for transferring capabilities from Transformer models to Mamba-based architectures. This could enable efficient models that retain the performance of larger, attention-based predecessors.
HONOR's Lightning Robot Runs 21km in 50:26, Beating Human World Record
At Beijing's 2026 humanoid robot half-marathon, HONOR's 'Lightning' robot finished the 21 km course in 50 minutes and 26 seconds. This time surpasses the current human men's world record of 57:20, marking a massive leap from last year's winning robot time of over 2 hours 40 minutes.
GPT-5.4 Spends 3 Hours Optimizing Embedding Model for Qualcomm NPU
An X user observed GPT-5.4 working for three hours to optimize an embedding model specifically for the Qualcomm NPU. This suggests a practical application of advanced AI for hardware-specific model tuning.
Nvidia to Ship 1.19 Exabytes of HBM in 2026, Apple iPhone Memory 2x Larger
An analysis projects Nvidia will ship ~1.19 exabytes of HBM memory in 2026 for AI infrastructure, while Apple will ship ~2.4 exabytes of LPDDR5 for iPhones, putting AI's massive hardware scale in consumer market perspective.
Stanford 2026 AI Index: Models Beat Human Baselines, U.S.-China Gap Narrows
The 423-page Stanford 2026 AI Index Report reveals frontier AI models now match or exceed human baselines on hard coding, science, and math tests. Global AI adoption has hit ~53% in just three years, while the U.S.-China capability gap shrinks.
Demis Hassabis: AI Tools Enable Billion-Dollar Startups by 'Kids'
Demis Hassabis stated that current AI tools are so powerful that young entrepreneurs could build multi-billion dollar businesses by discovering novel applications, as labs focus on model development, not exhausting use cases.
Google's Gemma 4B Model Runs on Nintendo Switch at 1.5 Tokens/Second
A developer successfully ran Google's 4-billion parameter Gemma language model on a Nintendo Switch, achieving 1.5 tokens/second inference. This demonstrates the increasing feasibility of running small LLMs on consumer-grade edge hardware.
AI Model Decodes Silent Speech from Phone Sensors, No Microphone Needed
A new AI model can reconstruct speech by analyzing imperceptible facial movements captured by smartphone sensors, effectively enabling silent speech recognition without a microphone. This represents a significant leap in sensor fusion and on-device AI.
Study: 10 Minutes with ChatGPT Cuts Problem-Solving Rate from 73% to 57%
Researchers from Carnegie Mellon, Oxford, MIT, and UCLA found that just 10 minutes of ChatGPT use reduced participants' independent problem-solving success from 73% to 57%. The effect was strongest in users who sought direct answers, whose performance fell below their original baseline.
Efficient Universal Perception Encoder (EUPE) Family Challenges DINOv2
Researchers introduced the Efficient Universal Perception Encoder (EUPE), a family of compact vision models that achieve performance rivaling the larger DINOv2. This could enable high-quality visual understanding on resource-constrained devices.
Ethan Mollick: Gemma 4 Impressive On-Device, But Agentic Workflows Doubted
Wharton professor Ethan Mollick finds Google's Gemma 4 powerful for on-device use but is skeptical about its ability to execute true agentic workflows, citing limitations in judgment and self-correction.
Neuromorphic Computing Patents Surge 401% in 2025, Hits 596 by 2026
Patent filings for neuromorphic computing—hardware that mimics the brain's architecture—surged 401% in 2025, reaching 596 by early 2026. This indicates the technology is transitioning from lab prototypes to commercial products.
Developer Ranks NPU Model Compilation Ease: Apple 1st, AMD Last
Developer @mweinbach ranked the ease of using AI coding agents to compile ML models for NPUs. Apple's ecosystem was rated easiest, while AMD's tooling was ranked most difficult.
X Post Reveals Audible Quality Differences in GPU vs. NPU AI Inference
A developer demonstrated audible quality differences in AI text-to-speech output when run on GPU, CPU, and NPU hardware, highlighting a key efficiency vs. fidelity trade-off for on-device AI.
Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal Capabilities
Google has released the Gemma 4 family of open-weight models, derived from Gemini 3 technology. The four models, ranging from 2B to 31B parameters and including a Mixture-of-Experts variant, are available under a permissive Apache 2.0 license and feature multimodal processing.
Apple Hires Former Google Exec Lilian Rincon as VP of AI Product Marketing
Apple has appointed Lilian Rincon, a former Google executive, as its Vice President of Product Marketing for Artificial Intelligence. This is a key strategic hire as Apple intensifies its push into consumer-facing AI products.
Niantic's Pokémon GO Dataset of 30B Images Now Powers Centimeter-Precise Robotics Vision
Niantic's Lightship VPS, trained on 30 billion images from Pokémon GO players, now enables delivery robots to navigate with centimeter precision. The dataset represents the largest real-world visual positioning system ever created.
Memory Market Squeeze Threatens iPhone Price Hikes as AI Demands Strain Supply
A global RAM shortage and price increases could force Apple to raise iPhone prices by up to $250, according to industry analysis. The tech giant is reportedly unwilling to absorb the cost, passing it directly to consumers amid surging memory demands from AI applications.
Stanford's OpenJarvis: The Open-Source Framework Bringing Personal AI Agents to Your Device
Stanford researchers have released OpenJarvis, an open-source framework for building personal AI agents that operate entirely on-device. This local-first approach prioritizes privacy and autonomy while providing tools, memory, and learning capabilities.
Nvidia's NemoClaw: The Open-Source Platform Poised to Democratize AI Agent Development
Nvidia is preparing to launch NemoClaw, an open-source platform designed specifically for building and deploying AI agents. This move aims to accelerate the development of autonomous systems that can perform complex, multi-step tasks.
Fully Autonomous Humanoid Robots: The Next Leap Beyond Teleoperation
A breakthrough in robotics demonstrates fully autonomous humanoid capabilities without teleoperation, signaling rapid progress toward household robots by 2027.
WiFi Signals Now Track Human Movement Through Walls: The Privacy Revolution You Didn't See Coming
A groundbreaking open-source project called WiFi-DensePose uses ordinary WiFi signals to track human movement through walls without cameras or special equipment. This technology transforms standard home routers into motion sensors capable of detecting poses and activities.
AI's Insatiable Appetite: Nvidia's Rubin Chip Demands 288GB Memory, Sparking Global Shortage Crisis
Nvidia's upcoming Rubin AI chip requires 288GB of RAM—800% more than top desktop computers—creating unprecedented memory demand. Massive purchases by OpenAI and Alphabet have depleted supply, driving DDR4 prices up 2352% and causing a global memory chip shortage.