qualcomm

30 articles about qualcomm in AI news

Qualcomm Taps TSMC for 3nm/2nm Dragonfly C100 CPUs, AI300 Accelerators

TSMC to fab Qualcomm's Dragonfly C100 and AI300 chips on 3nm/2nm nodes. The move challenges NVIDIA in data center AI, but timelines and performance remain undisclosed.

Jun 26, 2026100% relevant

Qualcomm in Talks to Acquire Modular for $4B, Landing Lattner

Qualcomm nears $4B acquisition of Modular, Chris Lattner's AI infra startup. Deal targets inference software for edge and data center AI chips.

Jun 22, 202682% relevant

Qualcomm Launches AI Data Center Program With Hyperscaler Customer

Qualcomm launched an AI data center program with a major hyperscaler customer, targeting inference workloads. Financial terms and partner identity undisclosed.

Jun 17, 202685% relevant

Qualcomm Ships Hyperscaler Custom Silicon by December 2026

Qualcomm is developing custom silicon for an unnamed hyperscaler, with shipments expected December 2026, marking its most concrete data-center comeback move.

May 1, 202676% relevant

Qualcomm Builds Dedicated CPU for Agentic AI, Enters Hyperscale Silicon Market

Qualcomm CEO revealed dedicated CPU for agentic AI, custom silicon deal with hyperscaler shipping Dec 2026, and agentic smartphones. Pivot challenges GPU-centric AI infrastructure consensus.

May 1, 2026100% relevant

GPT-5.4 Spends 3 Hours Optimizing Embedding Model for Qualcomm NPU

An X user observed GPT-5.4 working for three hours to optimize an embedding model specifically for the Qualcomm NPU. This suggests a practical application of advanced AI for hardware-specific model tuning.

Apr 15, 202685% relevant

ASUS Zenbook A16 Launches with Qualcomm X2 Elite Extreme AI Chip

ASUS announced the Zenbook A16 laptop featuring the Qualcomm Snapdragon X2 Elite Extreme processor. This marks a significant push for premium Windows on Arm laptops optimized for local AI tasks.

Apr 13, 202687% relevant

Snap & Qualcomm Partner on Snapdragon XR for Future Spectacles

Snap has entered a strategic agreement with Qualcomm to power future generations of its Spectacles AR glasses with Snapdragon XR platforms. This hardware partnership is critical for Snap's long-term bet on AI-driven augmented reality.

Apr 10, 202685% relevant

Qualcomm X2 Elite Matches Apple M5 in Efficiency Test

In a mixed-use laptop test simulating office work, Qualcomm's Snapdragon X2 Elite system-on-chip matched the power efficiency of Apple's latest M5 chip. This marks a significant milestone for Windows on Arm in its competition with Apple Silicon.

Apr 7, 202675% relevant

Qualcomm NPU Shows 6-8x OCR Speed-Up Over CPU in Mobile Workload

A benchmark shows Qualcomm's dedicated NPU processing OCR workloads 6-8 times faster than the device's CPU. This highlights the growing efficiency gap for AI tasks on mobile silicon.

Apr 5, 202685% relevant

Qualcomm's Arduino Ventuno Q: A Powerhouse Single-Board Computer for the Next Wave of Physical AI

Qualcomm and Arduino have launched the Ventuno Q, a high-performance single-board computer designed specifically for robotics and physical AI applications. Powered by the Dragonwing IQ8 processor with a dedicated NPU and paired with a low-latency microcontroller, it enables complex, offline AI tasks like object tracking and gesture recognition for systems that interact with the real world.

Mar 9, 202680% relevant

Nvidia Unveils New Windows SoC, Targeting AI PCs

Nvidia announced a Windows SoC for AI PCs, per @mweinbach. Chip targets on-device inference, competing with Qualcomm and Intel.

Jun 1, 2026100% relevant

MCP's Enterprise Auth Standard Goes Stable: Okta Provisions 2,000 Ramp Employees in One Policy

Anthropic and Okta launched Enterprise-Managed Authorization (EMA) for MCP on June 18, 2026, provisioning Ramp's 2,000 employees with zero per-user OAuth steps. Seven MCP servers — Asana, Atlassian, Canva, Figma, Granola, Linear, Supabase — support the standard at launch; VS Code and Azure AD users

Jun 19, 202685% relevant

llada.cpp Cuts LLaDA-8B Latency 17-42x on Mobile NPU

llada.cpp, the first NPU-aware dLLM inference framework, cuts LLaDA-8B latency 17-42x on smartphones, enabling real-time on-device generation.

Jun 15, 202684% relevant

SemiAnalysis Calls Jensen ComputeX Keynote 'F Tier' Over No AI DC News

SemiAnalysis rated Jensen Huang's ComputeX keynote 'F Tier' for no AI datacenter news and revealed a delayed NVIDIA ARM chip with broken video output.

Jun 1, 202682% relevant

Nvidia N1X Arm Laptop Chip Nears Reveal at Computex

Nvidia, Microsoft, Arm tease N1X Arm laptop chip debut at Computex. Nvidia enters Windows-on-Arm without owning the architecture it tried to buy.

May 30, 202685% relevant

WiFi routers can identify individuals with near-perfect accuracy, KIT shows

KIT researchers show WiFi routers can identify individuals with near-perfect accuracy via beamforming feedback, tested on 197 subjects.

May 24, 202675% relevant

Snapdragon X2 Elite Beats Intel Arrow Lake for AI Coding Agents

Snapdragon X2 Elite beat Intel Arrow Lake for Windows AI coding agents. CPU bottleneck, not inference speed, limited performance per @mweinbach.

May 11, 202692% relevant

Horizon Launches Full-Stack AI Platform for Autonomous Driving

Horizon Robotics launched a trio of products—a new chip, an open-source OS, and a smart driving system—aiming to push cars closer to becoming autonomous AI agents. The platform integrates hardware and software for enhanced perception and decision-making.

Apr 23, 202682% relevant

John Ternus Takes Over Apple AI Leadership as Era Ends

Apple's AI leadership transitions to John Ternus, marking a new era following Steve Jobs' vision and Tim Cook's operational success. This comes as Apple accelerates its generative AI push with Apple Intelligence.

Apr 20, 202691% relevant

Prefill-as-a-Service Paper Claims to Decouple LLM Inference Bottleneck

A research paper proposes a 'Prefill-as-a-Service' architecture to separate the heavy prefill computation from the lighter decoding phase in LLM inference. This could enable new deployment models where resource-constrained devices handle only the decoding step.

Apr 20, 202685% relevant

MLX-VLM Adds Continuous Batching, OpenAI API, and Vision Cache for Apple Silicon

The next release of MLX-VLM will introduce continuous batching, an OpenAI-compatible API, and vision feature caching for multimodal models running locally on Apple Silicon. These optimizations promise up to 228x speedups on cache hits for models like Gemma4.

Apr 16, 202695% relevant

Meta Expands Broadcom Partnership for Next-Gen AI Infrastructure

Meta is expanding its partnership with semiconductor giant Broadcom to co-develop its next-generation AI infrastructure. This move signals a continued, long-term commitment to custom silicon for AI training and inference.

Apr 14, 202685% relevant

Microsoft Raises Surface PC Prices Amid AI Copilot+ PC Push

Microsoft has implemented substantial price increases for its entire Surface PC portfolio. This move likely reflects the higher component and development costs associated with integrating next-generation AI capabilities into the Copilot+ PC platform.

Apr 13, 202675% relevant

MLX Enables Local Grounded Reasoning for Satellite, Security, Robotics AI

Apple's MLX framework is enabling 'local grounded reasoning' for AI applications in satellite imagery, security systems, and robotics, moving complex tasks from the cloud to on-device processing.

Apr 11, 202685% relevant

Google's Gemma 4B Model Runs on Nintendo Switch at 1.5 Tokens/Second

A developer successfully ran Google's 4-billion parameter Gemma language model on a Nintendo Switch, achieving 1.5 tokens/second inference. This demonstrates the increasing feasibility of running small LLMs on consumer-grade edge hardware.

Apr 8, 202689% relevant

Dell XPS 14 with Core Ultra X7 Outlasts Snapdragon X Elite in Battery Test

A new battery test shows the Dell XPS 14 with Intel Core Ultra X7 lasts 11.7 hours, beating the 11.3 hours of Microsoft's ARM-based Surface Laptop 15 with Snapdragon X Elite. This is a significant win for Intel's latest chip in the critical mobile performance metric.

Apr 7, 202685% relevant

ModelBest Hits $1B+ Valuation for On-Device Foundation Models

ModelBest, a Chinese developer of on-device AI foundation models, raised several hundred million RMB, reaching a valuation exceeding $1 billion. The funding will accelerate its push to deploy efficient models directly on smartphones and IoT devices.

Apr 7, 202695% relevant

Developer Ranks NPU Model Compilation Ease: Apple 1st, AMD Last

Developer @mweinbach ranked the ease of using AI coding agents to compile ML models for NPUs. Apple's ecosystem was rated easiest, while AMD's tooling was ranked most difficult.

Apr 5, 202675% relevant

X Post Reveals Audible Quality Differences in GPU vs. NPU AI Inference

A developer demonstrated audible quality differences in AI text-to-speech output when run on GPU, CPU, and NPU hardware, highlighting a key efficiency vs. fidelity trade-off for on-device AI.

Apr 5, 202675% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety