edge ai

30 articles about edge ai in AI news

Edge AI Breakthrough: Qwen3.5 2B Runs Locally on iPhone 17 Pro, Redefining On-Device Intelligence

Alibaba's Qwen3.5 2B model now runs locally on iPhone 17 Pro devices, marking a significant breakthrough in edge AI. This development enables sophisticated language processing without cloud dependency, potentially transforming mobile AI applications and user privacy paradigms.

Mar 2, 202685% relevant

Edge AI for Loss Prevention: Adaptive Pose-Based Detection for Luxury Retail Security

A new periodic adaptation framework enables edge devices to autonomously detect shoplifting behaviors from pose data, offering a scalable, privacy-preserving solution for luxury retail security with 91.6% outperformance over static models.

Mar 6, 202685% relevant

Google's Nano-Banana 2: The Edge AI Revolution That Puts 4K Image Generation in Your Pocket

Google has officially unveiled Nano-Banana 2, a specialized AI model delivering sub-second 4K image synthesis with advanced subject consistency entirely on-device. This breakthrough represents a strategic pivot toward edge computing, challenging the cloud-centric paradigm of current generative AI.

Feb 26, 202675% relevant

Google's TensorFlow 2.21 Revolutionizes Edge AI with Unified LiteRT Framework

Google has launched TensorFlow 2.21, marking LiteRT's transition to a production-ready universal on-device inference framework. This major update delivers faster GPU performance, new NPU acceleration, and seamless PyTorch edge deployment, effectively replacing TensorFlow Lite for mobile and edge applications.

Mar 7, 202675% relevant

NSA Uses Anthropic's Claude Mythos Despite 'Supply Chain Risk' Label

The National Security Agency is using Anthropic's Claude Mythos Preview for its capabilities, despite having labeled Anthropic itself as a potential supply chain risk. This highlights the tension between security concerns and the operational need for cutting-edge AI.

Apr 19, 202697% relevant

AI Developer Tools Shift to Mac-First, Excluding Windows/Linux Users

AI developers report a growing trend of cutting-edge AI tools being released exclusively or primarily for macOS, making it difficult for Windows and Linux users to access the latest innovations. This platform shift creates a hardware-based barrier to entry in the AI development ecosystem.

Apr 17, 202675% relevant

NVIDIA Advances AI Robotics with Simulation-First Training, Isaac & Jetson

NVIDIA showcased AI robotics advances using foundation models and synthetic environments for training, enabling scalable deployment in real-world sectors like agriculture and solar. Key platforms are the Isaac simulator and Jetson edge AI hardware.

Apr 8, 202685% relevant

EdgeConneX Claims 10GW Pipeline, Australia Gigawatt Campus Proposed

EdgeConneX claims 10GW pipeline; gigawatt behind-the-meter campus proposed in Australia's NT. Chevron signs 20-year Microsoft AI power deal.

Jul 1, 202694% relevant

DeepSeek Raises $7B, Ends No-Funding Pledge, Doubles Staff

DeepSeek raised $7B, abandoning its no-funding pledge, to double headcount and launch a coding agent team competing with Claude Code.

Jun 27, 2026100% relevant

Apple Ditches Apple Silicon Pledge, Routes AI Queries to Google Cloud

Apple routes AI queries to Google Cloud, breaking 2024 Apple silicon pledge. Distilled Gemini runs locally; heavier queries use Nvidia tech in Google Cloud.

May 31, 202694% relevant

Neo4j's agent-memory: Open-source unified memory for AI agents via knowledge graphs

Neo4j releases agent-memory, an open-source unified memory layer for AI agents using knowledge graphs, enabling persistent structured recall.

May 19, 202675% relevant

Omar Sarayra Builds LLM Artifact Generator for AI Knowledge Discovery

Omar Sarayra created a system that transforms dense LLM knowledge bases into consumable visual artifacts, like a pulse on HN AI discussions. He argues this format could become a new medium for staying current.

Apr 19, 202687% relevant

Andrej Karpathy's LLM-Wiki Framework Solves AI Amnesia with Persistent Knowledge

Andrej Karpathy published a two-page framework called LLM-Wiki that transforms how AI systems handle accumulated knowledge. Instead of retrieving from raw documents each time, the AI compiles sources into its own structured wiki that persists across sessions.

Apr 19, 202685% relevant

Claude AI Prompts Claim to Build Hedge Fund-Level Trading Strategies

A prompt collection claims to enable Claude to build and backtest hedge fund-level trading strategies. The prompts aim to automate quantitative analysis tasks typically performed by high-paid analysts.

Apr 14, 202687% relevant

FORGE Benchmark Reveals Domain Knowledge

Researchers introduced FORGE, a multimodal dataset with 2D/3D data and fine-grained annotations for manufacturing. Evaluating 18 MLLMs revealed domain knowledge, not visual grounding, is the key bottleneck, with fine-tuning offering a clear path forward.

Apr 10, 202672% relevant

MedGemma 1.5 Technical Report Released, Details Multimodal Medical AI

Google DeepMind has published the technical report for MedGemma 1.5, detailing the architecture and capabilities of its open-source, multimodal medical AI model. This follows the initial Med-PaLM 2 release and represents a significant step in making specialized medical AI more accessible.

Apr 9, 202685% relevant

Google Launches AI Edge Eloquent: Free, Offline-First Dictation App on iOS

Google has quietly launched AI Edge Eloquent, a free, subscription-less dictation app for iOS. It uses a Gemma-based speech recognition model to process audio locally, removing filler words and self-corrections to produce cleaner text.

Apr 6, 202697% relevant

GitNexus Open Sources Codebase Knowledge Graph Engine for AI Agents

GitNexus, an open-source knowledge graph engine, autonomously indexes codebases to map dependencies and execution flows. It integrates with Claude Code, Cursor, and Windsurf via MCP to give AI agents architectural awareness, preventing breaking changes.

Apr 4, 202699% relevant

Zero-Shot Cross-Domain Knowledge Distillation: A YouTube-to-Music Case Study

Google researchers detail a case study transferring knowledge from YouTube's massive video recommender to a smaller music app, using zero-shot cross-domain distillation to boost ranking models without training a dedicated teacher. This offers a practical blueprint for improving low-traffic AI systems.

Apr 1, 202696% relevant

New Research Proposes FilterRAG and ML-FilterRAG to Defend Against Knowledge Poisoning Attacks in RAG Systems

Researchers propose two novel defense methods, FilterRAG and ML-FilterRAG, to mitigate 'PoisonedRAG' attacks where adversaries inject malicious texts into a knowledge source to manipulate an LLM's output. The defenses identify and filter adversarial content, maintaining performance close to clean RAG systems.

Mar 30, 202692% relevant

Microsoft's Satya Nadella Details Internal 'Lean for Knowledge Work' AI Initiative

Microsoft CEO Satya Nadella described the company's internal application of AI to streamline knowledge work, framing it as a 'Lean' manufacturing-style efficiency push for cognitive tasks. The initiative focuses on using AI to reduce process friction and improve productivity across internal operations.

Mar 29, 202685% relevant

Edge Computing in Retail 2026: Examples, Benefits, and a Guide

Shopify outlines the strategic shift toward edge computing in retail, detailing its benefits—real-time personalization, inventory management, and enhanced in-store experiences—and providing a practical implementation guide for 2026.

Mar 19, 202682% relevant

Future-Proof Your AI Search: Why Static Knowledge Bases Fail Luxury Retail

New research reveals AI retrieval benchmarks degrade over time as information changes. For luxury brands using AI for product recommendations and clienteling, this means static knowledge bases become stale, hurting customer experience and sales.

Mar 6, 202660% relevant

Anthropic's Stealth Education Revolution: Free AI Curriculum Democratizes Technical Knowledge

Anthropic has launched a comprehensive, completely free AI curriculum designed to make technical AI education accessible to everyone. The curriculum covers fundamentals to advanced topics without tuition, waitlists, or prerequisites, potentially reshaping how AI knowledge is distributed.

Mar 3, 202685% relevant

Multimodal Knowledge Graphs Unlock Next-Generation AI Training Data

Researchers have developed MMKG-RDS, a novel framework that synthesizes high-quality reasoning training data by mining multimodal knowledge graphs. The system addresses critical limitations in existing data synthesis methods and improves model reasoning accuracy by 9.2% with minimal training samples.

Mar 2, 202680% relevant

NullClaw: The 1MB AI Agent Revolutionizing Edge Computing

NullClaw, a fully autonomous AI agent written in Zig, runs on just 1MB RAM and 678KB binary size, enabling AI deployment on $5 hardware with <2ms startup times. This breakthrough eliminates traditional runtime bloat and opens new possibilities for edge computing.

Mar 1, 202695% relevant

Google's AI Edge Gallery Arrives on iPhone: A Privacy-First Revolution in On-Device Intelligence

Google AI Edge Gallery has launched on iOS, bringing true on-device function calling to iPhones for the first time. Powered by the compact 270M parameter FunctionGemma model, it enables natural voice commands to trigger phone actions like calendar events and flashlight toggles—completely offline.

Feb 28, 202675% relevant

Trump's AI Energy Summit: Tech Giants Pledge to Self-Generate Power Amid Grid Concerns

Former President Donald Trump is convening Amazon, Google, Meta, Microsoft, xAI, Oracle, and OpenAI at the White House to sign a 'Rate Payer Protection Pledge,' committing them to generate or purchase their own electricity for new AI data centers, signaling a major shift in how tech's energy demands are addressed.

Feb 25, 202685% relevant

GitNexus Revolutionizes Code Exploration: Browser-Based AI Transforms GitHub Repositories into Interactive Knowledge Graphs

A new tool called GitNexus transforms any GitHub repository into an interactive knowledge graph with AI chat capabilities, running entirely in the browser without backend infrastructure. This breakthrough enables developers to visualize and query complex codebases through intuitive graph interfaces and natural language conversations.

Feb 25, 202685% relevant

FAERec: A New Framework for Fusing LLM Knowledge with Collaborative Signals for Tail-Item Recommendations

A new paper introduces FAERec, a framework designed to improve recommendations for niche items by better fusing semantic knowledge from LLMs with collaborative filtering signals. It addresses structural inconsistencies between embedding spaces to enhance model accuracy.

Apr 7, 202688% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety