computer architecture

30 articles about computer architecture in AI news

AI Agents Get a Memory Upgrade: New Framework Treats Multi-Agent Memory as Computer Architecture

A new paper proposes treating multi-agent memory systems as a computer architecture problem, introducing a three-layer hierarchy and identifying critical protocol gaps. This approach could significantly improve reasoning, skills, and tool usage in collaborative AI systems.

Mar 13, 202685% relevant

How a 50-Year-Old Computer Science Concept Just Outperformed Anthropic's Claude Code

A small startup has outperformed Anthropic's flagship Claude Code using a novel architecture based on persistent memory systems. This breakthrough demonstrates how classic computer science principles can solve modern AI limitations in context retention and reasoning.

Feb 24, 202670% relevant

Instacart Acquires Computer Vision Firm Arpalus for Real-Time Grocery

Instacart acquired computer vision firm Arpalus to add real-time shelf intelligence for grocery retailers. The technology automates inventory monitoring, product placement, and pricing verification.

Jul 16, 202694% relevant

Anthropic Leases xAI's Colossus 1 After Mixed-Architecture Flaw Blocked

Anthropic leased xAI's 220K-GPU Colossus 1 after its mixed architecture failed to train Grok. Musk builds Blackwell-only Colossus 2 for training and IPO.

May 15, 2026100% relevant

Albertsons Launches AI Supply Chain Tool With Computer Vision

Albertsons launched a patent-pending AI supply chain tool using computer vision to reduce food waste and improve inventory across 2,200+ stores.

May 13, 2026100% relevant

NVIDIA, DOE Build 100K-GPU Supercomputer for Science

DOE and NVIDIA announced Solstice, a 100K-GPU Vera Rubin supercomputer delivering 5,000 exaflops, and Equinox with 10K Blackwell GPUs.

May 7, 202680% relevant

Large Memory Models: New Architecture Beyond RAG and Vector Search

Researchers with 160+ Nature and ICLR publications have built Large Memory Models (LMMs), a new architecture designed to emulate human memory processes, offering an alternative to RAG and vector search paradigms.

Apr 29, 202687% relevant

GPT-5.4 Launches with Computer Control API

OpenAI launched GPT-5.4, featuring a 'Computer Use' API that lets the model control a user's desktop. Despite improvements, it scores 78.5% on SWE-Bench, behind Claude 3.5 Sonnet's 81.2%.

Apr 18, 202677% relevant

Perplexity AI Launches 'Personal Computer' for Mac App Orchestration

Perplexity AI has released 'Personal Computer', a feature that integrates with its Mac app to securely orchestrate local files and applications. This move expands its AI assistant from web search to direct desktop interaction.

Apr 16, 202687% relevant

Meta's 'Model as Computer' Paper Explores LLM OS-Level Integration

A new research paper from Meta explores a paradigm where the language model acts as the computer's kernel, directly managing processes and memory. This could fundamentally change how AI agents are architected and interact with systems.

Apr 11, 202689% relevant

Meta's Neural Computers: Learned Runtimes Replace External OS for AI Agents

Meta AI and KAUST research introduces Neural Computers, a paradigm where AI models internalize computation, memory, and I/O. Early prototypes show 98.7% GUI cursor control and an 83% arithmetic accuracy boost via reprompting.

Apr 10, 202697% relevant

ASI-Evolve: This AI Designs Better AI Than Humans Can — 105 New Architectures, Zero Human Guidance

Researchers built an AI that runs the entire research cycle on its own — reading papers, designing experiments, running them, and learning from results. It discovered 105 architectures that beat human-designed models, and invented new learning algorithms. Open-sourced.

Apr 5, 202698% relevant

Memory Systems for AI Agents: Architectures, Frameworks, and Challenges

A technical analysis details the multi-layered memory architectures—short-term, episodic, semantic, procedural—required to transform stateless LLMs into persistent, reliable AI agents. It compares frameworks like MemGPT and LangMem that manage context limits and prevent memory drift.

Apr 5, 202695% relevant

Anthropic's Claude Desktop Apps Gain Windows Support for Computer Use Feature

Anthropic has released Windows versions of Claude Code Desktop and Claude Cowork, bringing the 'computer use' feature—which allows the AI to interact with files and applications on a user's computer—to the platform. This follows the macOS release and marks a key step in Anthropic's desktop strategy.

Apr 4, 202687% relevant

Claude AI Gains Computer Control Feature: Opens Apps, Navigates Browser, Fills Spreadsheets

Anthropic's Claude AI can now be enabled to directly control a user's computer to perform tasks like opening applications, browser navigation, and spreadsheet work. This represents a significant shift from chat-based interaction to direct system automation.

Mar 23, 202687% relevant

China's DeepSeek-R1: Open-Source AI Agent Runs Locally with Web Search, Code Generation, and Built-In Computer

Chinese AI company DeepSeek has released DeepSeek-R1, a fully open-source AI agent that runs locally on personal computers with web search capabilities, code generation, and built-in computer functionality. The model represents a significant move toward accessible, self-contained AI systems outside the dominant U.S. ecosystem.

Mar 23, 202699% relevant

8 AI Model Architectures Visually Explained: From Transformers to CNNs and VAEs

A visual guide maps eight foundational AI model architectures, including Transformers, CNNs, and VAEs, providing a clear reference for understanding specialized models beyond LLMs.

Mar 21, 202685% relevant

RF-DETR: A Real-Time Transformer Architecture That Surpasses 60 mAP on COCO

RF-DETR is a new lightweight detection transformer using neural architecture search and internet-scale pre-training. It's the first real-time detector to exceed 60 mAP on COCO, addressing generalization issues in current models.

Mar 10, 202685% relevant

Qualcomm's Arduino Ventuno Q: A Powerhouse Single-Board Computer for the Next Wave of Physical AI

Qualcomm and Arduino have launched the Ventuno Q, a high-performance single-board computer designed specifically for robotics and physical AI applications. Powered by the Dragonwing IQ8 processor with a dedicated NPU and paired with a low-latency microcontroller, it enables complex, offline AI tasks like object tracking and gesture recognition for systems that interact with the real world.

Mar 9, 202680% relevant

Privacy-First Computer Vision: Transforming Luxury Retail Analytics from Showroom to Boutique

Privacy-first computer vision platforms enable luxury retailers to analyze in-store customer behavior, optimize merchandising, and enhance clienteling without compromising personal data. This transforms physical retail intelligence with ethical data collection.

Mar 5, 202685% relevant

TimeGS: How Computer Graphics Techniques Are Revolutionizing Time Series Forecasting

Researchers have introduced TimeGS, a novel AI framework that treats time series forecasting as a 2D rendering problem. By adapting Gaussian splatting techniques from computer graphics, the approach achieves state-of-the-art performance while maintaining temporal continuity.

Mar 4, 202675% relevant

DualPath Architecture Shatters KV-Cache Bottleneck, Doubling LLM Throughput for AI Agents

Researchers have developed DualPath, a novel architecture that eliminates the KV-cache storage bottleneck in agentic LLM inference. By implementing dual-path loading with RDMA transfers, the system achieves nearly 2× throughput improvements for both offline and online scenarios.

Feb 28, 202685% relevant

Perplexity Computer: The AI Agent That Works While You Sleep

Perplexity has launched 'Computer,' an AI agent that autonomously logs into user tools, executes workflows, and operates continuously without human prompting. This represents a fundamental shift from conversational AI to proactive task automation.

Feb 25, 202695% relevant

FDM-1: The AI That Learned to Use Computers by Watching 11 Million Hours of Screen Recordings

Standard Intelligence has unveiled FDM-1, an AI system trained on 11 million hours of screen recordings that can perform complex computer tasks like CAD design, web navigation, and even simulated driving with minimal fine-tuning.

Feb 24, 202695% relevant

Expert Pyramid Tuning: A New Parameter-Efficient Fine-Tuning Architecture for Multi-Task LLMs

Researchers propose Expert Pyramid Tuning (EPT), a novel PEFT method that uses multi-scale feature pyramids to better handle tasks of varying complexity. It outperforms existing MoE-LoRA variants while using fewer parameters, offering more efficient multi-task LLM deployment.

Mar 16, 202679% relevant

OpenAI's WebSocket Revolution: The End of AI Voice Lag and What It Means for Human-Computer Interaction

OpenAI has introduced WebSocket mode for its API, dramatically reducing latency in voice AI interactions. This technical breakthrough enables near-real-time conversations by eliminating the sequential processing bottlenecks that plagued previous voice AI systems.

Feb 23, 202675% relevant

JUPITER Exascale Maps Brain at Cellular Scale on 4,096 Grace Hopper Nodes

JUPITER, Europe's first exascale supercomputer, trained CytoNet brain model on 6.5 PB in 5 days and runs climate, 6G, and quantum simulations.

Jun 22, 202685% relevant

LANL Taps NVIDIA Vera CPUs for 7x Agentic AI Speed on Scientific Workloads

LANL selects NVIDIA Vera CPUs for three supercomputers, claiming 7x performance on agentic AI workloads over x86. Systems Mission, Vision, Veritas deploy by 2027.

Jun 22, 202698% relevant

Microsoft World-R1: RL Aligns Text-to-Video with 3D Physics

Microsoft's World-R1 framework applies reinforcement learning with feedback from pre-trained 3D foundation models to align text-to-video outputs with physical 3D constraints, improving structural coherence without modifying the underlying video diffusion architecture.

Apr 28, 202685% relevant

Columbia Prof: LLMs Can't Generate New Science, Only Map Known Data

Columbia CS Professor Vishal Misra argues LLMs cannot generate new scientific ideas because they learn structured maps of known data and fail outside those boundaries. True discovery requires creating new conceptual maps, a capability current architectures lack.

Apr 21, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety