ai memory systems

30 articles about ai memory systems in AI news

Beyond RAG: How AI Memory Systems Are Creating Truly Adaptive Agents

AI development is shifting from static retrieval systems to dynamic memory architectures that enable continual learning. This evolution from RAG to agent memory represents a fundamental change in how AI systems accumulate and utilize knowledge over time.

Mar 1, 202685% relevant

AI Memory Survey: Three Systems Needed for Human-Like Recall

A new survey paper proposes that modern AI requires three distinct memory systems—parametric, retrieval, and agent memory—to achieve human-like cognition, highlighting control as the key bottleneck.

Apr 28, 202680% relevant

Hindsight AI: How Biomimetic Memory Systems Are Revolutionizing Agent Intelligence

Hindsight, an open-source AI memory system, achieves state-of-the-art performance on the LongMemEval benchmark by mimicking human memory structures. Unlike traditional RAG approaches, it employs parallel retrieval strategies to enable agents that don't just remember—they learn.

Mar 13, 202695% relevant

Memory Systems for AI Agents: Architectures, Frameworks, and Challenges

A technical analysis details the multi-layered memory architectures—short-term, episodic, semantic, procedural—required to transform stateless LLMs into persistent, reliable AI agents. It compares frameworks like MemGPT and LangMem that manage context limits and prevent memory drift.

Apr 5, 202695% relevant

The Unix Philosophy Returns: How File Systems Could Solve AI's Memory Crisis

A new research paper proposes treating AI context management like a Unix file system, with OpenClaw demonstrating that storing memory, tools, and knowledge as files creates traceable, auditable AI systems. This approach could solve fragmentation and transparency issues plaguing current agent frameworks.

Mar 7, 202685% relevant

Google's TITANS Architecture: A Neuroscience-Inspired Revolution in AI Memory

Google's TITANS architecture represents a fundamental shift from transformer limitations by implementing cognitive neuroscience principles for adaptive memory. This breakthrough enables test-time learning and addresses the quadratic scaling problem that has constrained AI development.

Mar 4, 202680% relevant

Anthropic Democratizes AI Memory: Claude's Free Tier Gets Contextual Recall

Anthropic has expanded access to Claude's memory feature, making it available to all free users. This strategic move coincides with new tools to import conversations from rival chatbots, positioning Claude as a more personalized and sticky alternative in the competitive AI assistant market.

Mar 2, 202675% relevant

Physics-Inspired AI Memory: How Continuous Fields Could Solve AI's Forgetting Problem

Researchers have developed a revolutionary memory system for AI agents that treats information as continuous fields governed by physics-inspired equations rather than discrete database entries. The approach shows dramatic improvements in long-context reasoning, with +116% performance on multi-session tasks and near-perfect collective intelligence in multi-agent scenarios.

Feb 26, 202675% relevant

AI Agents Get a Memory Upgrade: New Framework Treats Multi-Agent Memory as Computer Architecture

A new paper proposes treating multi-agent memory systems as a computer architecture problem, introducing a three-layer hierarchy and identifying critical protocol gaps. This approach could significantly improve reasoning, skills, and tool usage in collaborative AI systems.

Mar 13, 202685% relevant

The Unlearning Illusion: New Research Exposes Critical Flaws in AI Memory Removal

Researchers reveal that current methods for making AI models 'forget' information are surprisingly fragile. A new dynamic testing framework shows that simple query modifications can recover supposedly erased knowledge, exposing significant safety and compliance risks.

Mar 13, 202695% relevant

AI Agents Get a Memory Upgrade: New Research Tackles Long-Horizon Task Challenges

Researchers have developed new methods to scale AI agent memory for complex, long-horizon tasks. The breakthrough addresses one of the biggest limitations in current agent systems—their inability to retain and utilize information over extended sequences of actions.

Mar 10, 202687% relevant

The File Paradigm: How Simple File Systems Could Revolutionize AI Context Management

New research proposes treating all AI context as files within a unified system, potentially solving memory and organization challenges in complex AI workflows. This approach could dramatically simplify how AI systems access and manage information.

Mar 8, 202685% relevant

Nous Research's Hermes Agent Features Self-Improving Skills, Persistent Memory

A new evaluation of Nous Research's Hermes Agent highlights its self-improving ability to build reusable tools from experience and a smarter persistent memory system that conserves token usage. The agent reportedly improves with continued use, representing a shift towards more adaptive AI systems.

Apr 7, 202685% relevant

Memory Sparse Attention (MSA) Achieves 100M Token Context with Near-Linear Complexity

A new attention architecture, Memory Sparse Attention (MSA), breaks the 100M token context barrier while maintaining 94% accuracy at 1M tokens. It uses document-wise RoPE and end-to-end sparse attention to outperform RAG systems and frontier models.

Mar 29, 202695% relevant

Alibaba DAMO Academy Releases AgentScope: A Python Framework for Multi-Agent Systems with Visual Design

Alibaba's DAMO Academy has open-sourced AgentScope, a Python framework for building coordinated AI agent systems with visual design, MCP tools, memory, RAG, and reasoning. It provides a complete architecture rather than just building blocks.

Mar 24, 202697% relevant

Memory Sparse Attention (MSA) Enables 100M Token Context Windows with Minimal Performance Loss

Memory Sparse Attention (MSA) is a proposed architecture that allows AI models to store and reason over massive long-term memory directly within their attention mechanism, eliminating the need for external retrieval systems. The approach reportedly enables context windows of up to 100 million tokens with minimal performance degradation.

Mar 21, 202685% relevant

Aehr Test Systems Lands $41M AI Chip Order; H2 Bookings Top $92M

Aehr Test Systems received a record $41 million production order from a key hyperscale AI customer. Total bookings for the second half of its fiscal year exceeded $92 million, highlighting surging demand for semiconductor test and burn-in equipment.

Apr 16, 202674% relevant

Cognee Open-Source Framework Unifies Vector, Graph, and Relational Memory for AI Agents

Developer Akshay Pachaar argues AI agent memory requires three data stores—vector, graph, and relational—to handle semantics, relationships, and provenance. His open-source project Cognee unifies them behind a simple API.

Apr 15, 202687% relevant

Karpathy's LLM Wiki Hits 5k Stars, Gains Memory Lifecycle Extension

Andrej Karpathy's LLM Wiki repository gained 5,000 GitHub stars in two days. A developer has now extended it with memory lifecycle features, addressing a noted gap.

Apr 12, 202677% relevant

Mind: Open-Source Persistent Memory for AI Coding Agents

An open-source tool called Mind creates a shared memory layer for AI coding agents, allowing them to remember project context across sessions and different interfaces like Claude Code, Cursor, and Windsurf.

Apr 12, 202685% relevant

Snapchat Details Production Use of Semantic IDs for Recommender Systems

A technical paper from Snapchat details their application of Semantic IDs (SIDs) in production recommender systems. SIDs are ordered lists of codes derived from item semantics, offering smaller cardinality and semantic clustering than atomic IDs. The team reports overcoming practical challenges to achieve positive online metrics impact in multiple models.

Apr 7, 202690% relevant

OpenAI Reallocates Compute and Talent Toward 'Automated Researchers' and Agent Systems

OpenAI is reallocating significant compute resources and engineering talent toward developing 'automated researchers' and agent-based systems capable of executing complex tasks end-to-end, signaling a strategic pivot away from some existing projects.

Apr 3, 202689% relevant

Google DeepMind Maps Six 'AI Agent Traps' That Can Hijack Autonomous Systems in the Wild

Google DeepMind has published a framework identifying six categories of 'traps'—from hidden web instructions to poisoned memory—that can exploit autonomous AI agents. This research provides the first systematic taxonomy for a growing attack surface as agents gain web access and tool-use capabilities.

Apr 1, 202695% relevant

MemFactory Framework Unifies Agent Memory Training & Inference, Reports 14.8% Gains Over Baselines

Researchers introduced MemFactory, a unified framework treating agent memory as a trainable component. It supports multiple memory paradigms and shows up to 14.8% relative improvement over baseline methods.

Apr 1, 202697% relevant

Throughput Optimization as a Strategic Lever in Large-Scale AI Systems

A new arXiv paper argues that optimizing data pipeline and memory throughput is now a strategic necessity for training large AI models, citing specific innovations like OVERLORD and ZeRO-Offload that deliver measurable efficiency gains.

Mar 31, 202688% relevant

Google's TurboQuant AI Research Report Sparks Sell-Off in Micron, Samsung, and SK Hynix Memory Stocks

Google's TurboQuant research blog publication triggered immediate market reaction, with shares of major memory manufacturers dropping 2-4% as investors anticipate AI-driven efficiency gains reducing future memory demand.

Mar 26, 202685% relevant

Multi-Agent AI Systems: Architecture Patterns and Governance for Enterprise Deployment

A technical guide outlines four primary architecture patterns for multi-agent AI systems and proposes a three-layer governance framework. This provides a structured approach for enterprises scaling AI agents across complex operations.

Mar 18, 202670% relevant

Did You Check the Right Pocket? A New Framework for Cost-Sensitive Memory Routing in AI Agents

A new arXiv paper frames memory retrieval in AI agents as a 'store-routing' problem. It shows that selectively querying specialized data stores, rather than all stores for every request, significantly improves efficiency and accuracy, formalizing a cost-sensitive trade-off.

Mar 18, 202670% relevant

How a GPU Memory Leak Nearly Cost an AI Team a Major Client During a Live Demo

A detailed post-mortem of a critical AI inference failure during a client demo reveals how silent GPU memory leaks, inadequate health checks, and missing circuit breakers can bring down a production pipeline. The author shares the architectural fixes implemented to prevent recurrence.

Mar 17, 202695% relevant

Beyond the Model: New Framework Evaluates Entire AI Agent Systems, Revealing Framework Choice as Critical as Model Selection

Researchers introduce MASEval, a framework-agnostic evaluation library that shifts focus from individual AI models to entire multi-agent systems. Their systematic comparison reveals that implementation choices—like topology and orchestration logic—impact performance as much as the underlying language model itself.

Mar 11, 202675% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety