systems architecture

30 articles about systems architecture in AI news

UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems

A new arXiv paper introduces UniMixer, a unified scaling architecture for recommender systems. It bridges attention-based, TokenMixer-based, and factorization-machine-based methods into a single theoretical framework, aiming to improve parameter efficiency and scaling return on investment (ROI).

Apr 2, 202696% relevant

AI Agent Types and Communication Architectures: From Simple Systems to Multi-Agent Ecosystems

A guide to designing scalable AI agent systems, detailing agent types, multi-agent patterns, and communication architectures for real-world enterprise production. This represents the shift from reactive chatbots to autonomous, task-executing AI.

Mar 19, 202672% relevant

Multi-Agent AI Systems: Architecture Patterns and Governance for Enterprise Deployment

A technical guide outlines four primary architecture patterns for multi-agent AI systems and proposes a three-layer governance framework. This provides a structured approach for enterprises scaling AI agents across complex operations.

Mar 18, 202670% relevant

Beyond Self-Play: The Triadic Architecture for Truly Self-Evolving AI Systems

New research reveals why AI self-play systems plateau and proposes a triadic architecture with three key design principles that enable sustainable self-evolution through measurable information gain across iterations.

Mar 4, 202685% relevant

Memory Systems for AI Agents: Architectures, Frameworks, and Challenges

A technical analysis details the multi-layered memory architectures—short-term, episodic, semantic, procedural—required to transform stateless LLMs into persistent, reliable AI agents. It compares frameworks like MemGPT and LangMem that manage context limits and prevent memory drift.

Apr 5, 202695% relevant

Beyond Architecture: How Training Tricks Make or Break AI Fraud Detection Systems

New research reveals that weight initialization and normalization techniques—often overlooked in AI development—are critical for graph neural networks detecting financial fraud on blockchain networks. The study shows these training practices affect different GNN architectures in dramatically different ways.

Mar 2, 202675% relevant

Building Production-Ready Agentic AI Systems with Docker and FastAPI

Towards AI published a practical guide on deploying production-ready agentic AI systems with FastAPI and Docker. The article covers scalable architecture, orchestration, and enterprise considerations for AI agents.

Jun 26, 202666% relevant

RIFT-Bench Tests 45 Agentic Systems With Dynamic Red-Teaming

RIFT-Bench evaluates 45 agentic AI systems via a graph-driven two-phase pipeline, enabling unified security comparison across heterogeneous architectures.

Jun 24, 202685% relevant

8 RAG Architectures Explained for AI Engineers: From Naive to Agentic Retrieval

A technical thread explains eight distinct RAG architectures with specific use cases, from basic vector similarity to complex agentic systems. This provides a practical framework for engineers choosing the right approach for different retrieval tasks.

Apr 3, 202685% relevant

Alibaba DAMO Academy Releases AgentScope: A Python Framework for Multi-Agent Systems with Visual Design

Alibaba's DAMO Academy has open-sourced AgentScope, a Python framework for building coordinated AI agent systems with visual design, MCP tools, memory, RAG, and reasoning. It provides a complete architecture rather than just building blocks.

Mar 24, 202697% relevant

AI Agents Get a Memory Upgrade: New Framework Treats Multi-Agent Memory as Computer Architecture

A new paper proposes treating multi-agent memory systems as a computer architecture problem, introducing a three-layer hierarchy and identifying critical protocol gaps. This approach could significantly improve reasoning, skills, and tool usage in collaborative AI systems.

Mar 13, 202685% relevant

Google DeepMind Unveils 'Intelligent AI Delegates': A Paradigm Shift in Autonomous Agent Architecture

Google DeepMind has introduced a groundbreaking framework called 'Intelligent AI Delegates' that fundamentally reimagines how AI agents operate. This new architecture enables more autonomous, efficient, and collaborative problem-solving by allowing AI systems to delegate tasks dynamically.

Mar 12, 202697% relevant

Beyond Simple Retrieval: The Rise of Agentic RAG Systems That Think for Themselves

Traditional RAG systems are evolving into 'agentic' architectures where AI agents actively control the retrieval process. A new 5-layer evaluation framework helps developers measure when these intelligent pipelines make better decisions than static systems.

Mar 11, 202681% relevant

OpenDev Paper Formalizes the Architecture for Next-Generation Terminal AI Coding Agents

A comprehensive 81-page research paper introduces OpenDev, a systematic framework for building terminal-based AI coding agents. The work details specialized model routing, dual-agent architectures, and safety controls that address reliability challenges in autonomous coding systems.

Mar 8, 202695% relevant

Subagent AI Architecture: The Key to Reliable, Scalable Retail Technology Development

Subagent AI architectures break complex development tasks into specialized roles, enabling more reliable implementation of retail systems like personalization engines, inventory APIs, and clienteling tools. This approach prevents context collapse in large codebases.

Mar 5, 202665% relevant

Beyond RAG: How AI Memory Systems Are Creating Truly Adaptive Agents

AI development is shifting from static retrieval systems to dynamic memory architectures that enable continual learning. This evolution from RAG to agent memory represents a fundamental change in how AI systems accumulate and utilize knowledge over time.

Mar 1, 202685% relevant

Ecolab Acquires CoolIT Systems for $4.75B in AI Cooling Bet

Ecolab acquires CoolIT Systems for $4.75B in AI liquid cooling bet. Nvidia endorsement signals strategic alignment with high-density GPU racks.

Jul 9, 202684% relevant

Foxconn and Intel Partner on AI Data Center Rack Systems

Foxconn and Intel partner on AI rack systems, integrating Intel components into Foxconn manufacturing for hyperscale customers. No financial terms disclosed.

Jun 4, 202690% relevant

Multi-Agent Systems Hit Diminishing Returns Past 4 Agents

Adding more agents to LLM-driven multi-agent systems degrades performance past a task-dependent optimum, with weaker models peaking at 4 agents and stronger ones at 2.

Jun 2, 2026100% relevant

Anthropic Publishes Zero-Trust Architecture for AI Agents

Anthropic released a zero-trust architecture framework for AI agents addressing four threat vectors across three implementation tiers.

May 30, 202685% relevant

Multi-Agent LLM Systems Fail to Outperform Single Models, Study Finds

New paper finds multi-agent LLM systems underperform single models by 2.3% on reasoning benchmarks, challenging a core assumption in AI engineering.

May 13, 202689% relevant

Recursive Multi-Agent Systems Top Hugging Papers; Eywa Bridges LLMs and Scientific Models

Recursive Multi-Agent Systems leads Hugging Papers with 242 upvotes. Eywa and OneManCompany signal a move from chat-based to structural agent collaboration.

May 3, 202689% relevant

Large Memory Models: New Architecture Beyond RAG and Vector Search

Researchers with 160+ Nature and ICLR publications have built Large Memory Models (LMMs), a new architecture designed to emulate human memory processes, offering an alternative to RAG and vector search paradigms.

Apr 29, 202687% relevant

AI Memory Survey: Three Systems Needed for Human-Like Recall

A new survey paper proposes that modern AI requires three distinct memory systems—parametric, retrieval, and agent memory—to achieve human-like cognition, highlighting control as the key bottleneck.

Apr 28, 202680% relevant

Layers on Layers — How You Can Improve Your Recommendation Systems

An IBM article critiques monolithic recommendation engines for trying to do too much with one score. It proposes a layered architecture—candidate generation, ranking, and business logic—to improve performance and adaptability. This is a direct, practical framework for engineering teams.

Apr 21, 202682% relevant

A Reference Architecture for Agentic Hybrid Retrieval in Dataset Search

A new research paper presents a reference architecture for 'agentic hybrid retrieval' that orchestrates BM25, dense embeddings, and LLM agents to handle underspecified queries against sparse metadata. It introduces offline metadata augmentation and analyzes two architectural styles for quality attributes like governance and performance.

Apr 21, 202684% relevant

Poisoned RAG: 5 Documents Can Corrupt 'Hallucination-Free' AI Systems

Researchers proved that planting a handful of poisoned documents in a RAG system's database can cause it to generate confident, incorrect answers. This exposes a critical vulnerability in systems marketed as 'hallucination-free'.

Apr 20, 202685% relevant

Akshay Pachaar Inverts LLM Agent Architecture with 'Harness' Design

AI engineer Akshay Pachaar outlined a novel 'harness' architecture for LLM agents that externalizes intelligence into memory, skills, and protocols. He is building a minimal, didactic open-source implementation of this design.

Apr 18, 202689% relevant

Claude Code's Architecture Revealed

An analysis of Claude Code's source code shows its core is a simple loop, but its power comes from systems like a 5-layer compaction pipeline and a 7-mode permission system, which developers can leverage for better performance.

Apr 18, 2026100% relevant

A Practical Guide to Building Real-Time Recommendation Systems

This article provides a practical overview of building real-time recommendation systems, covering core components like data ingestion, feature stores, and model serving. It matters because real-time personalization is becoming a baseline expectation in digital commerce.

Apr 17, 202678% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety