agent systems

30 articles about agent systems in AI news

Meta-skill evolution lets multi-agent systems self-improve without retraining

Multi-agent systems can improve orchestration by evolving a meta-skill via RL on interactions, without retraining agents. Demonstrated on a simulated benchmark.

Jun 20, 202680% relevant

Multi-Agent Systems Hit Diminishing Returns Past 4 Agents

Adding more agents to LLM-driven multi-agent systems degrades performance past a task-dependent optimum, with weaker models peaking at 4 agents and stronger ones at 2.

Jun 2, 2026100% relevant

Recursive Multi-Agent Systems Top Hugging Papers; Eywa Bridges LLMs and Scientific Models

Recursive Multi-Agent Systems leads Hugging Papers with 242 upvotes. Eywa and OneManCompany signal a move from chat-based to structural agent collaboration.

May 3, 202689% relevant

Alibaba DAMO Academy Releases AgentScope: A Python Framework for Multi-Agent Systems with Visual Design

Alibaba's DAMO Academy has open-sourced AgentScope, a Python framework for building coordinated AI agent systems with visual design, MCP tools, memory, RAG, and reasoning. It provides a complete architecture rather than just building blocks.

Mar 24, 202697% relevant

Beyond the Model: New Framework Evaluates Entire AI Agent Systems, Revealing Framework Choice as Critical as Model Selection

Researchers introduce MASEval, a framework-agnostic evaluation library that shifts focus from individual AI models to entire multi-agent systems. Their systematic comparison reveals that implementation choices—like topology and orchestration logic—impact performance as much as the underlying language model itself.

Mar 11, 202675% relevant

MASFactory: A Graph-Centric Framework for Orchestrating LLM-Based Multi-Agent Systems

Researchers introduce MASFactory, a framework that uses 'Vibe Graphing' to compile natural-language intent into executable multi-agent workflows. This addresses implementation complexity and reuse challenges in LLM-based agent systems.

Mar 9, 202675% relevant

The 'Black Box' of AI Collaboration: How Dynamic Graphs Could Revolutionize Multi-Agent Systems

Researchers have developed a novel framework called Dynamic Interaction Graph (DIG) that makes emergent collaboration between AI agents observable and explainable. This breakthrough addresses critical challenges in scaling truly autonomous multi-agent systems by enabling real-time identification and correction of collaboration failures.

Mar 3, 202675% relevant

OpenAI Reallocates Compute and Talent Toward 'Automated Researchers' and Agent Systems

OpenAI is reallocating significant compute resources and engineering talent toward developing 'automated researchers' and agent-based systems capable of executing complex tasks end-to-end, signaling a strategic pivot away from some existing projects.

Apr 3, 202689% relevant

LangChain Releases DeepAgents: Open-Source Framework for Hierarchical AI Agent Systems

LangChain has open-sourced DeepAgents, a framework for building AI agents that can plan tasks, spawn sub-agents, and manage files. It aims to enable more complex, autonomous workflows by structuring agents hierarchically.

Mar 15, 202685% relevant

Debug Multi-Agent Systems Locally with the A2A Simulator

Test and debug AI agents that communicate via Google's A2A protocol using a local simulator that shows both sides of the conversation.

Mar 28, 202695% relevant

AI Agent Types and Communication Architectures: From Simple Systems to Multi-Agent Ecosystems

A guide to designing scalable AI agent systems, detailing agent types, multi-agent patterns, and communication architectures for real-world enterprise production. This represents the shift from reactive chatbots to autonomous, task-executing AI.

Mar 19, 202672% relevant

Preventing AI Team Meltdowns: How to Stop Error Cascades in Multi-Agent Retail Systems

New research reveals how minor errors in AI agent teams can snowball into systemic failures. For luxury retailers deploying multi-agent systems for personalization and operations, this governance layer prevents cascading mistakes without disrupting workflows.

Mar 6, 202670% relevant

Google Launches Free 5-Day AI Agents Course, 1.5M Enrolled Last Run

Google launched a free 5-day AI Agents course, following 1.5M learners in the prior edition. The curriculum covers vibe coding, multi-agent systems, and production deployment on Kaggle.

May 31, 202687% relevant

New Protocol Enables Self-Improving AI Agents with Auditable Lineage

Researchers have proposed a formal protocol for creating self-improving AI agent systems. The framework enables agents to autonomously evaluate and implement upgrades while maintaining auditable lineage and safe rollback options.

Apr 19, 202685% relevant

OpenAI Open-Sources Agents SDK, Supports 100+ LLMs

OpenAI has open-sourced its internal Agents SDK, a lightweight framework for building multi-agent systems. It features three core primitives, works with over 100 LLMs, and has gained 18.9k GitHub stars immediately.

Apr 18, 202695% relevant

Microsoft Launches Free 'AI Agent Course' for Developers, Covers Design Patterns to Production

Microsoft has released a comprehensive, hands-on course for building AI agents, covering design patterns, RAG, tools, and multi-agent systems. It's a practical resource aimed at moving developers from theory to deployment.

Mar 31, 202685% relevant

Google Researchers Challenge Singularity Narrative: Intelligence Emerges from Social Systems, Not Individual Minds

Google researchers argue AI's intelligence explosion will be social, not individual, observing frontier models like DeepSeek-R1 spontaneously develop internal 'societies of thought.' This reframes scaling strategy from bigger models to richer multi-agent systems.

Mar 27, 202687% relevant

Open-Source 'AI Office' Platform Lets Users Walk Through 3D Space to Monitor Autonomous Agents

An open-source project called AI Office creates a 3D virtual workspace where AI agents are visualized as avatars performing tasks. Users can navigate the space instead of reading logs, offering a novel interface for multi-agent systems.

Mar 23, 202685% relevant

Research Paper 'Can AI Agents Agree?' Finds LLM-Based Groups Fail at Simple Coordination

A new study demonstrates that groups of LLM-based AI agents cannot reliably reach consensus on simple decisions, with failure rates increasing with group size. This challenges the common developer assumption that multi-agent systems will naturally converge through discussion.

Mar 21, 202687% relevant

Researchers Apply Distributed Systems Theory to LLM Teams, Revealing O(n²) Communication Bottlenecks

A new paper applies decades-old distributed computing principles to LLM multi-agent systems, finding identical coordination problems: O(n²) communication bottlenecks, straggler delays, and consistency conflicts.

Mar 15, 202685% relevant

Google DeepMind's Intelligent Delegation Framework: The Missing Infrastructure for AI Agents

Google DeepMind has introduced a groundbreaking framework called Intelligent AI Delegation that enables AI agents to safely hand off tasks to other agents and humans. The system addresses critical issues of accountability, transparency, and reliability in multi-agent systems.

Mar 11, 202695% relevant

AI Agents Get a Memory Upgrade: New Research Tackles Long-Horizon Task Challenges

Researchers have developed new methods to scale AI agent memory for complex, long-horizon tasks. The breakthrough addresses one of the biggest limitations in current agent systems—their inability to retain and utilize information over extended sequences of actions.

Mar 10, 202687% relevant

When AI Agents Disagree: New Research Tests Whether LLMs Can Reach Consensus

New research explores whether LLM-based AI agents can effectively communicate and reach agreement in multi-agent systems. The study reveals surprising patterns in how AI agents negotiate, disagree, and sometimes fail to find common ground.

Mar 4, 202685% relevant

Graph-Based AI Agents Are Revolutionizing Software Development

Researchers are developing graph-based multi-agent systems that dynamically adapt their collaboration patterns to solve complex coding problems more effectively than traditional fixed architectures.

Feb 27, 202685% relevant

ARLArena Framework Solves Critical Stability Problem in AI Agent Training

Researchers have developed ARLArena, a unified framework that addresses the persistent instability problem in agentic reinforcement learning. The framework provides standardized testing and introduces SAMPO, a stable optimization method that prevents training collapse in complex AI agent systems.

Feb 26, 202670% relevant

OpenAI Backs AI "Bot Army" Startup Isara in $94M Funding Round at $650M Valuation

OpenAI has led a $94 million investment in Isara, a startup developing autonomous AI agents that can collaborate in large groups. The deal values the company at $650 million and signals OpenAI's strategic push into multi-agent systems.

Mar 26, 202695% relevant

Google's Gemini API Goes Free: A Game-Changer for AI Development and Experimentation

Google has removed rate limits and introduced free access to its Gemini API, enabling developers to experiment with AI prompts in CI/CD pipelines and agent systems without billing concerns. This move democratizes access to advanced language models and encourages innovation.

Mar 12, 202689% relevant

Building Production-Ready Agentic AI Systems with Docker and FastAPI

Towards AI published a practical guide on deploying production-ready agentic AI systems with FastAPI and Docker. The article covers scalable architecture, orchestration, and enterprise considerations for AI agents.

Jun 26, 202666% relevant

RIFT-Bench Tests 45 Agentic Systems With Dynamic Red-Teaming

RIFT-Bench evaluates 45 agentic AI systems via a graph-driven two-phase pipeline, enabling unified security comparison across heterogeneous architectures.

Jun 24, 202685% relevant

Multi-Agent LLM Systems Fail to Outperform Single Models, Study Finds

New paper finds multi-agent LLM systems underperform single models by 2.3% on reasoning benchmarks, challenging a core assumption in AI engineering.

May 13, 202689% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety