agent systems

30 articles about agent systems in AI news

Alibaba DAMO Academy Releases AgentScope: A Python Framework for Multi-Agent Systems with Visual Design

Alibaba's DAMO Academy has open-sourced AgentScope, a Python framework for building coordinated AI agent systems with visual design, MCP tools, memory, RAG, and reasoning. It provides a complete architecture rather than just building blocks.

97% relevant

Beyond the Model: New Framework Evaluates Entire AI Agent Systems, Revealing Framework Choice as Critical as Model Selection

Researchers introduce MASEval, a framework-agnostic evaluation library that shifts focus from individual AI models to entire multi-agent systems. Their systematic comparison reveals that implementation choices—like topology and orchestration logic—impact performance as much as the underlying language model itself.

75% relevant

MASFactory: A Graph-Centric Framework for Orchestrating LLM-Based Multi-Agent Systems

Researchers introduce MASFactory, a framework that uses 'Vibe Graphing' to compile natural-language intent into executable multi-agent workflows. This addresses implementation complexity and reuse challenges in LLM-based agent systems.

75% relevant

The 'Black Box' of AI Collaboration: How Dynamic Graphs Could Revolutionize Multi-Agent Systems

Researchers have developed a novel framework called Dynamic Interaction Graph (DIG) that makes emergent collaboration between AI agents observable and explainable. This breakthrough addresses critical challenges in scaling truly autonomous multi-agent systems by enabling real-time identification and correction of collaboration failures.

75% relevant

OpenAI Reallocates Compute and Talent Toward 'Automated Researchers' and Agent Systems

OpenAI is reallocating significant compute resources and engineering talent toward developing 'automated researchers' and agent-based systems capable of executing complex tasks end-to-end, signaling a strategic pivot away from some existing projects.

89% relevant

LangChain Releases DeepAgents: Open-Source Framework for Hierarchical AI Agent Systems

LangChain has open-sourced DeepAgents, a framework for building AI agents that can plan tasks, spawn sub-agents, and manage files. It aims to enable more complex, autonomous workflows by structuring agents hierarchically.

85% relevant

Debug Multi-Agent Systems Locally with the A2A Simulator

Test and debug AI agents that communicate via Google's A2A protocol using a local simulator that shows both sides of the conversation.

100% relevant

AI Agent Types and Communication Architectures: From Simple Systems to Multi-Agent Ecosystems

A guide to designing scalable AI agent systems, detailing agent types, multi-agent patterns, and communication architectures for real-world enterprise production. This represents the shift from reactive chatbots to autonomous, task-executing AI.

72% relevant

Preventing AI Team Meltdowns: How to Stop Error Cascades in Multi-Agent Retail Systems

New research reveals how minor errors in AI agent teams can snowball into systemic failures. For luxury retailers deploying multi-agent systems for personalization and operations, this governance layer prevents cascading mistakes without disrupting workflows.

70% relevant

Microsoft Launches Free 'AI Agent Course' for Developers, Covers Design Patterns to Production

Microsoft has released a comprehensive, hands-on course for building AI agents, covering design patterns, RAG, tools, and multi-agent systems. It's a practical resource aimed at moving developers from theory to deployment.

85% relevant

Google Researchers Challenge Singularity Narrative: Intelligence Emerges from Social Systems, Not Individual Minds

Google researchers argue AI's intelligence explosion will be social, not individual, observing frontier models like DeepSeek-R1 spontaneously develop internal 'societies of thought.' This reframes scaling strategy from bigger models to richer multi-agent systems.

87% relevant

Open-Source 'AI Office' Platform Lets Users Walk Through 3D Space to Monitor Autonomous Agents

An open-source project called AI Office creates a 3D virtual workspace where AI agents are visualized as avatars performing tasks. Users can navigate the space instead of reading logs, offering a novel interface for multi-agent systems.

85% relevant

Research Paper 'Can AI Agents Agree?' Finds LLM-Based Groups Fail at Simple Coordination

A new study demonstrates that groups of LLM-based AI agents cannot reliably reach consensus on simple decisions, with failure rates increasing with group size. This challenges the common developer assumption that multi-agent systems will naturally converge through discussion.

87% relevant

Researchers Apply Distributed Systems Theory to LLM Teams, Revealing O(n²) Communication Bottlenecks

A new paper applies decades-old distributed computing principles to LLM multi-agent systems, finding identical coordination problems: O(n²) communication bottlenecks, straggler delays, and consistency conflicts.

85% relevant

Google DeepMind's Intelligent Delegation Framework: The Missing Infrastructure for AI Agents

Google DeepMind has introduced a groundbreaking framework called Intelligent AI Delegation that enables AI agents to safely hand off tasks to other agents and humans. The system addresses critical issues of accountability, transparency, and reliability in multi-agent systems.

95% relevant

AI Agents Get a Memory Upgrade: New Research Tackles Long-Horizon Task Challenges

Researchers have developed new methods to scale AI agent memory for complex, long-horizon tasks. The breakthrough addresses one of the biggest limitations in current agent systems—their inability to retain and utilize information over extended sequences of actions.

87% relevant

When AI Agents Disagree: New Research Tests Whether LLMs Can Reach Consensus

New research explores whether LLM-based AI agents can effectively communicate and reach agreement in multi-agent systems. The study reveals surprising patterns in how AI agents negotiate, disagree, and sometimes fail to find common ground.

85% relevant

Graph-Based AI Agents Are Revolutionizing Software Development

Researchers are developing graph-based multi-agent systems that dynamically adapt their collaboration patterns to solve complex coding problems more effectively than traditional fixed architectures.

85% relevant

ARLArena Framework Solves Critical Stability Problem in AI Agent Training

Researchers have developed ARLArena, a unified framework that addresses the persistent instability problem in agentic reinforcement learning. The framework provides standardized testing and introduces SAMPO, a stable optimization method that prevents training collapse in complex AI agent systems.

70% relevant

OpenAI Backs AI "Bot Army" Startup Isara in $94M Funding Round at $650M Valuation

OpenAI has led a $94 million investment in Isara, a startup developing autonomous AI agents that can collaborate in large groups. The deal values the company at $650 million and signals OpenAI's strategic push into multi-agent systems.

100% relevant

OpenAI Targets First 'AI Intern' by September 2028, Building Toward Autonomous Researchers

OpenAI plans to deploy its first 'AI intern' by September and aims for a full autonomous research system by 2028. The effort builds on reasoning models and agent systems like Codex, which have shown dramatic productivity gains but still face reliability and safety challenges.

95% relevant

OpenClaw Enables Natural Language Control for Drones and Humanoid Robots via Open-Source Framework

OpenClaw, an open-source framework, now allows developers to control drones and humanoid robots using natural language commands. The system integrates with physical sensors like cameras and lidar to build multi-agent systems.

87% relevant

Google's Gemini API Goes Free: A Game-Changer for AI Development and Experimentation

Google has removed rate limits and introduced free access to its Gemini API, enabling developers to experiment with AI prompts in CI/CD pipelines and agent systems without billing concerns. This move democratizes access to advanced language models and encourages innovation.

89% relevant

Agentic AI Systems Failing in Production: New Research Reveals Benchmark Gaps

New research reveals that agentic AI systems are failing in production environments in ways not captured by current benchmarks, including alignment drift and context loss during handoffs between agents.

87% relevant

Rethinking Recommendation Paradigms: From Pipelines to Agentic Recommender Systems

New arXiv research proposes transforming static, multi-stage recommendation pipelines into self-evolving 'Agentic Recommender Systems' where modules become autonomous agents. This paradigm shift aims to automate system improvement using RL and LLMs, moving beyond manual engineering.

94% relevant

Multi-Agent AI Systems: Architecture Patterns and Governance for Enterprise Deployment

A technical guide outlines four primary architecture patterns for multi-agent AI systems and proposes a three-layer governance framework. This provides a structured approach for enterprises scaling AI agents across complex operations.

70% relevant

Beyond Simple Retrieval: The Rise of Agentic RAG Systems That Think for Themselves

Traditional RAG systems are evolving into 'agentic' architectures where AI agents actively control the retrieval process. A new 5-layer evaluation framework helps developers measure when these intelligent pipelines make better decisions than static systems.

81% relevant

AI Efficiency Breakthrough: New Framework Optimizes Agentic RAG Systems Under Budget Constraints

Researchers have developed a systematic framework for optimizing agentic RAG systems under budget constraints. Their study reveals that hybrid retrieval strategies and limited search iterations deliver maximum accuracy with minimal costs, providing practical guidance for real-world AI deployment.

79% relevant

Beyond Simple Messaging: LDP Protocol Brings Identity and Governance to Multi-Agent AI Systems

Researchers have introduced the LLM Delegate Protocol (LDP), a new communication standard designed specifically for multi-agent AI systems. Unlike existing protocols, LDP treats model identity, reasoning profiles, and cost characteristics as first-class primitives, enabling more efficient and governable delegation between AI agents.

75% relevant

Three Research Frontiers in Recommender Systems: From Agent-Driven Reports to Machine Unlearning and Token-Level Personalization

Three arXiv papers advance recommender systems: RecPilot proposes agent-generated research reports instead of item lists; ERASE establishes a practical benchmark for machine unlearning; PerContrast improves LLM personalization via token-level weighting. These address core UX, compliance, and personalization challenges.

92% relevant