Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution
arXiv researchers introduce a verification-driven framework for coordinating specialized AI agents to solve complex queries with measurable quality improvements.
What Happened
On March 12, 2026, researchers published a paper on arXiv introducing Verified Multi-Agent Orchestration (VMAO), a novel framework for coordinating multiple specialized large language model (LLM) agents to resolve complex queries. The system employs a Plan-Execute-Verify-Replan iterative loop that represents a significant advancement in multi-agent orchestration architectures.
The core innovation lies in using LLM-based verification as an orchestration-level coordination signal rather than just a final quality check. This verification-driven approach enables adaptive replanning when results are incomplete or insufficient, creating a self-correcting system that improves over traditional sequential or parallel agent execution patterns.
Technical Details
The VMAO Framework Architecture

VMAO operates through four distinct phases that form a continuous loop:
Plan: The system decomposes a complex query into a directed acyclic graph (DAG) of sub-questions, identifying dependencies between components that must be resolved sequentially versus those that can be executed in parallel.
Execute: Domain-specific agents process sub-questions in parallel where possible, with automatic context propagation ensuring that dependent questions receive necessary information from their prerequisites.
Verify: An LLM-based verifier evaluates result completeness and quality at the orchestration level, providing a coordination signal that determines whether the current execution path is sufficient or requires adjustment.
Replan: Based on verification feedback, the system adaptively replans to address gaps, either by refining existing sub-questions, adding new ones, or adjusting execution order.
Key Technical Contributions
The paper highlights three primary contributions:
Dependency-aware parallel execution: Unlike simpler multi-agent systems that either execute all agents sequentially or in uncoordinated parallel, VMAO intelligently maps dependencies into a DAG structure, enabling maximum parallelism while respecting logical constraints.
Verification-driven adaptive replanning: The LLM-based verifier serves as more than a quality gate—it provides actionable feedback that drives system adaptation, creating a closed-loop improvement mechanism.
Configurable stop conditions: The framework includes tunable parameters that allow balancing answer quality against computational resource usage, enabling practical deployment considerations.
Experimental Results
The researchers evaluated VMAO on 25 expert-curated market research queries, comparing it against a single-agent baseline. Results showed substantial improvements:
- Answer completeness: Increased from 3.1 to 4.2 on a 1-5 scale (35% improvement)
- Source quality: Increased from 2.6 to 4.1 on a 1-5 scale (58% improvement)
These metrics demonstrate that orchestration-level verification provides an effective mechanism for multi-agent quality assurance, addressing a critical challenge in deploying complex AI systems for real-world tasks.
Retail & Luxury Implications
Strategic Market Intelligence Applications

While the paper evaluates VMAO on general market research queries, the framework has direct applicability to competitive intelligence, trend analysis, and consumer insight generation in retail and luxury sectors. Complex questions like "How will changing sustainability regulations in the EU affect our supply chain costs and brand perception among Gen Z consumers in France and Italy over the next 18 months?" require precisely the kind of multi-faceted analysis VMAO enables.
The system could coordinate specialized agents for:
- Regulatory analysis (scanning legal documents and policy announcements)
- Supply chain modeling (analyzing cost structures and alternative sourcing)
- Consumer sentiment tracking (monitoring social media and review platforms)
- Competitive benchmarking (comparing sustainability initiatives across luxury brands)
- Market forecasting (projecting adoption rates and price sensitivity)
Enhanced Customer Service and Personalization
Beyond market research, VMAO's architecture could transform complex customer service scenarios where multiple information sources and decision paths intersect. For high-value luxury clients with intricate requests—such as planning a multi-city shopping itinerary that considers inventory availability, personal styling preferences, local events, and logistical constraints—VMAO could coordinate:
- Inventory agents checking real-time stock across locations
- Personalization agents accessing client history and preferences
- Logistics agents coordinating transportation and timing
- Experience agents suggesting complementary activities or services
Product Development and Creative Direction
The verification-driven replanning mechanism could assist in creative and product development processes that typically involve multiple specialized teams. When exploring a new product concept, VMAO could help answer complex questions like "What materials, manufacturing techniques, design aesthetics, and marketing narratives would create a coherent luxury product addressing sustainability concerns while maintaining exclusivity and craftsmanship associations?"
Implementation Considerations for Retail
While promising, implementing VMAO in production retail environments requires addressing several practical considerations:
Domain-specific agent training: The framework's effectiveness depends on having well-trained specialized agents for retail-specific domains (inventory, CRM, pricing, etc.).
Latency vs. quality trade-offs: The iterative replanning process introduces computational overhead that must be balanced against response time requirements, particularly for customer-facing applications.
Integration with existing systems: VMAO would need robust APIs to connect with retail ERP, CRM, PIM, and other enterprise systems to access accurate, real-time data.
Explainability and auditability: For high-stakes decisions in luxury retail (pricing, inventory allocation, etc.), the system's reasoning process must be transparent and auditable.
The Verification Challenge in Luxury Contexts
A particularly interesting challenge for luxury applications is defining verification criteria that capture subjective qualities like brand alignment, aesthetic coherence, and exclusivity maintenance. While the paper's evaluation used expert-curated market research queries with relatively objective completeness criteria, luxury applications often involve more nuanced quality assessments that may resist straightforward LLM-based verification.
Looking Forward
The VMAO framework represents an important step toward more reliable, verifiable multi-agent AI systems—a critical requirement for enterprise adoption in sectors like luxury retail where decision quality carries significant financial and reputational consequences. By making verification an integral, driving component of the orchestration process rather than an afterthought, the approach addresses fundamental concerns about AI system reliability.

For retail AI leaders, the key insight is that orchestration architecture matters as much as individual agent capabilities. As LLM agents become more specialized and capable, the coordination layer becomes the critical determinant of overall system performance. VMAO's verification-driven approach offers a promising direction for building systems that can handle the complex, multi-faceted queries characteristic of strategic decision-making in competitive retail environments.
The research paper "Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution" is available on arXiv under identifier 2603.11445v1.



