RIFT-Bench evaluates 45 agentic AI systems using a graph-driven red-teaming pipeline. The benchmark, published on arXiv June 22, 2026, automates security assessment across heterogeneous agent architectures.
Key facts
- Published on arXiv June 22, 2026.
- Evaluates 45 agentic AI systems.
- Two phases: Discovery and Scanning.
- Supports adaptive adversarial attacks.
- Also evaluates mitigation strategies.
Agentic AI systems—LLM-powered autonomous decision-makers—introduce attack surfaces beyond those of traditional large language models. Existing security evaluations are typically domain-specific or implementation-tied, making cross-system comparison impossible. According to RIFT-Bench, a new benchmark published June 22, 2026 on arXiv, addresses this gap with a graph representation-driven methodology for dynamic red-teaming.
Two-Phase Automated Pipeline
RIFT-Bench operates in two automated phases: Discovery, which extracts system structure into a hierarchical NodeSpec representation, and Scanning, which deploys adaptive adversarial attacks against that representation. The framework evaluates the system itself rather than just the underlying LLM, enabling unified comparison across 45 agentic systems spanning diverse implementations. The authors demonstrate that the approach generalizes effectively to heterogeneous agentic architectures.
Attack Taxonomy and Mitigation Testing
Beyond systems and attacks, RIFT-Bench supports direct evaluation of mitigation strategies. The proposed attack taxonomy organizes adversarial influence along an attack-surface axis and a failure-objective axis, allowing the same attack to be instantiated with different goals. This makes RIFT-Bench a scalable foundation for security evaluation, according to the paper.

Why This Matters for the Field
RIFT-Bench treats the agentic system itself as the evaluation target, not just the LLM behind it. This mirrors the shift in the industry from model-level safety to system-level security—a gap that existing benchmarks like SciRisk-Bench (testing risk dimensions) or the NVIDIA Blackwell Ultra agentic benchmark (performance-focused) do not address. RIFT-Bench is the first to provide a unified, automated red-teaming framework for agentic architectures.

What to watch
Watch for the release of RIFT-Bench's code and dataset on GitHub, and for third-party validations that compare its findings against manual red-teaming results. Adoption by AI safety labs and enterprise security teams will signal whether the benchmark becomes a de facto standard.

Source: arxiv.org









