A 111-page survey from top US and China labs proposes 5 levels of AI progress toward AGI, from responder to ecosystem. The paper argues that epistemic exploration — agents actively reducing uncertainty — is the missing ingredient, not better answer generation.
Key facts
- 111-page survey paper from top US and China labs
- 5 AGI levels: responder, reasoner, agent, prospector, ecosystem
- Epistemic exploration has 3 needs: info, skill, avoid stuck
- Exploration is 'disciplined act of asking' what changes beliefs
- Current models operate at bottom 2 levels, per framework
A 111-page survey paper from leading US and China labs — posted on X by @rohanpaul_ai According to @rohanpaul_ai — argues that the path to AGI requires agents that actively explore what they do not know, not just models that answer better. The paper, titled "Agent Exploration Toward Artificial General Intelligence," introduces a 5-level framework for AI progress: responder, reasoner, agent, prospector, and ecosystem.
The 5 Levels of AGI Progress
The authors organize AI progress into 5 levels: responder, reasoner, agent, prospector, and ecosystem, where each level explores a wider space than the last. A responder mostly gives an answer, a reasoner searches through possible thoughts, an agent tests the outside world, a prospector simulates futures, and an ecosystem uses many agents working together. This hierarchy reframes the dominant narrative that scaling compute and data alone yields AGI — instead, it centers exploration breadth as the key metric.
Epistemic Exploration as Core Mechanism
The paper breaks epistemic exploration into 3 needs: seek useful information, turn hard-but-learnable experiences into better ability, and avoid getting stuck in one narrow strategy too early. "Exploration is not randomness; it is the disciplined act of asking which observation would change your beliefs, which attempt would improve your skill, and which path must remain open before it closes," the authors write. This contrasts with current RLHF-based approaches that optimize for safe, predictable responses rather than curiosity-driven exploration.
Unique Take: The Exploration Gap
The survey's structural insight is that the AI industry's current focus on benchmark-chasing and answer accuracy may be counterproductive for AGI. By defining AGI progress in terms of exploration space rather than answer quality, the paper implies that models like GPT-4 and Claude — despite their impressive responder and reasoner capabilities — operate at the bottom two levels. Real AGI requires agents that can actively test hypotheses in the world, simulate counterfactual futures, and coordinate across multiple specialized agents. This is a direct challenge to the "scale is all you need" orthodoxy.
Key Takeaways
- 111-page survey from US/China labs defines 5 AGI levels, argues epistemic exploration — not better answering — is key.
- Challenges scaling orthodoxy.
What to watch

Watch for follow-up papers from these labs that operationalize the 5-level framework into measurable benchmarks — specifically, whether any team publishes an exploration-coverage metric that correlates with downstream generalization on held-out tasks. Also track if frontier labs like OpenAI or DeepMind publicly adopt the framework.








