Ego2WebJudge
product→ stable
DeepSeek's Ego2WebJudge is a novel evaluation method that achieves 84% human agreement for assessing AI agents on web-based tasks.
1Total Mentions
+0.30Sentiment (Positive)
0.0%Velocity (7d)
First seen: Mar 25, 2026Last active: Mar 25, 2026
Signal Radar
Five-axis snapshot of this entity's footprint
Loading radar…
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
mentionsrelevance
Loading timeline…
Timeline
1- Research MilestoneMar 25, 2026
Achieved 84% human agreement on task success evaluation
View source- agreement rate:
- 84%
Relationships
No relationships mapped yet.
Recent Articles
No articles found for this entity.
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
Positive sentiment
Negative sentiment
Range: -1 to +1
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W13 | 0.30 | 1 |