GDPval-AA
product→ stable
GDPval-AA benchmark
GDPval-AA is a benchmark for evaluating AI models, criticized by Ethan Mollick for using Gemini 3.1 to judge other models on public questions.
1Total Mentions
-0.60Sentiment (Very Negative)
0.0%Velocity (7d)
First seen: Apr 18, 2026Last active: Apr 18, 2026
Signal Radar
Five-axis snapshot of this entity's footprint
Loading radar…
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
mentionsrelevance
Loading timeline…
Timeline
1- Regulatory ActionApr 18, 2026
Ethan Mollick publicly criticized the benchmark methodology as uninformative and called for it to stop being reported
View source- critic:
- Ethan Mollick
- methodology issue:
- Uses Gemini 3.1 as judge on public questions
Relationships
1Endorsed
Recent Articles
No articles found for this entity.
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
Positive sentiment
Negative sentiment
Range: -1 to +1
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W16 | -0.60 | 1 |