Coverage (30d)
0vs0
This Week
0vs0
Evidence
1 articlesRelationships
0Timeline
XpertBench2026-04-06
Publication of benchmark revealing 'expert-gap' with top LLMs scoring only ~66% on complex professional tasks
Ecosystem
GSM8K
No mapped relationships
XpertBench
useslarge language models1 src