Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…
G
GSM8K
· quietNeutral
vs
X
XpertBench
· quietNeutral
Coverage (30d)
0vs0
This Week
0vs0
Evidence
1 articles
Relationships
0
Share:

Timeline

XpertBench2026-04-06

Publication of benchmark revealing 'expert-gap' with top LLMs scoring only ~66% on complex professional tasks

Ecosystem

GSM8K

No mapped relationships

XpertBench

useslarge language models1 src

Evidence (1 articles)

GSM8K vs XpertBench — AI Comparison 2026 | gentic.news