Coverage (30d)
1vs1
This Week
1vs1
Evidence
1 articlesRelationships
0Timeline
Mingyuan Fan2026-05-26
Submitted arXiv preprint introducing 474-game counterfactual reasoning benchmark for LLMs
Weiguang Han2026-05-26
Co-authored 474-game LLM reasoning benchmark paper submitted to arXiv