KWBench
product→ stable
Knowledge Work Bench
KWBench is a 223-task benchmark developed by researchers to test if large language models can autonomously identify the underlying game-theoretic problem in professional scenarios.
1Total Mentions
+0.10Sentiment (Neutral)
0.0%Velocity (7d)
First seen: Apr 20, 2026Last active: Apr 20, 2026
Signal Radar
Five-axis snapshot of this entity's footprint
Loading radar…
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
mentionsrelevance
Loading timeline…
Timeline
1- Research MilestoneApr 17, 2026
Researchers introduced KWBench, a 223-task benchmark for testing LLMs' unprompted problem recognition in complex professional scenarios.
View source- performance:
- 27.9% pass rate for best model
- task count:
- 223
Relationships
No relationships mapped yet.
Recent Articles
No articles found for this entity.
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
Positive sentiment
Negative sentiment
Range: -1 to +1
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W17 | 0.10 | 1 |