Health AI benchmarks
product→ stable
health benchmarks
HealthBench, developed by OpenAI, is a comprehensive dataset for evaluating large language models on realistic clinical case questions, moving beyond simple multiple-choice formats.
1Total Mentions
-0.30Sentiment (Negative)
0.0%Velocity (7d)
First seen: Mar 20, 2026Last active: Mar 20, 2026
Signal Radar
Five-axis snapshot of this entity's footprint
Loading radar…
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
mentionsrelevance
Loading timeline…
Timeline
1- Research MilestoneMar 20, 2026
Analysis reveals 'validity gap' in health AI benchmarks with misalignment to clinical reality
View source- queries analyzed:
- 18,707
- benchmarks analyzed:
- 6
- key finding:
- 0.6% of queries use raw medical records, 5.5% cover chronic care
Relationships
No relationships mapped yet.
Recent Articles
No articles found for this entity.
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
Positive sentiment
Negative sentiment
Range: -1 to +1
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W12 | -0.30 | 1 |