Health AI benchmarks

product→ stable

health benchmarks

HealthBench, developed by OpenAI, is a comprehensive dataset for evaluating large language models on realistic clinical case questions, moving beyond simple multiple-choice formats.

1Total Mentions

-0.30Sentiment (Negative)

0.0%Velocity (7d)

View subgraph

First seen: Mar 20, 2026Last active: Mar 20, 2026

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

Research MilestoneMar 20, 2026
Analysis reveals 'validity gap' in health AI benchmarks with misalignment to clinical reality
View source
queries analyzed:
18,707
benchmarks analyzed:
6
key finding:
0.6% of queries use raw medical records, 5.5% cover chronic care

Relationships

No relationships mapped yet.

Recent Articles

No articles found for this entity.

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.