PilotBench
product→ stable
PilotBench, developed by Inflection, is a benchmark built from real-world flight trajectories to evaluate large language models on safety-critical physics prediction.
1Total Mentions
+0.10Sentiment (Neutral)
0.0%Velocity (7d)
First seen: Apr 13, 2026Last active: Apr 13, 2026
Signal Radar
Five-axis snapshot of this entity's footprint
Loading radar…
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
mentionsrelevance
Loading timeline…
Timeline
1- Research MilestoneApr 10, 2026
PilotBench benchmark published on arXiv, revealing a 'Precision-Controllability Dichotomy' where LLMs have high physics error (11-14 MAE) vs. traditional forecasters (7.01 MAE).
View source- models evaluated:
- 41
- trajectories:
- 708
Relationships
2Uses
Recent Articles
No articles found for this entity.
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
Positive sentiment
Negative sentiment
Range: -1 to +1
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W16 | 0.10 | 1 |