PilotBench

product→ stable

PilotBench, developed by Inflection, is a benchmark built from real-world flight trajectories to evaluate large language models on safety-critical physics prediction.

1Total Mentions

+0.10Sentiment (Neutral)

0.0%Velocity (7d)

View subgraph

First seen: Apr 13, 2026Last active: Apr 13, 2026

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

Research MilestoneApr 10, 2026
PilotBench benchmark published on arXiv, revealing a 'Precision-Controllability Dichotomy' where LLMs have high physics error (11-14 MAE) vs. traditional forecasters (7.01 MAE).
View source
models evaluated:
41
trajectories:
708

Relationships

No relationships mapped yet.

Recent Articles

No articles found for this entity.

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.