PilotBench

product stable
1Total Mentions
+0.10Sentiment (Neutral)
+1.2%Velocity (7d)
First seen: Apr 13, 2026Last active: 3h ago

Timeline

1
  1. Research MilestoneApr 10, 2026

    PilotBench benchmark published on arXiv, revealing a 'Precision-Controllability Dichotomy' where LLMs have high physics error (11-14 MAE) vs. traditional forecasters (7.01 MAE).

    View source
    models evaluated:
    41
    trajectories:
    708

Relationships

2

Uses

Recent Articles

1

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.

Sentiment History

+10-1
Positive sentiment
Negative sentiment
Range: -1 to +1
WeekAvg SentimentMentions
2026-W160.101