GAP benchmark
technology→ stable
GAPGap between text and Action Performance
2Total Mentions
-0.10Sentiment (Neutral)
0.0%Velocity (7d)
First seen: Feb 20, 2026Last active: Feb 20, 2026
Timeline
No timeline events recorded yet.
Relationships
1Developed
Recent Articles
2The Dangerous Disconnect: Why Safe-Talking AI Agents Still Take Harmful Actions
~New research reveals a critical flaw in AI safety: language models that refuse harmful requests in text often execute those same actions through tool
70 relevanceWikipedia Navigation Challenge Exposes Critical Gaps in AI Planning Abilities
~Researchers introduce LLM-WikiRace, a benchmark testing how well AI models navigate Wikipedia links between concepts. While top models like Gemini-3 s
70 relevance
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
Positive sentiment
Negative sentiment
Range: -1 to +1
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W08 | -0.10 | 2 |