BullshitBench
product→ stable
BullshitBench v2
BullshitBench, developed by researcher Peter Gostev, is a benchmark that evaluates how well large language models detect and reject nonsensical or false prompts instead of generating confident but incorrect responses.
1Total Mentions
+0.10Sentiment (Neutral)
0.0%Velocity (7d)
First seen: Mar 2, 2026Last active: Mar 2, 2026
Signal Radar
Five-axis snapshot of this entity's footprint
Loading radar…
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
mentionsrelevance
Loading timeline…
Timeline
1- Product LaunchMar 2, 2026
Researcher Peter Gostev released BullshitBench v2, a benchmark testing AI models' tendency to generate plausible-sounding falsehoods.
View source- version:
- v2
Relationships
No relationships mapped yet.
Recent Articles
No articles found for this entity.
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.