BullshitBench

product→ stable

BullshitBench v2

BullshitBench, developed by researcher Peter Gostev, is a benchmark that evaluates how well large language models detect and reject nonsensical or false prompts instead of generating confident but incorrect responses.

1Total Mentions

+0.10Sentiment (Neutral)

0.0%Velocity (7d)

View subgraph

First seen: Mar 2, 2026Last active: Mar 2, 2026

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

Product LaunchMar 2, 2026
Researcher Peter Gostev released BullshitBench v2, a benchmark testing AI models' tendency to generate plausible-sounding falsehoods.
View source
version:
v2

Relationships

No relationships mapped yet.

Recent Articles

No articles found for this entity.

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.