Muses-Bench

product→ stable

Muses-Bench, from the researchers who formalized the multi-user interaction problem, is a benchmark for testing three critical scenarios in multi-user interaction.

1Total Mentions

+0.00Sentiment (Neutral)

0.0%Velocity (7d)

View subgraph

First seen: Apr 14, 2026Last active: Apr 14, 2026

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

Research MilestoneApr 14, 2026
New benchmark Muses-Bench revealed LLMs struggle with multi-user scenarios; Gemini 3 Pro scored 85.6%.
View source

Relationships

No relationships mapped yet.

Recent Articles

No articles found for this entity.

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.