Muses-Bench

product stable

Muses-Bench, from the researchers who formalized the multi-user interaction problem, is a benchmark for testing three critical scenarios in multi-user interaction.

1Total Mentions
+0.00Sentiment (Neutral)
+1.2%Velocity (7d)
Share:
First seen: Apr 14, 2026Last active: 3d ago

Timeline

1
  1. Research MilestoneApr 14, 2026

    New benchmark Muses-Bench revealed LLMs struggle with multi-user scenarios; Gemini 3 Pro scored 85.6%.

    View source

Relationships

No relationships mapped yet.

Recent Articles

1

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.

Sentiment History

+10-1
Positive sentiment
Negative sentiment
Range: -1 to +1
WeekAvg SentimentMentions
2026-W160.001