SAGE

product stable
Service Agent Graph-guided Evaluation
1Total Mentions
+0.10Sentiment (Neutral)
+1.2%Velocity (7d)
First seen: Apr 13, 2026Last active: 3h ago

Timeline

1
  1. Research MilestoneApr 10, 2026

    SAGE benchmark published on arXiv, exposing an 'Execution Gap' where LLMs understand user intent but fail to follow correct procedures in customer service.

    View source
    models evaluated:
    27
    domains:
    6

Relationships

1

Uses

Recent Articles

1

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.

Sentiment History

+10-1
Positive sentiment
Negative sentiment
Range: -1 to +1
WeekAvg SentimentMentions
2026-W160.101