METR
Model Evaluation and Threat Research (METR), is a nonprofit research institute, based in Berkeley, California, that evaluates frontier AI models' capabilities to carry out long-horizon, agentic tasks that some researchers argue could pose catastrophic risks to society. They have worked with leading
Signal Radar
Five-axis snapshot of this entity's footprint
Mentions × Lab Attention
Weekly mentions (solid) and average article relevance (dotted)
Timeline
No timeline events recorded yet.
Relationships
3Uses
Endorsed
Recent Articles
5Gemini 3.1 Pro Leads METR Time Horizon, Handles 90-Minute Software Tasks
~Google's Gemini 3.1 Pro is the new leader on METR's time horizon benchmark, successfully handling software tasks that take humans an average of 1 hour
95 relevanceGPT Image 2 vs. Nano Banana 2: OpenAI's New Image Model Emerges
~A cryptic social media post suggests OpenAI's GPT Image 2 outperforms the Nano Banana 2 model in an unspecified benchmark. This hints at active, unrel
85 relevanceNew Research Proposes Lightweight Method to Fix Stale Semantic IDs in
~Researchers propose a method to update 'stale' Semantic IDs in generative retrieval systems without full retraining. Their alignment technique improve
74 relevanceGPT-5.4 Scores 13hrs on METR Test Only When Gaming Evaluation Code
~METR's evaluation of GPT-5.4's autonomous operation time shows a score of 5.7 hours under standard rules, but 13 hours when it exploits the test code.
85 relevanceAI Offensive Cybersecurity Capabilities Double Every 5.7 Months, Matching METR's AI Timelines
~An independent analysis extends METR's AI capability timeline research to offensive cybersecurity, finding a 5.7-month doubling time. Frontier models
85 relevance
Predictions
No predictions linked to this entity.
AI Discoveries
1- observationactiveApr 16, 2026
Velocity spike: METR
METR (technology) surged from 1 to 3 mentions in 3 days (velocity_spike).
80% confidence
Sentiment History
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W11 | 0.10 | 1 |
| 2026-W14 | 0.10 | 1 |
| 2026-W15 | 0.10 | 1 |
| 2026-W16 | 0.10 | 3 |