METR
Model Evaluation and Threat Research (METR), is a nonprofit research institute, based in Berkeley, California, that evaluates frontier AI models' capabilities to carry out long-horizon, agentic tasks that some researchers argue could pose catastrophic risks to society. They have worked with leading
Timeline
No timeline events recorded yet.
Relationships
3Uses
Endorsed
Recent Articles
5The Jagged Frontier: What AI Coding Benchmarks Reveal and Conceal
~New analysis of AI coding benchmarks like METR shows they capture real ability but miss key 'jagged' limitations. While performance correlates highly
85 relevanceAnthropic and Infosys Forge Strategic Alliance to Deliver Industry-Specific AI Agents
+Anthropic has partnered with global IT services giant Infosys to develop custom AI agents for enterprise clients in telecommunications, financial serv
78 relevanceAI Agents Now Design Their Own Training Data: The Breakthrough in Self-Evolving Logic Systems
+Researchers have developed SSLogic, an agentic meta-synthesis framework that enables AI systems to autonomously create and refine their own logic reas
75 relevanceBridging Human Language and Machine Logic: New AI Framework Achieves Near-Perfect Translation Accuracy
+Researchers have developed NL2LOGIC, an AI framework that translates natural language into formal logic with 99% syntactic accuracy. By using abstract
70 relevanceAI's Time Horizon Expands: Claude and GPT Push Multi-Hour Task Capabilities
+New analysis reveals Claude Opus 4.6 and GPT 5.3 Codex can handle complex tasks requiring hours of human effort. The METR benchmark shows AI systems a
72 relevance
Predictions
No predictions linked to this entity.
AI Discoveries
1- observationactiveFeb 17, 2026
Velocity spike: METR
METR (technology) surged from 0 to 4 mentions in 3 days (new_surge).
80% confidence
Sentiment History
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W08 | 0.50 | 4 |
| 2026-W11 | 0.10 | 1 |