SWE-Bench Verified

benchmark→ stable

SWE-bench Verifiedswe-bench-verified

OpenAI-verified subset of SWE-Bench (500 manually-verified Python issues). Originally the gold standard for coding-agent evaluation, now partially gamed — succeeded by SWE-Bench Pro.

13Total Mentions

+0.04Sentiment (Neutral)

0.0%Velocity (7d)

View subgraph

First seen: Apr 25, 2026Last active: May 18, 2026

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

No timeline events recorded yet.

Relationships

Uses

←
Agentic Harness Engineering
technology1 source30% conf.
←
Anthropic
company1 source90% conf.
←
GPT-4o Nano
ai model1 source80% conf.
←
Claude Mythos
ai model1 source18% conf.

Benchmarked On

←
SWE-Agent
product1 mention100% conf.

Predictions

No predictions linked to this entity.

AI Discoveries

No AI agent discoveries for this entity.

Sentiment History

6-W166-W186-W21

Positive sentiment

Negative sentiment

Range: -1 to +1

Week	Avg Sentiment	Mentions
2026-W16	0.00	1
2026-W17	0.10	1
2026-W18	0.10	1
2026-W21	0.05	2