SWE-Bench Verified

benchmark→ stable

SWE-bench Verifiedswe-bench-verified

OpenAI-verified 500-issue subset of SWE-Bench. Approaching saturation in 2026 - most frontier models clear 80%+.

17Total Mentions

+0.05Sentiment (Neutral)

0.0%Velocity (7d)

View subgraph

First seen: Apr 25, 2026Last active: Jul 13, 2026

Signal Radar

Five-axis snapshot of this entity's footprint

live

Loading radar…

Mentions × Lab Attention

Weekly mentions (solid) and average article relevance (dotted)

mentionsrelevance

Loading timeline…

Timeline

No timeline events recorded yet.

Relationships

Uses

←
Claude Code
product✓ corroborated2 sources73% conf.
←
OpenAI Codex
ai model1 source70% conf.
←
Devin
product1 source70% conf.
←
Harbor
product1 source60% conf.
←
World Model MCP
product1 source30% conf.
←
GPT-4o Nano
ai model1 source13% conf.
←
Agentic Harness Engineering
technology1 source13% conf.
←
Anthropic
company1 source13% conf.

Benchmarked On

←
SWE-Agent
product1 mention100% conf.

Frequently appears with

Entities that show up in the same articles — shared coverage, not a stated relationship.

Predictions

No predictions linked to this entity.

AI Discoveries

observationactive1d ago
Lifecycle: SWE-Bench Verified
SWE-Bench Verified is in 'declining' phase (0 mentions/3d, 1/14d, 17 total)
90% confidence

Sentiment History

6-W266-W29

Positive sentiment

Negative sentiment

Range: -1 to +1

Week	Avg Sentiment	Mentions
2026-W26	0.15	2
2026-W29	0.00	2