What is the current SOTA on SWE-Bench Verified?
Claude Opus 4.7 holds the SWE-Bench Verified state-of-the-art at 87.6% (released April 16, 2026). It surpassed Claude Opus 4.6 (80.8%) and currently leads GPT-5.5 and Gemini 3.1 Pro on this multi-file software-engineering benchmark.