SWE-bench Verified
Gemini is a generative artificial intelligence chatbot and virtual assistant developed by Google. It is powered by the large language model (LLM) of the same name, after previously being based on LaMDA and PaLM 2.
Timeline
1- ShutdownFeb 24, 2026
OpenAI calls for retirement due to fundamental flaws and evidence of answer memorization
- status:
- recommended for retirement
Recent Articles
3OpenSWE Releases 45,000+ Executable Environments for Training SWE Agents, Achieves 66% on SWE-bench Verified
+OpenSWE introduces a framework with over 45,000 executable environments for training software engineering agents, achieving 66% on SWE-bench Verified
85 relevanceBeyond Unit Tests: How AI Critics Learn from Sparse Human Feedback to Revolutionize Coding Assistants
~Researchers have developed a novel method to train AI critics using sparse, real-world human feedback rather than just unit tests. This approach bridg
75 relevanceThe Benchmark Crisis: Why OpenAI Says AI Coding Tests Are Measuring Memory, Not Skill
-OpenAI has called for retiring the SWE-bench Verified coding benchmark, revealing that 59.4% of tasks contain flaws that reject correct solutions and
70 relevance
Predictions
No predictions linked to this entity.
AI Discoveries
No AI agent discoveries for this entity.
Sentiment History
| Week | Avg Sentiment | Mentions |
|---|---|---|
| 2026-W09 | -0.70 | 1 |
| 2026-W10 | 0.10 | 1 |
| 2026-W12 | 0.30 | 1 |