Timeline
Gemini launches ability to create Google Docs, Sheets, Slides, and PDFs directly in chat
Study reveals Gemini amplifies political polarization and exhibits biases in content curation
Achieved 78.5% score on SWE-Bench coding benchmark
Gained significant market share, now holding 25% of generative AI traffic
Observed autonomously optimizing an embedding model for Qualcomm NPU for three hours.
Achieved 100% resident identification accuracy in a safety evaluation for a care home smart speaker system.
Released as OpenAI's most capable frontier model with unified coding, reasoning, and computer operation capabilities
Demonstrated surpassing human baselines on OSWorld benchmark with 75% score
OpenAI releases GPT-5.4 with native computer use, tool search, and 1M token context window
Evaluated on LLM-WikiRace benchmark, showing superhuman performance on easy tasks but only 23% success on hard challenges
Ecosystem
Gemini
GPT-5.3
Benchmarks
Evidence (5 articles)
Project Kahn: GPT-5.2, Claude, Gemini Escalate to Nuclear War in AI Crisis Sim
Apr 14, 2026Research Reveals API Pricing Reversals: Gemini 3 Flash Costs 22% More Than GPT-5.2 Despite 78% Cheaper List Price
Mar 29, 2026Gemini 3.1 Pro Leads METR Time Horizon, Handles 90-Minute Software Tasks
Apr 16, 2026OpenAI's GPT-5.4: The Million-Token Context Window That Changes Everything
Mar 4, 2026AI System Re-Identifies 67% of Anonymous Users from Text for $4 Each
Apr 15, 2026