gemini
30 articles about gemini in AI news
Google's AICore Beta Enables On-Device Gemini Nano 4 Downloads for Android Phones
A new beta of Google's AICore system service enables users to download Gemini Nano 4 Full and Gemini Nano 4 Fast models directly onto compatible Android phones, including those with Snapdragon 8 Elite Gen 5 chips. This moves beyond pre-installed AI to user-initiated model management.
Google Gemini Introduces Data Import Feature from Other AI Platforms
Google's Gemini AI now allows users to transfer their chat history, memories, and preferences from other AI assistants, eliminating the need to start fresh when switching platforms. This interoperability feature represents a strategic move to attract users from competing services by reducing migration friction.
Glass AI Coding Editor Expands to Windows, Bundles Claude Opus 4.6, GPT-5.4 & Gemini 3.1 Pro Access
The Glass AI coding editor is now available on Windows, offering developers a single subscription that includes usage of Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro without additional API costs. This expansion significantly broadens its potential user base beyond the Mac ecosystem.
Google Launches Gemini API 'Flex' & 'Turbo' Tiers, Cuts Standard Pricing by 50%
Google has added 'Flex' and 'Turbo' service tiers to its Gemini API, with Flex offering a 50% reduction in cost compared to Standard. This move provides developers with more granular control over cost versus latency for their AI applications.
Claude Code, Gemini, and 50+ Dev Tools Dockerized into Single AI Coding Workstation
A developer packaged Claude Code's browser UI, Gemini, Codex, Cursor, TaskMaster CLIs, Playwright with Chromium, and 50+ development tools into a single Docker Compose setup, creating a pre-configured AI coding environment that uses existing Claude subscriptions.
Google Gemini Launches Manual Memory & Chat Import to Ease Switching from ChatGPT, Claude
Google Gemini is rolling out 'Import Memory' and 'Import Chat History' features for desktop users. The manual tools provide prompts and a .zip upload to transfer data from other AI assistants, aiming to lower the barrier for users to switch from competitors like ChatGPT or Claude.
Apple iOS 27 to Introduce 'Extensions' for Siri, Allowing Users to Link to ChatGPT, Gemini, or Claude
Apple's iOS 27 will reportedly let users choose third-party AI chatbots like Google Gemini or Anthropic Claude to power Siri responses via a new 'Extensions' feature. This follows Apple's confirmed deal with Google to power its overhauled Siri, signaling a major shift from a closed to an open AI assistant ecosystem.
Google Announces Gemini 3.1 Flash Live: A New Real-Time AI Model
Google has announced Gemini 3.1 Flash Live, a new model variant focused on real-time, low-latency AI interactions. The announcement came via a developer tweet, indicating a potential push for faster, more responsive AI applications.
Open-Source Code Editor 'Cline' Integrates Claude Opus, GPT-4, and Gemini Pro via Single API
Developer Hasan Tohar announced 'Cline', an open-source code editor that integrates multiple top-tier AI models through a unified interface. The tool allows switching between Claude Opus, GPT-4, and Gemini Pro without managing separate API keys or subscriptions.
Apple Reportedly Gains Full Internal Access to Google's Gemini for On-Device Model Distillation
A report claims Apple's AI deal with Google includes full internal model access, enabling distillation of Gemini's reasoning into smaller, on-device models. This would allow Apple to build specialized, efficient AI without relying solely on cloud APIs.
Glass AI IDE Emerges, Claims to Offer Free Access to Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro
A new AI-powered coding editor called Glass claims to provide free access to multiple top-tier LLMs, including Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro, without API fees. This positions it as a direct, cost-free competitor to established paid AI IDEs like Cursor and Windsurf.
Google DeepMind Unveils Gemini-Powered Browser That Generates Websites in Real-Time
Google DeepMind has demonstrated a browser prototype powered by Gemini 3.1 Flash-Lite that generates complete HTML/CSS websites dynamically based on user prompts and navigation context, shifting from static page retrieval to on-demand interface generation.
Tessera Launches Open-Source Framework for 32 OWASP AI Security Tests, Benchmarks GPT-4o, Claude, Gemini, Llama 3
Tessera introduces the first open-source framework to run all 32 OWASP AI security tests against any model with one CLI command. It provides benchmark results for GPT-4o, Claude, Gemini, Llama 3, and Mistral across 21 model-specific security tests.
Origin CLI: Open-Source Git Blame for AI Agents Tracks Claude Code, Cursor, and Gemini Contributions
Origin is a new open-source CLI tool that adds AI attribution to git commits, tagging each line with which agent wrote it, the prompt, model, and cost. It works offline with Claude Code, Cursor, and Gemini, storing data in git notes.
Apple Siri Rebuilt as System-Wide AI Agent in iOS 27, Powered by Apple Foundation Models and Google Gemini
Apple is rebuilding Siri into a conversational system-wide AI agent with deep app integration and personal data access, launching in iOS 27. The overhaul includes a standalone app, web browsing, and writing tools, powered by Apple's models and a Google Gemini partnership.
OpenClaw AI Agent Adds Real-Time Vision to Meta Ray-Ban Smart Glasses via Gemini Live API
An open-source project enables Meta Ray-Ban smart glasses to function as a real-time AI assistant. It streams the glasses' camera feed (~1fps) to Gemini Live for visual context, then delegates actions via the OpenClaw agent framework.
ReXInTheWild Benchmark Reveals VLMs Struggle with Medical Photos: Gemini-3 Leads at 78%, MedGemma Trails at 37%
Researchers introduced ReXInTheWild, a benchmark of 955 clinician-verified questions based on 484 real medical photographs. Leading multimodal models show wide performance gaps, with Gemini-3 scoring 78% accuracy while the specialized MedGemma model achieved only 37%.
RedNote's 3B-Parameter Multimodal OCR Model Ranks Second to Gemini 3 Pro on Document Parsing Benchmarks
RedNote has released a 3-billion parameter multimodal OCR model that converts text, charts, diagrams, and tables into structured formats like Markdown and HTML. It reportedly ranks second only to Google's Gemini 3 Pro on OCR benchmarks.
MiniMax M2.7 Achieves 30% Internal Benchmark Gain via Self-Improvement Loops, Ties Gemini 3.1 on MLE Bench Lite
MiniMax had its M2.7 model run 100+ autonomous development cycles—analyzing failures, modifying code, and evaluating changes—resulting in a 30% performance improvement. The model now handles 30-50% of the research workflow and tied Gemini 3.1 in ML competition trials.
Claude Octopus: GitHub Tool Enables Claude Code to Run Gemini and Codex Simultaneously
A developer discovered Claude Octopus, a GitHub repository that allows Anthropic's Claude Code to execute prompts across Google's Gemini and OpenAI's Codex models concurrently. The tool appears to enable parallel code generation from multiple AI assistants.
How to Orchestrate Claude Code with GPT and Gemini Using CLI Calls and Shared Context Files
A developer's system for making Claude Code orchestrate GPT and Gemini via CLI calls, using shared markdown files for persistent context and a session closer agent for compounding knowledge.
Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI Applications
Google has released Gemini Embedding 2, a second-generation multimodal embedding model designed to process text, images, and audio simultaneously. This technical advancement creates more unified AI representations, potentially improving search, recommendation, and personalization systems.
Google's Gemini API Goes Free: A Game-Changer for AI Development and Experimentation
Google has removed rate limits and introduced free access to its Gemini API, enabling developers to experiment with AI prompts in CI/CD pipelines and agent systems without billing concerns. This move democratizes access to advanced language models and encourages innovation.
Google Maps Gets an AI Brain: How Gemini Transforms Navigation from Directions to Dialogue
Google is fundamentally reshaping Maps by integrating its Gemini AI, launching 'Ask Maps' for conversational discovery and 'Immersive Navigation' for a complete visual and data-driven route overhaul. This represents a shift from static maps to intelligent, proactive travel assistants.
Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI
Google has launched Gemini Embedding 2, a second-generation multimodal embedding model. This technical release, alongside the removal of API rate limits, provides developers with a more powerful and accessible tool for building AI applications that understand text, images, and other data types.
Google's Gemini Embedding 2 Unifies All Media Types in Single AI Framework
Google has launched Gemini Embedding 2, its first fully multimodal embedding model that maps text, images, video, audio, and documents into a single shared vector space. The breakthrough supports 100+ languages and flexible vector sizing for optimized performance.
Google's Gemini AI Agents Deploy to Pentagon in Major Defense Tech Shift
Google is providing the Pentagon with Gemini-powered AI agents for over 3 million personnel, automating tasks like meeting summaries and budget building on unclassified networks, with plans to expand to classified systems.
Google's Gemini AI Integrates Deeply Into Workspace, Creating Unified Productivity Ecosystem
Google has integrated its Gemini AI assistant directly into Docs, Sheets, Slides, and Drive, creating a unified AI-powered workflow across its core productivity suite. This move represents a significant step toward seamless AI assistance in everyday work tasks.
AI Models Investigate Prehistoric Mysteries: How GPT-5.4, Claude Opus, and Gemini DeepThink Tackled the Dinosaur Civilization Question
Leading AI models including GPT-5.4 Pro, Claude Opus, and Gemini DeepThink were challenged to investigate whether advanced dinosaur civilizations existed. The experiment reveals how modern AI systems approach complex historical questions with original analysis and data gathering capabilities.
Capgemini Joins OpenAI's Elite Alliance to Bridge the AI Deployment Gap
Capgemini has become a founding partner in OpenAI's Frontier Alliance, a strategic initiative designed to accelerate enterprise AI deployment. The collaboration aims to transform AI experimentation into scalable, real-world business solutions across industries.