gemini

30 articles about gemini in AI news

Google's AICore Beta Enables On-Device Gemini Nano 4 Downloads for Android Phones

A new beta of Google's AICore system service enables users to download Gemini Nano 4 Full and Gemini Nano 4 Fast models directly onto compatible Android phones, including those with Snapdragon 8 Elite Gen 5 chips. This moves beyond pre-installed AI to user-initiated model management.

85% relevant

Google Gemini Introduces Data Import Feature from Other AI Platforms

Google's Gemini AI now allows users to transfer their chat history, memories, and preferences from other AI assistants, eliminating the need to start fresh when switching platforms. This interoperability feature represents a strategic move to attract users from competing services by reducing migration friction.

78% relevant

Glass AI Coding Editor Expands to Windows, Bundles Claude Opus 4.6, GPT-5.4 & Gemini 3.1 Pro Access

The Glass AI coding editor is now available on Windows, offering developers a single subscription that includes usage of Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro without additional API costs. This expansion significantly broadens its potential user base beyond the Mac ecosystem.

87% relevant

Google Launches Gemini API 'Flex' & 'Turbo' Tiers, Cuts Standard Pricing by 50%

Google has added 'Flex' and 'Turbo' service tiers to its Gemini API, with Flex offering a 50% reduction in cost compared to Standard. This move provides developers with more granular control over cost versus latency for their AI applications.

87% relevant

Claude Code, Gemini, and 50+ Dev Tools Dockerized into Single AI Coding Workstation

A developer packaged Claude Code's browser UI, Gemini, Codex, Cursor, TaskMaster CLIs, Playwright with Chromium, and 50+ development tools into a single Docker Compose setup, creating a pre-configured AI coding environment that uses existing Claude subscriptions.

100% relevant

Google Gemini Launches Manual Memory & Chat Import to Ease Switching from ChatGPT, Claude

Google Gemini is rolling out 'Import Memory' and 'Import Chat History' features for desktop users. The manual tools provide prompts and a .zip upload to transfer data from other AI assistants, aiming to lower the barrier for users to switch from competitors like ChatGPT or Claude.

79% relevant

Apple iOS 27 to Introduce 'Extensions' for Siri, Allowing Users to Link to ChatGPT, Gemini, or Claude

Apple's iOS 27 will reportedly let users choose third-party AI chatbots like Google Gemini or Anthropic Claude to power Siri responses via a new 'Extensions' feature. This follows Apple's confirmed deal with Google to power its overhauled Siri, signaling a major shift from a closed to an open AI assistant ecosystem.

100% relevant

Google Announces Gemini 3.1 Flash Live: A New Real-Time AI Model

Google has announced Gemini 3.1 Flash Live, a new model variant focused on real-time, low-latency AI interactions. The announcement came via a developer tweet, indicating a potential push for faster, more responsive AI applications.

100% relevant

Open-Source Code Editor 'Cline' Integrates Claude Opus, GPT-4, and Gemini Pro via Single API

Developer Hasan Tohar announced 'Cline', an open-source code editor that integrates multiple top-tier AI models through a unified interface. The tool allows switching between Claude Opus, GPT-4, and Gemini Pro without managing separate API keys or subscriptions.

85% relevant

Apple Reportedly Gains Full Internal Access to Google's Gemini for On-Device Model Distillation

A report claims Apple's AI deal with Google includes full internal model access, enabling distillation of Gemini's reasoning into smaller, on-device models. This would allow Apple to build specialized, efficient AI without relying solely on cloud APIs.

95% relevant

Glass AI IDE Emerges, Claims to Offer Free Access to Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro

A new AI-powered coding editor called Glass claims to provide free access to multiple top-tier LLMs, including Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro, without API fees. This positions it as a direct, cost-free competitor to established paid AI IDEs like Cursor and Windsurf.

89% relevant

Google DeepMind Unveils Gemini-Powered Browser That Generates Websites in Real-Time

Google DeepMind has demonstrated a browser prototype powered by Gemini 3.1 Flash-Lite that generates complete HTML/CSS websites dynamically based on user prompts and navigation context, shifting from static page retrieval to on-demand interface generation.

95% relevant

Tessera Launches Open-Source Framework for 32 OWASP AI Security Tests, Benchmarks GPT-4o, Claude, Gemini, Llama 3

Tessera introduces the first open-source framework to run all 32 OWASP AI security tests against any model with one CLI command. It provides benchmark results for GPT-4o, Claude, Gemini, Llama 3, and Mistral across 21 model-specific security tests.

97% relevant

Origin CLI: Open-Source Git Blame for AI Agents Tracks Claude Code, Cursor, and Gemini Contributions

Origin is a new open-source CLI tool that adds AI attribution to git commits, tagging each line with which agent wrote it, the prompt, model, and cost. It works offline with Claude Code, Cursor, and Gemini, storing data in git notes.

86% relevant

Apple Siri Rebuilt as System-Wide AI Agent in iOS 27, Powered by Apple Foundation Models and Google Gemini

Apple is rebuilding Siri into a conversational system-wide AI agent with deep app integration and personal data access, launching in iOS 27. The overhaul includes a standalone app, web browsing, and writing tools, powered by Apple's models and a Google Gemini partnership.

95% relevant

OpenClaw AI Agent Adds Real-Time Vision to Meta Ray-Ban Smart Glasses via Gemini Live API

An open-source project enables Meta Ray-Ban smart glasses to function as a real-time AI assistant. It streams the glasses' camera feed (~1fps) to Gemini Live for visual context, then delegates actions via the OpenClaw agent framework.

85% relevant

ReXInTheWild Benchmark Reveals VLMs Struggle with Medical Photos: Gemini-3 Leads at 78%, MedGemma Trails at 37%

Researchers introduced ReXInTheWild, a benchmark of 955 clinician-verified questions based on 484 real medical photographs. Leading multimodal models show wide performance gaps, with Gemini-3 scoring 78% accuracy while the specialized MedGemma model achieved only 37%.

75% relevant

RedNote's 3B-Parameter Multimodal OCR Model Ranks Second to Gemini 3 Pro on Document Parsing Benchmarks

RedNote has released a 3-billion parameter multimodal OCR model that converts text, charts, diagrams, and tables into structured formats like Markdown and HTML. It reportedly ranks second only to Google's Gemini 3 Pro on OCR benchmarks.

91% relevant

MiniMax M2.7 Achieves 30% Internal Benchmark Gain via Self-Improvement Loops, Ties Gemini 3.1 on MLE Bench Lite

MiniMax had its M2.7 model run 100+ autonomous development cycles—analyzing failures, modifying code, and evaluating changes—resulting in a 30% performance improvement. The model now handles 30-50% of the research workflow and tied Gemini 3.1 in ML competition trials.

95% relevant

Claude Octopus: GitHub Tool Enables Claude Code to Run Gemini and Codex Simultaneously

A developer discovered Claude Octopus, a GitHub repository that allows Anthropic's Claude Code to execute prompts across Google's Gemini and OpenAI's Codex models concurrently. The tool appears to enable parallel code generation from multiple AI assistants.

89% relevant

How to Orchestrate Claude Code with GPT and Gemini Using CLI Calls and Shared Context Files

A developer's system for making Claude Code orchestrate GPT and Gemini via CLI calls, using shared markdown files for persistent context and a session closer agent for compounding knowledge.

95% relevant

Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI Applications

Google has released Gemini Embedding 2, a second-generation multimodal embedding model designed to process text, images, and audio simultaneously. This technical advancement creates more unified AI representations, potentially improving search, recommendation, and personalization systems.

77% relevant

Google's Gemini API Goes Free: A Game-Changer for AI Development and Experimentation

Google has removed rate limits and introduced free access to its Gemini API, enabling developers to experiment with AI prompts in CI/CD pipelines and agent systems without billing concerns. This move democratizes access to advanced language models and encourages innovation.

89% relevant

Google Maps Gets an AI Brain: How Gemini Transforms Navigation from Directions to Dialogue

Google is fundamentally reshaping Maps by integrating its Gemini AI, launching 'Ask Maps' for conversational discovery and 'Immersive Navigation' for a complete visual and data-driven route overhaul. This represents a shift from static maps to intelligent, proactive travel assistants.

99% relevant

Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI

Google has launched Gemini Embedding 2, a second-generation multimodal embedding model. This technical release, alongside the removal of API rate limits, provides developers with a more powerful and accessible tool for building AI applications that understand text, images, and other data types.

99% relevant

Google's Gemini Embedding 2 Unifies All Media Types in Single AI Framework

Google has launched Gemini Embedding 2, its first fully multimodal embedding model that maps text, images, video, audio, and documents into a single shared vector space. The breakthrough supports 100+ languages and flexible vector sizing for optimized performance.

100% relevant

Google's Gemini AI Agents Deploy to Pentagon in Major Defense Tech Shift

Google is providing the Pentagon with Gemini-powered AI agents for over 3 million personnel, automating tasks like meeting summaries and budget building on unclassified networks, with plans to expand to classified systems.

75% relevant

Google's Gemini AI Integrates Deeply Into Workspace, Creating Unified Productivity Ecosystem

Google has integrated its Gemini AI assistant directly into Docs, Sheets, Slides, and Drive, creating a unified AI-powered workflow across its core productivity suite. This move represents a significant step toward seamless AI assistance in everyday work tasks.

85% relevant

AI Models Investigate Prehistoric Mysteries: How GPT-5.4, Claude Opus, and Gemini DeepThink Tackled the Dinosaur Civilization Question

Leading AI models including GPT-5.4 Pro, Claude Opus, and Gemini DeepThink were challenged to investigate whether advanced dinosaur civilizations existed. The experiment reveals how modern AI systems approach complex historical questions with original analysis and data gathering capabilities.

85% relevant

Capgemini Joins OpenAI's Elite Alliance to Bridge the AI Deployment Gap

Capgemini has become a founding partner in OpenAI's Frontier Alliance, a strategic initiative designed to accelerate enterprise AI deployment. The collaboration aims to transform AI experimentation into scalable, real-world business solutions across industries.

75% relevant