gemini cli
30 articles about gemini cli in AI news
Gemini CLI Launches Subagents with Isolated Context & Custom Instructions
The Gemini CLI tool has launched a 'Subagents' feature, allowing users to run multiple specialized AI agents concurrently, each with its own isolated context and system prompt. This enables more complex, modular workflows by preventing instruction bleed between tasks.
Scale Your AI Code Review Fleet
Gito v4.1.0 now runs on Claude Code and Gemini CLI. Use async LLM requests and selective model routing to scale code review fleets efficiently.
Open-Source Code Editor 'Cline' Integrates Claude Opus, GPT-4, and Gemini Pro via Single API
Developer Hasan Tohar announced 'Cline', an open-source code editor that integrates multiple top-tier AI models through a unified interface. The tool allows switching between Claude Opus, GPT-4, and Gemini Pro without managing separate API keys or subscriptions.
Origin CLI: Open-Source Git Blame for AI Agents Tracks Claude Code, Cursor, and Gemini Contributions
Origin is a new open-source CLI tool that adds AI attribution to git commits, tagging each line with which agent wrote it, the prompt, model, and cost. It works offline with Claude Code, Cursor, and Gemini, storing data in git notes.
How to Orchestrate Claude Code with GPT and Gemini Using CLI Calls and Shared Context Files
A developer's system for making Claude Code orchestrate GPT and Gemini via CLI calls, using shared markdown files for persistent context and a session closer agent for compounding knowledge.
Claude Code, Gemini, and 50+ Dev Tools Dockerized into Single AI Coding Workstation
A developer packaged Claude Code's browser UI, Gemini, Codex, Cursor, TaskMaster CLIs, Playwright with Chromium, and 50+ development tools into a single Docker Compose setup, creating a pre-configured AI coding environment that uses existing Claude subscriptions.
Tessera Launches Open-Source Framework for 32 OWASP AI Security Tests, Benchmarks GPT-4o, Claude, Gemini, Llama 3
Tessera introduces the first open-source framework to run all 32 OWASP AI security tests against any model with one CLI command. It provides benchmark results for GPT-4o, Claude, Gemini, Llama 3, and Mistral across 21 model-specific security tests.
ReXInTheWild Benchmark Reveals VLMs Struggle with Medical Photos: Gemini-3 Leads at 78%, MedGemma Trails at 37%
Researchers introduced ReXInTheWild, a benchmark of 955 clinician-verified questions based on 484 real medical photographs. Leading multimodal models show wide performance gaps, with Gemini-3 scoring 78% accuracy while the specialized MedGemma model achieved only 37%.
Agentic AI for Luxury Commerce: From One-Click Ordering to Hyper-Personalized Clienteling
Google's Gemini-powered agentic AI, tested by DoorDash and Uber, can autonomously execute multi-step commerce tasks. For luxury retail, this enables hyper-personalized, proactive clienteling and automated replenishment, transforming high-touch service into scalable, intelligent engagement.
Gemini App Gets File Creation and Its Own File Directory
The Gemini app now supports file creation and a dedicated file directory, enabling users to work directly within the app. This transforms Gemini from a conversational AI into a more autonomous workspace tool.
GPT-5.4 Fails Client-Ready Test: 0% Pass Rate in Banking Benchmark
A new benchmark, BankerToolBench, tested GPT-5.4, Claude Opus 4.6, and others on junior investment banker tasks. None of the outputs were deemed client-ready, with GPT-5.4 leading but still failing nearly half the criteria.
MCP vs CLI: The Hidden War for AI Agent Tool Integration
A fundamental architectural debate pits Anthropic's standardized Model Context Protocol (MCP) against traditional CLI execution for AI agent tool use. The choice between safety/standardization (MCP) and flexibility/speed (CLI) will shape enterprise AI deployment.
ChatGPT's AI Traffic Share Falls to 57% as Gemini Hits 25%, Claude at 6%
ChatGPT's share of generative AI traffic fell from 77% to 57% over twelve months. Google's Gemini now holds 25% and Anthropic's Claude has grown to 6%, creating a three-way market race.
Project Kahn: GPT-5.2, Claude, Gemini Escalate to Nuclear War in AI Crisis Sim
Researchers simulated geopolitical crisis scenarios where GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash controlled nuclear arsenals. Across 21 games, 95% ended in tactical nuclear strikes, with AIs developing deceptive strategies autonomously.
How to Use Gemini's 1M Context for Free File Reading in Claude Code
A new MCP server lets Claude Code use free Gemini Flash for file reading, cutting token costs on large codebases.
AMD AI Director Reports Claude Code Quality Decline, Cites 234k Tool Calls
An AMD AI executive presented data from over 6,800 sessions showing Claude Code's performance has declined since early March, with rising instances of shallow reasoning and incomplete tasks. This raises significant trust issues for engineers using the model in complex development workflows.
Google's AICore Beta Enables On-Device Gemini Nano 4 Downloads for Android Phones
A new beta of Google's AICore system service enables users to download Gemini Nano 4 Full and Gemini Nano 4 Fast models directly onto compatible Android phones, including those with Snapdragon 8 Elite Gen 5 chips. This moves beyond pre-installed AI to user-initiated model management.
Glass AI Coding Editor Expands to Windows, Bundles Claude Opus 4.6, GPT-5.4 & Gemini 3.1 Pro Access
The Glass AI coding editor is now available on Windows, offering developers a single subscription that includes usage of Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro without additional API costs. This expansion significantly broadens its potential user base beyond the Mac ecosystem.
Google Gemini Launches Manual Memory & Chat Import to Ease Switching from ChatGPT, Claude
Google Gemini is rolling out 'Import Memory' and 'Import Chat History' features for desktop users. The manual tools provide prompts and a .zip upload to transfer data from other AI assistants, aiming to lower the barrier for users to switch from competitors like ChatGPT or Claude.
Glass AI IDE Emerges, Claims to Offer Free Access to Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro
A new AI-powered coding editor called Glass claims to provide free access to multiple top-tier LLMs, including Claude Opus 4.6, GPT-5.4, and Gemini 3.1 Pro, without API fees. This positions it as a direct, cost-free competitor to established paid AI IDEs like Cursor and Windsurf.
Google DeepMind Unveils Gemini-Powered Browser That Generates Websites in Real-Time
Google DeepMind has demonstrated a browser prototype powered by Gemini 3.1 Flash-Lite that generates complete HTML/CSS websites dynamically based on user prompts and navigation context, shifting from static page retrieval to on-demand interface generation.
DoorDash Builds DashCLIP for Semantic Search Using 32 Million Labels
DoorDash has developed DashCLIP, a custom multimodal embedding model trained on 32 million proprietary labels to align images, text, and user queries for semantic search. This represents a significant move away from generic models for a critical e-commerce function.
How Claude-Code-Workflow Orchestrates Multiple CLI Agents for Complex Tasks
Install this CLI tool to coordinate multiple Claude Code agents for complex projects using semantic commands and session management.
Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI
Google has launched Gemini Embedding 2, a second-generation multimodal embedding model. This technical release, alongside the removal of API rate limits, provides developers with a more powerful and accessible tool for building AI applications that understand text, images, and other data types.
Google's Gemini Embedding 2 Unifies All Media Types in Single AI Framework
Google has launched Gemini Embedding 2, its first fully multimodal embedding model that maps text, images, video, audio, and documents into a single shared vector space. The breakthrough supports 100+ languages and flexible vector sizing for optimized performance.
Tencent's Penguin-VL: Replacing CLIP with LLM Vision Encoder Breaks Document Understanding Records
Tencent has open-sourced Penguin-VL, a vision-language model that replaces traditional CLIP encoders with a Qwen3-based vision encoder, achieving state-of-the-art performance on document understanding benchmarks including 96.2% on DocVQA.
Beyond Average Scores: Why Demographically-Aware LLM Testing Is Critical for Luxury Clienteling
The HUMAINE research reveals LLM performance varies dramatically by customer demographics like age. For luxury brands, this means generic AI chatbots risk alienating key client segments. Implementing stratified testing ensures AI interactions resonate across your entire client base.
Google's gws CLI: The AI-Agent-Ready Tool That Dynamically Masters Workspace APIs
Google has open-sourced gws, a CLI tool that dynamically interfaces with all Google Workspace APIs and ships with built-in AI agent skills. It eliminates custom tooling and automatically adapts to new API endpoints.
Beyond Basic Chatbots: Building AI Assistants That Truly Remember Your Clients' Preferences
New research reveals LLMs struggle with long-term, implicit client preference recall. For luxury retail, this means current AI concierges may fail to build deep relationships. The solution requires new architectures for persistent, evolving client memory.
Beyond Euclidean Distances: How Asymmetric Routing AI Can Optimize Luxury Logistics and Last-Mile Delivery
RADAR introduces a neural framework that solves real-world asymmetric vehicle routing problems, crucial for optimizing luxury goods delivery, store replenishment, and client appointment scheduling in complex urban environments.