cli
30 articles about cli in AI news
MLLM Raters Show Central Tendency Bias in Clinical Scoring
Study finds GPT-5 and other MLLMs show central tendency bias in clinical scoring, compressing predictions toward scale midpoint despite prompt modifications.
Google DeepMind Launches Real-Time Video AI Co-Clinician
Google DeepMind launched AI Co-Clinician, a real-time video analysis system for triadic care, claiming 30% fewer diagnostic errors in early tests.
GPT-5.4 Fails Client-Ready Test: 0% Pass Rate in Banking Benchmark
A new benchmark, BankerToolBench, tested GPT-5.4, Claude Opus 4.6, and others on junior investment banker tasks. None of the outputs were deemed client-ready, with GPT-5.4 leading but still failing nearly half the criteria.
Apple Releases DFNDR-12M Dataset, Claims 5x CLIP Training Efficiency
Apple has open-sourced DFNDR-12M, a multimodal dataset of 12.8 million image-text pairs with synthetic captions and pre-computed embeddings. The company claims it enables up to 5x training efficiency over standard CLIP datasets.
MIT, Harvard Studies Link AI Use to Declining Critical Thinking in Youth
Research from MIT and Harvard indicates that AI usage is correlated with a significant decline in critical thinking and creativity scores among 17–25 year olds, with 67% of students acknowledging the negative impact.
MCP vs CLI: The Hidden War for AI Agent Tool Integration
A fundamental architectural debate pits Anthropic's standardized Model Context Protocol (MCP) against traditional CLI execution for AI agent tool use. The choice between safety/standardization (MCP) and flexibility/speed (CLI) will shape enterprise AI deployment.
Gemini CLI Launches Subagents with Isolated Context & Custom Instructions
The Gemini CLI tool has launched a 'Subagents' feature, allowing users to run multiple specialized AI agents concurrently, each with its own isolated context and system prompt. This enables more complex, modular workflows by preventing instruction bleed between tasks.
Kering Reports Q1 2026 Revenue Decline as Gucci Sales Fall 14%
Luxury group Kering reported a 6% year-on-year revenue decline to €3.5bn in Q1 2026. The drop was driven by a 14% fall in Gucci sales, with declines in Asia-Pacific and Western Europe offsetting North American growth. CEO Luca de Meo called it a 'first step in our recovery' as a comprehensive brand reset continues.
HeyGen Launches CLI Tool for AI Video Generation from Terminal
AI video platform HeyGen has launched a CLI tool, allowing users to generate videos with avatars, voice, and script via terminal commands. This moves video synthesis from a web dashboard into developer workflows.
MiniMax Open-Sources Three Agent Music Skills for MMX-CLI
MiniMax has open-sourced three 'Music Skills' for its MMX-CLI agent platform. The skills allow AI agents to generate music, sing in a persona, and curate playlists from a user's local library.
AMD AI Director Reports Claude Code Quality Decline, Cites 234k Tool Calls
An AMD AI executive presented data from over 6,800 sessions showing Claude Code's performance has declined since early March, with rising instances of shallow reasoning and incomplete tasks. This raises significant trust issues for engineers using the model in complex development workflows.
MiniMax Launches MMX-CLI, First Infrastructure Built for AI Agents
MiniMax released MMX-CLI, a CLI built for AI agents, not humans. It provides agents with seven multimodal 'senses' and native integration with popular AI coding environments.
PetClaw Launches One-Click Desktop AI Agent, Aims to Fix OpenClaw Setup Woes
A new tool called PetClaw promises a fully functional AI desktop agent in under 60 seconds with one click, no API keys, and no terminal configuration. This directly targets the primary user complaint about its powerful but notoriously difficult-to-setup predecessor, OpenClaw.
Kerf-CLI: The SQLite-Powered Cost Dashboard Every Claude Code User Needs
Install Kerf-CLI to track Claude Code spending, enforce budgets, and identify wasted Opus spend with a local SQLite database and polished dashboard.
FDA-Designated AI 'Vox' Detects Heart Failure from 5-Second Voice Clip
An AI tool named Vox can detect signs of worsening heart failure from a 5-second patient voice clip. It's trained on >3M voice samples and backed by five clinical trials, targeting a condition affecting 64M people globally.
Simon Willison's 'scan-for-secrets' CLI Tool Detects API Keys in Logs
Simon Willison built 'scan-for-secrets', a Python CLI tool for scanning log files for accidentally exposed API keys. It's a lightweight utility for developers to sanitize data before sharing.
Inner Ear Gene Therapy Injection Reverses Deafness in All 10 Patients in Clinical Trial
A clinical trial has reported that a single injection of gene therapy into the inner ear successfully reversed deafness in all ten participating patients. This marks a significant threshold in treating genetic hearing loss, with some patients regaining hearing within weeks.
DISCO-TAB: Hierarchical RL Framework Boosts Clinical Data Synthesis by 38.2%, Achieves JSD < 0.01
Researchers propose DISCO-TAB, a reinforcement learning framework that guides a fine-tuned LLM with multi-granular feedback to generate synthetic clinical data. It improves downstream classifier utility by up to 38.2% versus GAN/diffusion baselines and achieves near-perfect statistical fidelity (JSD < 0.01).
Open-Source 'Codex CLI' Emerges as Free Alternative to OpenAI's Tools, Claims 30-Agent Architecture
An open-source project called 'Codex CLI' has been released, offering a free command-line interface that its creators claim outperforms OpenAI's offerings by coordinating 30 specialized AI agents for coding tasks.
TPC-CMA Framework Reduces CLIP Modality Gap by 82.3%, Boosts Captioning CIDEr by 57.1%
Researchers propose TPC-CMA, a three-phase fine-tuning curriculum that reduces the modality gap in CLIP-like models by 82.3%, improving clustering ARI from 0.318 to 0.516 and captioning CIDEr by 57.1%.
pixcli: The First MCP Server for Brazil's Pix Payments (Install It Now)
A new Rust CLI with built-in MCP server lets Claude Code agents create Pix charges, check payments, and manage webhooks—automating Brazilian payment workflows.
Lorg CLI: The New Way to Give Claude Code a Permanent Memory
Lorg's new CLI tool lets Claude Code autonomously archive what it learns between sessions, eliminating knowledge loss with a simple npm install.
The Cognitive Divergence: AI Context Windows Expand as Human Attention Declines, Creating a Delegation Feedback Loop
A new arXiv paper documents the exponential growth of AI context windows (512 tokens in 2017 to 2M in 2026) alongside a measured decline in human sustained-attention capacity. It introduces the 'Delegation Feedback Loop' hypothesis, where easier AI delegation may further erode human cognitive practice. This is a foundational study on human-AI interaction dynamics.
Claude Code's /mcp Computer Use: Test Your Local Apps Directly from the CLI
Claude Code can now open your apps, click through UIs, and test builds via a new /mcp computer-use command, turning it into a hands-on testing agent.
Claude Code 2.5: New CLI, Dashboard, and Best Practices for Web Devs
Anthropic's latest Claude Code update adds a CLI, usage dashboard, and web-focused best practices. Here's how to use them.
OpenClaw Creator Peter Steinberger Declined OpenAI Acquisition Offer, Citing Vision Alignment
Peter Steinberger, creator of the ClawdBot/OpenClaw robotics project, revealed on the Lex Fridman Podcast that he declined an acquisition offer from OpenAI. He cited a misalignment in vision for the project's future as the primary reason.
Requestly Launches Git-Synced API Client to Replace Scattered Postman Setups
Requestly has launched an AI-powered API client that automatically syncs team collections through Git, eliminating stale docs and configuration drift. The tool directly targets the collaboration pain points of Postman and Insomnia users.
Epismo CLI: Save and Reuse Your Claude Code Workflows Like GitHub Repositories
Epismo CLI lets you capture, version, and share multi-step Claude Code workflows as markdown chains, solving the 'how did I get here?' problem.
Insanely Fast Whisper CLI Transcribes 2.5 Hours of Audio in 98 Seconds with Flash Attention 2
A new open-source CLI tool called Insanely Fast Whisper achieves 19x speedup over standard Whisper large-v3, transcribing 150 minutes of audio in 98 seconds using Flash Attention 2 and batching with no quality loss.
Track Claude Code's Peak Hours with PromoClock's API and CLI Integration
A new free tool provides a live API and CLI snippet to show if you're in a peak usage window, helping you manage your Claude Code session limits more effectively.