browser automation

30 articles about browser automation in AI news

Qwen 3.6 Plus Demonstrates Full Web OS and Browser Automation in Single Session

A developer tested Qwen 3.6 Plus on a complex web OS workflow involving Python terminal operations, gaming, and browser automation, with the model handling all tasks seamlessly in a single session.

Apr 3, 202689% relevant

Safari MCP Cuts Browser Automation CPU Usage by 95% for Mac Developers

Replace your Chromium-based MCP browser tool with Safari MCP to eliminate Chrome's resource drain while keeping your existing logged-in sessions.

Mar 28, 202686% relevant

Pilot MCP: A 41% Faster Drop-In Replacement for Playwright in Claude Code

Replace @playwright/mcp with pilot-mcp for 41% faster browser automation, 6x less context usage, and cookie import from your daily browser.

Mar 28, 202687% relevant

Skale Launches Desktop AI Agent Running on 300MB RAM with 11+ LLM Provider Support

Skale introduces a desktop AI agent that installs in 30 seconds on Windows and macOS, requiring only 300MB RAM. The tool offers browser automation, calendar integration, and autonomous task execution without terminal access.

Mar 20, 202687% relevant

OpenClaw Skills: The GitHub Repository That's Supercharging AI Agents with 1,700+ Ready-to-Use Capabilities

A new GitHub repository called 'awesome-openclaw-skills' has emerged, offering over 1,715 production-ready AI agent skills that can be installed with a single CLI command. This collection promises to dramatically accelerate AI agent development by providing pre-built capabilities ranging from browser automation to complex data processing.

Feb 26, 202685% relevant

OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users

OpenAI released a major Codex update featuring background macOS automation, an in-app browser, persistent memory, and 90+ plugins. With 3M weekly users and nearly half of usage now non-coding, Codex is being repositioned as a general work agent.

Apr 16, 2026100% relevant

Claude AI Gains Computer Control Feature: Opens Apps, Navigates Browser, Fills Spreadsheets

Anthropic's Claude AI can now be enabled to directly control a user's computer to perform tasks like opening applications, browser navigation, and spreadsheet work. This represents a significant shift from chat-based interaction to direct system automation.

Mar 23, 202687% relevant

Browser Bridge MCP: Drive Your Real Logged-In Chrome from Claude Code

Browser Bridge MCP gives Claude Code control of your real Chrome session with 63 tools. Install in 60 seconds, then automate authenticated browsing, network capture, and security testing.

Jul 24, 202675% relevant

Browser-use open-sources Claude-powered video editor

Browser-use open-sourced a video editor inside Claude Computer Use, replacing manual editing with natural language commands and challenging Adobe Premiere.

Jul 2, 202677% relevant

GPT-5.5 + Codex Combines App Building, Browser Use, Image Gen

@intheworldofai claims GPT-5.5 + Codex is a super app better than Claude Code, with 7 capabilities including app building, debugging, browser use, and image generation.

Apr 30, 2026100% relevant

Google Launches MCP Server for Chrome DevTools, Enabling AI Browser Control

Google released a Model Context Protocol server that lets AI coding agents directly control Chrome DevTools. This enables automated browser debugging, network request inspection, and performance tracing through tools like Cursor and VS Code.

Apr 11, 2026100% relevant

OpenAgents Workspace Launches Open-Source Platform to Connect AI Agents with Shared Files and Browser

OpenAgents Workspace is an open-source platform that connects multiple local AI agents into a unified workspace with shared files and browser context, enabling automated collaboration without manual intervention.

Apr 3, 202681% relevant

ExSpec: Run Gherkin Tests in Real Browsers with Claude Code—No Step Definitions Required

ExSpec lets you write plain-text Gherkin specs and have Claude Code execute them in a real browser, eliminating brittle step definitions and glue code.

Mar 27, 202695% relevant

Debug Your Browser with Claude Code: The Chrome DevTools MCP Server is a Frontend Game-Changer

Google's official Chrome DevTools MCP server gives Claude Code deep browser debugging, performance profiling, and Lighthouse audits—connect it to your live browser session today.

Mar 24, 202698% relevant

Claude Desktop Gains 'Use My Computer' Feature for Direct App and Browser Control

Anthropic's Claude Desktop app now includes an experimental 'Use My Computer' feature that allows Claude AI to directly interact with local applications, browsers, and files when explicitly enabled by users.

Mar 24, 202693% relevant

SamarthyaBot: The Self-Hosted AI Agent OS That Puts Privacy and Automation First

SamarthyaBot is a privacy-first, self-hosted AI agent operating system that runs entirely on local machines. Unlike cloud-based assistants, it performs actual system tasks like running terminal commands, deploying projects via SSH, and controlling browsers while keeping all data encrypted and local.

Mar 5, 202680% relevant

How to Use Claude Code's New 'Auto Mode' for Safer Desktop Automation

Claude Code's new 'Auto Mode' lets you delegate tasks to run autonomously on your desktop, but you must configure it correctly to avoid security risks.

Mar 28, 202695% relevant

How to Automate Microsoft Teams Replies with Claude Code and a Browser Script

A developer built a script that uses Claude Code's --chrome flag to read and reply to Teams messages automatically, with access to local repos for context-aware answers.

Mar 20, 202686% relevant

Moonshot AI's Kimi WebBridge Lets Agent Use Your Logged-In Sessions

Moonshot AI released Kimi WebBridge, a browser extension that lets its Kimi agent use your logged-in sessions. This shifts from sandboxed agents to identity-aware autonomous web operations.

May 20, 202692% relevant

Claude Code's Playwright MCP Server: Generate Web Tests from Natural Language

Claude Code now integrates with Playwright via MCP, letting you generate complete test automation from simple prompts without leaving your terminal.

Apr 18, 2026100% relevant

Claude Code, Gemini, and 50+ Dev Tools Dockerized into Single AI Coding Workstation

A developer packaged Claude Code's browser UI, Gemini, Codex, Cursor, TaskMaster CLIs, Playwright with Chromium, and 50+ development tools into a single Docker Compose setup, creating a pre-configured AI coding environment that uses existing Claude subscriptions.

Mar 29, 202695% relevant

Alumnium MCP Hits 98.5% on WebVoyager: How to Add SOTA Browsing to Claude Code

The open-source Alumnium MCP server, which acts as a high-level browser subagent for Claude Code, just set a new state-of-the-art benchmark score. Install it to offload complex web tasks.

Mar 27, 202695% relevant

The API Testing Revolution: How AI-Powered Tools Are Challenging Postman's Dominance

Developers are increasingly abandoning Postman for new AI-enhanced API testing tools that prioritize privacy, local-first workflows, and intelligent automation. These alternatives offer login-free experiences, secure local storage, and AI-generated test cases.

Feb 26, 202685% relevant

AI Phone Assistants Reach New Milestone: Autonomous Call-Handling Goes Mainstream

A new AI system can now answer phone calls autonomously, moving beyond chatbots to handle real-time conversations. This development represents a significant leap in voice AI capabilities and practical automation.

Mar 11, 202687% relevant

Why Claude Code's 80.8% SWE-Bench Score and 1M Context Window Beat Codex

Claude Code's 80.8% SWE-Bench score, 1M token context, and local execution make it the top choice for senior devs—use `claude code` in your terminal for complex codebase work.

Jul 12, 202685% relevant

Gemini 3.5 Flash Scores 78.4 on OSWorld, Matching GPT-5.5

Google integrated Computer Use into Gemini 3.5 Flash, scoring 78.4 on OSWorld — matching GPT-5.5 and undercutting on cost.

Jun 25, 2026100% relevant

SDAR: Self-Distilled RL Stabilizes Multi-Turn LLM Agents, +9.4% on ALFWorld

SDAR gates self-distillation within GRPO to stabilize multi-turn LLM agent training, yielding +9.4% on ALFWorld and gains on WebShop and Search-QA across Qwen2.5 and Qwen3 models.

May 15, 202685% relevant

Codex Update Cuts GUI Workflow Latency 42%

Codex app update cuts GUI workflow latency 42%, enabling near-human-speed interface operation for autonomous app building and debugging.

May 1, 202684% relevant

Microsoft's Playwright MCP Server Replaces Vision for Web Agents

Microsoft built an MCP server for Playwright that lets AI agents interact with web pages using the accessibility tree, eliminating the need for screenshots and vision models. This approach reduces hallucinations and broken selectors, working with tools like Cursor, VS Code, and Claude Desktop.

Apr 28, 2026100% relevant

AI Agent Security Startup Emerges Amid Enterprise Rush, Per VC Tweet

A VC's tweet highlights a critical gap in enterprise AI agent adoption: security. This signals a market opportunity, with a new startup reportedly emerging to address it.

Apr 20, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety