Skip to content
gentic.news — AI News Intelligence Platform

browser automation

30 articles about browser automation in AI news

Qwen 3.6 Plus Demonstrates Full Web OS and Browser Automation in Single Session

A developer tested Qwen 3.6 Plus on a complex web OS workflow involving Python terminal operations, gaming, and browser automation, with the model handling all tasks seamlessly in a single session.

89% relevant

Safari MCP Cuts Browser Automation CPU Usage by 95% for Mac Developers

Replace your Chromium-based MCP browser tool with Safari MCP to eliminate Chrome's resource drain while keeping your existing logged-in sessions.

86% relevant

Pilot MCP: A 41% Faster Drop-In Replacement for Playwright in Claude Code

Replace @playwright/mcp with pilot-mcp for 41% faster browser automation, 6x less context usage, and cookie import from your daily browser.

87% relevant

Skale Launches Desktop AI Agent Running on 300MB RAM with 11+ LLM Provider Support

Skale introduces a desktop AI agent that installs in 30 seconds on Windows and macOS, requiring only 300MB RAM. The tool offers browser automation, calendar integration, and autonomous task execution without terminal access.

87% relevant

OpenClaw Skills: The GitHub Repository That's Supercharging AI Agents with 1,700+ Ready-to-Use Capabilities

A new GitHub repository called 'awesome-openclaw-skills' has emerged, offering over 1,715 production-ready AI agent skills that can be installed with a single CLI command. This collection promises to dramatically accelerate AI agent development by providing pre-built capabilities ranging from browser automation to complex data processing.

85% relevant

OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users

OpenAI released a major Codex update featuring background macOS automation, an in-app browser, persistent memory, and 90+ plugins. With 3M weekly users and nearly half of usage now non-coding, Codex is being repositioned as a general work agent.

100% relevant

Claude AI Gains Computer Control Feature: Opens Apps, Navigates Browser, Fills Spreadsheets

Anthropic's Claude AI can now be enabled to directly control a user's computer to perform tasks like opening applications, browser navigation, and spreadsheet work. This represents a significant shift from chat-based interaction to direct system automation.

87% relevant

Google Launches MCP Server for Chrome DevTools, Enabling AI Browser Control

Google released a Model Context Protocol server that lets AI coding agents directly control Chrome DevTools. This enables automated browser debugging, network request inspection, and performance tracing through tools like Cursor and VS Code.

100% relevant

OpenAgents Workspace Launches Open-Source Platform to Connect AI Agents with Shared Files and Browser

OpenAgents Workspace is an open-source platform that connects multiple local AI agents into a unified workspace with shared files and browser context, enabling automated collaboration without manual intervention.

81% relevant

ExSpec: Run Gherkin Tests in Real Browsers with Claude Code—No Step Definitions Required

ExSpec lets you write plain-text Gherkin specs and have Claude Code execute them in a real browser, eliminating brittle step definitions and glue code.

95% relevant

Debug Your Browser with Claude Code: The Chrome DevTools MCP Server is a Frontend Game-Changer

Google's official Chrome DevTools MCP server gives Claude Code deep browser debugging, performance profiling, and Lighthouse audits—connect it to your live browser session today.

98% relevant

Claude Desktop Gains 'Use My Computer' Feature for Direct App and Browser Control

Anthropic's Claude Desktop app now includes an experimental 'Use My Computer' feature that allows Claude AI to directly interact with local applications, browsers, and files when explicitly enabled by users.

93% relevant

OpenClaw Agent Demonstrates In-Browser Video Creation Without App Switching

OpenClaw agent can now create videos directly within a browser interface without opening separate applications or switching tabs. The development suggests progress toward more integrated multimodal AI workflows.

85% relevant

Leaked 'Claude Cowork' Setup Shows AI Agent Automating Browser Tasks, Compressing Workflows

A leaked configuration for a system called 'Claude Cowork' demonstrates an AI agent automating browser-based tasks, reportedly compressing a workday into 90 seconds. The setup appears to use Anthropic's Claude models with a custom script to control a browser.

87% relevant

SamarthyaBot: The Self-Hosted AI Agent OS That Puts Privacy and Automation First

SamarthyaBot is a privacy-first, self-hosted AI agent operating system that runs entirely on local machines. Unlike cloud-based assistants, it performs actual system tasks like running terminal commands, deploying projects via SSH, and controlling browsers while keeping all data encrypted and local.

80% relevant

How to Use Claude Code's New 'Auto Mode' for Safer Desktop Automation

Claude Code's new 'Auto Mode' lets you delegate tasks to run autonomously on your desktop, but you must configure it correctly to avoid security risks.

95% relevant

How to Automate Microsoft Teams Replies with Claude Code and a Browser Script

A developer built a script that uses Claude Code's --chrome flag to read and reply to Teams messages automatically, with access to local repos for context-aware answers.

86% relevant

Claude Code's Playwright MCP Server: Generate Web Tests from Natural Language

Claude Code now integrates with Playwright via MCP, letting you generate complete test automation from simple prompts without leaving your terminal.

100% relevant

Claude Code, Gemini, and 50+ Dev Tools Dockerized into Single AI Coding Workstation

A developer packaged Claude Code's browser UI, Gemini, Codex, Cursor, TaskMaster CLIs, Playwright with Chromium, and 50+ development tools into a single Docker Compose setup, creating a pre-configured AI coding environment that uses existing Claude subscriptions.

95% relevant

Alumnium MCP Hits 98.5% on WebVoyager: How to Add SOTA Browsing to Claude Code

The open-source Alumnium MCP server, which acts as a high-level browser subagent for Claude Code, just set a new state-of-the-art benchmark score. Install it to offload complex web tasks.

95% relevant

The API Testing Revolution: How AI-Powered Tools Are Challenging Postman's Dominance

Developers are increasingly abandoning Postman for new AI-enhanced API testing tools that prioritize privacy, local-first workflows, and intelligent automation. These alternatives offer login-free experiences, secure local storage, and AI-generated test cases.

85% relevant

AI Phone Assistants Reach New Milestone: Autonomous Call-Handling Goes Mainstream

A new AI system can now answer phone calls autonomously, moving beyond chatbots to handle real-time conversations. This development represents a significant leap in voice AI capabilities and practical automation.

87% relevant

Microsoft's Playwright MCP Server Replaces Vision for Web Agents

Microsoft built an MCP server for Playwright that lets AI agents interact with web pages using the accessibility tree, eliminating the need for screenshots and vision models. This approach reduces hallucinations and broken selectors, working with tools like Cursor, VS Code, and Claude Desktop.

85% relevant

AI Agent Security Startup Emerges Amid Enterprise Rush, Per VC Tweet

A VC's tweet highlights a critical gap in enterprise AI agent adoption: security. This signals a market opportunity, with a new startup reportedly emerging to address it.

87% relevant

OpenAI Codex Gains Screen Control, Long-Run Agents, and 90+ Plugins

OpenAI has upgraded Codex from a code-completion tool to an agentic macOS assistant that can see/click screens, run for weeks autonomously, and integrate with 90+ dev tools. This marks a strategic move into persistent, multi-modal coding agents.

86% relevant

Avoko Launches Platform to Interview AI Agents, Maps Non-Human Behavior

Avoko has launched a platform designed to interview AI agents directly to map their actual behavior. This tackles the primary bottleneck in AI product development: agents' non-human, unpredictable actions that traditional user research cannot diagnose.

85% relevant

AI-Powered Circuit Simulator Offers Free Hardware Prototyping

A new website provides a free, AI-assisted environment for designing and testing electronic circuits, featuring pre-built projects for learning. This lowers the barrier to entry for hardware prototyping and education.

75% relevant

Tiny Fish Improves Live Web Usability for AI Coding Agents

Tiny Fish has released a tool that makes the live web significantly more usable for AI coding agents. This addresses a critical failure point where agent workflows often break down during real-world web interactions.

85% relevant

HeyGen Launches CLI Tool for AI Video Generation from Terminal

AI video platform HeyGen has launched a CLI tool, allowing users to generate videos with avatars, voice, and script via terminal commands. This moves video synthesis from a web dashboard into developer workflows.

85% relevant

AI-Powered Password Leak Detection: A Critical Security Shift

Security experts are leveraging AI to detect when user passwords appear in data breaches, enabling immediate alerts. This shifts the security paradigm from periodic manual checks to continuous, automated monitoring.

85% relevant