web automation
30 articles about web automation in AI news
Skales AI Agent Runs Locally on 300MB RAM, Enables Desktop Automation Without Terminal
Skales, a new desktop AI agent, runs locally on just 300MB of RAM and enables full automation workflows without terminal interaction. The agent can execute tasks like file management, application control, and web automation through a visual interface.
Qwen 3.6 Plus Demonstrates Full Web OS and Browser Automation in Single Session
A developer tested Qwen 3.6 Plus on a complex web OS workflow involving Python terminal operations, gaming, and browser automation, with the model handling all tasks seamlessly in a single session.
RiskWebWorld: A New Benchmark Exposes the Limits of AI for E-commerce Risk
Researchers introduced RiskWebWorld, a realistic benchmark for testing GUI agents on 1,513 authentic e-commerce risk management tasks. It reveals a major capability gap, showing even the best models fail over 50% of the time, highlighting the immaturity of AI for high-stakes operational automation.
Anthropic Ships Official Claude Code Plugin for Project Automation
Anthropic shipped an official Claude Code plugin for one-click project automation. Most users haven't explored half its capabilities.
Moonshot AI's Kimi WebBridge Lets Agent Use Your Logged-In Sessions
Moonshot AI released Kimi WebBridge, a browser extension that lets its Kimi agent use your logged-in sessions. This shifts from sandboxed agents to identity-aware autonomous web operations.
Microsoft's Playwright MCP Server Replaces Vision for Web Agents
Microsoft built an MCP server for Playwright that lets AI agents interact with web pages using the accessibility tree, eliminating the need for screenshots and vision models. This approach reduces hallucinations and broken selectors, working with tools like Cursor, VS Code, and Claude Desktop.
Claude Code's Playwright MCP Server: Generate Web Tests from Natural Language
Claude Code now integrates with Playwright via MCP, letting you generate complete test automation from simple prompts without leaving your terminal.
Tiny Fish Improves Live Web Usability for AI Coding Agents
Tiny Fish has released a tool that makes the live web significantly more usable for AI coding agents. This addresses a critical failure point where agent workflows often break down during real-world web interactions.
AI Agent 'Business OS' Emerges, Claims Full GUI-Based Business Automation
A developer announced an AI agent that operates a business through a GUI, not just chat. The claim suggests a shift from task-specific AI to full-process automation.
Alumnium MCP Hits 98.5% on WebVoyager: How to Add SOTA Browsing to Claude Code
The open-source Alumnium MCP server, which acts as a high-level browser subagent for Claude Code, just set a new state-of-the-art benchmark score. Install it to offload complex web tasks.
Open-Source 'Manus Alternative' Emerges: Fully Local AI Agent with Web Browsing, Code Execution, and Voice Input
An open-source project has been released that replicates core features of AI agent platforms like Manus—autonomous web browsing, multi-language code execution, and voice input—while running entirely locally on user hardware with no external API dependencies.
AI2's MolmoWeb: Open 8B-Parameter Web Agent Navigates Using Screenshots, Challenges Proprietary Systems
The Allen Institute for AI released MolmoWeb, a fully open web agent that operates websites using only screenshots. The 8B-parameter model outperforms other open models and approaches proprietary performance, with all training data and weights publicly released.
Cloudflare CEO Predicts AI Bot Traffic Will Surpass Human Web Traffic by 2027
Cloudflare CEO Matthew Prince forecasts that automated bot traffic will exceed human web traffic within three years, driven by the proliferation of AI agents. This projection highlights a fundamental shift in internet infrastructure demands.
WebMCP: Turn Any Web Page into a Claude Code Tool with This Chrome Flag
WebMCP lets Claude Code interact directly with web pages via a Chrome extension, turning browsing sessions into structured data sources without scraping.
AI Agents Learn to Plan Like Humans: New Framework Solves Complex Web Tasks
Researchers have developed STRUCTUREDAGENT, a hierarchical planning framework that enables AI web agents to tackle complex, multi-step tasks by using dynamic AND/OR trees and structured memory. The system achieves 46.7% success on challenging shopping tasks, outperforming existing methods.
Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail
Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.
ByteDance Enters the AI Agent Arena: Open-Source 'SuperAgent' Promises Multi-Task Automation
ByteDance has open-sourced a new AI 'SuperAgent' capable of performing complex tasks like research, coding, and content creation. This move signals a major push into the competitive AI agent space, challenging established players by making advanced automation tools publicly available.
Build Durable Jira Automation with MCP + Temporal
Pair MCP for Jira/Confluence tool access with Temporal for durable execution to build agentic workflows that survive crashes, retries, and long-running approvals.
OpenAI Launches ChatGPT Workspace Agents for Team Automation
OpenAI has introduced workspace agents within ChatGPT, powered by Codex, designed to automate complex, multi-step workflows for teams across shared environments like Slack. These agents can gather context, execute tasks, request approvals, and run continuously in the cloud.
Safari MCP Cuts Browser Automation CPU Usage by 95% for Mac Developers
Replace your Chromium-based MCP browser tool with Safari MCP to eliminate Chrome's resource drain while keeping your existing logged-in sessions.
SamarthyaBot: The Self-Hosted AI Agent OS That Puts Privacy and Automation First
SamarthyaBot is a privacy-first, self-hosted AI agent operating system that runs entirely on local machines. Unlike cloud-based assistants, it performs actual system tasks like running terminal commands, deploying projects via SSH, and controlling browsers while keeping all data encrypted and local.
Movable Ink Launches Programmatic CRM With AI Agents for Personalized
Movable Ink launched Programmatic CRM with AI agents on June 18, 2026, automating personalized content creation and customer engagement for brands. The platform leverages real-time data to generate tailored content across email, web, and mobile, reducing manual effort while scaling personalization.
SDAR: Self-Distilled RL Stabilizes Multi-Turn LLM Agents, +9.4% on ALFWorld
SDAR gates self-distillation within GRPO to stabilize multi-turn LLM agent training, yielding +9.4% on ALFWorld and gains on WebShop and Search-QA across Qwen2.5 and Qwen3 models.
Shopify Engineering details 'Flow generation through natural language'
Shopify Engineering describes a 2026 approach to generating complex workflows (flows) from natural language prompts using an agentic modeling framework, enabling non-technical users to create automation.
Manycore Tech Pivots from Real Estate to AI Robotics, Hits $1B Valuation
Manycore Tech Inc., a Chinese software company previously focused on real estate, has raised $150 million to pivot into AI and robotics, achieving a $1 billion valuation. The move is led by an Nvidia alumnus and capitalizes on China's strategic push into automation.
OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users
OpenAI released a major Codex update featuring background macOS automation, an in-app browser, persistent memory, and 90+ plugins. With 3M weekly users and nearly half of usage now non-coding, Codex is being repositioned as a general work agent.
Meta Deploys Unified AI Agents to Manage Hyperscale Infrastructure
Meta's engineering team has built and deployed a system of unified AI agents to autonomously manage capacity and performance across its hyperscale infrastructure. This represents a significant shift from rule-based automation to AI-driven orchestration for one of the world's largest computing fleets.
Perplexity AI Launches 'Personal Computer' for Mac App Orchestration
Perplexity AI has released 'Personal Computer', a feature that integrates with its Mac app to securely orchestrate local files and applications. This move expands its AI assistant from web search to direct desktop interaction.
Anthropic Disables Claude Max for 24/7 Autonomous Agent Workflows
Anthropic has disabled the 'Claude Max' feature that allowed for 24/7 autonomous agent operation, a move affecting developers running persistent coding and automation tasks on the platform.
How Claude Routines Could Automate Your Dev Workflow (And What's Still Missing)
Claude Routines are cloud-based AI automations that run on triggers/schedules. While not directly in Claude Code yet, they hint at future workflow automation possibilities developers should prepare for.