Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

web automation

30 articles about web automation in AI news

Skales AI Agent Runs Locally on 300MB RAM, Enables Desktop Automation Without Terminal

Skales, a new desktop AI agent, runs locally on just 300MB of RAM and enables full automation workflows without terminal interaction. The agent can execute tasks like file management, application control, and web automation through a visual interface.

85% relevant

Qwen 3.6 Plus Demonstrates Full Web OS and Browser Automation in Single Session

A developer tested Qwen 3.6 Plus on a complex web OS workflow involving Python terminal operations, gaming, and browser automation, with the model handling all tasks seamlessly in a single session.

89% relevant

RiskWebWorld: A New Benchmark Exposes the Limits of AI for E-commerce Risk

Researchers introduced RiskWebWorld, a realistic benchmark for testing GUI agents on 1,513 authentic e-commerce risk management tasks. It reveals a major capability gap, showing even the best models fail over 50% of the time, highlighting the immaturity of AI for high-stakes operational automation.

92% relevant

Anthropic Ships Official Claude Code Plugin for Project Automation

Anthropic shipped an official Claude Code plugin for one-click project automation. Most users haven't explored half its capabilities.

100% relevant

Moonshot AI's Kimi WebBridge Lets Agent Use Your Logged-In Sessions

Moonshot AI released Kimi WebBridge, a browser extension that lets its Kimi agent use your logged-in sessions. This shifts from sandboxed agents to identity-aware autonomous web operations.

92% relevant

Microsoft's Playwright MCP Server Replaces Vision for Web Agents

Microsoft built an MCP server for Playwright that lets AI agents interact with web pages using the accessibility tree, eliminating the need for screenshots and vision models. This approach reduces hallucinations and broken selectors, working with tools like Cursor, VS Code, and Claude Desktop.

100% relevant

Claude Code's Playwright MCP Server: Generate Web Tests from Natural Language

Claude Code now integrates with Playwright via MCP, letting you generate complete test automation from simple prompts without leaving your terminal.

100% relevant

Tiny Fish Improves Live Web Usability for AI Coding Agents

Tiny Fish has released a tool that makes the live web significantly more usable for AI coding agents. This addresses a critical failure point where agent workflows often break down during real-world web interactions.

85% relevant

AI Agent 'Business OS' Emerges, Claims Full GUI-Based Business Automation

A developer announced an AI agent that operates a business through a GUI, not just chat. The claim suggests a shift from task-specific AI to full-process automation.

89% relevant

Alumnium MCP Hits 98.5% on WebVoyager: How to Add SOTA Browsing to Claude Code

The open-source Alumnium MCP server, which acts as a high-level browser subagent for Claude Code, just set a new state-of-the-art benchmark score. Install it to offload complex web tasks.

95% relevant

Open-Source 'Manus Alternative' Emerges: Fully Local AI Agent with Web Browsing, Code Execution, and Voice Input

An open-source project has been released that replicates core features of AI agent platforms like Manus—autonomous web browsing, multi-language code execution, and voice input—while running entirely locally on user hardware with no external API dependencies.

85% relevant

AI2's MolmoWeb: Open 8B-Parameter Web Agent Navigates Using Screenshots, Challenges Proprietary Systems

The Allen Institute for AI released MolmoWeb, a fully open web agent that operates websites using only screenshots. The 8B-parameter model outperforms other open models and approaches proprietary performance, with all training data and weights publicly released.

100% relevant

Cloudflare CEO Predicts AI Bot Traffic Will Surpass Human Web Traffic by 2027

Cloudflare CEO Matthew Prince forecasts that automated bot traffic will exceed human web traffic within three years, driven by the proliferation of AI agents. This projection highlights a fundamental shift in internet infrastructure demands.

87% relevant

WebMCP: Turn Any Web Page into a Claude Code Tool with This Chrome Flag

WebMCP lets Claude Code interact directly with web pages via a Chrome extension, turning browsing sessions into structured data sources without scraping.

87% relevant

AI Agents Learn to Plan Like Humans: New Framework Solves Complex Web Tasks

Researchers have developed STRUCTUREDAGENT, a hierarchical planning framework that enables AI web agents to tackle complex, multi-step tasks by using dynamic AND/OR trees and structured memory. The system achieves 46.7% success on challenging shopping tasks, outperforming existing methods.

85% relevant

Amazon's AI Agent Incident Highlights Critical Risks of Unsupervised Automation in Retail

Amazon's retail website suffered multiple high-severity outages linked to an engineer acting on inaccurate advice from an AI agent that sourced information from an outdated internal wiki. This incident underscores the operational risks of deploying autonomous AI agents without proper human oversight and data governance in critical retail systems.

95% relevant

ByteDance Enters the AI Agent Arena: Open-Source 'SuperAgent' Promises Multi-Task Automation

ByteDance has open-sourced a new AI 'SuperAgent' capable of performing complex tasks like research, coding, and content creation. This move signals a major push into the competitive AI agent space, challenging established players by making advanced automation tools publicly available.

85% relevant

Build Durable Jira Automation with MCP + Temporal

Pair MCP for Jira/Confluence tool access with Temporal for durable execution to build agentic workflows that survive crashes, retries, and long-running approvals.

92% relevant

OpenAI Launches ChatGPT Workspace Agents for Team Automation

OpenAI has introduced workspace agents within ChatGPT, powered by Codex, designed to automate complex, multi-step workflows for teams across shared environments like Slack. These agents can gather context, execute tasks, request approvals, and run continuously in the cloud.

97% relevant

Safari MCP Cuts Browser Automation CPU Usage by 95% for Mac Developers

Replace your Chromium-based MCP browser tool with Safari MCP to eliminate Chrome's resource drain while keeping your existing logged-in sessions.

86% relevant

SamarthyaBot: The Self-Hosted AI Agent OS That Puts Privacy and Automation First

SamarthyaBot is a privacy-first, self-hosted AI agent operating system that runs entirely on local machines. Unlike cloud-based assistants, it performs actual system tasks like running terminal commands, deploying projects via SSH, and controlling browsers while keeping all data encrypted and local.

80% relevant

Movable Ink Launches Programmatic CRM With AI Agents for Personalized

Movable Ink launched Programmatic CRM with AI agents on June 18, 2026, automating personalized content creation and customer engagement for brands. The platform leverages real-time data to generate tailored content across email, web, and mobile, reducing manual effort while scaling personalization.

98% relevant

SDAR: Self-Distilled RL Stabilizes Multi-Turn LLM Agents, +9.4% on ALFWorld

SDAR gates self-distillation within GRPO to stabilize multi-turn LLM agent training, yielding +9.4% on ALFWorld and gains on WebShop and Search-QA across Qwen2.5 and Qwen3 models.

85% relevant

Shopify Engineering details 'Flow generation through natural language'

Shopify Engineering describes a 2026 approach to generating complex workflows (flows) from natural language prompts using an agentic modeling framework, enabling non-technical users to create automation.

98% relevant

Manycore Tech Pivots from Real Estate to AI Robotics, Hits $1B Valuation

Manycore Tech Inc., a Chinese software company previously focused on real estate, has raised $150 million to pivot into AI and robotics, achieving a $1 billion valuation. The move is led by an Nvidia alumnus and capitalizes on China's strategic push into automation.

70% relevant

OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users

OpenAI released a major Codex update featuring background macOS automation, an in-app browser, persistent memory, and 90+ plugins. With 3M weekly users and nearly half of usage now non-coding, Codex is being repositioned as a general work agent.

100% relevant

Meta Deploys Unified AI Agents to Manage Hyperscale Infrastructure

Meta's engineering team has built and deployed a system of unified AI agents to autonomously manage capacity and performance across its hyperscale infrastructure. This represents a significant shift from rule-based automation to AI-driven orchestration for one of the world's largest computing fleets.

70% relevant

Perplexity AI Launches 'Personal Computer' for Mac App Orchestration

Perplexity AI has released 'Personal Computer', a feature that integrates with its Mac app to securely orchestrate local files and applications. This move expands its AI assistant from web search to direct desktop interaction.

87% relevant

Anthropic Disables Claude Max for 24/7 Autonomous Agent Workflows

Anthropic has disabled the 'Claude Max' feature that allowed for 24/7 autonomous agent operation, a move affecting developers running persistent coding and automation tasks on the platform.

89% relevant

How Claude Routines Could Automate Your Dev Workflow (And What's Still Missing)

Claude Routines are cloud-based AI automations that run on triggers/schedules. While not directly in Claude Code yet, they hint at future workflow automation possibilities developers should prepare for.

100% relevant