web

30 articles about web in AI news

Moonshot AI's Kimi WebBridge Lets Agent Use Your Logged-In Sessions

Moonshot AI released Kimi WebBridge, a browser extension that lets its Kimi agent use your logged-in sessions. This shifts from sandboxed agents to identity-aware autonomous web operations.

May 20, 202692% relevant

Gemini 3.5 Flash Generates Full Web OS in One Shot

Gemini 3.5 Flash generated a full web OS from one prompt in a single HTML file, showcasing one-shot generation of complex UI.

May 18, 202685% relevant

Claude Security Public Beta Launches in Claude Code on Web

Anthropic launched Claude Security in public beta for Claude Code on web, letting developers validate and fix vulnerabilities without leaving the editor.

Apr 30, 2026100% relevant

Microsoft's Playwright MCP Server Replaces Vision for Web Agents

Microsoft built an MCP server for Playwright that lets AI agents interact with web pages using the accessibility tree, eliminating the need for screenshots and vision models. This approach reduces hallucinations and broken selectors, working with tools like Cursor, VS Code, and Claude Desktop.

Apr 28, 2026100% relevant

Webcam Head-Tracking Wallpaper Uses AI for Parallax Effect

A developer built a dynamic wallpaper that tracks a user's head via webcam to shift the background perspective in real-time. It demonstrates a novel, accessible application of computer vision for interactive desktop environments.

Apr 18, 202675% relevant

WebAI's Open-Source Model Hits #1 on MTEB Retrieval Leaderboard

WebAI has open-sourced a document retrieval model that currently holds the #1 position on the Massive Text Embedding Benchmark (MTEB) leaderboard. This provides a high-performance, free alternative to closed-source embedding APIs used in Retrieval-Augmented Generation (RAG) pipelines.

Apr 17, 202687% relevant

Open-Source FaceSwap Tool Enables Real-Time Webcam Swaps

Developer Gurisingh has released a free, open-source tool for real-time face-swapping on webcams. It works with live video calls and requires only a single source photo.

Apr 17, 202685% relevant

RiskWebWorld: A New Benchmark Exposes the Limits of AI for E-commerce Risk

Researchers introduced RiskWebWorld, a realistic benchmark for testing GUI agents on 1,513 authentic e-commerce risk management tasks. It reveals a major capability gap, showing even the best models fail over 50% of the time, highlighting the immaturity of AI for high-stakes operational automation.

Apr 17, 202692% relevant

Tiny Fish Improves Live Web Usability for AI Coding Agents

Tiny Fish has released a tool that makes the live web significantly more usable for AI coding agents. This addresses a critical failure point where agent workflows often break down during real-world web interactions.

Apr 14, 202685% relevant

ClawIDE: A Web-Based IDE for Managing Multiple Claude Code Sessions

ClawIDE is a free, open-source web IDE that enables developers to manage multiple concurrent Claude Code sessions, addressing a core limitation of the terminal-based workflow.

Apr 10, 2026100% relevant

Keygraph Launches Shannon AI to Automate Web App Security Testing

Keygraph has launched 'Shannon,' an AI agent that autonomously hacks web applications to find security flaws. This positions AI as an offensive security tool for proactive defense.

Apr 7, 202687% relevant

Google DeepMind: Web Environment, Not Model Weights, Is Key AI Agent Attack Surface

Google DeepMind researchers present a systematic framework showing that the web environment itself—not just the model—is a primary attack surface for AI agents. In benchmarks, hidden prompt injections hijacked agents in up to 86% of scenarios, with memory poisoning attacks exceeding 80% success.

Apr 6, 202697% relevant

OpenSCAD Web: Open-Source Text-to-CAD Tool Runs Fully In-Browser via WebAssembly

A developer has released an open-source text-to-CAD tool that runs entirely in a web browser using WebAssembly. Users describe a 3D object in plain English, optionally upload a reference image, and receive a parametric model with adjustable dimensions that exports directly to 3D printer formats.

Apr 4, 202685% relevant

Qwen 3.6 Plus Demonstrates Full Web OS and Browser Automation in Single Session

A developer tested Qwen 3.6 Plus on a complex web OS workflow involving Python terminal operations, gaming, and browser automation, with the model handling all tasks seamlessly in a single session.

Apr 3, 202689% relevant

Emergence WebVoyager: A New Benchmark Exposes Inconsistencies in Web Agent Evaluation

A new study introduces Emergence WebVoyager, a standardized benchmark for evaluating web-based AI agents. It reveals significant performance inconsistencies, showing OpenAI Operator's success rate is 68.6%, not 87%. This highlights a critical need for rigorous, transparent testing in agent development.

Apr 1, 202672% relevant

Claude Code 2.5: New CLI, Dashboard, and Best Practices for Web Devs

Anthropic's latest Claude Code update adds a CLI, usage dashboard, and web-focused best practices. Here's how to use them.

Mar 29, 202695% relevant

Agent Reach: Open-Source Tool Gives AI Agents Free Access to Twitter, YouTube, Reddit, and Web Content

Agent Reach is an open-source Python toolkit that enables AI agents to scrape and read content from Twitter, YouTube, Reddit, Xiaohongshu, and the web without paid APIs. It solves the persistent problem of agents hitting authentication walls and anti-scraping blocks when trying to access online information.

Mar 29, 202685% relevant

Alumnium MCP Hits 98.5% on WebVoyager: How to Add SOTA Browsing to Claude Code

The open-source Alumnium MCP server, which acts as a high-level browser subagent for Claude Code, just set a new state-of-the-art benchmark score. Install it to offload complex web tasks.

Mar 27, 202695% relevant

Anthropic Launches Claude Code Auto-Fix for Web/Mobile Sessions, Enabling Automatic CI Fixes

Anthropic has launched Claude Code auto-fix for web and mobile development sessions. The feature allows Claude to automatically follow pull requests and fix CI failures in the cloud.

Mar 27, 202689% relevant

AI Coding Agent Rewrites Canon Webcam Software in Rust, Fixes Persistent Crashes

A developer used an AI coding agent to rewrite Canon's official, crash-prone webcam software. The agent produced a fully functional Rust application overnight, solving a problem that had persisted for years.

Mar 26, 202685% relevant

Open-Source 'Manus Alternative' Emerges: Fully Local AI Agent with Web Browsing, Code Execution, and Voice Input

An open-source project has been released that replicates core features of AI agent platforms like Manus—autonomous web browsing, multi-language code execution, and voice input—while running entirely locally on user hardware with no external API dependencies.

Mar 26, 202685% relevant

Stop Claude Code's Web Fetches from Burning 700K Tokens on HTML Junk

A new MCP server, token-enhancer, strips scripts, nav bars, and ads from web pages before they hit Claude's context, cutting token waste by 90%+.

Mar 25, 202684% relevant

AI2's MolmoWeb: Open 8B-Parameter Web Agent Navigates Using Screenshots, Challenges Proprietary Systems

The Allen Institute for AI released MolmoWeb, a fully open web agent that operates websites using only screenshots. The 8B-parameter model outperforms other open models and approaches proprietary performance, with all training data and weights publicly released.

Mar 25, 2026100% relevant

Stop Pasting Secrets to Websites: How mcp-devutils Secures Your API Debugging

Install mcp-devutils to run 44 developer tools locally through Claude Code—no more leaking JWTs or API keys to third-party websites.

Mar 25, 202682% relevant

Ego2Web Benchmark Bridges Egocentric Video and Web Agents, Exposing Major Performance Gaps

Researchers introduce Ego2Web, the first benchmark requiring AI agents to understand real-world first-person video and execute related web tasks. Their novel Ego2WebJudge evaluation method achieves 84% human agreement, while state-of-the-art agents perform poorly across all task categories.

Mar 25, 202695% relevant

Google DeepMind Unveils Gemini-Powered Browser That Generates Websites in Real-Time

Google DeepMind has demonstrated a browser prototype powered by Gemini 3.1 Flash-Lite that generates complete HTML/CSS websites dynamically based on user prompts and navigation context, shifting from static page retrieval to on-demand interface generation.

Mar 25, 202695% relevant

Firecrawl MCP Server: When to Upgrade from Fetch MCP for Web Scraping

Firecrawl's MCP server offers 12+ tools for advanced web scraping, but its 500-credit free tier and complex pricing mean you should only install it for specific, complex data extraction tasks.

Mar 24, 202672% relevant

OpenCSF: A 1.5TB Free Computer Science Library Emerges from Unstructured Web Data

A new open-source dataset called OpenCSF has been compiled, containing 1.5TB of computer science materials scraped from public web sources. It provides a massive, free corpus for AI training and research in software engineering and CS education.

Mar 24, 202685% relevant

China's DeepSeek-R1: Open-Source AI Agent Runs Locally with Web Search, Code Generation, and Built-In Computer

Chinese AI company DeepSeek has released DeepSeek-R1, a fully open-source AI agent that runs locally on personal computers with web search capabilities, code generation, and built-in computer functionality. The model represents a significant move toward accessible, self-contained AI systems outside the dominant U.S. ecosystem.

Mar 23, 202699% relevant

MiRA Framework Boosts Gemma3-12B to 43% Success Rate on WebArena-Lite, Surpassing GPT-4 and WebRL

Researchers propose MiRA, a milestone-based RL framework that improves long-horizon planning in LLM agents. It boosts Gemma3-12B's web navigation success from 6.4% to 43%, outperforming GPT-4-Turbo (17.6%) and the previous SOTA WebRL (38.4%).

Mar 23, 202677% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety