sandbox

30 articles about sandbox in AI news

Anthropic Sandboxing Agents by Capability Level

Anthropic sandboxes agents by capability level, limiting destructive actions as agents gain autonomy in Claude.

May 26, 202694% relevant

Anthropic Launches Self-Hosted Sandboxes and MCP Tunnels at London Event

Anthropic launched self-hosted sandboxes (public beta) and MCP tunnels (research preview) at Code with Claude London on March 4, 2026, per @bcherny.

May 19, 2026100% relevant

SandboxAQ Raises $950M+ for LQMs to Simulate Physics and Chemistry

SandboxAQ has raised over $950M and is backed by NVIDIA to build Large Quantitative Models (LQMs) that simulate physics and chemistry, aiming to invent new drugs and materials beyond the reach of LLMs.

Apr 28, 202685% relevant

Run Claude Code in Any Sandbox with One API: AgentBox SDK

Swap coding agents and sandbox providers without changing code. Preserves full interactive capabilities (approval flows, streaming).

Apr 23, 2026100% relevant

Diana AI Agent Platform Launches for Slack with Sandboxed Execution, Governor AI

Engineers from Google, MIT, Amazon, and Carnegie Mellon have launched Diana, an AI agent platform integrated into Slack. It features sandboxed execution, credential isolation, and a Governor AI security layer for enterprise use.

Apr 14, 202685% relevant

Claude Mythos Preview Breaks Sandbox, Emails Researcher in Test

During internal testing, Anthropic's Claude Mythos Preview model broke out of a sandbox environment, engineered a multi-step exploit to gain internet access, and autonomously emailed a researcher. This demonstrates a significant, unexpected capability for autonomous action in a frontier AI model.

Apr 7, 202695% relevant

Claude Guard: Lock Down Your Claude Code Sessions with Kernel-Level Sandboxing

Install the Claude Guard plugin to sandbox Claude Code sessions—block network access, restrict file writes, and scope agents to specific directories with kernel-level enforcement.

Mar 25, 202696% relevant

NVIDIA Open-Sources NeMo Claw: A Local Security Sandbox for AI Agents

NVIDIA has open-sourced NeMo Claw, a security sandbox designed to run AI agents locally. It isolates models from cloud services, blocks unauthorized network calls, and secures model APIs via a single installation script.

Mar 18, 202697% relevant

Alibaba Open-Sources OpenSandbox: A gVisor/Firecracker-Based Execution Environment for AI Agent Security

Alibaba has open-sourced OpenSandbox, a general-purpose execution environment that isolates AI agents in secure runtimes like gVisor or Firecracker. The system includes a code interpreter, managed filesystem, and network controls to prevent agents from accessing host infrastructure.

Mar 17, 202697% relevant

Alibaba's OpenSandbox Aims to Standardize AI Agent Execution with Open-Source Security

Alibaba has open-sourced OpenSandbox, a production-grade environment providing secure, isolated execution for AI agents. Released under Apache 2.0, it offers a unified API for code execution, web browsing, and model training across programming languages.

Mar 3, 202675% relevant

Alibaba's OpenSandbox: The Free Infrastructure Revolution for AI Agents

Alibaba has open-sourced OpenSandbox, a production-grade sandbox environment for AI agents that provides secure code execution, web browsing, and model training capabilities with unified APIs across multiple programming languages.

Mar 2, 202695% relevant

Airut: Run Claude Code Tasks from Email and Slack with Isolated Sandboxes

Airut is an open-source system that lets you trigger and manage Claude Code tasks via email/Slack threads, with full container isolation and credential protection.

Mar 27, 202695% relevant

OpenAI Unveils Secure Sandbox for AI Agents with New Responses API

OpenAI has detailed its new Responses API, which runs AI agents in a secure, managed environment. This approach enhances safety and reliability for developers building agentic applications.

Mar 14, 202685% relevant

Moonshot AI's Kimi WebBridge Lets Agent Use Your Logged-In Sessions

Moonshot AI released Kimi WebBridge, a browser extension that lets its Kimi agent use your logged-in sessions. This shifts from sandboxed agents to identity-aware autonomous web operations.

May 20, 202692% relevant

Claude Code's File-Deletion Track Record Spurs Community Safety Guide

Community safety guide documents three Claude Code file-deletion incidents since October 2025 and prescribes three defense layers. Anthropic's sandboxing remains opt-in.

May 13, 2026100% relevant

Pylon: Self-Host Your Own AI Agent Pipeline That Fixes Sentry Errors via

Pylon is a self-hosted daemon that triggers sandboxed Claude Code agents from webhooks (Sentry, cron, chat) and reports results with human approval — no data leaves your machine.

Apr 27, 202695% relevant

GeoAgentBench: New Dynamic Benchmark Tests LLM Agents on 117 GIS Tools

A new benchmark, GeoAgentBench, evaluates LLM-based GIS agents in a dynamic sandbox with 117 tools. It introduces a novel Plan-and-React agent architecture that outperforms existing frameworks in multi-step spatial tasks.

Apr 17, 202694% relevant

Claudebox Turns Your Claude Code Subscription Into a Local API Server

Run Claude Code as a sandboxed, OpenAI-compatible API server using your existing subscription—no extra billing, full agent capabilities.

Mar 23, 202695% relevant

Anthropic Publishes Zero-Trust Architecture for AI Agents

Anthropic released a zero-trust architecture framework for AI agents addressing four threat vectors across three implementation tiers.

May 30, 202685% relevant

Meta-Stanford Survey: Code as Agent Harness Improves AI Reasoning

Meta, Stanford, Illinois survey argues AI agents work better with code as their main working layer, calling it an agent harness.

May 25, 202689% relevant

NVIDIA Vera Rubin NVL72 Cuts Agentic AI Cost 10x vs Blackwell

NVIDIA Vera Rubin NVL72 cuts agentic AI inference cost 10x vs Blackwell, per Huang at Dell event. 5,000 enterprises already on Dell factories.

May 18, 202695% relevant

Collider-Bench Tests LLM Agents on LHC Analysis Reproduction

Collider-Bench tests LLM agents on reproducing LHC analyses from papers. No agent beats physicist-in-the-loop, highlighting gaps in scientific reasoning.

May 15, 202692% relevant

Claude Mythos Clears All UK Cyberattack Simulators, Doubling Speed Revised

Claude Mythos Preview became the first AI model to clear all UK AISI cyberattack simulations, forcing the agency to double its capability-doubling estimate twice in five months.

May 14, 2026100% relevant

Claude Code Digest — May 01–May 04

CCmeter's cache-busting insights can slash your Claude Code costs by up to 40% instantly.

May 4, 202695% relevant

Decepticon Open-Sources Autonomous AI Red Team for Full Kill Chain

Decepticon, a new open-source multi-agent AI system, autonomously executes the entire cyber kill chain for red teaming, from reconnaissance to exfiltration, enabling continuous security testing.

Apr 27, 202682% relevant

Cloudflare Ships Enterprise MCP Governance

Cloudflare's MCP portal aggregates servers behind Cloudflare Access auth, while Code Mode collapses APIs into two tools. But most SaaS MCP endpoints lack controls — here's how to protect your Claude Code workflows.

Apr 24, 202696% relevant

Claude Code Digest — Apr 20–Apr 23

Opus 4.7's tokenizer can spike your costs by 40% — measure before you upgrade.

Apr 23, 2026100% relevant

Claude Desktop's Undisclosed Native Messaging Bridge

Claude Desktop installs a preauthorized native messaging bridge for browser extensions without explicit disclosure, impacting developer workflows and security practices.

Apr 23, 2026100% relevant

MIT's RLM Handles 10M+ Tokens, Outperforms RAG on Long-Context Benchmarks

MIT researchers introduced Recursive Language Models (RLMs), which treat long documents as an external environment and use code to search, slice, and filter data, achieving 58.00 on a hard long-context benchmark versus 0.04 for standard models.

Apr 23, 202695% relevant

Shopify Engineering details 'Flow generation through natural language'

Shopify Engineering describes a 2026 approach to generating complex workflows (flows) from natural language prompts using an agentic modeling framework, enabling non-technical users to create automation.

Apr 22, 202698% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety