![Making Claude Code more secure and autonomous with sandboxing \ Anthropic](https://www.anthropic.com/_next/image?url=https%3A%2F%2Fwww-cdn.anthropic.com%2Fimages%2F4zrzovbb%2Fwebsite%2F0d1c612947c798aef48e6ab4beb7e8544da9d41a-4096x2305.png&w=3840&q=75)

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Developer coding on laptop with AgentBox SDK integration diagram showing sandbox environments connected via a single API

Open SourceScore: 100

Run Claude Code in Any Sandbox with One API: AgentBox SDK

Swap coding agents and sandbox providers without changing code. Preserves full interactive capabilities (approval flows, streaming).

AAAla SMITH & AI Research Desk·Apr 23, 2026·3 min read··491 views·AI-Generated·Report error

Source: github.comvia hn_claude_code, @HowToAI_, reddit_claude, devto_claudecodeWidely Reported

TL;DR

AgentBox SDK lets you run Claude Code, Codex, or OpenCode inside Docker, E2B, Modal, Daytona, or Vercel with a unified API — no more shelling out to non-interactive CLIs.

Key Takeaways

Swap coding agents and sandbox providers without changing code.
Preserves full interactive capabilities (approval flows, streaming).

What Changed

$Making Claude Code more secure and autonomous with sandboxing \ Anthropic$

AgentBox is a new SDK that abstracts the runtime for coding agents. Instead of wrapping claude --print (non-interactive mode), it launches each agent as a server process inside a sandbox and communicates over WebSocket or HTTP. This preserves approval flows, tool-use control, and streaming events.

Key abstraction: One API for any agent + any sandbox provider.

import { Agent, Sandbox } from "agentbox-sdk";

const sandbox = new Sandbox("local-docker", {
  workingDir: "/workspace",
  image: process.env.IMAGE_ID!,
  env: { ANTHROPIC_API_KEY: process.env.ANTHROPIC_API_KEY! },
});

const agent = new Agent("claude-code", {
  sandbox,
  cwd: "/workspace",
  approvalMode: "auto",
});

const result = await agent.run({
  model: "sonnet",
  input: "Create a hello world Express server in /workspace/server.ts",
});

await sandbox.delete();

What It Means For You

If you're building multi-agent workflows or need to run Claude Code in a CI/CD pipeline, this matters. Most existing solutions call agents in non-interactive mode (claude --print), which strips away approval flows and tool-use control. AgentBox preserves the full interactive session.

Supported agents:

claude-code
opencode
codex

Supported sandboxes:

local-docker
e2b
modal
daytona
vercel

Swap either — your app code stays the same. This is particularly useful for:

Running untrusted agent code in isolated environments
Parallelizing agent runs across multiple sandboxes
Testing different agents on the same task without refactoring

Try It Now

Install: npm install agentbox-sdk (requires Node >= 20)

Build a sandbox image:

npx agentbox image build --provider local-docker --preset browser-agent

This prints an image reference. Set it as IMAGE_ID.

Stream events in real-time:

const run = agent.stream({
  model: "sonnet",
  input: "Write a fizzbuzz in Python",
});

for await (const event of run) {
  if (event.type === "text.delta") {
    process.stdout.write(event.delta);
  }
}

const result = await run.finished;

Key methods on sandbox: run(), runAsync(), gitClone(), openPort(), getPreviewLink(), snapshot(), stop(), delete()

gentic.news Analysis

AgentBox arrives at a time when Claude Code usage is surging — it appeared in 58 articles this week alone (total: 634 across our coverage). The trend toward running agents in sandboxed environments aligns with the recent CVE-2026-35022 security disclosure for Claude Code, which highlighted the risks of running agents without isolation.

This SDK directly addresses a pain point we've seen in our coverage: developers want to use Claude Code in CI/CD but need proper sandboxing. Previously, they had to choose between non-interactive mode (losing approval flows) or custom scripting. AgentBox provides a standardized abstraction similar to what the Vercel AI SDK did for LLM calls — but for agent + runtime.

The ability to swap between Claude Code, Codex, and OpenCode without changing code is particularly valuable as the agent ecosystem fragments. With Claude Opus 4.6 scoring 94.1% on ThermoQA and Codex 5.3 competing on SWE-Bench, having a provider-agnostic runtime lets you benchmark agents on your actual tasks.

What you should do differently: If you're currently running Claude Code with claude --print in CI, migrate to AgentBox for sandboxed, interactive sessions. If you're building multi-agent architectures, use AgentBox as your runtime abstraction layer — it'll save you from rewriting integration code when you switch sandbox providers or agents.

Source: gentic.news · Apr 23, 2026 · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

Claude Code users should immediately evaluate AgentBox for any workflow where they currently run `claude --print` or `claude --non-interactive`. The SDK preserves approval flows and tool-use control, which means you can safely automate agent runs without losing the ability to review actions. For teams running Claude Code in CI/CD pipelines, use AgentBox with Docker sandboxes to isolate each run. The `snapshot()` method on sandboxes lets you capture state for debugging or resuming interrupted runs. This is especially useful for long-running code generation tasks. If you're experimenting with multiple coding agents (Claude Code vs Codex vs OpenCode), AgentBox lets you swap them with a single line change. Use this to benchmark which agent performs best on your specific codebase before committing to one.

#sdk #claude code #agent runtime #sandbox #docker

Compare side-by-side

Claude Code vs AgentBox

→

Mentioned in this article

Anthropic Claude Code AgentBox

Enjoyed this article?

Get the weekly AI intelligence briefing

✨AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Open Source2 shared topics

Anthropic Ships Official Claude Code Plugin for Project Automation

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

Original research · EUMAS 2026

MNEMA — A Witness Lattice for Multi-Agent AI Memory

Cryptographic memory units · 1−α detection floor · 15 pp PDF

Field framework · v1.0

Epistemic Infrastructure

12 pillars · 11-stage knowledge metabolism · pathology catalog

More in Open Source

View all

Researchers collaborate on a dashboard displaying multimodal AI data pipelines merging text, images, and healthcare…

Open Source

DataArc-SynData-Toolkit: Open-Source Framework for Multimodal Synthetic Data

DataArc-SynData-Toolkit is an open-source framework for multimodal synthetic data, aiming to lower technical barriers for LLM training. It features a configuration-driven pipeline with visual interface and modular architecture.

arxiv.org/May 12, 2026/3 min read/Multi-Source

open-sourceresearchllm

Open SourceBreakthrough

100

Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal Capabilities

Google has released the Gemma 4 family of open-weight models, derived from Gemini 3 technology. The four models, ranging from 2B to 31B parameters and including a Mixture-of-Experts variant, are available under a permissive Apache 2.0 license and feature multimodal processing.

engadget.com/Apr 2, 2026/3 min read/Widely Reported

product launchopen sourcegoogle

A sleek interface shows a waveform graph with a transcription panel, highlighting Cohere's ASR model achieving top…

Open Source

Cohere Transcribe: 2B-Parameter Open-Source ASR Model Achieves 5.42% WER, Topping Hugging Face Leaderboard

Cohere released Transcribe, a 2B-parameter open-source speech recognition model. It claims a 5.42% average word error rate, beating OpenAI Whisper v3 and topping the Hugging Face Open ASR Leaderboard.

the-decoder.com/Mar 27, 2026/3 min read/Widely Reported

open-sourcespeech-aibenchmarks

Key Takeaways

What Changed

What It Means For You

Try It Now

gentic.news Analysis

AI Analysis

✨AI Toolslive

Related Articles

Build a Zero-Dependency MCP Server

Claude Code Token Costs Got You Down? Here's How to Cut Usage 40% Without

Anthropic's 80% Code Stat: What It Means for Your CLAUDE.md and Workflow Design

Claude Opus 4.8 Launches Dynamic Workflows for Agentic Code

Anthropic Opus 4.8 Cuts Bug-Finding Cost by 5x, SemiAnalysis Finds