Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Developer coding on laptop with AgentBox SDK integration diagram showing sandbox environments connected via a single API
Open SourceScore: 100

Run Claude Code in Any Sandbox with One API: AgentBox SDK

Swap coding agents and sandbox providers without changing code. Preserves full interactive capabilities (approval flows, streaming).

·Apr 23, 2026·3 min read··406 views·AI-Generated·Report error
Share:
Source: github.comvia hn_claude_code, @HowToAI_, reddit_claude, devto_claudecodeWidely Reported
TL;DR

AgentBox SDK lets you run Claude Code, Codex, or OpenCode inside Docker, E2B, Modal, Daytona, or Vercel with a unified API — no more shelling out to non-interactive CLIs.

Key Takeaways

  • Swap coding agents and sandbox providers without changing code.
  • Preserves full interactive capabilities (approval flows, streaming).

What Changed

Making Claude Code more secure and autonomous with sandboxing \ Anthropic

AgentBox is a new SDK that abstracts the runtime for coding agents. Instead of wrapping claude --print (non-interactive mode), it launches each agent as a server process inside a sandbox and communicates over WebSocket or HTTP. This preserves approval flows, tool-use control, and streaming events.

Key abstraction: One API for any agent + any sandbox provider.

import { Agent, Sandbox } from "agentbox-sdk";

const sandbox = new Sandbox("local-docker", {
  workingDir: "/workspace",
  image: process.env.IMAGE_ID!,
  env: { ANTHROPIC_API_KEY: process.env.ANTHROPIC_API_KEY! },
});

const agent = new Agent("claude-code", {
  sandbox,
  cwd: "/workspace",
  approvalMode: "auto",
});

const result = await agent.run({
  model: "sonnet",
  input: "Create a hello world Express server in /workspace/server.ts",
});

await sandbox.delete();

What It Means For You

If you're building multi-agent workflows or need to run Claude Code in a CI/CD pipeline, this matters. Most existing solutions call agents in non-interactive mode (claude --print), which strips away approval flows and tool-use control. AgentBox preserves the full interactive session.

Supported agents:

  • claude-code
  • opencode
  • codex

Supported sandboxes:

  • local-docker
  • e2b
  • modal
  • daytona
  • vercel

Swap either — your app code stays the same. This is particularly useful for:

  • Running untrusted agent code in isolated environments
  • Parallelizing agent runs across multiple sandboxes
  • Testing different agents on the same task without refactoring

Try It Now

  1. Install: npm install agentbox-sdk (requires Node >= 20)

  2. Build a sandbox image:

    npx agentbox image build --provider local-docker --preset browser-agent
    

    This prints an image reference. Set it as IMAGE_ID.

  3. Stream events in real-time:

    const run = agent.stream({
      model: "sonnet",
      input: "Write a fizzbuzz in Python",
    });
    
    for await (const event of run) {
      if (event.type === "text.delta") {
        process.stdout.write(event.delta);
      }
    }
    
    const result = await run.finished;
    
  4. Key methods on sandbox: run(), runAsync(), gitClone(), openPort(), getPreviewLink(), snapshot(), stop(), delete()

gentic.news Analysis

AgentBox arrives at a time when Claude Code usage is surging — it appeared in 58 articles this week alone (total: 634 across our coverage). The trend toward running agents in sandboxed environments aligns with the recent CVE-2026-35022 security disclosure for Claude Code, which highlighted the risks of running agents without isolation.

This SDK directly addresses a pain point we've seen in our coverage: developers want to use Claude Code in CI/CD but need proper sandboxing. Previously, they had to choose between non-interactive mode (losing approval flows) or custom scripting. AgentBox provides a standardized abstraction similar to what the Vercel AI SDK did for LLM calls — but for agent + runtime.

The ability to swap between Claude Code, Codex, and OpenCode without changing code is particularly valuable as the agent ecosystem fragments. With Claude Opus 4.6 scoring 94.1% on ThermoQA and Codex 5.3 competing on SWE-Bench, having a provider-agnostic runtime lets you benchmark agents on your actual tasks.

What you should do differently: If you're currently running Claude Code with claude --print in CI, migrate to AgentBox for sandboxed, interactive sessions. If you're building multi-agent architectures, use AgentBox as your runtime abstraction layer — it'll save you from rewriting integration code when you switch sandbox providers or agents.

Source: gentic.news · · author= · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

Claude Code users should immediately evaluate AgentBox for any workflow where they currently run `claude --print` or `claude --non-interactive`. The SDK preserves approval flows and tool-use control, which means you can safely automate agent runs without losing the ability to review actions. For teams running Claude Code in CI/CD pipelines, use AgentBox with Docker sandboxes to isolate each run. The `snapshot()` method on sandboxes lets you capture state for debugging or resuming interrupted runs. This is especially useful for long-running code generation tasks. If you're experimenting with multiple coding agents (Claude Code vs Codex vs OpenCode), AgentBox lets you swap them with a single line change. Use this to benchmark which agent performs best on your specific codebase before committing to one.
Compare side-by-side
Claude Code vs AgentBox

Mentioned in this article

Enjoyed this article?
Share:

AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Related Articles

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

More in Open Source

View all