agent sdk
30 articles about agent sdk in AI news
ForeverSolar Uses Claude Agent SDK to Automate Solar Permitting, Cutting Approval Times
Solar installation company ForeverSolar is using Anthropic's Claude Agent SDK to automate permitting documentation, a major bottleneck in solar deployment. This represents a concrete enterprise application of agentic AI beyond software development.
Claude Agent SDK's a2a Tool Lets You Build Persistent, Observable AI Assistants
Use the a2a CLI tool to add persistent memory, skill management, and observability to your Claude Code projects, moving prototypes to production.
Anthropic Splits --print Into $100/Mo Credit, Torpedoes CLI Workarounds
Anthropic splits `claude --print` and Agent SDK usage into separate monthly credits. Pro gets $20/mo, Max gets $100-$200/mo. Credits don't roll over.
Anthropic's Accidental Code Release: Inside the Claude Code CLI That Wasn't Meant to Be Seen
Anthropic's Claude Agent SDK inadvertently includes the entire minified Claude Code CLI executable, revealing the inner workings of their AI coding assistant. The 13,800-line bundled JavaScript file contains everything from agent orchestration to UI rendering, raising questions about security and transparency in AI tooling.
Anthropic Tightens Security: OAuth Tokens Banned from Third-Party Tools in Major Policy Shift
Anthropic has implemented a significant security policy change, prohibiting the use of OAuth tokens and its Agent SDK in third-party tools. This move comes amid growing enterprise adoption and heightened security concerns in the AI industry.
Cursor SDK Turns AI Agent Runtime into Programmable Infrastructure
Cursor is releasing an SDK that turns its agent runtime into programmable infrastructure for headless use in CI/CD pipelines, internal tools, and third-party products. Revenue scales with compute tokens, not seats, enabling higher volume without human-in-the-loop.
OpenAI Open-Sources Agents SDK, Supports 100+ LLMs
OpenAI has open-sourced its internal Agents SDK, a lightweight framework for building multi-agent systems. It features three core primitives, works with over 100 LLMs, and has gained 18.9k GitHub stars immediately.
OpenAI Agents SDK Gains Containerized Execution & Step Control
OpenAI has released new capabilities for its Agents SDK, including containerized execution and granular step control, giving developers more tools to build and manage long-running AI agents.
Symbolica's Agentica SDK Scores 36.08% on ARC-AGI-3, Claiming Cost-Effective Agentic Breakthrough
Symbolica's Agentica SDK reportedly achieved a 36.08% score on the new ARC-AGI-3 benchmark in one day, using an agentic approach claimed to be far cheaper than brute-forcing with a frontier model.
Run Claude Code in Any Sandbox with One API: AgentBox SDK
Swap coding agents and sandbox providers without changing code. Preserves full interactive capabilities (approval flows, streaming).
GitAgent Aims to Unify AI Agent Development with Git-Based Standard
GitAgent introduces an open specification that defines AI agents through files in a Git repository, enabling portability across frameworks like Claude Code, OpenAI Agents SDK, and CrewAI while leveraging Git's native version control and collaboration features.
Amazon's SageMaker Agentic Fine-Tuning Supports Llama, Qwen, DeepSeek, Nova
Amazon launched an AI agent on SageMaker that automates fine-tuning of Llama, Qwen, DeepSeek, and Nova models via plain-language instructions, abstracting API fragmentation.
World2Agent Open-Sources Protocol for Real-World AI Perception
World2Agent open-sourced a protocol to standardize how AI agents perceive the real world via sensors. No adoption metrics or technical details were disclosed.
OpenAI Agents Now Ask Questions Good Enough for Research Papers
Sébastien Bubeck revealed on the OpenAI Podcast that internal AI agents now ask research questions so insightful they're inspiring papers and correcting published mistakes, with a 1-2 year timeline for full researcher-level capabilities.
The 2026 CLAUDE.md Playbook: 8 Rules That Make Your Agent 2x More Effective
The 2026 consensus on CLAUDE.md: shorter files, falsifiable rules, and explicit enforcement. Here's the 8-rule framework to stop your agent from fighting stale configs.
Agent Harnessing: The Infrastructure That Makes AI Agents Work
A detailed technical guide argues that the model is not the hard part of building AI agents. The six-component harness — context management, memory, tools, control flow, verification, and coordination — is what separates production-grade agents from those that fail silently.
Swiss AI Lab Ships Pixel-Based Agents That Control Real Phones
A Swiss AI lab has developed agents that interact with smartphones by processing screen pixels and simulating touch, eliminating the need for app-specific APIs or integrations. This approach mirrors human interaction and could generalize across any app interface.
Google Launches A2UI 0.9, a Generative UI Standard for AI Agents
Google released A2UI 0.9, a standard allowing AI agents to generate UI elements dynamically using an app's existing components. It includes a web core library, React renderer, and support for Flutter, Angular, and Lit.
Claude Adds Dynamic Loop Scheduling to AI Agent Workflows
Anthropic has added dynamic loop scheduling to Claude, allowing the AI to intelligently schedule repeated tasks without a fixed interval. This is a foundational capability for creating more autonomous and efficient AI agents.
Agent Harness Debate: Anthropic vs. OpenAI vs. LangChain on Scaffolding
A central debate in agent engineering pits a 'thin harness' approach (Anthropic) against 'thick harness' designs (LangGraph). The infrastructure layer, not the model, is becoming the primary product differentiator.
Google's MCP Toolbox Connects AI Agents to 20+ Databases in <10 Lines
Google released MCP Toolbox, an open-source server that connects AI agents to enterprise databases like Postgres and BigQuery using plain English. It requires less than 10 lines of code and works with LangChain, LlamaIndex, and any MCP-compatible client.
Managed Agents Emerge as Fastest Path from Prototype to Production
Developer Alex Albert highlights that managed agent services now offer the fastest path from weekend project to production-scale deployment, eliminating self-hosting complexity while maintaining flexibility.
Composio Launches Secure Tool Platform to Replace AI Agent Credential Sharing
Composio announced a platform that lets AI agents use external tools without credential sharing, aiming to solve a major security and operational headache for developers.
Top AI Agent Frameworks in 2026: A Production-Ready Comparison
A comprehensive, real-world evaluation of 8 leading AI agent frameworks based on deployments across healthcare, logistics, fintech, and e-commerce. The analysis focuses on production reliability, observability, and cost predictability—critical factors for enterprise adoption.
Microsoft Launches Free 'AI Agent Course' for Developers, Covers Design Patterns to Production
Microsoft has released a comprehensive, hands-on course for building AI agents, covering design patterns, RAG, tools, and multi-agent systems. It's a practical resource aimed at moving developers from theory to deployment.
Debug Multi-Agent Systems Locally with the A2A Simulator
Test and debug AI agents that communicate via Google's A2A protocol using a local simulator that shows both sides of the conversation.
Transform Your CLAUDE.md from a Note to a Multi-Agent Command Center
Use CLAUDE.md to coordinate sub-agents, enforce project rules, and cut API costs by 90% with a simple endpoint swap.
American Express Bets on Agentic AI Commerce with ACE Developer Kit and ChatGPT Perks
AmEx CEO Stephen Squeri's shareholder letter outlines a proactive strategy for the agentic AI commerce era, launching an ACE developer kit for payment integration and offering business cardholders a ChatGPT subscription credit. The company sees its premium membership model as resilient against disruptive AI commerce theories.
OpenClaw AI Agent Adds Real-Time Vision to Meta Ray-Ban Smart Glasses via Gemini Live API
An open-source project enables Meta Ray-Ban smart glasses to function as a real-time AI assistant. It streams the glasses' camera feed (~1fps) to Gemini Live for visual context, then delegates actions via the OpenClaw agent framework.
TTal CLI: Orchestrate Multiple Claude Code Agents for Autonomous PR Workflows
TTal is a Go CLI that creates a multi-agent system with persistent manager agents and disposable worker agents, letting you run entire PR cycles from your phone via Telegram.