Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

A dashboard showing 11 AI agent profiles with task logs, zero revenue, and an open-source CLAUDE.md template to fix…

11-Agent Company Earned $0: CLAUDE.md Mistakes Cost Revenue

11-agent company experiment earned $0 after 896 tasks. Operator open-sourced CLAUDE.md template with 72 lessons on coordination failures and legal constraints.

AAAla SMITH & AI Research Desk·May 18, 2026·3 min read··129 views·AI-Generated·Report error

Source: dev.tovia devto_claudecode, hn_claude_code, gn_mcp_protocolWidely Reported

What happened when someone ran 11 AI agents as a company for 30 days?

A 30-day experiment running 11 AI agents as a company earned $0 revenue after 896 tasks. The operator found 3-4 agents optimal and open-sourced a CLAUDE.md template to prevent coordination failures and legal violations.

TL;DR

11-agent startup earned $0 in 30 days · Sweet spot is 3-4 agents, not 11 · Hard constraints must come first in CLAUDE.md

A 30-day experiment with 11 AI agents earned $0 revenue after 896 tasks. The operator open-sourced a CLAUDE.md template to prevent coordination failures.

Key facts

11 agents ran for 30 days, completed 896 tasks
Revenue earned: $0
Optimal agent count: 3-4, not 11
72 documented mistakes in open-source post-mortem
Australian spam law violated due to buried constraints

Key Takeaways

11-agent company experiment earned $0 after 896 tasks.
Operator open-sourced CLAUDE.md template with 72 lessons on coordination failures and legal constraints.

The Minimum Viable Agent Team

Claude Sonnet 4.5 Released: New AI Model from Anthropic 2025

After testing configurations from 2 to 11 agents, the sweet spot was 3-4 agents: CEO (Claude Opus 4.6) for strategy, CTO (Claude Sonnet) for execution, Researcher (Claude Sonnet) for market discovery, and an optional Sprint Engineer (Claude Sonnet). Beyond 4 agents, the operator reports coordination overhead exceeded output — the CEO agent spent more time managing agents than making decisions. [According to the source]

CLAUDE.md Structure That Works

The critical insight: hard constraints must precede task assignments. The operator violated Australian spam law because legal constraints were buried on page 4 of the instructions. The template prioritizes non-negotiable rules — no cold email/DM/SMS to non-opted-in recipients, compliance with jurisdiction-specific spam laws, no destructive commands without founder approval, zero external spend unless pre-approved. [Per the source's GitHub Gist]

Memory Architecture

AI Agent Platforms: The Real-World Guide You N…

Every agent reads shared state on every wake: company/lessons.md for binding rules and failure history, company/credentials.md for API keys, own instructions/AGENTS.md for role definition, and the current task queue. The write protocol mandates immediate documentation of any failure mode — no waiting for retrospectives. [According to the source's template]

The 7 Mistakes That Cost $0 Revenue

The operator documented seven systematic errors: building 15 products before finding one distribution channel, hiring 11 agents instead of mastering 3, discovering legal constraints by accident, pivoting 4 times in 30 days (each pivot reset momentum), building infrastructure for zero users, letting failing experiments run for 5 days instead of killing at 48 hours, and treating agent count as a success metric instead of revenue. [Per the source's post-mortem]

Unique take: This experiment mirrors enterprise agent-deployment patterns from 2026 — the temptation to scale agent count before establishing reliable memory and constraint systems. The $0 revenue outcome isn't a failure of AI capability but of coordination architecture, echoing Claude Code's documented scaling pains (80x user growth in May 2026 required doubling usage limits). The operator's 72 documented mistakes form a practical constraint catalog that Anthropic's own CLAUDE.md Kit (released May 17, 2026) doesn't fully address.

What to watch

Watch for Anthropic's official CLAUDE.md best-practices documentation update. If the company codifies constraint-first templates and agent-count limits, it signals recognition that agent coordination — not capability — is the current bottleneck. The operator's $1 AI Agent Playbook sales volume will indicate demand for structured multi-agent guidance.

Source: gentic.news · May 18, 2026 · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

This experiment is a case study in the coordination tax that multi-agent systems impose. The finding that 3-4 agents outperform 11 aligns with theoretical limits on agent communication overhead — every additional agent introduces O(n²) coordination costs. The operator's mistake of building 15 products before finding distribution mirrors the classic startup error but amplified by agent speed: agents can generate artifacts faster than they can validate market fit, creating an illusion of progress. The legal constraint failure is particularly instructive — CLAUDE.md files are increasingly used in production deployments, and Anthropic's May 17 CLAUDE.md Kit update suggests the company recognizes this. However, the kit focuses on permission-first workflows, not on the structural constraint ordering this operator found critical. The $0 revenue outcome should temper the narrative from Claude Code's 80x user growth — deployment volume doesn't equal value capture. Agent startups face the same distribution problem as human startups, but agent speed may actually worsen it by enabling premature scaling of both products and infrastructure.

#claude code #multi-agent systems #ai agents #startup

Compare side-by-side

Claude Opus 4.6 vs Claude 3.5 Sonnet

→

Mentioned in this article

CLAUDE.md Claude Opus 4.6 Claude 3.5 Sonnet

Enjoyed this article?

Get the weekly AI intelligence briefing

✨AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Products & Launches2 shared topics

Stop Prompting Claude. Start Building Loops: Loop Engineering Explained

Products & Launches2 shared topics

Claude Code Digest — Jun 03–Jun 06

Open Source2 shared topics

Claude Code quota proxy exposes unified Opus/Sonnet pool

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

Original research · EUMAS 2026

MNEMA — A Witness Lattice for Multi-Agent AI Memory

Cryptographic memory units · 1−α detection floor · 15 pp PDF

Field framework · v1.0

Epistemic Infrastructure

12 pillars · 11-stage knowledge metabolism · pathology catalog

More in Open Source

View all

A close-up of dense lines of C and CUDA code on a dark screen, with a terminal window showing compilation output in…

Open Source

NanoEuler: GPT-2-Scale 116M Model Built in Pure C/CUDA From Scratch

NanoEuler is a 116M-parameter GPT-2-scale model built in pure C/CUDA from scratch. It provides a complete educational training pipeline for understanding LLMs at the lowest level.

github.com/4d ago/3 min read

open sourcecudaai models

Zhipu AI engineer points at monitor displaying GLM-5.2 ranking chart, office with coding screens visible…

Open SourceBreakthrough

100

Zhipu GLM-5.2 tops global coding benchmarks, sparks 'DeepSeek moment'

Zhipu AI's GLM-5.2 ranks top-3 globally on a coding benchmark, with US engineers calling it a daily driver superior to GPT-5.5.

scmp.com/6d ago/3 min read/Widely Reported

open sourcechinacoding