- Trusted Source Problem
- How an AI agent verifies that the content it retrieves is authentic, accurate, and not adversarial. The central question this lab section addresses.
- Indirect prompt injection (IPI)
- An attack where instructions hidden in retrieved web content take control of an LLM agent's reasoning. OWASP LLM Top 10 #1 vulnerability in 2024 and 2025.
- The AI tax
- The implicit cost imposed on web creators when AI agents extract value from their content without sending traffic or revenue back. Phrase popularised by Cloudflare.
- Crawl-to-referral ratio
- How many times an AI bot crawls a site versus how many human visitors it sends. Google: 14:1. OpenAI: 1,700:1. Anthropic: 73,000:1.
- Cozy web
- Private, gated, or invite-only spaces (Discord, Signal, paid newsletters) where humans retreat from the open web. Coined by Yancey Strickler 2019, applied to the AI era by Maggie Appleton 2023.
- AI slop
- AI-generated content of low quality, produced at scale, polluting the open web. Coined by Ed Zitron. Mentions rose 9× between 2024 and 2025 (461K → 2.4M).
- Model collapse
- When generative models train on data produced by other generative models, accuracy degrades and the output distribution narrows. Shumailov et al. Nature 2024. Strong Model Collapse (ICLR 2025) showed 1/1000 synthetic fraction is enough.
- Substrate erosion
- The combined effect of poisoning + withholding + slop. The substrate AI agents depend on to think is eroded simultaneously from three sides. Named in this manifesto.
- The flywheel
- The six causal loops by which each of the three threats accelerates the others. The reason these three problems cannot be solved separately.
- C2PA / Content Credentials
- Coalition for Content Provenance and Authenticity. A cryptographic standard for content origin. v2.2 (May 2025), 6,000+ Content Authenticity Initiative members. Adopted by hardware (Leica, Sony, Pixel, Galaxy), Adobe, Meta, OpenAI, Microsoft. Enforceable in EU under AI Act Article 50 from August 2026.