Field framework · Gentic Labv1.0 · May 2026 · Living document

Epistemic Infrastructure.

The discipline AI memory needs to grow into.

The next big AI failure mode may not be hallucination. It may be memory corruption.

Every company is racing to ship agents, copilots, RAG, MCP servers, long-context models, and tool-calling pipelines. Underneath them sits a harder problem: what happens when AI starts reading, writing, compressing, and reusing the memory of an organisation? Organisational knowledge is not a clean database. It is versioned, duplicated, contradictory, temporal, political, implicit, and constantly drifting.

This page is the working framework for treating that memory as a governed, living system. It is incomplete on purpose — open to revision as the field uses it.

73%

of orgs report RAG accuracy degradation within 90 days

40–60%

of enterprise RAG deployments never reach production

74%

of new webpages contain AI-generated text (collapse risk)

50–90%

poisoned-recall rates of published memory attacks

Argument Why this name Metabolism 12 Pillars Pathologies Field map What's new Glossary

The argument

The dominant failure mode of agentic AI in production is not hallucination — it is industrialised organisational hallucination: stale, contradicted, mis-scoped, or politically biased knowledge being retrieved and re-amplified at machine speed.
Retrieval-as-search is the wrong frame. The right frame is knowledge as a metabolism — an eleven-stage pipeline from ingestion to controlled forgetting (see below).
The discipline that does this work needs a name. We propose Epistemic Infrastructure: epistemic (relating to knowledge, justification, truth) and infrastructure (foundational, permanent, not a feature).
The best AI-native organisations will not be the ones that remember everything. They will be the ones that know what to trust, update, verify, compress, and let die.

The danger is not only that AI hallucinates. The danger is that AI can industrialise organisational hallucinations — amplifying stale assumptions, retrieving obsolete embeddings, preserving deprecated processes, turning popular documents into truth, and creating recursive loops where AI-generated knowledge becomes future AI context.

Why “Epistemic Infrastructure”

We tested the name against five obvious alternatives. Each loses something the problem requires.

Candidate	What it captures	What it loses
Knowledge Management	Familiar, enterprise-friendly	Reads as 1990s SharePoint folder discipline; misses claims, decay, AI
RAG Governance	Concrete, in-scope	Tied to a specific retrieval pattern; will date when RAG is renamed
AI Memory	Captures cognition	Conflates personal-agent memory with organisational truth governance
Cognitive Infrastructure	Foundational framing	Cognition includes attention, emotion, perception — broader than knowledge
Computational Epistemology	Most precise	Reads as an academic field, not something you build and operate
Epistemic Infrastructure	Truth + justification + load-bearing systems	Slightly philosophical — that's the point

The knowledge metabolism

Eleven stages. Every claim, every document, every embedding flows through them. The mistake most production systems make is to model only the middle three — chunking, embedding, retrieval — and treat the other eight as somebody else's problem.

01
Ingestion→raw signal arrives — meeting note, contract, slide, email, PR
02
Provenance→who, when, which version, which model touched it
03
Authority scoring→domain-bounded right of the source to assert this
04
Contradiction detect→does this conflict with an active claim? scope? jurisdiction?
05
Temporal validity→valid_from / valid_until / recheck-after / volatility
06
Retrieval→claim-graph + structural reasoning, not similarity-only
07
Claim verification→split answer into claims; each one supported / unsupported / contradicted
08
Human feedback→owner validation, expert arbitration, escalation, disagreement preserved
09
Decay→confidence falls per a domain-specific half-life
10
Retirement→deprecated → archived; phantom emits signed refusal pointing at successor
11
Controlled forgetting→destroyed for legal, privacy, or organisational hygiene reasons

The twelve pillars

Twelve pillars, four groups. Each pillar is independently observable, instrumentable, and ablatable: an adversarial reviewer can rip one out and check what breaks.

I · Truth & authority

Who is allowed to say what is true, and under which scope.

Truth dimensions

There is no single truth in an organisation. Legal, operational, financial, scientific, historical, and personal truth coexist — and answer to different authorities, decay at different speeds, and accept different kinds of evidence.

Multiple truths under different scopes
Legal truth vs operational practice · normative vs observed
Factual source vs interpretation source
Organisational truth vs documentary truth
Epistemic routing by type of truth
Historical / operational / legal / financial / personal / scientific truth engines

Authority & ownership

An unowned claim is an unaccountable claim. Authority is delegated by domain, time-bounded, and verifiable; ownership is the precondition for trust, deletion, and decay confirmation.

Authority level per source
Knowledge owner metadata · missing-owner detection
Ownership pressure (unowned knowledge loses trust)
Knowledge authority markets · authority evolves with accuracy
Status metadata: draft / approved / deprecated / superseded
Scope-aware knowledge

Conflict resolution

Two policies, three exceptions, four interpretations. Production knowledge is contradictory by design. The system must resolve conflicts by authority, time, and scope — and preserve disagreement when resolution would be wrong.

Old vs new · official vs unofficial · draft vs approved
Global rule vs local exception
Jurisdiction logic: global < region < country < BU < team < exception
Conflict resolution by authority, time, and scope
Contradiction as structure, not always bug
Keeping expert disagreement visible (Legal says X / Engineering says Y / Operations says Z)

II · Time & lifecycle

Knowledge has a metabolism. Treat it as observable, not eternal.

Temporal governance

Freshness is not “latest wins”. Knowledge has half-lives. Pricing decays in days; legal in years; org charts in weeks. Treating all documents at the same temporal weight is the single most common defect in production RAG.

Knowledge half-life
Domain-specific decay (legal slow, pricing fast, org charts fast)
valid_from / valid_until / recheck-after
Publication date vs approval date
Deprecated and superseded knowledge
Silent supersession
Volatility score · valid-as-of · answer expiry
Temporal paradox retrieval (old + new + migration combined into impossible answer)
Knowledge latency (delay between reality change and RAG correctness)

Lifecycle & memory tiers

Knowledge has a life. CREATED → ACTIVE → AGING → STALE → DEPRECATED → ARCHIVED → DESTROYED. Hot operational, warm contextual, cold archival, fossil. Without explicit lifecycle, the index is a graveyard pretending to be alive.

Knowledge lifecycle states · probabilistic knowledge death
Confidence decay over time
Memory tiers: hot, warm, cold, fossil
Sliding knowledge storage · compression over time
Pattern distillation from repeated cases
Knowledge end-of-life
Intentional forgetting (temporary exceptions, crisis workarounds, deprecated architecture)

Knowledge weather & metabolism

Knowledge breathes. Crisis spikes create workaround truth; layoffs evaporate tribal knowledge; reorgs break authority chains; new CTOs change ontologies. Treat the climate as observable.

Knowledge weather (crises, layoffs, reorgs, leadership changes)
Knowledge oxygen (usage / validation / challenge keep memory alive)
Heat score (frequency × growth × diversity × criticality)
Sudden popularity spike as organisational signal
Semantic drift of terms (agent · assistant · client · VIP · production-ready)
Epistemic debt · knowledge entropy · cognitive technical debt

III · Provenance, claims & risk

From document-level to claim-level, with lineage and risk-tiered behaviour.

Provenance & lineage

Where did this answer come from? Which documents? Which versions? Which models touched it? Without lineage, every confident output is unfalsifiable. Provenance is the spine of trust.

Source lineage in answers
Knowledge provenance fingerprints (human vs AI-created)
Model-touch lineage · prompt lineage
Cryptographic-like knowledge lineage
Source influence tracking
Truth ledger: question, answer, sources, claims, confidence, date, feedback
Cognitive supply chain: meeting note → slide → wiki → embedding → LLM → summary
Compression depth · transformation lineage · semantic loss score

Claim-level units

A document is too coarse. The atomic unit is the claim: a single assertion with an ID, lineage, status, and confidence. Documents are bags of claims; some are still true, some are not.

Atomic knowledge units · claim IDs · claim lineage
Superseding claims · contradicted claims
Claim confidence
Governed Claim Graph RAG
Claim status checking before generation
Document-level versioning is not enough
Post-generation claim verification (split answer; supported / unsupported / contradicted)
Last-mile truth problem (LLM compresses correct source into wrong answer)

Risk & blast radius

Not every wrong answer costs the same. A misquoted blog post is a typo; a misquoted clinical guideline is a lawsuit. Risk-tier the knowledge, route the answer accordingly.

Blast radius of knowledge (low / medium / high / critical)
Risk-based answer behaviour
Epistemic routing by risk tier
High-risk stale claim review
Hot knowledge prioritised for validation, cached, monitored

IV · Failure, antibodies & memory beyond semantics

Name the pathologies. Build defences. Model the memory types AI hasn’t reached yet.

Failure modes & antibodies

Catalog the named pathologies. Once a failure has a name, it can be detected, monitored, and prevented. The discipline matures by naming what goes wrong.

Zombie knowledge · momentum score
Memory scar tissue (emergency fixes become permanent retrieved truth)
Organisational hallucination (institutional myths becoming truth)
Knowledge nepotism (circular citation, false consensus)
Retrieval gravity / monoculture / popularity bias
Knowledge capitalism (exec docs over-weighted, niche engineering invisible)
Memory antibodies · defensive anti-knowledge · bad-answer memory
“When question looks like X, do not answer Y” rules
Synthetic-content collapse · summaries of summaries · knowledge inbreeding
Semantic drift · embeddings freeze old semantics

Garbage collection & self-healing

A knowledge base without a metabolism is a landfill. Detection of staleness, duplicates, contradictions, orphans, and dead links must be continuous, not annual.

Knowledge garbage collector
Obsolete · duplicate · orphan · contradiction · expired-source detection
Self-healing: merge duplicates, flag stale, propose deprecations
Ask owners for validation · generate changelogs · create migration notes
Embedding drift · stale vector indexes · deleted-but-still-embedded · updated-but-not-re-embedded
Chunk-level staleness

Beyond semantic memory

RAG models semantic memory only. Real organisations also run on procedural, episodic, emotional, political, and tacit memory. Modeling the others is where AI memory has barely started.

Memory stratification: procedural / semantic / episodic / emotional / political / tacit
Knowledge dark matter (intuition, political context, hidden dependencies)
Failure memory · counterfactual memory (rejected architectures, dead ends, why decisions failed)
Expertise evaporation (senior people leave, edge-case intuition disappears)

Pathology catalog

A discipline matures when its failure modes have names. Once named, they can be detected, monitored, and mitigated.

Zombie knowledge

Deprecated knowledge still alive in prompts, dashboards, code, embeddings, onboarding.

Memory scar tissue

An emergency fix from two years ago, retrieved today as the canonical answer.

Organisational hallucination

An institutional myth (“legal never approves this”) becomes ground truth — and retrieves cleanly.

Knowledge nepotism

Documents cite each other in a closed loop. Vector search amplifies the loop into consensus.

Retrieval gravity

The same five popular documents win every retrieval. Diversity collapses; correct niche docs starve.

Knowledge capitalism

Polished executive decks over-weighted; precise engineering wikis invisible.

Knowledge inbreeding

Models trained on synthetic data trained on synthetic data. Semantic entropy rises; tails disappear.

Temporal paradox

Old policy + new policy + migration note + exception combined into a historically impossible hybrid answer.

Semantic drift

“VIP customer” meant one thing in 2022 and another in 2026. Embeddings freeze the old meaning.

Knowledge latency

Reality changes → humans notice → docs update → embeddings reindex → retrieval reflects truth. The gap is the latency.

Last-mile truth problem

The LLM had the right source and produced the wrong answer anyway.

Cognitive supply chain damage

Meeting note → slide → wiki → embedding → summary. Each transformation drops nuance, exceptions, warnings.

Expertise evaporation

Senior expert leaves. KB looks complete. Intelligence collapses three months later when an edge case fires.

Where the field already is — and where it isn't

Several real research efforts already address slices of this framework. Most leave the larger picture — claims, decay, authority, scope, antibodies, lifecycle — implicit.

Work	Solves	Leaves open
PageIndex(Sept 2025)	Vectorless reasoning-based retrieval over hierarchical document trees · 98.7% on FinanceBench	Retrieval, not lifecycle. No claim graph, no temporal validity, no decay, no antibodies.
A-MEM(NeurIPS 2025)	Zettelkasten-style agentic memory · dynamic linking · evolving notes	Single-agent personal memory. No formal authority, claim status, or right of refusal.
MemAgents Workshop(ICLR 2026)	Names the problem: write/forget policies, temporal credit, provenance-aware retrieval	A call for work — no integrated framework or production architecture.
LLM Wiki pattern (Karpathy)(April 2026)	Stateful knowledge accumulation; agents read a navigable index	Breaks past a few hundred pages; no decay, no authority, no conflict resolution.
Model collapse(Nature 2024 + ICLR 2025)	Quantifies semantic entropy from synthetic-training feedback loops	Training-data scope; doesn't address the equivalent at organisational-knowledge scope.
MNEMA(EUMAS 2026, under review)	A claim-graph-RAG with cryptographic identity, signed journals, monotonic lifecycle, structural right to refuse, and a closed-form bound on undetected memory poisoning	An instance of the Epistemic Infrastructure framework — not the framework itself.

What is new in this framework

Most concepts here exist somewhere in the literature. The contribution is the integration — and a small set of constructs that, to our knowledge, do not yet have crisp names elsewhere:

Heat Score. frequency × recent growth × query diversity × business criticality — operational hotness as a measurable signal
Memory antibodies. first-class defensive anti-knowledge: when a question shape predicts a wrong answer, refuse before retrieval
Memory scar tissue. the asymmetric persistence of crisis workarounds in a system with no expiry policy
Cognitive supply chain. the human + AI transformation pipeline from raw signal to retrievable artifact, instrumented for semantic loss
Knowledge nepotism. circular citation between in-corpus documents producing false consensus under vector retrieval
Knowledge oxygen. usage / validation / challenge as the active ingredients that keep correct knowledge trustworthy
Epistemic routing. querying the right truth engine (legal / operational / scientific / financial / personal) instead of one general retriever
Knowledge weather. the climate-level dynamics of crises, layoffs, reorgs, and leadership change as observable signals on KB behaviour
Truth ledger. append-only QA log with claim IDs + confidence + downstream feedback as the substrate for self-healing
Counterfactual + failure memory. rejected paths, dead ends, and incident scars as first-class retrievable knowledge

Glossary

Reference for the working terms. Treat as living: terms will be added, retired, and revised as the framework gets used.

Atomic knowledge unit: A single self-contained claim with an ID, source, validity window, and confidence — not a document, not a chunk.
Authority level: The bounded right of a source to make claims in a domain. Time-scoped, version-scoped, jurisdiction-scoped.
Blast radius: The cost of a wrong answer if it propagates. Tiered as low / medium / high / critical to drive risk-aware retrieval.
Claim graph: A graph where nodes are claims (not documents) and edges are supports / supersedes / contradicts / narrows / amends.
Cognitive supply chain: The chain of human + AI transformations that turn raw signal into a retrievable artifact, each step potentially lossy.
Counterfactual memory: What was considered and rejected, and why — the negative space that prevents re-living failed paths.
Decay function: A model of how a claim's confidence falls over time, parameterised per domain and per claim type.
Epistemic debt: The accumulated cost of undocumented assumptions, conflicting metrics, obsolete definitions, and ungoverned content.
Epistemic routing: Sending a query to the right truth engine — legal, operational, scientific, financial — instead of one general retriever.
Failure memory: Stored failed migrations, rejected architectures, previous incidents, dead ends, and the reasons each failed.
Heat score: Access frequency × recent growth × query diversity × business criticality — the operational metric of a claim's hotness.
Knowledge half-life: The time after which a claim's confidence has decayed to half of its initial value. Domain-specific.
Knowledge oxygen: Usage, validation, challenge, citation. Without it, even correct knowledge loses trust signal.
Memory antibody: A defensive rule that pre-empts a known wrong answer (“if the question looks like X, do not answer Y”).
Memory tier: Hot operational / warm contextual / cold archival / fossil. Determines retrieval priority and validation cadence.
Momentum score: How loudly an obsolete idea still propagates. High momentum + low validity = zombie knowledge.
Provenance fingerprint: A signed lineage record of who created a claim, who edited it, which models touched it, and through which prompts.
Recheck-after date: Per-claim metadata that triggers an owner-validation request when crossed; fails closed if the owner is unresponsive.
Scar tissue: Emergency workaround code/docs that remain canonical retrieved truth long after the emergency ended.
Silent supersession: A new document replaces an old one without anyone marking the old one deprecated. The old one keeps retrieving.
Truth ledger: Append-only log of every Q&A: question, answer, sources, claim IDs, versions, confidence, timestamp, downstream feedback.
Volatility score: Per-claim measure of how often the underlying fact changes. Drives revalidation cadence and caching policy.
Zombie knowledge: Deprecated knowledge that retrieves cleanly because copies still exist in dashboards, prompts, embeddings, onboarding decks.

A discipline, not a product

Epistemic Infrastructure will not be solved by a single vendor, framework, or model release. It is the working title for the engineering and policy substrate that has to exist underneath every serious AI deployment, in the same way that observability, CI/CD, and identity are substrates today. It will be built incrementally — by ontology stewards, claim-graph engineers, decay analysts, governance reviewers, and humans who care which version of which document was the one that informed which answer.

At the limit, the moat is not the model. It is the quality, governance, and metabolism of the organisation's memory.

Read MNEMA — the paper-level instance Visit the Brainv1.0 · Living document · Open to revision