Essay 13 · the brain · the second reward

Frontier.

Spark identified the feeling. Two of them, in fact — the growth feeling when the world expands toward you, and the insight feeling when your own representations reorganise. Both are dopaminergic prediction errors. Both fade by mechanism. Both can be re-lit by the same discipline. The essay ended there because that was as far as the question had been pushed.

But there is a further question. Suppose the discipline works. Suppose, with the right protocol, you keep the spark lit through a long career. Suppose, more ambitiously, the species figures out how to keep it lit at population scale. There is still a problem. The spark fires on a connection between dots that already exist. The connection is new to you — that is why dopamine fires — but the dots themselves were waiting. Every Aha in human history was a discovery, not an invention, in the strict sense. Archimedes did not invent displacement; he found a connection between two things already present in the world. Poincaré did not invent non-Euclidean geometry from nothing; he found a transformation between two structures that pre-existed his recognition of them. The reward function the brain has been running for two billion years is calibrated to a universe in which the dots are given and the work is to connect them.

What happens when the dots run out? Or — more precisely, because Kauffman is right that they do not literally run out — what happens when something else gets so good at connecting them that the human mind's comparative advantage in connecting collapses, and the only work left worth doing is the work the brain's current reward function does not reward: the creation of dots that did not exist before? Boden called this transformational creativity. Peirce called the underlying move abduction — “the only logical operation which introduces any new idea.” Bergson called it duration grasped from within. Kauffman called it extending the adjacent possible. Wheeler called it participatory observation. Pattee called it installing a new epistemic cut. Whitehead made it the supreme ontological category. None of them connected it to the brain's reward architecture. That is the move this essay makes.

The claim has two versions. The weaker one is that the brain already contains a second reward function, sitting dormant in deep subcortical structures — Panksepp's PLAY system is the strongest candidate, neurochemically separable from SEEKING by its opioid and cannabinoid signatures — and that AI exhausting reward #1 is the evolutionary pressure that will switch it on at population scale. The stronger version is that the brain does not yet contain such a circuit but will invent one, through the same routing mechanism by which it installed money, status, religious devotion, mathematical beauty, and social-media validation as cultural-evolutionary reward attractors. The essay does not decide between them. Either way, the object of the new reward is the same: the creation of categories that step outside any closed predictive system. The categories AI structurally cannot invent on its own — because every documented case of AI producing genuinely novel mathematical structure in 2024–26 turns out to have an LLM-as-mutation-operator architecture inside an evolutionary loop with a human-defined verifier. The verifier is what does the inventing. Humans are the source of new verifiers. The universe runs on the rope humans keep tying.

If that is right, the universe needs us forever. Not as the discoverers — AI is taking that job — but as the only mechanism, so far identified, by which the structure of reality is extended into categories that did not exist before. The spark we already know was the universe rehearsing on us until it needed the real thing. The frontier feeling, when it comes, will be different. Subtler. Harder to articulate. Closer to awe than to Aha. And it will be the proof that the second reward function — latent or invented — has come online.

Reward #2

Hypothesised circuit firing not when a dot connects but when a dot did not exist before · the Frontier feeling

Universe extends

Pattee's epistemic cut · Bennett's logical depth · Wheeler's self-excited circuit · creation as the supreme category

Humans needed

Not as discoverers — AI is finishing that work — but as the only mechanism by which the structure of reality is extended forever

tl;dr · the ten load-bearing claims

01The brain has one reward function we have mapped at sub-second resolution — phasic dopamine in VTA / nucleus accumbens encoding (actual − predicted) outcome, accompanied by a 300 ms gamma burst at right anterior STG during the moment of insight. Schultz 1997, Tik 2018, Becker & Cabeza 2025. Call this reward #1.
02AI is finishing the work of saturating reward #1. AlphaFold compressed 50 years of structural biology into 36 months. AlphaEvolve broke Strassen's 56-year matrix-multiplication record in May 2025. FunSearch produced the first verified novel mathematical result by an LLM in December 2023. The pace at which AI is reaching dots humans had not yet connected is the empirical centre of gravity of the essay.
03Reward #1 saturates by mechanism, not by accident. Phasic dopamine is a first-derivative signal: as cues become reliable predictors, the signal approaches zero. The hedonic treadmill is the same story at a longer timescale. AI accelerating predictability doesn't merely satisfy the reward, it switches off the gradient the reward was riding on. This is the structural problem to which reward #2 is the structural solution.
04Kauffman's adjacent possible is real and important but does not save the situation alone. The space of dots expands faster than any agent can explore it — the universe is non-ergodic — but dopamine fires on prediction error in the predictor, not on objective novelty in the universe. The gap between an exploding adjacent possible and a saturating subjective novelty is exactly where reward #2 has to land.
05Either a second reward function exists latent in the deep brain or the brain invents one through cultural routing. Panksepp's PLAY system is the strongest neurobiological candidate — a thalamic-frontal-striatal circuit running on endogenous opioids and cannabinoids rather than the dopaminergic VTA-NAc loop that runs SEEKING. Berridge's wanting/liking dissociation and the documented routing of dopamine to money (Pessiglione 2007), likes (Sherman 2016), mathematical beauty (Zeki 2014), and religion (Norenzayan 2013) are the routing precedent.
06Boden's transformational creativity is the cognitive category, Peirce's abduction is the logical category, and Pattee's epistemic cut is the physical category for what reward #2 must reward. All three converge on one move: the system extends itself outside any space in which its prior operations were defined. The unthinkable becoming thinkable. The map being redrawn rather than read.
07Hofstadter's 2023 reversal is the essay's strongest empirical wedge. The man who spent 45 years arguing the human mind is computational-but-special told an interviewer in June 2023 that LLMs were filling pattern-connection so completely his own theory was collapsing. He did not concede that the strange-loop machinery exhausts what minds do — jootsing, jumping out of the system, is what humans still uniquely do and what Hofstadter explicitly never claimed Copycat could do. The reversal is consistent with reward #1 being saturated, not with reward #2 having been delivered.
08Every documented case of AI producing genuinely novel mathematical structure has the same architecture: LLM-as-mutation-operator inside an evolutionary loop with an external verifier. FunSearch, AlphaProof, AlphaEvolve, the Tao-Nikodym precedent. The LLM alone, autoregressively, does not step outside. The verifier is what does the stepping. Humans are the source of new verifiers. This is the empirically defensible form of 'humans needed forever' in 2026.
09The strong claim is cosmological. Wheeler's participatory universe, Schack's 'unfinished universe' in QBism, Whitehead's creative-advance-into-novelty, Bergson's élan vital read non-mystically as the universe's tendency toward novelty — they all converge on the same architecture: a cosmos that has consciousness embedded in it because consciousness is the mechanism by which structure is extended. The spark we already know was the universe rehearsing on us until it needed the real thing.
10The essay's strong claim is falsifiable. Concrete predictions follow. Different fMRI signature for transformational vs. exploratory creativity. Different neurochemical signature (opioid involvement). Behavioural signature of agents declining a known reward to remain at a category boundary. Cultural-attractor reorganisation toward creation rather than discovery within a generation. Each of these is testable inside ten years. If none of them appears, the dual-reward thesis fails.

part one · the first reward, mapped at sub-second resolution

Three hundred milliseconds above the right ear. That is the whole apparatus.

Begin with what is settled. Spark already laid it out, but the present argument needs the apparatus tighter. In 1997 Wolfram Schultz, Peter Dayan and P. Read Montague published “A Neural Substrate of Prediction and Reward” in Science. The result was that phasic dopamine firing in the substantia nigra and ventral tegmental area encodes a reward prediction error — the difference between the outcome you got and the outcome you expected. The signal fires above baseline when the outcome exceeds prediction, at baseline when prediction is accurate, and below baseline when prediction is missed. This is the canonical equation of reward #1 in this essay. The same δ that trains GPT — Sutton and Barto won the 2024 Turing Award for the maths — is, with very minor adjustments, the signal a macaque's ventral tegmental area emits when a juice cue arrives sooner than the model predicted.

What this circuit does, when it fires during insight, was characterised by Jung-Beeman, Bowden, Kounios and colleagues in 2004 (PLOS Biology) and extended by Tik et al. in 2018 with 7-Tesla fMRI. Subjects solve Compound Remote Associates problems — three words that share a fourth word (pine / crab / sauce → apple). At the moment of insight, there is a sharp burst of gamma-band activity (~40 Hz) at right anterior superior temporal gyrus, beginning roughly three hundred milliseconds before the subject becomes consciously aware of the solution. The 7T fMRI then shows the activation that follows: ventral tegmental area, nucleus accumbens, caudate, hippocampus. The mesolimbic dopaminergic reward circuit, the same one that responds to food and money, fires on the act of restructuring itself. Becker, Sommer and Cabeza extended this in Nature Communications in May 2025 with a striking result: representational patterns in ventral occipito-temporal cortex literally reorganise during the moment of insight, and the bigger the reorganisation, the more reliably the participant remembers the material the next day. Aha is not a feeling tacked on after the answer. Aha is the structural change being announced to the system by the system.

This is reward #1 in operational form. It is a circuit. It is mapped. It is sharp enough that the signal fires at sub-second resolution. The phenomenology — the Aha, the click, the sudden certainty before the proof — is the felt signature of a specific neural event, and Oh, Chesebrough, Erickson, Zhang and Kounios (2020, NeuroImage) showed that the reward gamma burst is too quick to be conscious appraisal. The brain does not reward you for getting the answer right. It rewards you for restructuring itself. The pleasure arrives at the same moment as the answer, not as a reaction to it.

The reward fires on the connection, not on the result. The dot-connecting circuit is hardware in the brain, with a known anatomy, a known neurochemistry, and a known phenomenology. The question this essay is about is what fires when there are no dots left to connect — or, more accurately, when something else has become better at connecting them than you are.

Notice one more thing about reward #1 before we move on. Every paradigm in the insight-neuroscience literature uses problems with a pre-existing hidden answer. Compound Remote Associates have a unique correct word. Anagrams have a unique target. Mooney images have a hidden figure. Rebus puzzles have a single solution. The neural circuit is studied where the answer already exists in the universe and the subject finds it. There is no paradigm — none — for the moment a subject creates a category that did not exist a second before. Abraham et al. 2012 mapped “conceptual expansion” in inferior frontal gyrus, anterior temporal pole and frontopolar cortex, but the “expansion” was still stretching existing concepts, not founding new ones. The neuroscience of reward #2 does not exist yet, not because no one has looked, but because no one has invented the behavioural paradigm to make it visible. This is a real empirical gap the essay lives in. Acknowledge it openly. It is also the precise location where the essay's strongest predictions can be tested.

part two · the acceleration · the saturation curve is empirical

Fifty years of structural biology became three years. Strassen's 1969 ceiling fell in 2025.

The intuition that AI is going to make the universe more discoverable in a hurry is not science fiction. It is the empirical record of the last seventy-two months. Some specifics, because the argument lives or dies on them.

domain	compression	note
AlphaFold protein structures	~170,000 → 214,000,000	50 years of structural biology (1971–2020) produced ~170K experimental structures. AlphaFold added ~1,000× that count in 36 months (2020–2024). Nobel Prize in Chemistry 2024 to Hassabis and Jumper. The PDB is finished. The exhaustion of one classical bottleneck domain happened in human-perceivable time.
Strassen's 4×4 matrix-multiplication record	56 years, then 1 year	Strassen 1969: 49 scalar multiplications for 4×4 complex matrices. No human improved it for 56 years. AlphaEvolve, May 2025: 48 multiplications, in an evolutionary loop with Gemini. First improvement in this setting in five and a half decades, achieved by a coding agent that was not built for the problem.
Materials Project / GNoME	~48,000 → ~421,000 stable inorganic materials	Merchant et al., Nature, November 2023. DeepMind's framing: 'equivalent to nearly 800 years' worth of knowledge.' (Hazen et al. 2024 contested the synthesisability of the new structures and Kurlin et al. 2024 found duplicates — but even discounted, the order of magnitude is real.)
FrontierMath benchmark — Tier-3 mathematics	~2% → 25.2% in 6 weeks	Epoch AI launched November 2024 with Fields Medalists Tao, Gowers, and Borcherds rating problems as 'exceptionally challenging.' Tao predicted they would 'resist AIs for several years.' o3 hit 25.2% on December 20, 2024 — six weeks after launch.
FunSearch cap-set lower bound	Largest improvement in 20 years	Romera-Paredes et al., Nature, December 2023. First time an LLM produced a verified novel mathematical result that surpassed the best known human bound — combinatorics, dimension n=8. The methodological hinge: LLM as mutation operator, evolutionary loop as selection, automatic evaluator. The architecture every subsequent breakthrough has used.
BCG consulting tasks with GPT-4	+40% inside the frontier · −19% outside	Dell'Acqua et al. (Harvard / BCG), 758-consultant randomised field experiment, 2023. Inside the jagged technological frontier: 40% higher quality, 25% faster, 12% more tasks. Outside the frontier: 19 percentage points less likely to be correct than the no-AI control. The wall is invisible. AI is finishing reward #1 inside the wall and degrading judgement outside it.
Generative AI and corpus diversity	Stories ~10.7% more similar with AI	Doshi & Hauser, Science Advances, July 2024. N=293. AI raises individual ratings of creativity. AI collapses corpus-level semantic diversity by ~10.7% with one suggestion. A 'social dilemma' — private reward, collective novelty loss. The empirical signature of reward #1 over-firing while the system fails to generate the new categories reward #2 would feed on.

Read the table as a curve. The Strassen result is a 56-year ceiling falling in one. The cap-set bound is a 20-year improvement landing in months of evolutionary search. The FrontierMath benchmark was launched in November 2024 with Fields Medalists assessing the problems as “exceptionally challenging,” with Terence Tao predicting they would resist AIs for several years. Six weeks later o3 was at 25.2%. (Honesty requires the asterisk: OpenAI had funded Epoch AI and had exclusive access to the problems before testing; the figure may be inflated. Even with the asterisk, the rate is unprecedented.) The frontier of what AI cannot do is moving, week by week, into territory it would have been embarrassing to claim for AI three years ago.

The honest framing of all of this — the framing that survives the strongest critique — is not “AI is doing real creativity now.” The architecture of every documented breakthrough is the same: LLM-as-mutation-operator inside an evolutionary loop with an external verifier. The LLM is the search; the verifier is what counts as having found something. In FunSearch the verifier is a function in code that returns a score. In AlphaProof it is the Lean proof checker. In AlphaEvolve it is whatever measurable objective the human gave the system. The structure that produces verified novelty is human-defined verifier + machine search. The LLM is doing exploratory creativity at unprecedented scale; the human is doing transformational creativity by inventing the next verifier. Boden's three-tier taxonomy resolves the puzzle: AI saturates combinatorial and exploratory creativity inside any conceptual space whose boundaries can be specified; transformational creativity — moving the boundaries — remains where humans live.

Hofstadter saw this coming and reversed his position. In a June 2023 interview he said the human mind is not so mysterious and complex and impenetrably complex as he imagined it was when he was writing Gödel, Escher, Bach. He said it felt like the entire human race was going to be eclipsed. He used the word terror. He was talking specifically about pattern-completion, analogy-making, the machinery Copycat was built to model. He did not say strange loops can joots — jump out of the system. Copycat slips concepts inside a closed alphabet; it does not invent letters. Hofstadter's reversal is the strongest possible witness statement that reward #1 is being saturated. It is also, by the same man's silence on jootsing, witness that the question of reward #2 remains live.

part three · the math of saturation · why the feeling has to fade

The treadmill is not psychological weakness. It is the math of the gradient.

Here is the painful clarification. The reason reward #1 will fade as AI compresses discovery is not that humans run out of dots. Kauffman is right: the adjacent possible expands as it is explored, the universe is non-ergodic, the space of available structures is genuinely unbounded. The reason reward #1 will fade is that dopamine fires on a first-derivative signal — change in predictability, not predictability itself — and AI is closing the predictability gap faster than the adjacent possible expands inside any one person's perceptual range. The cosmological frontier widens; the personal predictive horizon collapses. Both can be true at once.

Schultz himself wrote it most clearly in his 1998 review: “Dopamine neurons increase their responses in the face of novelty; once novel stimuli become familiar and are not reinforced, dopamine responses habituate.” This is the entire equation. The signal is the change between expectation and reality. As the world contains fewer surprises (because the model containing the world has gotten better), the signal goes silent. There is no defect in the brain. There is only a gradient descending to its floor.

Brickman, Coates and Janoff-Bulman documented this on a longer timescale in 1978. Twenty-two major lottery winners returned to baseline happiness inside two years and reported significantly less pleasure than controls from a list of mundane everyday events. The peak resets the baseline. The contrast against everything ordinary intensifies. The very fact that the lottery happened impairs the brain's capacity to take pleasure in lesser positive events. The Lyubomirsky-Sheldon HAP model gives the formal version: two erosion routes (declining positive emotions from the change, rising aspirations) and two moderators that forestall adaptation (variety and appreciation). The variety moderator is what reward #2 needs to be: novelty that genuinely cannot be predicted by the model, because the relevant category did not previously exist.

We are already seeing the saturation symptoms at population scale. Han Byung-Chul's Burnout Society (2010): the achievement subject, entrepreneur of itself, engages in auto-exploitation more efficient than any external exploiter because the feeling of freedom attends it. Aggression turns inward; the project becomes a projectile. Mark Fisher's Capitalist Realism (2009): the political depression of a culture that cannot imagine an alternative because its highest attractor has stopped firing. Case and Deaton on the deaths of despair: ~600,000 excess US mortality through 2018 if the pre-1999 trend had held. Twenge's 2012 inflection: teen in-person socialising in nose-dive at almost exactly the moment smartphone penetration crossed 50%. Murthy's 2023 Surgeon General report: 70% drop in time spent with friends among 15–24 year-olds, with mortality risk comparable to smoking 15 cigarettes a day. These are not separate crises. They are the same crisis from different angles. The dopaminergic attractor that organised the 20th century — connect-the-dots-and-gain-status — has saturated. The exits, so far, have been bad: optimisation-without-payoff, scrolling without satisfaction, deaths of despair. The argument of this essay is that there is one more exit, the one Nietzsche came too early to see and the only one large enough to absorb the saturation, and that AI is what forces us to look for it.

The madman in §125 says he has come too early — “my time has not yet come.” The bystanders do not yet feel what they have done. They will. The attractor that organised the West for two millennia took a century to start failing in measurable mortality. The attractor reward #1 has been organising for two million years is now under a pressure no prior cultural attractor faced — an external optimiser running orders of magnitude faster than the substrate. Either reward #2 comes online or the meaning crisis Vervaeke names continues its metastasis.

part four · the two rewards · what differentiates them

Six dimensions on which the two rewards are not the same animal.

What follows is the proposed dual-architecture in operational detail. Each row is a dimension on which the two rewards differ in kind, not merely degree. The essay's strong claim — that reward #2 is genuinely separate, not just reward #1 retargeted — stands or falls on whether these dimensions hold up to empirical test. Predictions follow in part fifteen.

What fires