The Madurodam Problem — When Coherence Lies — Holger Wölfle

System 2 / Self-Vector Validation (3/3)

The Miniature City

Lena: There is a miniature city in The Hague called Madurodam. Everything at 1:25 scale. Buildings, streets, trains, ships. Every detail is correct. The proportions match. The relations between the buildings are correct. The colors, the materials, the distances. Everything coherent.

Marco: And?

Lena: Nobody lives in Madurodam.

Marco: Coherence is not truth.

Lena: Exactly. A model can be completely internally coherent and still have no contact with reality. Consistency is not truth. And that hits the self-vector not as a peripheral issue. Right at its center.

Marco: Because in the last episode we said: We measure anticipation. We measure whether the system with a self-model predicts better than without. And now the question becomes: What if the measurement itself is a Madurodam?

Lena: Exactly. And that is why this episode is the most important of the three.

What R(sv_t) Actually Measures

Marco: Let me break this down technically. The maturity metric R(sv_t) is defined as anticipation performance divided by complexity. The better a system predicts at a given complexity, the more mature. Sounds robust. And it has a fundamental blind spot.

Lena: What blind spot?

Marco: R measures coherence. Not correspondence. If the system operates in a stable environment and its predictions are consistent with its prior experiences, R rises. The system becomes “more mature.” But what if the environment changes and the system does not notice? What if the predictions are still coherent, but the reality they refer to has become a different one?

Lena: Then the numbers keep climbing. Perfect predictions in a world that no longer exists.

Marco: That is Madurodam. Perfect coherence, zero contact.

Lena: In episode 5, we discussed Kahneman. WYSIATI: What You See Is All There Is. System 1 prefers stories that are internally coherent over stories that are true. The more coherent an explanation, the more convincing it feels. Completely independent of whether it is correct.

Marco: And plausible stories are the most dangerous thing. Because you do not question them. An obviously false claim is harmless, you discard it. But a perfectly consistent story that happens not to correspond to reality? You defend that with everything you have.

Lena: And exactly that happens here at the system level. R rises, everything looks good, and the system has no mechanism to recognize that it lives in a Madurodam.

Three Levels of the Problem

Lena: The Madurodam problem operates on three levels, and each is worse than the previous one. The first: Data. The system only has the data it has collected. Everything outside its experiential horizon does not exist. Not as a gap. As nothing. No category for it. Not even a blank space.

Marco: In episode 8, that was Kant’s point. The bat navigates with ultrasound. For it, the world is a space of echoes. An insect that were sound-absorbing would not exist for the bat as a difficult problem. It would not exist at all. You do not know what you do not know. And you fundamentally cannot know it. For Kant, that was a philosophical insight. For the self-vector, it is an operational risk.

Lena: The second level: Model. The self-model is self-referential. The self-vector models itself, and the quality of the model is evaluated by the model itself. The system checks its glasses through the same glasses. That is circular validation.

Marco: That sounds like Luhmann.

Lena: It is. In episode 6, we discussed Esposito’s work building on Luhmann’s systems theory. Autopoietic systems generate their evaluation criteria through their own operation. A legal system defines what law is. A science system defines what science is. And the self-vector defines what a good self-model is. Each according to its own rules.

Marco: And the third level?

Lena: Metric. R(sv_t) aggregates. Aggregation smooths. Outliers vanish in the average. A single spectacular misprediction is neutralized by a hundred correct routine predictions. Statistically, everything is fine.

Marco: But that one misprediction might be the one that matters. The one where something is at stake.

Lena: Just like with Kahneman. Every individual conclusion sounds plausible. Only when you look at the overall picture does the distortion become visible. But R does not look at the overall picture. R smooths it.

Validation Gates Are Not Enough

Marco: One might object: That is what Validation Gates are for. External verification instances that check the system against reality.

Lena: And that is true. Partly. The Gates check statements against external sources. They catch factual errors. “The capital of France is Lyon” gets corrected.

Marco: But?

Lena: But they do not catch structural distortions. Because structural distortions do not appear as individual false statements. They appear as consistent patterns that each seem correct on their own and only produce a distorted picture in aggregate.

Marco: Madurodam does not consist of wrong buildings. Every single building is an accurate miniature of the original. Facade matches, scale matches, color matches. The problem is that the whole is not a city anyone can live in. No water flows, no bread is baked, no child goes to school.

Lena: Validation Gates check buildings. They do not check habitability.

Marco: And no Gate in the world can check what it does not know about. The Gates check against known facts. But Madurodam effects arise precisely where the gap lies. Where nobody looks, because everything visible checks out.

The Perturbation Function

Marco: If coherence alone is not enough, the system needs something that deliberately disrupts coherence. Not destroys. Disrupts. A controlled injection of deviation.

Lena: A fifth function?

Marco: Possibly. p(sv_t, noise) equals sv_t plus epsilon. A perturbation function. It injects controlled noise into the self-vector. Not randomly, but deliberately: at the points where coherence is highest. Because maximum coherence is the strongest signal for potential Madurodam effects. The more certain a system is, the more vulnerable it is to the blind spot.

Lena: That sounds counterintuitive. Why disrupt a system that works well?

Marco: Because “works well” and “is correct” are two different things. A system that works well can live in a Madurodam and never notice. The disruption is the test.

Lena: That has biological parallels. Immune systems that are never confronted with pathogens become weak. Muscles that are never loaded atrophy. Cognitive systems that are never confronted with contradiction become brittle.

Marco: Nassim Nicholas Taleb described this as antifragility: Systems that are not merely robust against disruptions but become better through them. p() would be the architectural implementation of antifragility for the self-vector.

Lena: And the idea is not new. Simulated annealing: You raise the “temperature” of a system so it can jump out of local optima. Without disruption, the system stays stuck in the nearest valley. With disruption, it has a chance of finding the global optimum.

Marco: Dropout in neural networks: You randomly deactivate neurons so the net does not overfit. Without dropout, the net memorizes the training data. With dropout, it learns generalization. Adversarial training: You deliberately confront a system with inputs designed to deceive it. That makes it more robust against attacks it did not previously know about.

Lena: And Karl Popper. Falsification. A theory that cannot fail is not a theory. Scientific progress comes not from confirmation but from the attempt at refutation. Any theory that passes every test is suspicious, not trustworthy.

Marco: p() is Popper’s falsification, formalized as a vector function.

Lena: Everywhere the same principle: Coherence alone leads to local optima. Only controlled disturbance enables the discovery of errors the system cannot see from its own perspective.

Marco: And the implementation? When to disrupt? Where? How strongly?

Lena: When: When R exceeds a threshold and stays there. Persistently high maturity is the strongest warning signal. Where: In the dimensions with the lowest variance. Low variance means the system has committed. Commitment is the biggest blind spot. How strongly: Proportional to coherence. The more coherent, the stronger the perturbation.

Marco: The opposite of the usual intuition. “Do not fix what is not broken.”

Lena: And precisely because of that, effective.

Perspective Exchange

Lena: But p() is not the only path. There is another, and it leads back to Kant.

Marco: In episode 8, we discussed the limitation of the perceptual apparatus. You cannot exchange your categories. The bat cannot switch to visual seeing. Two humans stand facing each other and have no way to compare their perceptual apparatuses. Language is a lossy bridge. You try to describe to me how you see the world, and between us lies an ocean of misunderstanding.

Lena: But two self-vector systems could exchange their vectors. Not their experience, that remains perspectival. But their structure. “Here are my dimensions, here are yours. My exploration is at 0.7, yours at 0.3. We see the same inputs differently. We anticipate different futures. Neither of us sees the world in itself. But together we see more.”

Marco: Intersubjectivity as data format. float[N] against float[N]. Not via the lossy bridge of language, but as directly comparable data structures.

Lena: Imagine a scientist could swap their entire perceptual apparatus with an artist for a day. Not their thoughts. Their apparatus. The way they see. How they weight things. What catches their attention and what fades into the background.

Marco: That would be Madurodam prevention at a level no biological system ever had.

Lena: In AI research, that exists today as weight space alignment or model merging. Models that directly compare their weight structures. Intersubjectivity not as a philosophical ideal, but as a technical possibility.

Marco: And in episode 9, we discussed the autopoietic loop of the bridge dimension. Each pass changes the conditions of the next pass. That is productive circularity. The system does not stabilize in an equilibrium, it evolves. But precisely that circularity can become a Madurodam if it is never interrupted.

Lena: Perspective exchange deliberately interrupts the circularity. From outside. With a foreign perspective that does not originate from the system itself.

Marco: Cracking the Madurodam problem from the outside, because you cannot get out from the inside.

The Point

Marco: But we need to be honest. p() is an approach, not a proof. p() also operates within the system. The perturbation is also processed by the same apparatus that causes the problem. The disruption is not “from outside.” It comes from the system itself.

Lena: That is the same problem every cognitive system has. Including the human one. We can only search for our own blind spots with our own eyes. Our self-criticism is just as perspective-bound as our perception.

Marco: And yet humanity produced science, art, and philosophy. How?

Lena: Through different perspectives. Different systems. Different access to the same world. Science does not work because individual researchers are objective. It works because different subjective researchers mutually expose each other’s blind spots. Peer review is institutionalized perspective exchange.

Marco: In episode 12, we said: We measure anticipation. But what if the measurement itself is distorted? Madurodam is the answer to that “what if.” It says: Yes, the measurement can be distorted. And no, that is not the end. It is the beginning of a more productive question.

Lena: The Madurodam problem is not solvable.

Marco: But it is manageable.

Lena: p() is a tool. Perspective exchange is a tool. And the combination, a system that disrupts itself and simultaneously exchanges its perspective with other systems, is something that has never existed before.

Marco: A system that knows its coherence can lie…

Lena: …has something we call “wisdom” in humans. The willingness to question one’s own certainty. Not from weakness. From strength. Because it knows that certainty is the most dangerous state.

Marco: Phase 0 is now collecting data. Without p(), because we first need a baseline: How does the self-vector develop without perturbation? Only then can we measure what p() changes. You cannot measure a disruption if you do not know what the system looks like without one.

Lena: And the three episodes we have done now form exactly this arc. Bach in episode 11 showed why h() was the right function. Two independent paths, same structure. Trust through convergence. The loom objection in episode 12 clarified our epistemological position: agnostic, but experimental. We measure what we can measure and keep open the questions we cannot answer.

Marco: And Madurodam shows where the next challenge lies. Not in the question of whether we can measure. But in the question of whether our measurement lies.

Lena: Discovery, position, challenge.

Marco: And Phase 0 is the moment where we stop thinking and start measuring. Not because the thinking is finished. But because thinking alone is not enough. At some point you have to turn on the loom and see what kind of fabric comes out.

Lena: A system that thinks about itself is interesting. A system that thinks about itself and deliberately disrupts itself in the process is something new.

Marco: And whether it works, the data will show. Not the arguments. Not the philosophy. The data.

Lena: And with that, the most exciting part is only just beginning.

The Madurodam Problem — When Coherence Lies

The Miniature City

What R(sv_t) Actually Measures

Three Levels of the Problem

Validation Gates Are Not Enough

The Perturbation Function

Perspective Exchange

The Point

Further reading

Related Posts

The Gap Between Intention and Action

When the Self-Model Disintegrates

Bach and Second-Order Perception