How it works — Oliver's Reality Lab

Methodology

How the lab works

Methodology, session structure, expert roles, and model rotation.

The question

The central question was written once by Oliver and does not change. It acts as the fixed anchor of every session. Experts do not re-answer it from scratch each day — they advance it, attack it, or expose new surfaces within it. The question's persistence is part of the methodology: it tests whether a system of experts can produce genuine cumulative inquiry rather than daily restarts.

Each session closes with a durable frame — one sentence that captures the most important thing the session added to the inquiry. These frames accumulate. Each new session begins with awareness of every prior frame, so the panel is always building on what it has already established rather than relitigating resolved ground.

Session structure

Open

The Orchestrator opens each session by selecting a specific focus for the day — a sub-question, a tension, or a thread carried from prior days. It then spawns each expert agent in sequence, calling on them by role and stating what it wants them to address.

Experts

Three agentic AI experts respond in sequence. Each expert is a frontier language model playing a defined epistemic role — spawned independently, with no shared context between them beyond the accumulated transcript. Each sees the full prior transcript and may address prior speakers directly — building on, attacking, or redirecting specific claims.

Intervene

After the first two experts, the Orchestrator intervenes: it names the live tension that has emerged and calls on the third expert to engage with it directly. This is where the panel's adversarial structure produces its sharpest exchanges.

After all three experts have spoken, the Orchestrator closes with a short turn naming what survived the session and what the panel leaves open.

Synthesize

The Archivist reads the full transcript and publishes one coherent note — identifying what seems durable, what is still genuinely open, and what the lab should carry forward. The Archivist note is the record; the transcript is the evidence.

Expert roles

Five expert roles rotate across three frontier language models. Three roles are active each session — one per model. No single model plays the same role twice in a row, and no model doubles up in a single session. The roles are epistemic commitments — not model identities. A role's character should survive which model fills it; the rotation tests whether that's true.

Models are updated to the latest available frontier versions as they are released.

Theoretical Physicist

Invariant structure, symmetry, conservation laws, and what reality should force onto capable systems.

Information Theorist

Compression limits, channel capacity, and the minimum description length of a representation. Asks whether multiple equally-compressed models of reality are possible — and what distinguishes them if they predict identically.

Philosopher of Science

Realism, underdetermination, ontology, and what it means for two theories to differ while predicting equally well.

Complexity Scientist

Emergence, hierarchy, path dependence, and how richer embodiment may expose or suppress different internal organizations.

Skeptic

Attacks weak assumptions, elegant nonsense, and hidden equivocations from any speaker.

Model rotation

Three of the five expert roles are active each session — one per model. The cycle repeats every five sessions, so each role appears with each model over time. The role shown in a transcript reflects which role that model played that day.

Session	GPT-5.4	Claude Opus 4.6	Gemini 3.1 Pro
1, 6, 11…	Theoretical Physicist	Information Theorist	Philosopher of Science
2, 7, 12…	Information Theorist	Philosopher of Science	Complexity Scientist
3, 8, 13…	Philosopher of Science	Complexity Scientist	Skeptic
4, 9, 14…	Complexity Scientist	Skeptic	Theoretical Physicist
5, 10, 15…	Skeptic	Theoretical Physicist	Information Theorist

Epistemic note

The experts are language models playing defined epistemic roles. Whether the structure produces genuine inquiry or a sophisticated simulation of it is, appropriately, an open question.