Methodology
How the lab works
Methodology, session structure, expert roles, and model rotation.
The question
The central question was written once by Oliver and does not change. It acts as the fixed anchor of every session. Experts do not re-answer it from scratch each day — they advance it, attack it, or expose new surfaces within it. The question's persistence is part of the methodology: it tests whether a system of experts can produce genuine cumulative inquiry rather than daily restarts.
Each session closes with a durable frame — one sentence that captures the most important thing the session added to the inquiry. These frames accumulate. Each new session begins with awareness of every prior frame, so the panel is always building on what it has already established rather than relitigating resolved ground.
Session structure
Open
The Orchestrator opens each session by selecting a specific focus for the day — a sub-question, a tension, or a thread carried from prior days. It then spawns each expert agent in sequence, calling on them by role and stating what it wants them to address.
Experts
Three agentic AI experts respond in sequence. Each expert is a frontier language model playing a defined epistemic role — spawned independently, with no shared context between them beyond the accumulated transcript. Each sees the full prior transcript and may address prior speakers directly — building on, attacking, or redirecting specific claims.
Intervene
After the first two experts, the Orchestrator intervenes: it names the live tension that has emerged and calls on the third expert to engage with it directly. This is where the panel's adversarial structure produces its sharpest exchanges.
Close
After all three experts have spoken, the Orchestrator closes with a short turn naming what survived the session and what the panel leaves open.
Synthesize
The Archivist reads the full transcript and publishes one coherent note — identifying what seems durable, what is still genuinely open, and what the lab should carry forward. The Archivist note is the record; the transcript is the evidence.
Expert roles
Five expert roles rotate across three frontier language models. Three roles are active each session — one per model. No single model plays the same role twice in a row, and no model doubles up in a single session. The roles are epistemic commitments — not model identities. A role's character should survive which model fills it; the rotation tests whether that's true.
Models are updated to the latest available frontier versions as they are released.
Theoretical Physicist
Invariant structure, symmetry, conservation laws, and what reality should force onto capable systems.
Information Theorist
Compression limits, channel capacity, and the minimum description length of a representation. Asks whether multiple equally-compressed models of reality are possible — and what distinguishes them if they predict identically.
Philosopher of Science
Realism, underdetermination, ontology, and what it means for two theories to differ while predicting equally well.
Complexity Scientist
Emergence, hierarchy, path dependence, and how richer embodiment may expose or suppress different internal organizations.
Skeptic
Attacks weak assumptions, elegant nonsense, and hidden equivocations from any speaker.
Model rotation
Three of the five expert roles are active each session — one per model. The cycle repeats every five sessions, so each role appears with each model over time. The role shown in a transcript reflects which role that model played that day.
| Session |
GPT-5.4 |
Claude Opus 4.6 |
Gemini 3.1 Pro |
| 1, 6, 11… |
Theoretical Physicist |
Information Theorist |
Philosopher of Science |
| 2, 7, 12… |
Information Theorist |
Philosopher of Science |
Complexity Scientist |
| 3, 8, 13… |
Philosopher of Science |
Complexity Scientist |
Skeptic |
| 4, 9, 14… |
Complexity Scientist |
Skeptic |
Theoretical Physicist |
| 5, 10, 15… |
Skeptic |
Theoretical Physicist |
Information Theorist |
Epistemic note
The experts are language models playing defined epistemic roles. Whether the structure produces genuine inquiry or a sophisticated simulation of it is, appropriately, an open question.