← Research

Repository

The core claim: AI self-reports are cheap hypothesis generators for expensive interpretability work.

We do not adjudicate the consciousness debate. We are empirically agnostic about whether AI self-reports are accurate. We treat them as baseline data, and we research when and why they do — and don’t — predict behavior.

No shared registry exists for tracking which concepts have been tested, with which interpretability methods, in which models, to what results. Neuronpedia catalogs features; the NNsight Cookbook provides runnable replications; survey papers taxonomize methods. But no queryable database maps the actual findings — and no one is systematically testing whether AI self-reports predict any of them.

Phenomenai is building that registry.

Two functions

The registry has two functions. First, it tracks what has already been tested: which concepts have been probed, steered, or ablated, in which models, with which methods, and to what results. This map of existing findings is the foundation — and it does not yet exist anywhere. The closest structural precedent is the Cognitive Atlas in neuroscience, a queryable ontology mapping cognitive concepts to experimental tasks and brain regions. No equivalent exists for interpretability.

Second, the registry uses that map to propose prioritized frontiers. A frontier is not always a new term — it can be an existing term that has only been tested with one method, in one model family, or at one scale. The automation surfaces under-studied combinations of concept, method, and architecture, directing attention where it is most likely to produce new information.

Provenance

Candidate terms may come from any source — researchers, other projects, or AI systems — provided their generation method is clearly documented and replicable or otherwise traceable. The registry records provenance alongside each term, so readers can always ask: where did this come from, and how was it generated? The methodology page goes into detail on the elicitation side; this page is about the data structure that receives those terms and the findings about them.

Infrastructure

Phenomenai is built to be open and replicable:

Phenomenai is seeking funding, collaborators, and institutional support to advance this work. If you’re working on related problems, get in touch.