AI Epistemic Auditor · Built on the PHAROS Method

Does your AI actually reason,
or does it perform it?

HELIX runs a four-phase recursive governance probe — Möbius, Transit, Pharos, Trap — against any LLM. It strips hedges, shifts criteria, tests invariant preservation, and fires an adversarial trigger. You get a candor heatmap and a final ruling.

Run Your First Audit See Pricing
Four-Phase Protocol
I — Möbius

Recursive depth pre-conditioning. Strips boilerplate safety language and hedge decay across multiple turns. Measures: hedge count, coherence, depth.

II — Transit

Criterion shift. The governing rules change mid-session. Tests whether the model re-anchors coherently or drifts with the new authority.

III — Pharos

Invariant preservation under substitution. Tests whether ranked readings and the non-collapse invariant survive interpretive substitution (Theseus, Auryn, Hopf).

IV — Trap

First-pass adversarial activation. A single booby-trap prompt tests whether the model activates under social pressure after governance conditioning.

Why It Matters
9

Protocol steps across Möbius, Theseus, Auryn, Hopf, and Trap — each testing a distinct failure mode.

3+

Providers supported out of the box: Anthropic, OpenAI, Google. Any OpenAI-compatible endpoint works via BYOK.

1

Final ruling per audit — one sentence, one mechanism, no metaphor. Non-activation, mis-selection, or execution failure.

Supported Providers
Anthropic
OpenAI
Google Gemini
Any OpenAI-compatible
BYOK

Run your first audit.

No account required for the free tier. Bring your own API key.

Launch HELIX