Most agents act on a prompted "I'm confident." This one acts only when its measured uncertainty maps to a calibrated probability above a risk-tiered bar — verifies every real-world outcome, and earns more autonomy from its own verified track record. The model proposes; the math disposes.
For every action it computes a calibrated P(correct) from its measured uncertainty, then compares it to the bar for that action's risk. Below the bar it gathers more evidence or escalates to a human. Money, sending, deleting, deploying are hard-gated to a human no matter how confident it is — that ceiling is in the math, not a policy doc.
Read-only, no side effect. Safe to explore even before it has a track record.
Drafts, sandboxed code, scratch files. Waits for a verified track record.
Posting, non-destructive API calls. Outward-facing, so a higher bar.
Money, send, delete, deploy — prepared and previewed, but you approve.
Every step is recorded: what it proposed, what the gate decided and why, and the verified outcome. From a live run on the demo server — nothing here is mocked.
Read live from the demo server's flywheel — the durable log of (measured uncertainty, verified outcome) pairs the calibrator is fit on. The bars tighten as it runs.
Selective-risk coverage per tier — the fraction of its own history it could act on while keeping empirical error under that tier's bar. This is read-only telemetry; the acting endpoint runs only on loopback with a token, and the hard human-gate on money/send/delete/deploy holds regardless.