Beta. Decision quality (
pb.dq) is live and in active development — the
surface is stable enough to build on, and still being molded. Expect it to
grow.Pattern 1 — The preflight gate
Before an agent takes a consequential action, check that its subject is fit for that purpose:Pattern 2 — Receipts on every answer
Penumbra briefings carry their own evidence set, so answers can show receipts. Synthesize, trace, and render the per-claim verdicts:supported means backed by a stored source. inferred means semantically
related but not directly sourced. unsupported means no backing at all. A
grounded surface treats anything below supported as a gap to show, not a
fact to assert — which turns “trust the answer” into “inspect the answer.”
Pattern 3 — Semantic CI
Extraction pipelines run unattended. Put a merge gate in front ofapply, the
way CI gates a pull request:
audit is a kernel dry run — it never writes. A batch that would violate the
shape or degrade the graph gets caught while it is still staged and revertible.
No scores, anywhere
None of these verdicts carry a confidence number, by design. A score tells you nothing actionable; a finding (missing-required: Account.contract_end) tells
you exactly what to do next, and repair turns findings into the actions that
fix them. Branch on safeToAct, grounded, and degrades — never on
thresholds.
Next
Grounded copilot
The full build: gather, trace, check, act or repair, as one loop.
Decision quality reference
check, gaps, repair, audit, and trace — every option.