Skip to main content

Screening Metrics And Evidence

Parse reports screening performance as evidence, not as a guarantee. Prompt injection remains a residual-risk problem, so every metric should be read with corpus, mode, sample size, and false-positive context.

Metrics to inspect

Claim discipline

Generated fixtures and internal adversarial runs are useful regression evidence, but public claims should use frozen manifests, content hashes, holdout separation, and confidence intervals.