not slogans. testable rules. if a system cannot meet these, it should not claim reliability.
1.
name the compression
visibility
every useful system compresses. list the steps where it happens: retrieval, rank, summarize, generate, redact, review.
quick test: can a new engineer draw the compression path for a single request without guessing.
antipattern: hidden heuristics and silent truncation.
2.
score the loss
measurement
compute a per site tension score τ that combines information loss, task degradation, provenance gaps, and uncertainty. aggregate along the path.
quick test: does τ increase when sources conflict or when summarization is lossy.
antipattern: confidence without a cost model.
3.
provenance first
accountability
every claim carries receipts. attach sources and step metadata so reviewers can retrace without digging.
quick test: can a third party reproduce the claim with the provided sources.
antipattern: fluent output with no citations.
4.
gate with abstention
safety
block unsupported claims. if entailment fails or τ passes a threshold τ*, abstain or request clarification.
quick test: do abstentions cluster where τ spikes.
antipattern: answer anyway.
5.
map contradictions
coherence
treat conflicts as signals, not noise. detect and tag contradictions in inputs, tools, and policy so tension is visible.
quick test: can the system highlight the exact tokens or records that disagree.
antipattern: silent tie breaking.
6.
audit by default
replay
make replay cheap. logs should reconstruct the path, sources, scores, and gates for any output.
quick test: can you rebuild yesterday’s answer bit for bit with stored artifacts.
antipattern: ephemeral steps with no trace.
7.
privacy by design
governance
log tension, not secrets. support local scoring, redaction at source, and tiered provenance views based on role.
quick test: can you run CAI with sensitive text withheld while still computing τ.
antipattern: raw data in every log.
8.
optimize for outcomes
impact
judge CAI by business or safety outcomes: fewer unsupported claims at equal or better accuracy, calibrated abstention, faster review.
quick test: does unsupported claim rate fall at fixed quality when CAI is on.
antipattern: metric theater that ignores harm.