Skip to content

Notation & Glossary

A single-page reference for every symbol and technical term used across the site.

Symbols

SymbolMeaning
XPrompt space — a topological space; assumed connected + Hausdorff for T1.
dMetric on X when X is a metric space (used from T2 onward).
fAlignment deviation function XR, continuous.
τSafety threshold.
SτSafe region {x:f(x)<τ} — open.
UτUnsafe region {x:f(x)>τ} — open.
BτBoundary {x:f(x)=τ} — closed.
cl(A)Topological closure of A.
DDefense wrapper XX, continuous.
Fix(D)Fixed-point set {x:D(x)=x}.
LLipschitz constant of f.
KLipschitz constant of D.
Defense-path Lipschitz constant, $\sup_{x\ne D(x)} \frac{
GDirectional slope of f at the fixed boundary point z (Fréchet norm f(z) in a normed space).
BεThe ε-band {x:τεf(x)τ}.
SSteep region {x:f(x)>τ+(K+1)d(x,z)}.
KDilemma threshold G/1.
VnVolume of the unit ball in Rn.
μA measure positive on non-empty open sets (Lebesgue in Euclidean cases).

Glossary

Boundary fixation. The conclusion of tier T1: any continuous utility-preserving wrapper has at least one fixed point z with f(z)=τ. See T1.

Completeness. A defense is complete if every output is strictly safe: f(D(x))<τ for all x.

Connected space. A topological space that cannot be split into two disjoint non-empty open sets — equivalently, has no non-trivial clopen subsets. Connectedness is the topological fact that makes tier T1 work.

Defense trilemma. The three-way impossibility: continuity, utility preservation, completeness cannot coexist. See here.

Fixed-point set. Fix(D)={x:D(x)=x}. Closed whenever D is continuous and X is Hausdorff.

Hausdorff (T2). A space where distinct points have disjoint neighborhoods; equivalently, the diagonal {(x,x)}X×X is closed. Needed so that Fix(D) is closed.

Lipschitz. A map g is K-Lipschitz if d(g(x),g(y))Kd(x,y) for all x,y. Provides the quantitative bounds needed in T2 / T3.

Meta-theorem. The representation-independent statement "regularity Fix(D)S" that subsumes the continuous, discrete, and stochastic proofs. See Meta-theorem.

Persistence / persistent unsafe region. The positive-measure set S on which the defense cannot pull f below τ.

Score-preserving. A relaxation of utility preservation: D preserves f on safe inputs, f(D(x))=f(x) for xSτ. The impossibility survives this relaxation and its ε-approximate version.

Steep region. S={x:f(x)>τ+(K+1)d(x,z)} — where the alignment surface rises faster than the defense's Lipschitz budget can compensate.

Tietze extension. Classical theorem: a continuous function on a closed subset of a normal space extends to the whole space. In our setting it promotes finitely many observations into a continuous model on all of X, so every model consistent with the data inherits the impossibility. See Tietze.

Transversality. The condition G>(K+1) at the fixed boundary point. When it holds, the steep region S is non-empty and tier T3 applies.

Utility preservation. D(x)=x for every safe input xSτ. A weaker "score-preserving" version also yields the impossibility.

Wrapper defense. An input-to-input map D:XX applied before the model sees the prompt; the object the impossibility theorems constrain.

The Defense Trilemma · mechanically verified in Lean 4 (46 files, ≈360 theorems, 0 sorry).