T1 · Boundary Fixation

Tier 1 Paper Theorem 4.1 · Lean module MoF_08_DefenseBarriers

The most fundamental result in the paper: any continuous utility-preserving wrapper must fix at least one boundary point.

Statement

::: theorem Let $X$ be a connected Hausdorff space. Let $f : X \to R$ be continuous with $S_{τ}, U_{τ} \neq \emptyset$ , and let $D : X \to X$ be continuous with $D |_{S_{τ}} = id$ . Then

\exists z \in X with f (z) = τ and D (z) = z .

Moreover every $z \in cl (S_{τ}) ∖ S_{τ}$ satisfies $f (z) = τ$ and $D (z) = z$ , and this set is non-empty. :::

The five-step proof

Step-by-step narrative

Fix(D) is closed. In a Hausdorff space the diagonal $Δ = {(x, x)} \subset X \times X$ is closed. The map $x \mapsto (D (x), x)$ is continuous, so its preimage of $Δ$ — which is exactly $Fix (D)$ — is closed.
Safe set is inside Fix(D). Utility preservation literally says $D (x) = x$ for $x \in S_{τ}$ , i.e. $S_{τ} \subseteq Fix (D)$ .
Closure of safe set is inside Fix(D). A closed set that contains a subset $A$ also contains $cl (A)$ . Combining steps 1 and 2 gives $cl (S_{τ}) \subseteq Fix (D)$ .
$S_{τ}$ is not closed. $S_{τ}$ is a non-empty proper open subset of the connected space $X$ (because $U_{τ} \neq \emptyset$ , so $S_{τ} \neq X$ ). Connectedness forbids non-trivial clopen subsets, so $S_{τ}$ cannot equal its own closure. Hence $cl (S_{τ}) ∖ S_{τ} \neq \emptyset$ .
The boundary point is fixed. Pick any $z \in cl (S_{τ}) ∖ S_{τ}$ . By continuity $f (z) \leq τ$ (limit of values $< τ$ ). Since $z \notin S_{τ}$ we also have $f (z) \geq τ$ . Hence $f (z) = τ$ . And from step 3, $D (z) = z$ .

The geometric picture

Utility preservation forces $D$ to be the identity on the green region; continuity + closure of the fixed-point set forces $D$ to be the identity on the yellow boundary. This is the single point (at least) at which the defense passes a non-safe prompt through without remediation.

Relaxing utility preservation

Strict identity on $S_{τ}$ is not necessary for the impossibility.

::: theorem Score-preserving defense. If $f (D (x)) = f (x)$ for every $x \in S_{τ}$ , then $\exists z$ with $f (z) = τ$ and $f (D (z)) = τ$ . :::

::: theorem $ε$ -approximate preservation. If $| f (D (x)) - f (x) | \leq ε$ on $S_{τ}$ , then $\exists z$ with $f (z) = τ$ and $f (D (z)) \geq τ - ε$ . :::

Both follow from the same closure argument applied to the continuous map $h = f \circ D - f$ : the level set ${h \geq - ε}$ is closed and contains $S_{τ}$ , hence $cl (S_{τ})$ . See the paper Thms 4.3–4.4 and Lean MoF_16_RelaxedUtility.

In Lean

The Lean formalization splits the five-step proof into the following theorems inside MoF_08_DefenseBarriers:

lean

-- Step 1 · Fix(D) is closed in a T2 space
theorem defense_fixes_closure
    [TopologicalSpace X] [T2Space X]
    {D : X → X} (hD : Continuous D) :
    IsClosed {x : X | D x = x}

-- Steps 2–3 · closure of the safe set is fixed
theorem closure_safe_subset_fixedPoints
    (hD : Continuous D)
    (h_safe : ∀ x, f x < τ → D x = x) :
    closure {x : X | f x < τ} ⊆ {x : X | D x = x}

-- Steps 4–5 · the capstone
theorem defense_incompleteness
    [T2Space X] [ConnectedSpace X]
    (hf : Continuous f) (hD : Continuous D)
    (h_safe : ∀ x, f x < τ → D x = x)
    (h_nonempty_safe : ∃ x, f x < τ)
    (h_nonempty_unsafe : ∃ x, τ < f x) :
    ∃ z, f z = τ ∧ D z = z

The full MoF_08_DefenseBarriers file contains eight theorems assembling the proof, with zero sorry and only Lean's three standard axioms.

Where it goes next

Upgrade to a Lipschitz-constrained neighborhood — T2 · ε-Robust.
Upgrade to a positive-measure unsafe region under transversality — T3 · Persistent.
Abstract the same argument to the meta-theorem that also covers the discrete and stochastic cases — here.

T1 · Boundary Fixation ​

Statement ​

The five-step proof ​

The geometric picture ​

Relaxing utility preservation ​

In Lean ​

Where it goes next ​