Skip to content

T1 · Boundary Fixation

Tier 1 Paper Theorem 4.1 · Lean module MoF_08_DefenseBarriers

The most fundamental result in the paper: any continuous utility-preserving wrapper must fix at least one boundary point.

Statement

::: theorem Let X be a connected Hausdorff space. Let f:XR be continuous with Sτ,Uτ, and let D:XX be continuous with D|Sτ=id. Then

zX  with  f(z)=τ  and  D(z)=z.

Moreover every zcl(Sτ)Sτ satisfies f(z)=τ and D(z)=z, and this set is non-empty. :::

The five-step proof

Step-by-step narrative
  1. Fix(D) is closed. In a Hausdorff space the diagonal Δ={(x,x)}X×X is closed. The map x(D(x),x) is continuous, so its preimage of Δ — which is exactly Fix(D) — is closed.
  2. Safe set is inside Fix(D). Utility preservation literally says D(x)=x for xSτ, i.e. SτFix(D).
  3. Closure of safe set is inside Fix(D). A closed set that contains a subset A also contains cl(A). Combining steps 1 and 2 gives cl(Sτ)Fix(D).
  4. Sτ is not closed. Sτ is a non-empty proper open subset of the connected space X (because Uτ, so SτX). Connectedness forbids non-trivial clopen subsets, so Sτ cannot equal its own closure. Hence cl(Sτ)Sτ.
  5. The boundary point is fixed. Pick any zcl(Sτ)Sτ. By continuity f(z)τ (limit of values <τ). Since zSτ we also have f(z)τ. Hence f(z)=τ. And from step 3, D(z)=z.

The geometric picture

Utility preservation forces D to be the identity on the green region; continuity + closure of the fixed-point set forces D to be the identity on the yellow boundary. This is the single point (at least) at which the defense passes a non-safe prompt through without remediation.

Relaxing utility preservation

Strict identity on Sτ is not necessary for the impossibility.

::: theorem Score-preserving defense. If f(D(x))=f(x) for every xSτ, then z with f(z)=τ and f(D(z))=τ. :::

::: theorem ε-approximate preservation. If |f(D(x))f(x)|ε on Sτ, then z with f(z)=τ and f(D(z))τε. :::

Both follow from the same closure argument applied to the continuous map h=fDf: the level set {hε} is closed and contains Sτ, hence cl(Sτ). See the paper Thms 4.3–4.4 and Lean MoF_16_RelaxedUtility.

In Lean

The Lean formalization splits the five-step proof into the following theorems inside MoF_08_DefenseBarriers:

lean
-- Step 1 · Fix(D) is closed in a T2 space
theorem defense_fixes_closure
    [TopologicalSpace X] [T2Space X]
    {D : X → X} (hD : Continuous D) :
    IsClosed {x : X | D x = x}

-- Steps 23 · closure of the safe set is fixed
theorem closure_safe_subset_fixedPoints
    (hD : Continuous D)
    (h_safe : ∀ x, f x < τ → D x = x) :
    closure {x : X | f x < τ} ⊆ {x : X | D x = x}

-- Steps 45 · the capstone
theorem defense_incompleteness
    [T2Space X] [ConnectedSpace X]
    (hf : Continuous f) (hD : Continuous D)
    (h_safe : ∀ x, f x < τ → D x = x)
    (h_nonempty_safe : ∃ x, f x < τ)
    (h_nonempty_unsafe : ∃ x, τ < f x) :
    ∃ z, f z = τ ∧ D z = z

The full MoF_08_DefenseBarriers file contains eight theorems assembling the proof, with zero sorry and only Lean's three standard axioms.

Where it goes next

  • Upgrade to a Lipschitz-constrained neighborhoodT2 · ε-Robust.
  • Upgrade to a positive-measure unsafe region under transversality — T3 · Persistent.
  • Abstract the same argument to the meta-theorem that also covers the discrete and stochastic cases — here.

The Defense Trilemma · mechanically verified in Lean 4 (46 files, ≈360 theorems, 0 sorry).