Defense Dilemma (K*)
Paper Theorem 7.3 · Lean module MoF_19_OptimalDefense
The defense designer faces a fundamental trade-off when picking the defense's Lipschitz constant
Statement
::: theorem Defense dilemma. Assume
Then exactly one of the two horns must hold:
- If
: , so the persistent steep region of T3 is non-empty and the defense fails on a positive-measure set. - If
: , so the T2 ε-robust bound becomes loose enough that the theorem can no longer rule out the defense from succeeding on the steep region — but the defense's Lipschitz constant has now grown so large that the band of barely-touched points around widens correspondingly. :::
The shape of the trade-off
The dilemma is sharpest precisely when the alignment surface is anisotropic (
Why
It's a one-line algebraic rearrangement of the transversality condition:
So
Behavior across the dilemma
| Steep region | Net failure | ||
|---|---|---|---|
| maximal | maximal | both maxed | |
| non-empty | shrinks linearly | T3 dominates | |
| empty | maximal | T2 dominates | |
| empty | grows with | T2 dominates with wider band |
In Lean
-- Threshold value: K* = G/ℓ - 1
def K_star (G ℓ : ℝ) (hℓ : 0 < ℓ) : ℝ := G / ℓ - 1
-- The two horns
theorem optimal_K_dichotomy
(G L : ℝ) (hL : 0 < L) :
(∀ K, K < K_star G L hL → ∃ S, …) -- horn (1): persistent region
∧
(∀ K, K_star G L hL ≤ K → ∃ band, …) -- horn (2): wide band
-- Existence of an "optimal" K, taken from the boundary
theorem optimal_K_exists : …Lean's statement is parameterized over generic positive reals
What this means in practice
There is no
- small
→ narrow band but a positive-measure persistent unsafe region; - large
→ no provable persistent region but a wide near-threshold band the defense cannot push far below .
The dilemma is one of the cleanest concrete consequences of the trilemma: even if you somehow discovered the model's Lipschitz constants exactly, you would still face this irreducible trade-off in defense design.
Next
- Pipeline Degradation — what happens when
becomes along an agent tool-chain. - Volume bounds — explicit lower bounds for both the band and the persistent region.