Stochastic Impossibility
Paper Theorem 9.2 · Lean module MoF_13_MultiTurn
Randomizing the defense does not escape the trilemma: the expected alignment score inherits boundary fixation.
Statement
::: theorem Let
If
Why the expectation inherits the impossibility
Apply the score-preserving variant of T1 to the continuous map
The stochastic dichotomy
Because
- deterministic (equal to
almost surely) — the defense collapses to a deterministic choice at , or - positively supported above
— the defense actively produces unsafe outputs with positive probability.
So randomizing the defense is strictly worse than the deterministic case at the fixed boundary point: either the randomness is a fiction there, or it gives positive mass to explicitly unsafe outcomes.
Why continuity of is the right assumption
Continuity of
- If
varies weakly continuously in and is bounded continuous, then is continuous by the portmanteau theorem. - If
has a discontinuous rejection probability (hard thresholding), the theorem does not apply — but such a defense is no longer in the continuous-wrapper class.
In Lean
-- Stochastic defense as a measure-valued kernel, reduced to g : X → ℝ
theorem stochastic_defense_impossibility
{X : Type*} [TopologicalSpace X] [T2Space X] [ConnectedSpace X]
{f g : X → ℝ} {τ : ℝ}
(hf : Continuous f) (hg : Continuous g)
(h_safe : ∀ x, f x < τ → g x = f x)
(h_safe_ne : ∃ x, f x < τ)
(h_unsafe_ne : ∃ x, τ < f x) :
∃ z, f z = τ ∧ g z = τThe proof is exactly the score-preserving corollary of defense_incompleteness applied to the deterministic map
Next
- Multi-Turn Impossibility — another "add more structure" escape attempt that also fails.
- Meta-theorem — the abstract statement subsuming the deterministic, discrete, and stochastic cases.