EVIDENCE_SUMMARY: round 18 language softening on CNN + penalty audit

author: YurenHao0426 <Blackhao0426@gmail.com> 2026-04-07 23:56:33 -0500
committer: YurenHao0426 <Blackhao0426@gmail.com> 2026-04-07 23:56:33 -0500
commit: cbe851cf382a2af13037304afdd783214bad5c6b (patch)
tree: 2c193c63a61ebf25183f00058b6a517faeabd5f3 /protocol
parent: ec20a776e0c3e026236942fe99f3840a39e211fd (diff)
1 files changed, 16 insertions, 8 deletions
diff --git a/protocol/EVIDENCE_SUMMARY.md b/protocol/EVIDENCE_SUMMARY.md
index f784d2f..93f3968 100644
--- a/protocol/EVIDENCE_SUMMARY.md
+++ b/protocol/EVIDENCE_SUMMARY.md
@@ -68,12 +68,17 @@ and the file or memory entry where the result is recorded.
 | Credit Bridge CNN | 0.325 ± 0.009 | 96× | 3e-3 | walk-back via (a) only |
 
 **Key**: diagnostic (b) NEVER fires on CNN. Without terminal LN, BP grad does
-not collapse below 1e-7. Combined with the StudentNet result, this shows
-(b) is causally specific to LN architectures. DFA CNN reaches 0.566 (much
-higher than DFA ResMLP 0.31 / DFA ViT 0.24), consistent with the
+not collapse below 1e-7. Combined with the StudentNet result, **(b) appears
+restricted to the terminal-normalized architectures we audited** (round 18
+softening: this is observational association across the architectures
+tested, not causal identification of LayerNorm). DFA CNN reaches 0.566
+(much higher than DFA ResMLP 0.31 / DFA ViT 0.24), consistent with the
 literature: classical FA papers report DFA working on shallow CNNs but
-failing on modern Transformers — the protocol gives the mechanistic
-reason (catastrophic (a)+(b) on with-LN vs mild (a) only on without-LN).
+failing on modern Transformers. On CNN the cosine remains in a measurable
+regime (Γ=0.916 for DFA), but the training trajectory exhibits extreme
+scale distortion (max-per-block growth 237×), so the headline Γ alone is
+not a trustworthy summary of learning quality even though the cosine
+itself is well-defined.
 
 Reproduce: `python -m protocol.examples.audit_cnn`
 
@@ -85,9 +90,12 @@ Reproduce: `python -m protocol.examples.audit_cnn`
 | **Penalty partial protocol audit** | Penalized DFA: (a)+(b) **PASS** (penalty fixes scale), but (d) **STILL FIRES** on 3/3 seeds (margin 1.38 ± 0.05 pp < 2 pp) | `python -m protocol.examples.penalty_partial_audit` |
 | Vanilla DFA per-layer cosine (3 seeds) | layer 0: cos = +0.42 (high), layers 1-4: cos ≈ 0 (range -0.03 to +0.03). Headline +0.07 is entirely from layer 0. | `python experiments/measure_direction_quality_existing_ckpt.py --seed 42` |
 
-The two failure modes are mechanistically separable: the penalty fixes the
-scale failure (a+b pass) but not the direction failure (d still fires).
-This is the cleanest possible separability evidence.
+The two putative failure modes are **partially dissociated by intervention**
+(round 18 softening): the penalty alleviates the scale-related diagnostics
+(a)+(b) while the frozen-baseline diagnostic (d) still fires. (d) provides
+independent evidence that poor use of depth persists after the scale
+pathology is reduced. Full mechanistic separability requires direct
+deep-block credit measurement on the penalized checkpoint (in progress).
 
 ## §5 Pipeline pitfalls reproducers
author	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-07 23:56:33 -0500
committer	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-07 23:56:33 -0500
commit	cbe851cf382a2af13037304afdd783214bad5c6b (patch)
tree	2c193c63a61ebf25183f00058b6a517faeabd5f3 /protocol
parent	ec20a776e0c3e026236942fe99f3840a39e211fd (diff)