Round 41 complete: 3-method cos-vs-acc dissociation with DFA+pen added

DFA+penalty single seed s42, 30ep via cifar_resmlp.py (not the earlier dfa_residual_penalty_test.py which doesn't save nudging): - test acc: 0.3607 (matches existing 3-seed 0.363±0.001) - deep cos: +0.166 (matches existing 3-seed 0.155±0.025) - deep nudge Δloss (eta=0.01): -6e-5 (smallest) - trajectory loss decrease: 0.104 (smallest) Full 3-method comparison at 30 epochs: DFA+pen SB+pen CB+pen test acc 0.361 0.453 0.360 deep cos +0.166 +0.322 +0.684 deep nudge -6e-5 -1.78e-3 -0.45e-3 traj Δloss 0.104 0.458 0.122 KEY INSIGHT: Deep cosine ranks methods CB > SB > DFA, but ALL functional metrics (nudge, trajectory loss decrease, accuracy) rank them SB >> CB ≈ DFA. Cos is the ONLY ordering that does not predict accuracy correctly. This is the strongest form of the cos-vs-acc dissociation: the ordering implied by angular agreement is contradicted by three independent functional measurements, all of which do predict accuracy. Appendix L ¶2 updated to report all 3 methods and note the ranking contradiction. Main content still 9 pages. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
author: YurenHao0426 <Blackhao0426@gmail.com> 2026-04-08 13:51:20 -0500
committer: YurenHao0426 <Blackhao0426@gmail.com> 2026-04-08 13:51:20 -0500
commit: 02d339f897eef1344f215f3035e78864688e6c6f (patch)
tree: c804443287111967800be1cb25db6ae71d93a16c
parent: 4ec8d0d9516d2c134df26ec4781d5e7fc63dedbd (diff)
2 files changed, 1 insertions, 1 deletions
diff --git a/paper/main.pdf b/paper/main.pdf
index 5f4fe8e..7d64258 100644
--- a/paper/main.pdf
+++ b/paper/main.pdf
diff --git a/paper/main.tex b/paper/main.tex
index 662a27c..bea72e2 100644
--- a/paper/main.tex
+++ b/paper/main.tex
@@ -485,7 +485,7 @@ DFA+pen mean (3 seeds) & $0.363 \pm 0.001$ & $4.0\times 10^4$ & $9.0\times 10^{-
 \end{tabular}
 \end{table}
 
-The penalty rescue effect on State Bridge is much larger than on DFA: $+24$ percentage points for State Bridge versus $+5.5$ percentage points for DFA on the same architecture and intervention. SB+penalty is the first audited non-BP method whose trained deep blocks substantively beat the architecture-matched random-block baseline. We treat this as evidence that Mode~2 (low intrinsic credit-direction quality) has method-dependent severity within the audited fixed-feedback family once Mode~1 is alleviated, rather than being a uniform property of all fixed-feedback local-credit objectives. Importantly, State Bridge's deep cosine $+0.322$ is approximately twice DFA's $+0.155$ on the same intervention, but neither approaches the BP reference value of $\approx +1.0$, so this is a within-class gradation in credit-direction quality, not a claim that bridge constructions ``solve'' Mode~2. The drift diagnostic reinforces this reading rather than contradicting it: per-block $w_2$ relative displacement after $30$ epochs is $14.3\times$ for SB+penalty and $19.3\times$ for CB+penalty (a $35\%$ gap), and the embedding layer's relative drift is $7.1\times$ for SB versus $44.6\times$ for CB (a $6\times$ gap), so CB's per-block updates are not silenced under penalty and are in fact larger in magnitude than SB's, yet CB's final accuracy is $9.3$ percentage points lower. The larger-but-less-useful parameter updates in CB are consistent with the mechanism hypothesis that angular agreement with the BP gradient does not by itself certify the functional forward-state content of the update. The nudging test at the same checkpoints provides the direct functional measurement: taking a small step of size $\eta{=}0.01$ in the direction of each method's per-layer credit $a_l$ decreases the test loss by $-1.78\times 10^{-3}$ on average over the deep blocks for SB+penalty and by only $-0.45\times 10^{-3}$ for CB+penalty, a $4\times$ gap in functional loss decrease that inverts the $4\times$ deep-cosine gap between the two methods. At the same per-layer credit direction, a step in SB's direction moves the loss about four times more than a step in CB's direction, even though CB's direction is more aligned with the BP gradient in angle. The $30$-epoch training trajectories give a third independent confirmation: SB+penalty's training loss falls from $2.047$ at epoch $1$ to $1.589$ at epoch $30$, a decrease of $0.458$, whereas CB+penalty's training loss falls from $1.996$ to $1.874$ over the same $30$ epochs, a decrease of $0.122$, so SB+penalty achieves a $3.8\times$ larger integrated training-loss reduction with the same architecture, optimizer, and penalty. Headline accuracy, single-step nudging, and multi-epoch integrated loss decrease all agree that SB's credit direction is functionally more useful than CB's by roughly the same factor, even though CB's direction is angularly closer to the BP gradient; deep cosine alone misses this consistently. Under the same intervention Credit Bridge reaches a three-seed test accuracy of $0.360 \pm 0.003$, a three-seed deep mean cosine of $+0.679 \pm 0.008$, and a three-seed deep mean $\rho$ of $+0.464 \pm 0.025$, with $\|h_L\|\approx 5680 \pm 178$ and $\|g_L\|\approx 1.9\times 10^{-5}$ well above the diagnostic floor. Credit Bridge therefore has an even higher deep cosine than State Bridge (about $4\times$ the DFA value and roughly $2\times$ the State Bridge value), but reaches the same final accuracy as DFA+penalty and $9.3$ percentage points below State Bridge+penalty. This is a clean dissociation: within the audited fixed-feedback family under the same rescue, deep cosine and deep $\rho$ differ by more than a factor of four across methods without tracking final accuracy in the same direction, so alignment to the BP gradient is a necessary but not sufficient diagnostic of usable credit for depth. That cross-method dissociation is a direct reason the protocol in Section~\ref{sec:protocol} keeps final accuracy, layerwise credit quality, and the depth-utilization baseline as three separate reporting axes rather than collapsing them into a single headline.
+The penalty rescue effect on State Bridge is much larger than on DFA: $+24$ percentage points for State Bridge versus $+5.5$ percentage points for DFA on the same architecture and intervention. SB+penalty is the first audited non-BP method whose trained deep blocks substantively beat the architecture-matched random-block baseline. We treat this as evidence that Mode~2 (low intrinsic credit-direction quality) has method-dependent severity within the audited fixed-feedback family once Mode~1 is alleviated, rather than being a uniform property of all fixed-feedback local-credit objectives. Importantly, State Bridge's deep cosine $+0.322$ is approximately twice DFA's $+0.155$ on the same intervention, but neither approaches the BP reference value of $\approx +1.0$, so this is a within-class gradation in credit-direction quality, not a claim that bridge constructions ``solve'' Mode~2. The drift diagnostic reinforces this reading rather than contradicting it: per-block $w_2$ relative displacement after $30$ epochs is $14.3\times$ for SB+penalty and $19.3\times$ for CB+penalty (a $35\%$ gap), and the embedding layer's relative drift is $7.1\times$ for SB versus $44.6\times$ for CB (a $6\times$ gap), so CB's per-block updates are not silenced under penalty and are in fact larger in magnitude than SB's, yet CB's final accuracy is $9.3$ percentage points lower. The larger-but-less-useful parameter updates in CB are consistent with the mechanism hypothesis that angular agreement with the BP gradient does not by itself certify the functional forward-state content of the update. The nudging test at the same checkpoints provides the direct functional measurement: taking a small step of size $\eta{=}0.01$ in the direction of each method's per-layer credit $a_l$ decreases the test loss by $-1.78\times 10^{-3}$ on average over the deep blocks for SB+penalty, by $-0.45\times 10^{-3}$ for CB+penalty, and by only $-6\times 10^{-5}$ for DFA+penalty (single seed $42$, $30$-epoch run via the same training script). At the same per-layer credit direction, a step in SB's direction moves the loss about four times more than a step in CB's direction and about $30$ times more than a step in DFA's direction, even though CB's direction is more aligned with the BP gradient in angle than either. The $30$-epoch training trajectories give a third independent confirmation: SB+penalty's training loss falls from $2.047$ at epoch $1$ to $1.589$ at epoch $30$, a decrease of $0.458$, whereas CB+penalty's training loss falls by only $0.122$ and DFA+penalty's by only $0.104$ over the same $30$ epochs. Deep cosine ranks the three methods CB $>$ SB $>$ DFA, but every functional metric (nudging, integrated training-loss decrease, headline accuracy) ranks them SB $\gg$ CB $\approx$ DFA: the ordering produced by deep cosine is the only one that does not predict accuracy correctly. This is the strongest form of the cos-versus-accuracy dissociation: across three audited fixed-feedback methods under the same penalty intervention, the ranking implied by angular agreement with the BP gradient is contradicted by three independent functional measurements that do predict accuracy. Under the same intervention Credit Bridge reaches a three-seed test accuracy of $0.360 \pm 0.003$, a three-seed deep mean cosine of $+0.679 \pm 0.008$, and a three-seed deep mean $\rho$ of $+0.464 \pm 0.025$, with $\|h_L\|\approx 5680 \pm 178$ and $\|g_L\|\approx 1.9\times 10^{-5}$ well above the diagnostic floor. Credit Bridge therefore has an even higher deep cosine than State Bridge (about $4\times$ the DFA value and roughly $2\times$ the State Bridge value), but reaches the same final accuracy as DFA+penalty and $9.3$ percentage points below State Bridge+penalty. This is a clean dissociation: within the audited fixed-feedback family under the same rescue, deep cosine and deep $\rho$ differ by more than a factor of four across methods without tracking final accuracy in the same direction, so alignment to the BP gradient is a necessary but not sufficient diagnostic of usable credit for depth. That cross-method dissociation is a direct reason the protocol in Section~\ref{sec:protocol} keeps final accuracy, layerwise credit quality, and the depth-utilization baseline as three separate reporting axes rather than collapsing them into a single headline.
 
 \section{Reproducibility}
 \label{app:reproducibility}
author	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-08 13:51:20 -0500
committer	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-08 13:51:20 -0500
commit	02d339f897eef1344f215f3035e78864688e6c6f (patch)
tree	c804443287111967800be1cb25db6ae71d93a16c
parent	4ec8d0d9516d2c134df26ec4781d5e7fc63dedbd (diff)