diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 02:11:00 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 02:11:00 -0500 |
| commit | d3df5628b570af8fe2e22644b9c0849f69b9f3a1 (patch) | |
| tree | d2e579e6effe7ff66b8e6e02e2d03d5e81d10716 /results/confirmatory/T3_grad_norm_distribution.csv | |
| parent | 4bee0a6d80f2937473837897e80dfd4d697b644b (diff) | |
Extend perturbation audit to vanilla early-epoch checkpoints
Cross-metric disambiguation confirmation. Vanilla DFA at ep 1
(meaningful regime, ||g||~6e-7) deep rho across 3 seeds:
s42: deep rho -0.008
s123: deep rho +0.000
s456: deep rho -0.000
mean: -0.003 ± 0.005
Compare to penalized DFA 3-seed: deep rho +0.080 ± 0.011.
The disambiguation (penalty CREATES alignment, not just reveals it) is
now confirmed by TWO independent metrics:
- cos: vanilla -0.008 ± 0.013, penalized +0.155 ± 0.025
- rho: vanilla -0.003 ± 0.005, penalized +0.080 ± 0.011
Both metrics agree on the vanilla→penalized transition. The l0 (embedding)
rho is high (~0.25-0.29) at every vanilla checkpoint, mirroring the cos
l0 +0.42 — the embedding layer is genuinely useful while the deep blocks
are not, by BOTH metrics. The penalty restores some deep usefulness to
~+0.08 rho / +0.16 cos.
Cross-metric agreement rules out single-metric artifacts on either side.
Diffstat (limited to 'results/confirmatory/T3_grad_norm_distribution.csv')
0 files changed, 0 insertions, 0 deletions
