diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 12:53:00 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 12:53:00 -0500 |
| commit | 3919f5a7db4abb699af150107bb1a2c587c14cfd (patch) | |
| tree | 8e150567bde28ba316a077b829fa36e9137a60c0 /results/confirmatory/cifar_d512/bp_s42.json | |
| parent | c201cb31018b35bf88482f7dc768b8f7a057703b (diff) | |
Round 41 (Appendix L): add nudging-test functional evidence for cos-vs-acc dissociation
The nudging test values were already in the round 38 JSON under
diag['nudging']['0.01'] but never used. Extracted and added to Appendix L:
SB+penalty: deep nudge delta = -1.78e-3 (per-layer, eta=0.01)
CB+penalty: deep nudge delta = -0.45e-3 (per-layer, eta=0.01)
A single step of size eta=0.01 in each method's credit direction decreases
the test loss by 1.78e-3 (SB) vs 0.45e-3 (CB) — a 4x gap in functional
loss decrease that EXACTLY INVERTS the 4x deep-cosine gap between the
methods. This is the direct functional measurement for the
'angular agreement is not sufficient' claim.
Combined with the drift diagnostic (larger CB updates), the cos-vs-acc
mechanism hypothesis now has THREE independent pieces of support:
1. Test accuracy (headline: CB same as DFA, SB higher)
2. Parameter drift (CB larger updates than SB)
3. Nudging functional loss decrease (CB 4x smaller than SB)
Zero new compute — all from existing round 38 JSON data.
Main content still 9 pages exactly within E&D budget.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'results/confirmatory/cifar_d512/bp_s42.json')
0 files changed, 0 insertions, 0 deletions
