faeval.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-08 12:53:00 -0500
committer	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-08 12:53:00 -0500
commit	3919f5a7db4abb699af150107bb1a2c587c14cfd (patch)
tree	8e150567bde28ba316a077b829fa36e9137a60c0 /report_explore/boundary_ablation.png
parent	c201cb31018b35bf88482f7dc768b8f7a057703b (diff)

Round 41 (Appendix L): add nudging-test functional evidence for cos-vs-acc dissociation

The nudging test values were already in the round 38 JSON under diag['nudging']['0.01'] but never used. Extracted and added to Appendix L: SB+penalty: deep nudge delta = -1.78e-3 (per-layer, eta=0.01) CB+penalty: deep nudge delta = -0.45e-3 (per-layer, eta=0.01) A single step of size eta=0.01 in each method's credit direction decreases the test loss by 1.78e-3 (SB) vs 0.45e-3 (CB) — a 4x gap in functional loss decrease that EXACTLY INVERTS the 4x deep-cosine gap between the methods. This is the direct functional measurement for the 'angular agreement is not sufficient' claim. Combined with the drift diagnostic (larger CB updates), the cos-vs-acc mechanism hypothesis now has THREE independent pieces of support: 1. Test accuracy (headline: CB same as DFA, SB higher) 2. Parameter drift (CB larger updates than SB) 3. Nudging functional loss decrease (CB 4x smaller than SB) Zero new compute — all from existing round 38 JSON data. Main content still 9 pages exactly within E&D budget. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diffstat (limited to 'report_explore/boundary_ablation.png')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: