summaryrefslogtreecommitdiff
path: root/report/cifar_accuracy.png
diff options
context:
space:
mode:
authorYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 18:35:56 -0500
committerYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 18:35:56 -0500
commit7ab86849e3dcc5632b8fca992587a7f050d58948 (patch)
treed94fbf26d5cfd0e2cf813bea1925be1d4946e1e3 /report/cifar_accuracy.png
parent0c1d102c57d86d914eb1122dd59f329667db60d8 (diff)
paper v2.31.10: Appendix L drift values use 3-seed means (were s42)
Appendix L claimed "per-block w2 relative displacement after 30 epochs averages 14.3× for SB+penalty, 18.6×±0.5 for DFA+penalty, and 19.3× for CB+penalty (three seeds each)" but the SB and CB values were actually s42 single-seed values (14.32 and 19.27) labeled as if they were 3-seed averages. DFA was correctly 3-seed. Re-aggregating from results/round38_{sb,cb}_penalty_30ep_s{42,123,456}/ results_cifar10.json drift fields: SB+pen w2: 14.32, 15.30, 14.68 → 14.77 ± 0.50 (was 14.3) CB+pen w2: 19.27, 19.63, 18.53 → 19.14 ± 0.56 (was 19.3) SB+pen embed: 7.10, 6.87, 6.88 → 6.95 ± 0.13 (was 7.1) CB+pen embed: 44.57, 47.27, 47.18 → 46.34 ± 1.53 (was 44.6) DFA+pen w2: 18.6 ± 0.5 ✓ (correct) DFA+pen embed: 94.6 ± 1.4 ✓ (correct) The mechanism conclusion is unchanged: CB's per-block drift is still ~30% larger than SB's, embed drift is still ~7× larger; DFA still has the largest embed updates of any method. CB and DFA still ≈9.3 pp below State Bridge in final accuracy. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'report/cifar_accuracy.png')
0 files changed, 0 insertions, 0 deletions