diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 18:35:56 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 18:35:56 -0500 |
| commit | 7ab86849e3dcc5632b8fca992587a7f050d58948 (patch) | |
| tree | d94fbf26d5cfd0e2cf813bea1925be1d4946e1e3 /results/confirmatory/persample/dfa_s42.csv | |
| parent | 0c1d102c57d86d914eb1122dd59f329667db60d8 (diff) | |
paper v2.31.10: Appendix L drift values use 3-seed means (were s42)
Appendix L claimed "per-block w2 relative displacement after 30 epochs
averages 14.3× for SB+penalty, 18.6×±0.5 for DFA+penalty, and 19.3×
for CB+penalty (three seeds each)" but the SB and CB values were
actually s42 single-seed values (14.32 and 19.27) labeled as if they
were 3-seed averages. DFA was correctly 3-seed.
Re-aggregating from results/round38_{sb,cb}_penalty_30ep_s{42,123,456}/
results_cifar10.json drift fields:
SB+pen w2: 14.32, 15.30, 14.68 → 14.77 ± 0.50 (was 14.3)
CB+pen w2: 19.27, 19.63, 18.53 → 19.14 ± 0.56 (was 19.3)
SB+pen embed: 7.10, 6.87, 6.88 → 6.95 ± 0.13 (was 7.1)
CB+pen embed: 44.57, 47.27, 47.18 → 46.34 ± 1.53 (was 44.6)
DFA+pen w2: 18.6 ± 0.5 ✓ (correct)
DFA+pen embed: 94.6 ± 1.4 ✓ (correct)
The mechanism conclusion is unchanged: CB's per-block drift is still
~30% larger than SB's, embed drift is still ~7× larger; DFA still has
the largest embed updates of any method. CB and DFA still ≈9.3 pp
below State Bridge in final accuracy.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'results/confirmatory/persample/dfa_s42.csv')
0 files changed, 0 insertions, 0 deletions
