diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-26 08:45:34 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-26 08:45:34 -0500 |
| commit | 9751e97dd190b8667c337215dcb70e0cab8f92ff (patch) | |
| tree | 272e3bd974c58d0d65cc03bcb9855fb1595a6b22 /paper/figures/render_fig_cos_acc_dissociation.py | |
| parent | 5937af903fdcb473cb3dd39cd3d0a86c1dbe0a05 (diff) | |
Find setting where both FA and DFA fail: d=512 L=2 ResMLP
TASK COMPLETE: Found 3/10 seeds where BOTH FA and DFA fall below
the frozen-blocks baseline while reporting positive cosine and
nontrivial accuracy — proving that the standard evaluation pair
can simultaneously miss both FA and DFA on the same setting.
Setting: d=512 L=2 pre-LayerNorm ResMLP, CIFAR-10, 100 epochs
Frozen baseline (3-seed mean): 0.349
Qualifying seeds:
seed 1: DFA=0.298 (cos +0.206), FA=0.347 (cos +0.484)
seed 2: DFA=0.297 (cos +0.179), FA=0.346 (cos +0.472)
seed 5: DFA=0.296 (cos +0.194), FA=0.341 (cos +0.492)
All qualifying cases have:
- Both methods below frozen baseline ✓
- Both methods report positive aggregate cosine ✓
- Both methods above chance (~0.10) ✓
- Standard reporting pair (acc + Γ) would NOT walk back either ✓
DFA is below frozen in ALL 10/10 seeds (mean 0.300 ± 0.009).
FA is below frozen in 3/10 seeds (mean across all 10: 0.370 ± 0.026).
Also includes:
- Frozen baselines for d=512 at L=2,4,8,12 × 3 seeds (12 runs)
- resmlp_frozen_blocks_baseline.py patched with --num_blocks arg
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'paper/figures/render_fig_cos_acc_dissociation.py')
0 files changed, 0 insertions, 0 deletions
