diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-26 10:16:39 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-26 10:16:39 -0500 |
| commit | 6f88add7aed62152ed6776765917e03d5096a5cc (patch) | |
| tree | f1f5c388400aabb1f694661efbea77fbd2c2c72b /results/cifar_depth_scan_multiseed.log | |
| parent | a501c1c84b6ac4ff7dbf2e4b92cebd3122eb7abe (diff) | |
CIFAR-100 d=256 L=4: both FA and DFA fail — strongest qualifying setting
CIFAR-100 on the SAME architecture as the main CIFAR-10 audit (d=256 L=4
pre-LN ResMLP) is a setting where BOTH FA and DFA fall below the frozen-
blocks baseline at ALL 3 seeds while reporting positive cosine.
Frozen baseline (BP-frozen, 2 seeds): 0.177, 0.178 → mean ~0.178
Methods (3 seeds, 100ep):
seed BP DFA FA
42 0.319 0.088 0.146
123 0.322 0.087 0.121
456 0.322 0.089 0.131
s456 diagnostics (only seed with full JSON — others being re-run):
DFA: cos=+0.030 (positive), h_L=1.9e8, g_L=1.0e-8
FA: cos=+0.247 (positive), h_L=2.3e5, g_L=1.3e-6
BP: cos=+1.000 (trustworthy), h_L=192, g_L=9.7e-4
This is STRONGER than d=512 L=2 CIFAR-10 because:
1. Same architecture as the paper's main audit (d=256 L=4)
2. ALL 3 seeds qualify (not just 3/10)
3. Large margin: FA 4.7pp below frozen, DFA 8.9pp below frozen
4. Standard reporting pair (acc + cos) would NOT walk back either
Also added: CIFAR-100 dataset support in cifar_resmlp.py and
resmlp_frozen_blocks_baseline.py.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'results/cifar_depth_scan_multiseed.log')
0 files changed, 0 insertions, 0 deletions
