faeval.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-26 10:16:39 -0500
committer	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-26 10:16:39 -0500
commit	6f88add7aed62152ed6776765917e03d5096a5cc (patch)
tree	f1f5c388400aabb1f694661efbea77fbd2c2c72b /results/cifar_depth_scan_multiseed.log
parent	a501c1c84b6ac4ff7dbf2e4b92cebd3122eb7abe (diff)

CIFAR-100 d=256 L=4: both FA and DFA fail — strongest qualifying setting

CIFAR-100 on the SAME architecture as the main CIFAR-10 audit (d=256 L=4 pre-LN ResMLP) is a setting where BOTH FA and DFA fall below the frozen- blocks baseline at ALL 3 seeds while reporting positive cosine. Frozen baseline (BP-frozen, 2 seeds): 0.177, 0.178 → mean ~0.178 Methods (3 seeds, 100ep): seed BP DFA FA 42 0.319 0.088 0.146 123 0.322 0.087 0.121 456 0.322 0.089 0.131 s456 diagnostics (only seed with full JSON — others being re-run): DFA: cos=+0.030 (positive), h_L=1.9e8, g_L=1.0e-8 FA: cos=+0.247 (positive), h_L=2.3e5, g_L=1.3e-6 BP: cos=+1.000 (trustworthy), h_L=192, g_L=9.7e-4 This is STRONGER than d=512 L=2 CIFAR-10 because: 1. Same architecture as the paper's main audit (d=256 L=4) 2. ALL 3 seeds qualify (not just 3/10) 3. Large margin: FA 4.7pp below frozen, DFA 8.9pp below frozen 4. Standard reporting pair (acc + cos) would NOT walk back either Also added: CIFAR-100 dataset support in cifar_resmlp.py and resmlp_frozen_blocks_baseline.py. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diffstat (limited to 'results/cifar_depth_scan_multiseed.log')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: