faeval.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-26 11:03:29 -0500
committer	YurenHao0426 <Blackhao0426@gmail.com>	2026-04-26 11:03:29 -0500
commit	1c172f7d038eedd3d828d453e852c060072f52c8 (patch)
tree	3d7cf500dca3c0cb35f577b87c236559b58c3eef /results/cifar_depth_scan_multiseed
parent	6f88add7aed62152ed6776765917e03d5096a5cc (diff)

CIFAR-100 per-seed diagnostics complete — full qualifying table

CIFAR-100, d=256 L=4, 100ep, 3 seeds. Frozen baseline (BP-frozen) = 0.178. acc (±ddof=1) cos (±ddof=1) h_L g_L <frozen? BP 0.321 ± 0.002 +1.000 ~192 ~9.5e-4 no FA 0.133 ± 0.013 +0.234 ± 0.015 ~1e5-7e5 ~1e-6 YES (all 3) DFA 0.088 ± 0.001 +0.029 ± 0.001 ~2e8 ~9e-9 YES (all 3) Frozen 0.178 — — — baseline Both FA and DFA are below frozen at ALL 3 seeds with positive cosine. FA cos is +0.23 (clearly positive). DFA cos is +0.03 (small but positive). Both are well above chance (1% for 100 classes). BP is ~0.32, well above frozen (trustworthy control). This is the paper's strongest qualifying setting because it uses the SAME architecture (d=256 L=4) as the main CIFAR-10 audit — only the task difficulty changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diffstat (limited to 'results/cifar_depth_scan_multiseed')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: