summaryrefslogtreecommitdiff
path: root/results/confirmatory/persample/dfa_s123.csv
diff options
context:
space:
mode:
authorYurenHao0426 <Blackhao0426@gmail.com>2026-04-07 22:37:49 -0500
committerYurenHao0426 <Blackhao0426@gmail.com>2026-04-07 22:37:49 -0500
commit0c245f5683cceba448d20d9dfc2090adb3503f14 (patch)
tree0467408974b504888ae8cbb8551fbb206e3c2b53 /results/confirmatory/persample/dfa_s123.csv
parent111bab56e2d49c9fb1f3bfb9e55ea2028da4d008 (diff)
Add DFA direction-quality direct test (codex round 13 option c)
Trains both vanilla DFA (lam=0) and penalized DFA (lam=1e-2) from the same seed, then directly measures the per-layer cosine between DFA's local credit signal e_T @ B_l^T and the BP gradient at hidden layers. Uses the training Bs (not fresh ones, per the Bs-specificity finding from earlier). The penalized run is the key measurement: in that condition the BP grad is ~10^-7 (well above the eps=1e-8 floor), so a near-zero cosine here would be the direct evidence of the second failure mode (direction-quality ceiling) that codex round 13 hypothesized. Pre-registered prediction: penalized cos(DFA, BP) ~ 0.01-0.05 -> direction quality is the second, separable failure mode. Saves the penalized checkpoint so the diagnostic protocol can be re-applied to it (where (a) and (b) should pass, (d) should still fail).
Diffstat (limited to 'results/confirmatory/persample/dfa_s123.csv')
0 files changed, 0 insertions, 0 deletions