<feed xmlns='http://www.w3.org/2005/Atom'>
<title>faeval.git/experiments/null_calibration_penalized_cos.py, branch master</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<link rel='alternate' type='text/html' href='https://git.blackhao.com/faeval.git/'/>
<entry>
<title>Add null calibration script: training-Bs vs fresh-Bs cos on penalized DFA</title>
<updated>2026-04-08T06:15:08+00:00</updated>
<author>
<name>YurenHao0426</name>
<email>Blackhao0426@gmail.com</email>
</author>
<published>2026-04-08T06:15:08+00:00</published>
<link rel='alternate' type='text/html' href='https://git.blackhao.com/faeval.git/commit/?id=671d9823668197c21b2d35d08d15da0d5c3c4161'/>
<id>671d9823668197c21b2d35d08d15da0d5c3c4161</id>
<content type='text'>
Codex round 19's #1 critical control. Result on penalized DFA s42 (lam=1e-2, 30 ep):

  training-Bs deep-layer cos:  +0.1627
  fresh-Bs deep-layer cos:     +0.0022 ± 0.0220 (n=20 draws)

The +0.17 measurement is REAL signal, not artifact. The network specifically
adapted to its training-time Bs during the penalized run. Fresh Bs give
essentially zero cosine (within noise).

This validates the walk-back interpretation: in the rescued regime where
||g_l|| is meaningful, DFA's local credit signal shows partial alignment
with BP grad — and this alignment is specifically the network learning to
align with its specific Bs.

Round 19 caveat preserved: cannot yet distinguish whether the alignment
was always present in vanilla but hidden by measurement degeneracy, OR
whether it was created by the penalty intervention. The early-epoch
vanilla checkpoint sweep (round 19's other proposed control) would
disambiguate.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Codex round 19's #1 critical control. Result on penalized DFA s42 (lam=1e-2, 30 ep):

  training-Bs deep-layer cos:  +0.1627
  fresh-Bs deep-layer cos:     +0.0022 ± 0.0220 (n=20 draws)

The +0.17 measurement is REAL signal, not artifact. The network specifically
adapted to its training-time Bs during the penalized run. Fresh Bs give
essentially zero cosine (within noise).

This validates the walk-back interpretation: in the rescued regime where
||g_l|| is meaningful, DFA's local credit signal shows partial alignment
with BP grad — and this alignment is specifically the network learning to
align with its specific Bs.

Round 19 caveat preserved: cannot yet distinguish whether the alignment
was always present in vanilla but hidden by measurement degeneracy, OR
whether it was created by the penalty intervention. The early-epoch
vanilla checkpoint sweep (round 19's other proposed control) would
disambiguate.
</pre>
</div>
</content>
</entry>
</feed>
