<feed xmlns='http://www.w3.org/2005/Atom'>
<title>faeval.git/results/null_calibration_penalized_dfa.json, branch master</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<link rel='alternate' type='text/html' href='https://git.blackhao.com/faeval.git/'/>
<entry>
<title>Save null_calibration_penalized_dfa.json for §6 ¶2 audit</title>
<updated>2026-04-08T23:39:00+00:00</updated>
<author>
<name>YurenHao0426</name>
<email>Blackhao0426@gmail.com</email>
</author>
<published>2026-04-08T23:39:00+00:00</published>
<link rel='alternate' type='text/html' href='https://git.blackhao.com/faeval.git/commit/?id=05233a3d3854257483afb90fad6b517f30095977'/>
<id>05233a3d3854257483afb90fad6b517f30095977</id>
<content type='text'>
The §6 ¶2 fresh-B null control claim "deep cos +0.002 ± 0.022 (n=20
draws), per-layer stds 0.013-0.023" was verified against a fresh
re-run of experiments/null_calibration_penalized_cos.py:

  training-Bs deep cos:  +0.1627  (matches Appendix L row)
  fresh-Bs deep cos:     +0.0022 ± 0.0220 (per-layer std avg, n=20)
  per-layer stds:        [0.0125, 0.0221, 0.0162, 0.0229, 0.0228] (l0-l4)

The "0.013-0.023" range matches the per-layer std range exactly.
The "± 0.022" is the average per-layer std across deep layers (l1-l4).

Saved as the auditable source. The script (experiments/null_calibration_
penalized_cos.py) can re-derive these values from the saved checkpoint
in results/dfa_pen_short/dfa_pen_lam0.01_s42.pt.

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
The §6 ¶2 fresh-B null control claim "deep cos +0.002 ± 0.022 (n=20
draws), per-layer stds 0.013-0.023" was verified against a fresh
re-run of experiments/null_calibration_penalized_cos.py:

  training-Bs deep cos:  +0.1627  (matches Appendix L row)
  fresh-Bs deep cos:     +0.0022 ± 0.0220 (per-layer std avg, n=20)
  per-layer stds:        [0.0125, 0.0221, 0.0162, 0.0229, 0.0228] (l0-l4)

The "0.013-0.023" range matches the per-layer std range exactly.
The "± 0.022" is the average per-layer std across deep layers (l1-l4).

Saved as the auditable source. The script (experiments/null_calibration_
penalized_cos.py) can re-derive these values from the saved checkpoint
in results/dfa_pen_short/dfa_pen_lam0.01_s42.pt.

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
</pre>
</div>
</content>
</entry>
</feed>
