diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 19:21:56 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 19:21:56 -0500 |
| commit | bd873fea53ec917a01618799eeb97f770081ba53 (patch) | |
| tree | 456cacdcda8f1a9206ef90c6caf11a4af16780ed /protocol | |
| parent | d0a1c91f4f6fc9a027136cea88c0edddd8c53e39 (diff) | |
protocol/protocol.py: sync (c) range docstring with v2.31.13 paper update
The cross_batch_direction_stability docstring claimed healthy values
"~0.05-0.18" and drift-dominated "~0.5-0.99" — these were the same
loose ranges that v2.31.13 corrected in the paper §6 ¶3.
Re-aggregated from results/protocol_audit/audit_table_s42_s123_s456.json
(K=10 batches of 128 samples):
Healthy 6 BP+EP values: range [-0.036, 0.120], median 0.093
Degen 9 DFA/SB/CB values: range [-0.005, 0.992], median 0.352
5/9 above 0.30 cutoff
3/9 above 0.50
Updated docstring to match the actual audit data and point at the
JSON source. Now the paper §6 ¶3 prose and the protocol.py docstring
agree exactly on the (c) calibration ranges.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'protocol')
| -rw-r--r-- | protocol/protocol.py | 10 |
1 files changed, 7 insertions, 3 deletions
diff --git a/protocol/protocol.py b/protocol/protocol.py index 4298756..185c4f2 100644 --- a/protocol/protocol.py +++ b/protocol/protocol.py @@ -166,9 +166,13 @@ def cross_batch_direction_stability( the same regardless of which samples are in the batch. That is the fingerprint of a **sample-invariant global drift** — i.e. the reference vector against which Γ is computed is NOT per-sample credit, it is a - constant artifact of the trained network's geometry. On healthy - BP-trained or EP-trained networks this value is small (~0.05-0.18); on - DFA/SB/CB pre-LN ResMLPs we see ~0.5-0.99. + constant artifact of the trained network's geometry. In our 5-method + CIFAR-10 audit at K=10 batches of 128 samples, healthy BP/EP networks + cluster near zero with all six values in [-0.04, +0.12]; drift-dominated + DFA/SB/CB pre-LN ResMLPs reach high tails up to +0.99 (5/9 of the nine + degen values exceed the 0.30 default cutoff). See + results/protocol_audit/audit_table_s42_s123_s456.json for the + per-method per-seed values. Args: model: same contract as the other diagnostics. |
