summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 19:21:56 -0500
committerYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 19:21:56 -0500
commitbd873fea53ec917a01618799eeb97f770081ba53 (patch)
tree456cacdcda8f1a9206ef90c6caf11a4af16780ed
parentd0a1c91f4f6fc9a027136cea88c0edddd8c53e39 (diff)
protocol/protocol.py: sync (c) range docstring with v2.31.13 paper update
The cross_batch_direction_stability docstring claimed healthy values "~0.05-0.18" and drift-dominated "~0.5-0.99" — these were the same loose ranges that v2.31.13 corrected in the paper §6 ¶3. Re-aggregated from results/protocol_audit/audit_table_s42_s123_s456.json (K=10 batches of 128 samples): Healthy 6 BP+EP values: range [-0.036, 0.120], median 0.093 Degen 9 DFA/SB/CB values: range [-0.005, 0.992], median 0.352 5/9 above 0.30 cutoff 3/9 above 0.50 Updated docstring to match the actual audit data and point at the JSON source. Now the paper §6 ¶3 prose and the protocol.py docstring agree exactly on the (c) calibration ranges. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-rw-r--r--protocol/protocol.py10
1 files changed, 7 insertions, 3 deletions
diff --git a/protocol/protocol.py b/protocol/protocol.py
index 4298756..185c4f2 100644
--- a/protocol/protocol.py
+++ b/protocol/protocol.py
@@ -166,9 +166,13 @@ def cross_batch_direction_stability(
the same regardless of which samples are in the batch. That is the
fingerprint of a **sample-invariant global drift** — i.e. the reference
vector against which Γ is computed is NOT per-sample credit, it is a
- constant artifact of the trained network's geometry. On healthy
- BP-trained or EP-trained networks this value is small (~0.05-0.18); on
- DFA/SB/CB pre-LN ResMLPs we see ~0.5-0.99.
+ constant artifact of the trained network's geometry. In our 5-method
+ CIFAR-10 audit at K=10 batches of 128 samples, healthy BP/EP networks
+ cluster near zero with all six values in [-0.04, +0.12]; drift-dominated
+ DFA/SB/CB pre-LN ResMLPs reach high tails up to +0.99 (5/9 of the nine
+ degen values exceed the 0.30 default cutoff). See
+ results/protocol_audit/audit_table_s42_s123_s456.json for the
+ per-method per-seed values.
Args:
model: same contract as the other diagnostics.