summaryrefslogtreecommitdiff
path: root/protocol/README.md
AgeCommit message (Collapse)Author
2026-04-08protocol/README.md: sync (c) range with v2.31.13 paper updateYurenHao0426
Same fix as v2.31.13's paper §6 ¶3 and the protocol.py docstring sync: the README's "0.05-0.18 / 0.43-0.99" calibration ranges were the same loose values that v2.31.13 corrected. Updated to match the actual audit data: BP/EP in [-0.04, +0.12], degenerate up to +0.99 with 5/9 above the 0.30 cutoff. Now the paper §6 ¶3, protocol.py docstring, and protocol/README.md all agree on the (c) calibration ranges. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-07Add FA diagnostic protocol reference implementationYurenHao0426
Codex round 15 #1 priority for the E&D-track paper: - protocol/protocol.py: 4 diagnostics (residual norms, BP grad norms, cross-batch direction stability, and a frozen-baseline comparator) - protocol/report.py: DiagnosticReport with per-diagnostic verdicts and pretty-printer - protocol/smoke_test.py: validates BP/DFA/EP checkpoints produce the expected verdicts (BP/EP trustworthy; DFA walked back via residual explosion + BP grad at floor) - protocol/README.md: usage, audit cases, threshold rationale - protocol/CHECKLIST.md: 6 evaluation pipeline pitfalls (norm(-1), cosine_similarity eps clamp, fp16 underflow, Bs reproducibility, aggregation, layer-0 dominance) - protocol/REPORTING_TEMPLATE.md: per-method fillable form for FA papers