diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 01:33:00 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 01:33:00 -0500 |
| commit | 2ca87f2bd4449b1d4ac715d8cf4fb5f20b7afdd8 (patch) | |
| tree | 34adf68554034c97e344f356d305e91df03c43f0 /results/confirmatory/clean_sparsity/synth_bp_s1024_a0.0_L4.json | |
| parent | 02252d942dbf449276059c49260ec0994c4f9a5d (diff) | |
PAPER_OUTLINE: §4 rewrite under 'two distinct failure modes' framing
After the round 19 disambiguation experiment confirmed hypothesis B
(penalty CREATES deep alignment, not just reveals it), the paper §4
needs to use the new framing:
Mode 1: measurement degeneracy via terminal LN gradient cancellation
Mode 2: low intrinsic credit-direction quality of random feedback
Both modes are direct-measured (mode 1 by diagnostic (b), mode 2 by
per-layer cos in the meaningful regime). The penalty partially
alleviates BOTH modes. Neither is fully fixed.
§4 rewrite includes:
- The two modes (4.1)
- Penalty causal validation with 3-seed cos (4.2)
- Disambiguation: vanilla early-epoch cos table proving hypothesis B (4.3)
- Why the residual gap is partial alignment (4.4)
- Why this framing is paper-cleaner than prior ones (4.5)
Walk-back chain extended to 7 entries, with 6 and 7 happening same-day
and converging on the final two-distinct-modes framing.
Diffstat (limited to 'results/confirmatory/clean_sparsity/synth_bp_s1024_a0.0_L4.json')
0 files changed, 0 insertions, 0 deletions
