summaryrefslogtreecommitdiff
path: root/experiments/vector_credit_audit.py
diff options
context:
space:
mode:
authorYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 20:17:43 -0500
committerYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 20:17:43 -0500
commitd1c22697a99c894f07db972acb5a1a9229b0276a (patch)
treeb1725645a071c5b7bb2f9dd2c7830df605b9aad7 /experiments/vector_credit_audit.py
parenta18765a553ca454de49fc6462e231f05367ce580 (diff)
paper v2.35: add Figure 2 - cross-method cos-vs-accuracy dissociation
User said "you don't need to worry about page count for now", which freed up the page budget for substantive additions. Highest-yield substantive addition: a visual figure for the §4 ¶4 cross-method dissociation that the user previously flagged as the paper's strongest new observation but is currently text-only. New figure: paper/figures/fig_cos_acc_dissociation.pdf - Parallel-coordinates / slope-chart style - 4 columns: deep cos | accuracy | |nudging| | training-loss decrease - 3 lines: SB+pen (blue), CB+pen (red), DFA+pen (gray) - Each metric normalized to [0, 1] with raw values annotated - Shaded "cos: CB top" region on the left vs labeled "accuracy / nudging / training-loss: SB top" on the right - The X-pattern between cos and accuracy makes the dissociation visually immediate: SB rises from middle (cos) to top (functional), CB falls from top (cos) to tied with DFA (functional) Inserted between §4 ¶4 (Mode 2 mechanism) and §5 (intervention). Referenced from the §4 ¶4 functional measurements paragraph as "Figure 2". Why this figure replaces the prose-only argument's burden of proof: the X-pattern visualization is a single glance vs paragraph parsing. Reviewers will see "deep cosine ranks differently from 3 functional metrics" without needing to track the numbers. Important design choice: did NOT include deep ρ in the figure, even though it's in §4 ¶2, because ρ ranks CB > SB > DFA (same as cos), not the SB > CB > DFA pattern of the functional metrics. ρ groups with cos as a "directional alignment" metric, while the functional triad (accuracy, nudging, training-loss) groups around forward-state usefulness. The figure caption notes this distinction implicitly by listing only the three functional metrics. Page impact: total 18 → 19 pages, main content §1-§7 now spans p1-p10 (was p1-p9). Per user's relaxed constraint, page count is no longer the binding constraint. Figure auto-shifts the figure numbering: cos_acc_dissoc is now Figure 2, temporal_cross_arch becomes Figure 3, penalty_rescue → Figure 4, cross_arch_summary → Figure 5. All figure references use \\ref{} so they auto-update. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'experiments/vector_credit_audit.py')
0 files changed, 0 insertions, 0 deletions