diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-07 22:45:41 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-07 22:45:41 -0500 |
| commit | 3a520b203f4f0c75b37b2d5c34d461718729ea02 (patch) | |
| tree | 76cf5cfc7f2874bc7016414f1a586dee453f50d8 /results/cnn_baseline/credit_bridge_s789.json | |
| parent | 44614df2f4382e567b986bc6dbe5b3091072461e (diff) | |
Audit table extension to 3 seeds (s42/s123/s456)
3 seeds × 5 methods × 4 diagnostics = 60 measurements. Key reproducibility
findings:
- BP: trustworthy on all 3 seeds (acc 0.61-0.62, h_L ~200, g_L ~3-4e-4)
- EP: trustworthy on all 3 seeds (acc 0.29-0.36, h_L 3-8e3, g_L ~1e-4)
- DFA, SB, CB: walked back on all 3 seeds × all 3 of (a)/(b)/(d)
Diagnostic (c) is bimodal across seeds — confirms the prior memory finding:
- DFA s42=0.047 (noise), s123=0.436 (drift), s456=-0.005 (noise)
- SB s42=0.992 (drift), s123=0.561 (drift), s456=0.035 (noise)
- CB s42=0.352 (drift), s123=0.250 (~edge), s456=0.518 (drift)
(c) catches different methods on different seeds. (a)/(b)/(d) catch all 3
failing methods on all 3 seeds — robust binary detection.
Diffstat (limited to 'results/cnn_baseline/credit_bridge_s789.json')
0 files changed, 0 insertions, 0 deletions
