summaryrefslogtreecommitdiff
path: root/results/bp_with_penalty_3seed_summary.json
diff options
context:
space:
mode:
authorYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 19:11:40 -0500
committerYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 19:11:40 -0500
commitd022688ea9fcfcb81f900751ee92e35597ef19b8 (patch)
tree192e620a4dc4915d37c590fdc2621abeaa15c8c3 /results/bp_with_penalty_3seed_summary.json
parent6a057a379e58dc464f04e5208861699b01b5d477 (diff)
paper v2.32: BP+penalty multi-seeded (was single-seed s42)
The §5 ¶3 BP+penalty value (0.530, +18.1 pp margin) was single-seed s42. Ran s123 and s456 to multi-seed it, matching the BP-no-pen 3-seed control. 3-seed BP+pen 30ep results (lam=0.01, AdamW lr=1e-3 wd=0.01, cosine, batch 128): s42: 0.5303, +18.13 pp vs frozen s123: 0.5262, +17.72 pp s456: 0.5397, +19.07 pp 3-seed mean: 0.5321 ± 0.0057, +18.31 pp Updates: - §5 ¶3: BP+pen "0.530 (single seed)" → "0.532 ± 0.006" (3-seed) - §5 ¶3: BP penalty cost -5.5 pp → -5.3 pp - §5 ¶3: BP+pen margin +18.1 → +18.3 pp - §5 ¶3: BP-to-DFA gap 17.0 → 17.2 pp - §4 ¶4: BP+pen +18.1 → +18.3 pp comparison - Figure 3 panel C bar values: BP with_pen 0.530 → 0.532 - Figure 3 panel C title: BP-pen-cost -5.5pp → -5.3pp The +18.3 pp 3-seed mean is essentially the same as the s42 single-seed +18.13 pp, so the headline conclusion (BP+pen far above frozen baseline, huge gap vs DFA+pen) is unchanged. This commit removes the last single-seed value labeled as a key control. New auditable file: results/bp_with_penalty_3seed_summary.json Page layout preserved: 9 pages main, refs p10, 0 overfull boxes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'results/bp_with_penalty_3seed_summary.json')
-rw-r--r--results/bp_with_penalty_3seed_summary.json31
1 files changed, 31 insertions, 0 deletions
diff --git a/results/bp_with_penalty_3seed_summary.json b/results/bp_with_penalty_3seed_summary.json
new file mode 100644
index 0000000..9760faa
--- /dev/null
+++ b/results/bp_with_penalty_3seed_summary.json
@@ -0,0 +1,31 @@
+{
+ "description": "BP+pen 30ep multi-seed result (matched control for \u00a75 \u00b63)",
+ "config": {
+ "arch": "4-block d=256 pre-LN ResMLP",
+ "dataset": "CIFAR-10",
+ "epochs": 30,
+ "batch": 128,
+ "optimizer": "AdamW lr=1e-3 wd=0.01",
+ "schedule": "cosine",
+ "penalty_lam": 0.01
+ },
+ "per_seed_acc": {
+ "42": 0.5303,
+ "123": 0.5262,
+ "456": 0.5397
+ },
+ "mean_acc": 0.5321,
+ "std_ddof0": 0.0057,
+ "std_ddof1": 0.0069,
+ "frozen_baseline": 0.349,
+ "margin_pp_per_seed": {
+ "42": 18.13,
+ "123": 17.72,
+ "456": 19.07
+ },
+ "margin_pp_mean": 18.31,
+ "source_files": [
+ "results/bp_with_penalty/bp_pen_lam0.01_s{42,123,456}.json"
+ ],
+ "paper_section": "used in \u00a75 \u00b63 (was previously single-seed s42)"
+} \ No newline at end of file