summaryrefslogtreecommitdiff
path: root/results/confirmatory
diff options
context:
space:
mode:
authorYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 04:53:01 -0500
committerYurenHao0426 <Blackhao0426@gmail.com>2026-04-08 04:53:01 -0500
commit04178a5ef072c4fa3a3a028316cfe545c27fe744 (patch)
treef616aedfde535ef1bf67698ab7f2faab62231299 /results/confirmatory
parent1eb0c06b341b90fc5ebbe689154aab6c8b6830c0 (diff)
Round 27: fill in §2 Audit prose (4 paragraphs) via codex
Codex round 27 produced 4 substantive paragraphs for §2, replacing thin placeholders. Each paragraph follows round 23's prescription: P1: canonical setting (4-block d=256, AdamW, 100 ep, 3 seeds) + table/figure references P2: under field-standard reporting, all 5 methods look fine P3: EP internal comparison — same trustworthy measurement regime BUT EP depth contribution is also marginally negative (-3.3 pp vs frozen baseline). Honest about EP being trustworthy-measurement but neutral-depth-contribution (per round 27 prompt's caveat). P4: frozen-baseline comparison gives the walk-back: BP +26.6 pp, DFA -4.3 pp, SB -14.4 pp, CB -6.0 pp. Diagnostic split lines up with acc split. Compiles cleanly. Next: §3 Failure Mode 1 prose via round 28.
Diffstat (limited to 'results/confirmatory')
0 files changed, 0 insertions, 0 deletions