diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 18:19:52 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-04-08 18:19:52 -0500 |
| commit | c2228c43809fd1d634d98dfcecc1ee72881af639 (patch) | |
| tree | 5183e548f0079aedb01d34a275dd7d24b63fb32d /experiments/measure_direction_quality_existing_ckpt.py | |
| parent | 60947156c4e66d801d043b484ce8bda5314deab0 (diff) | |
paper v2.31.5: §3 ¶3 ep-4 g_L from 3-seed (was single-seed g_2)
Paper claimed "‖g_L‖ drops from 9.8×10^-4 at ep 0 to 6.7×10^-8 by ep 4
in the temporal replay across three seeds". The 9.8×10^-4 is the
3-seed mean of g_L at ep 0 (correct). But the 6.7×10^-8 was the
s42 single-seed g_2 value (6.73×10^-8) at ep 4, not g_L and not 3-seed.
The actual 3-seed g_L means from results/snapshot_evolution_v2/:
ep 0: 9.83, 9.74, 9.87 × 10^-4 → mean 9.81 ≈ 9.8e-4 ✓
ep 4: 6.82, 6.37, 4.12 × 10^-8 → mean 5.77 ≈ 5.8e-8 (was 6.7e-8)
Updated to 5.8×10^-8 with the per-seed values shown for transparency.
The "fires within 11 epochs" actionable-early-stop conclusion is
unchanged — all three seeds are well below the 1e-7 floor by ep 4.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'experiments/measure_direction_quality_existing_ckpt.py')
0 files changed, 0 insertions, 0 deletions
