diff options
| author | YurenHao0426 <Blackhao0426@gmail.com> | 2026-03-26 00:07:01 -0500 |
|---|---|---|
| committer | YurenHao0426 <Blackhao0426@gmail.com> | 2026-03-26 00:07:01 -0500 |
| commit | 05ccd23154d1e9d090178b9d4d5f2c821711e784 (patch) | |
| tree | 0ce74409f506df9f7f1717c13b30e79eb5a24f12 /NOTE.md | |
| parent | ccc6add69553893f6d3f9de4e2010ca8139ba1a6 (diff) | |
Add Phase 9B+9C: periodic refit fails, top-down curriculum neutral
Phase 9B (periodic refit K=5 R=1 alpha=0.75): 14.0% — Vec starts random,
periodic refits insufficient without offline pretraining.
Phase 9C (top-down curriculum): last1_vec=30.8%, last2_vec=31.1% vs DFA=31.2%.
Near-neutral. Cold-start problem persists even for single-block Vec.
Only Phase 9A's offline prefit + blend handoff (+1.5%) works.
The key ingredient is offline Vec training on frozen checkpoint features.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'NOTE.md')
| -rw-r--r-- | NOTE.md | 28 |
1 files changed, 27 insertions, 1 deletions
@@ -5,7 +5,7 @@ - **pilot**: Controlled iteration (commits 0b9ebb2, 7baf7ae) - **frozen**: Code at commit 0b9ebb2 for all reported results -## Status: PHASE 9A FULL — BLEND(0.75) OUTPERFORMS DFA BY +1.5% +## Status: PHASE 9 COMPLETE — OFFLINE PREFIT + BLEND IS THE KEY --- @@ -522,5 +522,31 @@ DFA stabilizes the trajectory while Vec adds learned corrections. Higher Vec fraction (0.75) outperforms lower (0.25, 0.5) at t0=5,10. Pure Vec handoff still fails at all t0. +### Phase 9B: Periodic Refit — FAILED + +K=5 (refit every 5 epochs), R=1 (1 epoch refit), alpha=0.75: **14.0%** (vs DFA 31.2%). +Vec starts random, periodic 1-epoch refits aren't enough. Without offline pretraining +(as in 9A), the cold-start problem remains unsolved. + +### Phase 9C: Top-Down Curriculum — NEUTRAL + +| Config | final acc | diff vs DFA | +|--------|-----------|-------------| +| DFA_only | 0.312 | — | +| last1_vec (blend 0.75) | 0.308 | -0.4% | +| last2_vec (blend 0.75) | 0.311 | -0.1% | + +Vec for last 1-2 blocks ≈ DFA. No improvement, no degradation. +Same cold-start problem: Vec can't learn useful credit online even for 1 block. + +### Phase 9 Overall Conclusion + +Only **offline prefit + blend handoff** (9A) works. Both online periodic refit (9B) +and top-down curriculum (9C) fail because they don't pre-train Vec on frozen features. + +The +1.5% gain from 9A's blend(0.75) at t0=5 is the project's best online result. + ### Experiment IDs (Phase 9) - `checkpointed_handoff/`: Phase 9A checkpointed handoff with branches +- `periodic_refit/`: Phase 9B periodic refit +- `topdown_curriculum/`: Phase 9C top-down curriculum |
