summaryrefslogtreecommitdiff
path: root/scripts/slurm_s2.sh
AgeCommit message (Collapse)Author
2026-02-10Add auto-resume checkpointing, S1/S2 configs, and experiment resultsYurenHao0426
- Auto-resume: find latest checkpoint in save_dir on startup - SIGUSR1 handler: save checkpoint before SLURM timeout - S1 config (constant tau=5, identity init verification) - S2 config (constant tau=2, gradient flow check) - Experiment results tracker with S0/S1 data - Speed estimates and experiment plan Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>