| Age | Commit message (Expand) | Author |
|---|---|---|
| 25 hours | commit ttt record (#77) | Sam Acquaviva |
| 25 hours | Record: 10L Mixed Precision: val_bpb=1.2147 (10 layers + int6 middle layers) ... | Nan Liu |
| 26 hours | Int6 + MLP 3x + sliding window: val_bpb=1.1574 (#61) | Sam Larson |
| 27 hours | Record: Sliding Window + FP16 Embed + 10L + Muon WD + Overtone Init (val_bpb=... | notapplica |
| 27 hours | New SOTA attempt (#52) | spokane-way |
| 27 hours | Fix: score final partial window in sliding window eval (#124) | Matthew Li |
| 30 hours | Add record: Sliding Window Eval (stride=64), val_bpb=1.1925 (#50) | Matthew Li |
| 30 hours | SOTA attempt (val_bpb=1.2064) (#49) | spokane-way |
| 31 hours | fp16 tied embedding + lr/warmdown tuning — val_bpb 1.2197 (#42) | Renier Velazco |
| 2 days | Launch snapshot | Will DePue |
