| Age | Commit message (Expand) | Author |
|---|---|---|
| 11 hours | Add entropy annealing to escape greedy local minimum after warmup | haoyuren |
| 11 hours | Auto-calibrate collect_batch when not specified | haoyuren |
| 11 hours | Batched game collection for ~7x training speedup | haoyuren |
| 11 hours | Separate CPU collect / GPU train, add training CSV log | haoyuren |
| 11 hours | Fix SWAP inheritance, stalemate logic, add greedy warmup | haoyuren |
| 20 hours | Update rules: free draw/pass, remove Q in 2-player games | haoyuren |
| 21 hours | Add tqdm progress bar, fix Colab username | haoyuren |
| 21 hours | Initial commit: Blazing Eights RL agent | haoyuren |
