index
:
blazing8.git
main
Unnamed repository; edit this file 'description' to name the repository.
Ubuntu
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2026-02-22
Add 2-player PPO training log (500k episodes, 60.4% vs greedy)
HEAD
main
YurenHao0426
2026-02-22
Raise entropy floor to 0.02, increase eval games to 2000
haoyuren
2026-02-22
Change default eval_every from 10000 to 2500
haoyuren
2026-02-22
Use auto-calibrated collect_batch in Colab notebook
haoyuren
2026-02-22
Add training curve plots to Colab notebook
haoyuren
2026-02-22
Add entropy annealing to escape greedy local minimum after warmup
haoyuren
2026-02-22
Auto-calibrate collect_batch when not specified
haoyuren
2026-02-22
Fix total_mem → total_memory in Colab GPU check
haoyuren
2026-02-22
Fix invalid notebook cell schema (markdown with execution_count)
haoyuren
2026-02-22
Batched game collection for ~7x training speedup
haoyuren
2026-02-22
Update README and Colab notebook for current rules and features
haoyuren
2026-02-22
Separate CPU collect / GPU train, add training CSV log
haoyuren
2026-02-22
Fix SWAP inheritance, stalemate logic, add greedy warmup
haoyuren
2026-02-22
Improve versus UI: suit colors, AI highlighting, draw tell
haoyuren
2026-02-22
Update rules: free draw/pass, remove Q in 2-player games
haoyuren
2026-02-22
Add tqdm progress bar, fix Colab username
haoyuren
2026-02-22
Add Colab GPU training notebook
haoyuren
2026-02-22
Initial commit: Blazing Eights RL agent
haoyuren