summaryrefslogtreecommitdiff
path: root/train.py
AgeCommit message (Expand)Author
6 hoursRaise entropy floor to 0.02, increase eval games to 2000haoyuren
7 hoursChange default eval_every from 10000 to 2500haoyuren
7 hoursAdd entropy annealing to escape greedy local minimum after warmuphaoyuren
7 hoursAuto-calibrate collect_batch when not specifiedhaoyuren
7 hoursBatched game collection for ~7x training speeduphaoyuren
7 hoursSeparate CPU collect / GPU train, add training CSV loghaoyuren
7 hoursFix SWAP inheritance, stalemate logic, add greedy warmuphaoyuren
16 hoursUpdate rules: free draw/pass, remove Q in 2-player gameshaoyuren
17 hoursAdd tqdm progress bar, fix Colab usernamehaoyuren
17 hoursInitial commit: Blazing Eights RL agenthaoyuren