summaryrefslogtreecommitdiff
path: root/src/training/trainer.py
AgeCommit message (Expand)Author
15 hoursFix NLL double-shift bug and head weight initYurenHao0426
16 hoursFix init state: add logit_bias so A≈1 at init (dense connectivity)YurenHao0426
17 hoursInitial implementation: DAGFormer Phase 1YurenHao0426