summaryrefslogtreecommitdiff
path: root/src/training/trainer.py
AgeCommit message (Expand)Author
11 hoursFix NLL double-shift bug and head weight initYurenHao0426
12 hoursFix init state: add logit_bias so A≈1 at init (dense connectivity)YurenHao0426
12 hoursInitial implementation: DAGFormer Phase 1YurenHao0426