| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 2026-04-01 | Add element-wise gradient concentration analysis (CPU, from checkpoints) | YurenHao0426 | |
| BP gradients are relatively uniform: top1%=7.1%, PR=0.327, eff_dim=0.632 DFA gradients extremely concentrated: top1%=40.6%, PR=0.089, eff_dim=0.272 SB/CB intermediate: top1%=17-21%, PR=0.14-0.17 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> | |||
