| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 2026-02-11 | Fix z_long definition to match code (zero-init + REINFORCE, not mean) | YurenHao0426 | |
| Paper incorrectly defined z_long as mean of item vectors. Code initializes z_long at zero and learns purely via REINFORCE. Also clarifies z_short reset-per-session behavior. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> | |||
