diff options
| author | YurenHao0426 <blackhao0426@gmail.com> | 2026-02-11 03:28:09 +0000 |
|---|---|---|
| committer | YurenHao0426 <blackhao0426@gmail.com> | 2026-02-11 03:28:09 +0000 |
| commit | dcc20b1f77702e5b45e2e6c08b0f243124c4676e (patch) | |
| tree | 28a2a2c7d98202f4e93de0cdbc7412c38c9fec65 /collaborativeagents/slurm/fullscale/run_rag_p0.sh | |
| parent | 6a917d3eda85e5725c2d5ad3bf5ec9bd30262198 (diff) | |
Fix z_long definition to match code (zero-init + REINFORCE, not mean)
Paper incorrectly defined z_long as mean of item vectors.
Code initializes z_long at zero and learns purely via REINFORCE.
Also clarifies z_short reset-per-session behavior.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Diffstat (limited to 'collaborativeagents/slurm/fullscale/run_rag_p0.sh')
0 files changed, 0 insertions, 0 deletions
