summaryrefslogtreecommitdiff
path: root/collaborativeagents/training/grpo
ModeNameSize
-rw-r--r--generate_grpo_data.py3103logplain
-rw-r--r--llama_grpo.py9296logplain