diff options
| author | YurenHao0426 <blackhao0426@gmail.com> | 2026-01-27 12:15:45 -0600 |
|---|---|---|
| committer | YurenHao0426 <blackhao0426@gmail.com> | 2026-01-27 12:15:45 -0600 |
| commit | 680513b7771a29f27cbbb3ffb009a69a913de6f9 (patch) | |
| tree | a0d60aef9ade1b2953b915f535b990c0de95e493 /scripts/test_reward_cmp_15667063.out | |
| parent | c06ec2f3b80f8968f09eb801b69237495b055ec1 (diff) | |
local reward model
Diffstat (limited to 'scripts/test_reward_cmp_15667063.out')
| -rw-r--r-- | scripts/test_reward_cmp_15667063.out | 16 |
1 files changed, 16 insertions, 0 deletions
diff --git a/scripts/test_reward_cmp_15667063.out b/scripts/test_reward_cmp_15667063.out new file mode 100644 index 0000000..1a38997 --- /dev/null +++ b/scripts/test_reward_cmp_15667063.out @@ -0,0 +1,16 @@ +=== Reward Model Comparison Test === +Local: Qwen2.5-1.5B-Instruct +API: GPT-5-nano + +================================================================================ +Reward Model Comparison: Qwen2.5-1.5B vs GPT-5-nano +================================================================================ + +Loading models/qwen2.5-1.5b-instruct... +Model loaded. +Running 12 test cases... + +--- Test 1/12: neg_constraint_restate - format preference --- +Expected: neg_constraint_restate + +=== Test Complete === |
