diff options
Diffstat (limited to 'scripts/test_reward_cmp_15667063.out')
| -rw-r--r-- | scripts/test_reward_cmp_15667063.out | 16 |
1 files changed, 16 insertions, 0 deletions
diff --git a/scripts/test_reward_cmp_15667063.out b/scripts/test_reward_cmp_15667063.out new file mode 100644 index 0000000..1a38997 --- /dev/null +++ b/scripts/test_reward_cmp_15667063.out @@ -0,0 +1,16 @@ +=== Reward Model Comparison Test === +Local: Qwen2.5-1.5B-Instruct +API: GPT-5-nano + +================================================================================ +Reward Model Comparison: Qwen2.5-1.5B vs GPT-5-nano +================================================================================ + +Loading models/qwen2.5-1.5b-instruct... +Model loaded. +Running 12 test cases... + +--- Test 1/12: neg_constraint_restate - format preference --- +Expected: neg_constraint_restate + +=== Test Complete === |
