summaryrefslogtreecommitdiff
path: root/scripts/test_reward_cmp_15667024.out
blob: 1a38997fd4bbbfc7fdb8d8e34dd1207d2808907c (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
=== Reward Model Comparison Test ===
Local: Qwen2.5-1.5B-Instruct
API: GPT-5-nano

================================================================================
Reward Model Comparison: Qwen2.5-1.5B vs GPT-5-nano
================================================================================

Loading models/qwen2.5-1.5b-instruct...
Model loaded.
Running 12 test cases...

--- Test 1/12: neg_constraint_restate - format preference ---
Expected: neg_constraint_restate

=== Test Complete ===