blob: 1a38997fd4bbbfc7fdb8d8e34dd1207d2808907c (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
|
=== Reward Model Comparison Test ===
Local: Qwen2.5-1.5B-Instruct
API: GPT-5-nano
================================================================================
Reward Model Comparison: Qwen2.5-1.5B vs GPT-5-nano
================================================================================
Loading models/qwen2.5-1.5b-instruct...
Model loaded.
Running 12 test cases...
--- Test 1/12: neg_constraint_restate - format preference ---
Expected: neg_constraint_restate
=== Test Complete ===
|