summaryrefslogtreecommitdiff
path: root/scripts/test_reward_cmp_15667063.out
diff options
context:
space:
mode:
Diffstat (limited to 'scripts/test_reward_cmp_15667063.out')
-rw-r--r--scripts/test_reward_cmp_15667063.out16
1 files changed, 16 insertions, 0 deletions
diff --git a/scripts/test_reward_cmp_15667063.out b/scripts/test_reward_cmp_15667063.out
new file mode 100644
index 0000000..1a38997
--- /dev/null
+++ b/scripts/test_reward_cmp_15667063.out
@@ -0,0 +1,16 @@
+=== Reward Model Comparison Test ===
+Local: Qwen2.5-1.5B-Instruct
+API: GPT-5-nano
+
+================================================================================
+Reward Model Comparison: Qwen2.5-1.5B vs GPT-5-nano
+================================================================================
+
+Loading models/qwen2.5-1.5b-instruct...
+Model loaded.
+Running 12 test cases...
+
+--- Test 1/12: neg_constraint_restate - format preference ---
+Expected: neg_constraint_restate
+
+=== Test Complete ===