summaryrefslogtreecommitdiff
path: root/scripts/test_reward_cmp_15667063.out
diff options
context:
space:
mode:
authorYurenHao0426 <blackhao0426@gmail.com>2026-01-27 12:15:45 -0600
committerYurenHao0426 <blackhao0426@gmail.com>2026-01-27 12:15:45 -0600
commit680513b7771a29f27cbbb3ffb009a69a913de6f9 (patch)
treea0d60aef9ade1b2953b915f535b990c0de95e493 /scripts/test_reward_cmp_15667063.out
parentc06ec2f3b80f8968f09eb801b69237495b055ec1 (diff)
local reward model
Diffstat (limited to 'scripts/test_reward_cmp_15667063.out')
-rw-r--r--scripts/test_reward_cmp_15667063.out16
1 files changed, 16 insertions, 0 deletions
diff --git a/scripts/test_reward_cmp_15667063.out b/scripts/test_reward_cmp_15667063.out
new file mode 100644
index 0000000..1a38997
--- /dev/null
+++ b/scripts/test_reward_cmp_15667063.out
@@ -0,0 +1,16 @@
+=== Reward Model Comparison Test ===
+Local: Qwen2.5-1.5B-Instruct
+API: GPT-5-nano
+
+================================================================================
+Reward Model Comparison: Qwen2.5-1.5B vs GPT-5-nano
+================================================================================
+
+Loading models/qwen2.5-1.5b-instruct...
+Model loaded.
+Running 12 test cases...
+
+--- Test 1/12: neg_constraint_restate - format preference ---
+Expected: neg_constraint_restate
+
+=== Test Complete ===