summaryrefslogtreecommitdiff
path: root/research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md
diff options
context:
space:
mode:
authorYurenHao0426 <blackhao0426@gmail.com>2026-06-13 12:35:36 -0500
committerYurenHao0426 <blackhao0426@gmail.com>2026-06-13 12:35:36 -0500
commit66e0d8b9fd4d0f7a2231d689c055e26fdf1cf04a (patch)
treec29cba61124018755a19b02c9d33e3ad5f2e05cc /research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md
rrm workspace: TRM/HRM/SRM code, Maze dataset, dynamical-analysis pipelineHEADmain
Curated export for clone-and-run Maze training (2x A6000) + diagnostics. trm/hrm pretrain.py carry trajectory-augmentation code (backward-compatible). Heavy artifacts (checkpoints/wandb/npz) gitignored; see PROVENANCE.md. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Diffstat (limited to 'research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md')
-rw-r--r--research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md17
1 files changed, 17 insertions, 0 deletions
diff --git a/research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md b/research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md
new file mode 100644
index 0000000..8d90e02
--- /dev/null
+++ b/research/flossing/analysis_2x2/early_pairing_hrm26040_joint.md
@@ -0,0 +1,17 @@
+# Early-window pairing — hrm26040_joint
+- paired n=2048; final acc=0.5259; already-correct@step4=0.3447
+- of final-correct, fraction already correct@4: 0.6555
+- early-window lam1: final-correct med -0.1714, final-wrong med -0.1314
+
+## Forecasting FINAL outcome from the first 4 ACT steps
+- AUC(-lam1_early -> final correct) = 0.728
+- AUC(-drift@4 -> final correct) = 0.486
+- AUC(q_halt@4 -> final correct) = 0.908
+- reference: AUC(-lam1_full -> final correct) = 0.987
+
+## Restricted to examples NOT yet correct at step 4 (the decision-relevant set)
+- n=1342, of which eventually correct: 371 (0.276)
+- AUC(-lam1_early -> eventually correct) = 0.448
+- AUC(-drift@4 -> eventually correct) = 0.312
+- AUC(q_halt@4 -> eventually correct) = 0.734
+- early lam1 med: eventually-correct -0.1225 vs never-correct -0.1314 \ No newline at end of file