summaryrefslogtreecommitdiff
path: root/results/threshold_sensitivity_output.txt
diff options
context:
space:
mode:
Diffstat (limited to 'results/threshold_sensitivity_output.txt')
-rw-r--r--results/threshold_sensitivity_output.txt105
1 files changed, 105 insertions, 0 deletions
diff --git a/results/threshold_sensitivity_output.txt b/results/threshold_sensitivity_output.txt
new file mode 100644
index 0000000..083176b
--- /dev/null
+++ b/results/threshold_sensitivity_output.txt
@@ -0,0 +1,105 @@
+========================================================================================
+DIAGNOSTIC (a) per-block growth: sensitivity over threshold
+========================================================================================
+method seed value >5× >10× >20× >50× >100× >500× >1000× >5000×
+bp 42 1.00e+00 ok ok ok ok ok ok ok ok
+dfa 42 2.04e+03 FIRE FIRE FIRE FIRE FIRE FIRE FIRE ok
+state_bridge 42 1.28e+04 FIRE FIRE FIRE FIRE FIRE FIRE FIRE FIRE
+credit_bridge 42 1.82e+03 FIRE FIRE FIRE FIRE FIRE FIRE FIRE ok
+ep 42 2.87e+00 ok ok ok ok ok ok ok ok
+bp 123 9.59e-01 ok ok ok ok ok ok ok ok
+dfa 123 9.78e+02 FIRE FIRE FIRE FIRE FIRE FIRE ok ok
+state_bridge 123 2.41e+04 FIRE FIRE FIRE FIRE FIRE FIRE FIRE FIRE
+credit_bridge 123 6.94e+02 FIRE FIRE FIRE FIRE FIRE FIRE ok ok
+ep 123 1.10e+01 FIRE FIRE ok ok ok ok ok ok
+bp 456 9.63e-01 ok ok ok ok ok ok ok ok
+dfa 456 2.55e+03 FIRE FIRE FIRE FIRE FIRE FIRE FIRE ok
+state_bridge 456 1.05e+04 FIRE FIRE FIRE FIRE FIRE FIRE FIRE FIRE
+credit_bridge 456 1.03e+03 FIRE FIRE FIRE FIRE FIRE FIRE FIRE ok
+ep 456 6.10e+00 FIRE ok ok ok ok ok ok ok
+
+Reading: BP/EP rows should be 'ok' across the entire row (the whole
+threshold range is healthy for them). DFA/SB/CB rows should be 'FIRE'
+at the chosen threshold and have a comfortable margin on either side.
+
+========================================================================================
+DIAGNOSTIC (b) g-norm floor: sensitivity over threshold
+========================================================================================
+method seed value <1e-09 <1e-08 <1e-07 <1e-06 <1e-05
+bp 42 3.70e-04 ok ok ok ok ok
+dfa 42 4.17e-09 ok FIRE FIRE FIRE FIRE
+state_bridge 42 1.84e-09 ok FIRE FIRE FIRE FIRE
+credit_bridge 42 9.01e-10 FIRE FIRE FIRE FIRE FIRE
+ep 42 1.64e-04 ok ok ok ok ok
+bp 123 3.09e-04 ok ok ok ok ok
+dfa 123 2.84e-09 ok FIRE FIRE FIRE FIRE
+state_bridge 123 2.24e-09 ok FIRE FIRE FIRE FIRE
+credit_bridge 123 4.18e-09 ok FIRE FIRE FIRE FIRE
+ep 123 1.02e-04 ok ok ok ok ok
+bp 456 4.02e-04 ok ok ok ok ok
+dfa 456 1.90e-09 ok FIRE FIRE FIRE FIRE
+state_bridge 456 2.40e-09 ok FIRE FIRE FIRE FIRE
+credit_bridge 456 2.38e-09 ok FIRE FIRE FIRE FIRE
+ep 456 1.16e-04 ok ok ok ok ok
+
+========================================================================================
+DIAGNOSTIC (c) stability ceiling: sensitivity over threshold
+========================================================================================
+method seed value >0.10 >0.20 >0.30 >0.50 >0.70 >0.90
+bp 42 0.099 ok ok ok ok ok ok
+dfa 42 0.047 ok ok ok ok ok ok
+state_bridge 42 0.992 FIRE FIRE FIRE FIRE FIRE FIRE
+credit_bridge 42 0.352 FIRE FIRE FIRE ok ok ok
+ep 42 -0.036 ok ok ok ok ok ok
+bp 123 0.087 ok ok ok ok ok ok
+dfa 123 0.436 FIRE FIRE FIRE ok ok ok
+state_bridge 123 0.561 FIRE FIRE FIRE FIRE ok ok
+credit_bridge 123 0.250 FIRE FIRE ok ok ok ok
+ep 123 0.120 FIRE ok ok ok ok ok
+bp 456 0.114 FIRE ok ok ok ok ok
+dfa 456 -0.005 ok ok ok ok ok ok
+state_bridge 456 0.035 ok ok ok ok ok ok
+credit_bridge 456 0.518 FIRE FIRE FIRE FIRE ok ok
+ep 456 -0.024 ok ok ok ok ok ok
+
+========================================================================================
+VERDICT ROBUSTNESS: at what threshold does each verdict CHANGE?
+========================================================================================
+method seed (a) value (a) flip near (b) value (b) flip near
+bp 42 1.00e+00 1.00e+00 3.70e-04 3.70e-04
+dfa 42 2.04e+03 2.04e+03 4.17e-09 4.17e-09
+state_bridge 42 1.28e+04 1.28e+04 1.84e-09 1.84e-09
+credit_bridge 42 1.82e+03 1.82e+03 9.01e-10 9.01e-10
+ep 42 2.87e+00 2.87e+00 1.64e-04 1.64e-04
+bp 123 9.59e-01 9.59e-01 3.09e-04 3.09e-04
+dfa 123 9.78e+02 9.78e+02 2.84e-09 2.84e-09
+state_bridge 123 2.41e+04 2.41e+04 2.24e-09 2.24e-09
+credit_bridge 123 6.94e+02 6.94e+02 4.18e-09 4.18e-09
+ep 123 1.10e+01 1.10e+01 1.02e-04 1.02e-04
+bp 456 9.63e-01 9.63e-01 4.02e-04 4.02e-04
+dfa 456 2.55e+03 2.55e+03 1.90e-09 1.90e-09
+state_bridge 456 1.05e+04 1.05e+04 2.40e-09 2.40e-09
+credit_bridge 456 1.03e+03 1.03e+03 2.38e-09 2.38e-09
+ep 456 6.10e+00 6.10e+00 1.16e-04 1.16e-04
+
+Interpretation:
+ - For DFA/SB/CB, the 'flip near' values are the diagnostic raw values.
+ Default (a) threshold 50 catches all if raw values > 50; default (b)
+ threshold 1e-7 catches all if raw values < 1e-7. Compare:
+ (a) max BP per-block growth across 3 seeds: 1.00e+00
+ (a) max EP per-block growth across 3 seeds: 1.10e+01
+ (a) min DFA per-block growth across 3 seeds: 9.78e+02
+ (a) min SB per-block growth across 3 seeds: 1.05e+04
+ (a) min CB per-block growth across 3 seeds: 6.94e+02
+ -> separation gap: healthy max = 1.10e+01,
+ degenerate min = 6.94e+02,
+ gap factor = 63×
+
+ (b) min BP ‖g_L‖ across 3 seeds: 3.09e-04
+ (b) min EP ‖g_L‖ across 3 seeds: 1.02e-04
+ (b) max DFA ‖g_L‖ across 3 seeds: 4.17e-09
+ (b) max SB ‖g_L‖ across 3 seeds: 2.40e-09
+ (b) max CB ‖g_L‖ across 3 seeds: 4.18e-09
+ -> separation gap: healthy min = 1.02e-04,
+ degenerate max = 4.18e-09,
+ gap factor = 24338×