2025-12-25 10:52:43,142 - INFO - Loaded dataset: math-500 2025-12-25 10:52:43,385 - INFO - Loaded 100 profiles from ../data/complex_profiles_v2/profiles_100.jsonl 2025-12-25 10:52:43,386 - INFO - Running method: contextual `torch_dtype` is deprecated! Use `dtype` instead! Loading checkpoint shards: 0%| | 0/4 [00:00