2025-12-25 21:55:51,030 - INFO - Loaded dataset: math-500 2025-12-25 21:55:51,242 - INFO - Loaded 100 profiles from ../data/complex_profiles_v2/profiles_100.jsonl 2025-12-25 21:55:51,243 - INFO - Running method: vanilla `torch_dtype` is deprecated! Use `dtype` instead! Loading checkpoint shards: 0%| | 0/4 [00:00