1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
|
=== FA d=512 depth sweep REDO (separate dirs) ===
L=2 (Wed Apr 22 11:16:28 PM CDT 2026)
Using device: cuda:0
============================================================
Seed 42
============================================================
--- FA ---
[FA] Epoch 1: loss=2.0612, train=0.2476, test=0.3028
[FA] Epoch 10: loss=1.8290, train=0.3435, test=0.3705
[FA] Epoch 20: loss=1.8102, train=0.3489, test=0.3634
[FA] Epoch 30: loss=1.7963, train=0.3546, test=0.3398
[FA] Epoch 40: loss=1.7775, train=0.3605, test=0.3497
[FA] Epoch 50: loss=1.7610, train=0.3685, test=0.3288
[FA] Epoch 60: loss=1.7592, train=0.3704, test=0.3376
[FA] Epoch 70: loss=1.7588, train=0.3747, test=0.3421
[FA] Epoch 80: loss=1.7564, train=0.3751, test=0.3497
[FA] Epoch 90: loss=1.7543, train=0.3769, test=0.3472
[FA] Epoch 100: loss=1.7527, train=0.3768, test=0.3495
Final test acc: 0.3495
All results saved to results/fa_depth_scan_d512_L2/results_cifar10.json
L=4 (Wed Apr 22 11:22:52 PM CDT 2026)
Using device: cuda:0
============================================================
Seed 42
============================================================
--- FA ---
[FA] Epoch 1: loss=2.0301, train=0.2531, test=0.2917
[FA] Epoch 10: loss=1.8487, train=0.3366, test=0.3541
[FA] Epoch 20: loss=1.7864, train=0.3609, test=0.3908
[FA] Epoch 30: loss=1.7510, train=0.3724, test=0.3990
[FA] Epoch 40: loss=1.7387, train=0.3767, test=0.3946
[FA] Epoch 50: loss=1.7209, train=0.3875, test=0.4165
[FA] Epoch 60: loss=1.7052, train=0.3913, test=0.4173
[FA] Epoch 70: loss=1.6945, train=0.3963, test=0.4137
[FA] Epoch 80: loss=1.6868, train=0.4018, test=0.4219
[FA] Epoch 90: loss=1.6830, train=0.4009, test=0.4250
[FA] Epoch 100: loss=1.6781, train=0.4021, test=0.4244
Final test acc: 0.4244
All results saved to results/fa_depth_scan_d512_L4/results_cifar10.json
L=6 (Wed Apr 22 11:29:10 PM CDT 2026)
Using device: cuda:0
============================================================
Seed 42
============================================================
--- FA ---
[FA] Epoch 1: loss=2.0375, train=0.2474, test=0.2938
[FA] Epoch 10: loss=1.8616, train=0.3294, test=0.3541
[FA] Epoch 20: loss=1.8289, train=0.3459, test=0.3711
[FA] Epoch 30: loss=1.7992, train=0.3579, test=0.3857
[FA] Epoch 40: loss=1.7837, train=0.3631, test=0.3942
[FA] Epoch 50: loss=1.7699, train=0.3710, test=0.3921
[FA] Epoch 60: loss=1.7550, train=0.3741, test=0.3975
[FA] Epoch 70: loss=1.7439, train=0.3770, test=0.4058
[FA] Epoch 80: loss=1.7413, train=0.3796, test=0.4014
[FA] Epoch 90: loss=1.7382, train=0.3791, test=0.4008
[FA] Epoch 100: loss=1.7363, train=0.3785, test=0.4014
Final test acc: 0.4014
All results saved to results/fa_depth_scan_d512_L6/results_cifar10.json
L=8 (Wed Apr 22 11:36:14 PM CDT 2026)
Using device: cuda:0
============================================================
Seed 42
============================================================
--- FA ---
[FA] Epoch 1: loss=2.0431, train=0.2481, test=0.2960
[FA] Epoch 10: loss=1.8619, train=0.3303, test=0.3574
[FA] Epoch 20: loss=1.8163, train=0.3500, test=0.3617
[FA] Epoch 30: loss=1.7889, train=0.3612, test=0.3795
[FA] Epoch 40: loss=1.7651, train=0.3681, test=0.3955
[FA] Epoch 50: loss=1.7509, train=0.3738, test=0.4002
[FA] Epoch 60: loss=1.7385, train=0.3783, test=0.4060
[FA] Epoch 70: loss=1.7297, train=0.3819, test=0.4046
[FA] Epoch 80: loss=1.7255, train=0.3861, test=0.4064
[FA] Epoch 90: loss=1.7214, train=0.3872, test=0.4076
[FA] Epoch 100: loss=1.7181, train=0.3879, test=0.4094
Final test acc: 0.4094
All results saved to results/fa_depth_scan_d512_L8/results_cifar10.json
=== DEPTH SWEEP REDO DONE (Wed Apr 22 11:44:47 PM CDT 2026) ===
|