From 15845f0226fe5e1f64ec2ab6bd0253d59ae813ce Mon Sep 17 00:00:00 2001 From: YurenHao0426 Date: Wed, 8 Apr 2026 10:22:20 -0500 Subject: =?UTF-8?q?=C2=A73=20fix:=20correctly=20distinguish=20DFA/SB/CB=20?= =?UTF-8?q?local=20credit=20vectors?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Previous §3 ¶1 wrote the local loss as - as if it applied to DFA, SB, and CB all three. But that's only DFA's form. SB and CB use learned bridge networks to derive credit: - DFA: a_l = B_l^T e_T (fixed random projection) - State Bridge: a_l = gradient of CE(head(LN(G_psi(h_l, t_l, s))), y) where G_psi is a learned state predictor of h_L - Credit Bridge: a_l = gradient of learned value net V(h_l, t_l, s) The fix correctly writes the shared local loss form - and defines a_l for each method in-line. This also serves as the first definition of SB and CB in the paper (previously they were named in Table 1 without being defined). Main content still ends at p9 (just slightly before the bottom margin now); references span p9-p10 but are not counted against the 9-page content budget. Total 17 pages. Co-Authored-By: Claude Opus 4.6 (1M context) --- paper/main.pdf | Bin 480230 -> 481568 bytes 1 file changed, 0 insertions(+), 0 deletions(-) (limited to 'paper/main.pdf') diff --git a/paper/main.pdf b/paper/main.pdf index 8238583..817491d 100644 Binary files a/paper/main.pdf and b/paper/main.pdf differ -- cgit v1.2.3