From f3897c16bb913640c2b65d2e82addab307245034 Mon Sep 17 00:00:00 2001 From: Will DePue Date: Thu, 19 Mar 2026 14:33:47 -0700 Subject: Update README.md --- README.md | 1 - 1 file changed, 1 deletion(-) diff --git a/README.md b/README.md index 72763cc..98aa544 100644 --- a/README.md +++ b/README.md @@ -30,7 +30,6 @@ Happy training! | Run | Score | Author | Summary | Date | Info | |-----|------:|--------|---------|------|------| -| Int6 quant | 1.1574 | samuellarson | Quantizing to int6 allows scaling to 21.8M params via 3x MLP | 2026-03-19 | [info](records/track_10min_16mb/2026-03-19_WarmdownQuantization/README.md) | | Muon WD + 10 layer | 1.1748 | notapplica | Includes prev. wins + Spectral embed init + resid mix | 2026-03-19 | [info](records/track_10min_16mb/2026-03-19_SlidingWindow_FP16Emb_10L_MuonWD_OvertoneInit/README.md) | | Sliding Window Eval | 1.1925 | Matthew Li | Sliding window evaluation at stride=64, increasing context for eval | 2026-03-19 | [info](records/track_10min_16mb/2026-03-19_SlidingWindowEval/README.md) | | 4k seq length| 1.2014 | Spokane Way | 4k seq length + better hypers | 2026-03-19 | [info](records/track_10min_16mb/2026-03-18_LongContextSeq2048/README.md) | -- cgit v1.2.3