diff options
| author | Will DePue <williamd@openai.com> | 2026-03-19 14:06:53 -0700 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2026-03-19 14:06:53 -0700 |
| commit | 2d6e9e0e613de139f95dd6f191f91853bec13894 (patch) | |
| tree | ee86063778c05f04b6982a0df20067338ab03723 /README.md | |
| parent | b87b883785733e1b43e17fece9eb001bd0fd69dd (diff) | |
Update README.md
Diffstat (limited to 'README.md')
| -rw-r--r-- | README.md | 2 |
1 files changed, 1 insertions, 1 deletions
@@ -30,8 +30,8 @@ Happy training! | Run | Score | Author | Summary | Date | Info | |-----|------:|--------|---------|------|------| +| Sliding Window Eval | 1.1925 | Matthew Li | Sliding window evaluation at stride=64, increasing context for eval | 2026-03-18 | [info](records/track_10min_16mb/2026-03-19_SlidingWindowEval/README.md) | | 4k seq length| 1.2014 | Spokane Way | 4k seq length + better hypers | 2026-03-18 | [info](records/track_10min_16mb/2026-03-18_LongContextSeq2048/README.md) | -| Sliding Window Eval | 1.206 | Matthew Li | Sliding window evaluation at stride=64, increasing context for eval | 2026-03-18 | [info](records/track_10min_16mb/2026-03-19_SlidingWindowEval/README.md) | | 2048 seq length | 1.206 | Spokane Way | 2048 seq length (train + val) | 2026-03-18 | [info](records/track_10min_16mb/2026-03-18_LongContextSeq2048/README.md) | | fp16 Embed | 1.2197 | Renier Velazco | FP16 Tied Embedding + LR/Warmdown Tuning | 2026-03-18 | [info](records/track_10min_16mb/2026-03-18_FP16Embed_WD3600/README.md) | | Naive Baseline | 1.2244 | Baseline | 9layer 512dim 1024vocab TiedEmbeddings 4 KV heads | 2026-03-18 | [info](records/track_10min_16mb/2026-03-17_NaiveBaseline/README.md) | |
