Merge pull request #9 from oof-baroomf/patch-1

Update README typo
author: Will DePue <williamd@openai.com> 2026-03-18 16:26:59 -0700
committer: GitHub <noreply@github.com> 2026-03-18 16:26:59 -0700
commit: 09c3e8edaa478068bcae05982b426026f1d3a023 (patch)
tree: 7e07884b89d9f1a9b359d49d1435f3359ad89a78
parent: 5472f29be414fe6b50189058c6ccc9aa3d73566d (diff)
parent: cbd940aed26c1a106a284a3ce6d26cacb81a7f72 (diff)
1 files changed, 1 insertions, 1 deletions
diff --git a/README.md b/README.md
index 2bb555b..de856a3 100644
--- a/README.md
+++ b/README.md
@@ -7,7 +7,7 @@
 
 This challenge is heavily inspired by the [NanoGPT Speedrunning](https://github.com/KellerJordan/modded-nanogpt) challenge, where participants compete to train a model that reaches 3.28 FineWeb validation loss as quickly as possible. We're excited to see how optimizing for a parameter-constrained setting pushes people toward unique architectures (test-time compute, aggressive parameter tying, depth recurrence, low-rank training, ...), compression schemes (low precision, QAT, bitnets, novel tokenizers, ...), and other creative submissions (test-time training, long context, megakernels ...). 
 
-If you're familiar with [neural scaling laws](https://arxiv.org/abs/2001.08361), you can consider this challenge a form of L(N) optimization, where the objective is to optimize the lowest loss given a fixed number of parameters (N) unconstrained by data, compute, steps, or architecture. Challenges like the [NanoGPT Speedrun](https://github.com/KellerJordan/modded-nanogpt), which optimizes for a form of L(T) (~lowest loss given constrained time) or the [NanoGPT Slowrun](https://github.com/qlabs-eng/slowrun), which optimizes for L(D) (lowest loss given constrained dataset size), can be thought of as equivalent challenges in this family.
+If you're familiar with [neural scaling laws](https://arxiv.org/abs/2001.08361), you can consider this challenge a form of L(N) optimization, where the objective is to optimize the lowest loss given a fixed number of parameters (N) unconstrained by data, compute, steps, or architecture. Challenges like the [NanoGPT Speedrun](https://github.com/KellerJordan/modded-nanogpt), which optimizes for a form of L(T) (~lowest time given constrained loss) or the [NanoGPT Slowrun](https://github.com/qlabs-eng/slowrun), which optimizes for L(D) (lowest loss given constrained dataset size), can be thought of as equivalent challenges in this family.
 
 Ideally, we'd allow for submissions to use arbitrary computational resources. But in order to make the challenge not inaccessibly expensive, we're limiting *leaderboard submissions* to 10 minutes on 8xH100s. However, we'd still love to see submissions that don't meet the compute limitation requirements in our 'Non-record Submissions' section: We're excited to see people push the infinite frontier of parameter limited performance as well.
author	Will DePue <williamd@openai.com>	2026-03-18 16:26:59 -0700
committer	GitHub <noreply@github.com>	2026-03-18 16:26:59 -0700
commit	09c3e8edaa478068bcae05982b426026f1d3a023 (patch)
tree	7e07884b89d9f1a9b359d49d1435f3359ad89a78
parent	5472f29be414fe6b50189058c6ccc9aa3d73566d (diff)
parent	cbd940aed26c1a106a284a3ce6d26cacb81a7f72 (diff)