summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorOne <imone@tuta.io>2025-07-21 18:40:40 +0800
committerOne <imone@tuta.io>2025-07-21 18:40:40 +0800
commit171e2fcde636bcb7e6c0073a9983ed5252f04753 (patch)
treed7844d28ad5f289c25a046e58ec9d20216cfba44 /README.md
parentbd6222774edcec1608a6842d0b06a637a4acef59 (diff)
Update
Diffstat (limited to 'README.md')
-rw-r--r--README.md32
1 files changed, 27 insertions, 5 deletions
diff --git a/README.md b/README.md
index 87620eb..b039222 100644
--- a/README.md
+++ b/README.md
@@ -15,16 +15,16 @@ These results underscore HRM’s potential as a transformative advancement towar
Ensure PyTorch and CUDA are installed. The repo needs CUDA extensions to be built. If not present, run the following commands:
```bash
-# Install CUDA 12.4
-CUDA_URL=https://developer.download.nvidia.com/compute/cuda/12.4.1/local_installers/cuda_12.4.1_550.54.15_linux.run
+# Install CUDA 12.6
+CUDA_URL=https://developer.download.nvidia.com/compute/cuda/12.6.3/local_installers/cuda_12.6.3_560.35.05_linux.run
wget -q --show-progress --progress=bar:force:noscroll -O cuda_installer.run $CUDA_URL
sudo sh cuda_installer.run --silent --toolkit --override
-export CUDA_HOME=/usr/local/cuda-12.4
+export CUDA_HOME=/usr/local/cuda-12.6
-# Install PyTorch with CUDA 12.4
-PYTORCH_INDEX_URL=https://download.pytorch.org/whl/cu124
+# Install PyTorch with CUDA 12.6
+PYTORCH_INDEX_URL=https://download.pytorch.org/whl/cu126
pip3 install torch torchvision torchaudio --index-url $PYTORCH_INDEX_URL
@@ -32,6 +32,20 @@ pip3 install torch torchvision torchaudio --index-url $PYTORCH_INDEX_URL
pip3 install packaging ninja wheel setuptools setuptools-scm
```
+Then install FlashAttention. For Hopper GPUs, install FlashAttention 3
+
+```bash
+git clone git@github.com:Dao-AILab/flash-attention.git
+cd flash-attention/hopper
+python setup.py install
+```
+
+For Ampere or earlier GPUs, install FlashAttenion 2
+
+```bash
+pip3 install flash-attn
+```
+
## Install Python Dependencies šŸ
```bash
@@ -62,6 +76,14 @@ OMP_NUM_THREADS=8 python pretrain.py data_path=data/sudoku-extreme-1k-aug-1000 e
Runtime: ~10 hours on a RTX 4070 laptop GPU
+## Trained Checkpoints 🚧
+
+ - [ARC-AGI-2](https://huggingface.co/sapientinc/HRM-checkpoint-ARC-2)
+ - [Sudoku 9x9 Extreme (1000 examples)](https://huggingface.co/sapientinc/HRM-checkpoint-sudoku-extreme)
+ - [Maze 30x30 Hard (1000 examples)](https://huggingface.co/sapientinc/HRM-checkpoint-maze-30x30-hard)
+
+To use the checkpoints, see Evaluation section below.
+
## Full-scale Experiments šŸ”µ
Experiments below assume an 8-GPU setup.