summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'README.md')
-rw-r--r--README.md9
1 files changed, 9 insertions, 0 deletions
diff --git a/README.md b/README.md
index 6f859a6..d5179b0 100644
--- a/README.md
+++ b/README.md
@@ -2,12 +2,15 @@
<a href='https://arxiv.org/abs/2505.20282'><img src='https://img.shields.io/badge/arXiv-2505.20282-b31b1b.svg'></a> &nbsp;
+---
+
### Reproducing One-shot EM Training (SOTA)
```bash
accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64
```
+---
### Reproducing Multi-shot EM Training
@@ -15,6 +18,8 @@ accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64
accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64 --data_path "dataset/numina/numina_00.parquet"
```
+---
+
### Evaluation
```bash
@@ -22,6 +27,8 @@ cd Qwen2.5-Eval/evaluation
bash sh/eval_all_math.sh
```
+---
+
### Acknowledgements
Our dataset references and builds upon the following open-source contributions:
@@ -34,6 +41,8 @@ Our dataset references and builds upon the following open-source contributions:
We sincerely thank the authors and maintainers of these projects for their excellent contributions to the research community!
+---
+
### Citation
```
@misc{gao2025oneshotentropyminimization,