diff options
| author | zitian-gao <zitian.gao@outlook.com> | 2025-05-27 17:05:18 +0800 |
|---|---|---|
| committer | zitian-gao <zitian.gao@outlook.com> | 2025-05-27 17:05:18 +0800 |
| commit | a14795415124cdca2889fdead4607e293b8f2f16 (patch) | |
| tree | 8d0769923ae85d6ed83a282c98b0c22af83d40f9 /README.md | |
| parent | e361e1011bb6eb9303a5a0979bed84df887c71ba (diff) | |
update
Diffstat (limited to 'README.md')
| -rw-r--r-- | README.md | 9 |
1 files changed, 0 insertions, 9 deletions
@@ -2,15 +2,12 @@ <a href='https://arxiv.org/abs/2505.20282'><img src='https://img.shields.io/badge/arXiv-2505.20282-b31b1b.svg'></a> ---- - ### Reproducing One-shot EM Training (SOTA) ```bash accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64 ``` ---- ### Reproducing Multi-shot EM Training @@ -18,8 +15,6 @@ accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64 accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64 --data_path "dataset/numina/numina_00.parquet" ``` ---- - ### Evaluation ```bash @@ -27,8 +22,6 @@ cd Qwen2.5-Eval/evaluation bash sh/eval_all_math.sh ``` ---- - ### Acknowledgements Our dataset references and builds upon the following open-source contributions: @@ -41,8 +34,6 @@ Our dataset references and builds upon the following open-source contributions: We sincerely thank the authors and maintainers of these projects for their excellent contributions to the research community! ---- - ### Citation ``` @misc{gao2025oneshotentropyminimization, |
