From 56af282a6177c43da8a639fa59bca51c9b1412dc Mon Sep 17 00:00:00 2001 From: zitian-gao Date: Tue, 27 May 2025 17:03:03 +0800 Subject: update --- README.md | 60 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 59 insertions(+), 1 deletion(-) (limited to 'README.md') diff --git a/README.md b/README.md index 7d3b0e6..3e1f8ea 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,61 @@ ## One-shot Entropy Minimization -   \ No newline at end of file +   + +### Overview + +This repository provides code and instructions to reproduce the results of the **One-shot Entropy Minimization (EM)** method presented in our paper. It includes both 1-shot and multi-shot training setups, as well as evaluation using the Qwen2.5-Math benchmark. + +--- + +### Reproducing One-shot EM Training (SOTA) + +```bash +accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64 +``` + +--- + +### Reproducing Multi-shot EM Training + +```bash +accelerate launch train.py --lr 2e-5 --temperature 0.5 --bsz 64 --data_path "dataset/numina/numina_00.parquet" +``` + +--- + +### Evaluation + +```bash +cd Qwen2.5-Eval/evaluation +bash sh/eval_all_math.sh +``` + +--- + +### Acknowledgements + +Our dataset references and builds upon the following open-source contributions: + +- [NuminaMath-CoT](https://huggingface.co/datasets/AI-MO/NuminaMath-CoT) +- [DeepScaler](https://github.com/agentica-project/deepscaler) +- [One-shot RLVR](https://github.com/ypwang61/One-Shot-RLVR/) – for data selection strategies +- [Qwen2.5-Eval](https://github.com/QwenLM/Qwen2.5-Math/) – for evaluation benchmarks + +We sincerely thank the authors and maintainers of these projects for their excellent contributions to the research community! + + +--- + +### Citation +``` +@misc{gao2025oneshotentropyminimization, + title={One-shot Entropy Minimization}, + author={Zitian Gao and Lynx Chen and Joey Zhou and Bryan Dai}, + year={2025}, + eprint={2505.20282}, + archivePrefix={arXiv}, + primaryClass={cs.CL}, + url={https://arxiv.org/abs/2505.20282}, +} +``` \ No newline at end of file -- cgit v1.2.3