Safetensors
English
qwen2
virtuoussy commited on
Commit
d2e8290
·
verified ·
1 Parent(s): 729fe11

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -8
README.md CHANGED
@@ -89,13 +89,10 @@ bash train.sh {METHOD} {PRETRAIN_PATH} {DATA_PATH} {REWARD_API}
89
  ## Citation
90
 
91
  ```bibtex
92
- @misc{su2025expandingrlverifiablerewards,
93
- title={Expanding RL with Verifiable Rewards Across Diverse Domains},
94
- author={Yi Su and Dian Yu and Linfeng Song and Juntao Li and Haitao Mi and Zhaopeng Tu and Min Zhang and Dong Yu},
95
- year={2025},
96
- eprint={2503.23829},
97
- archivePrefix={arXiv},
98
- primaryClass={cs.CL},
99
- url={https://arxiv.org/abs/2503.23829},
100
  }
101
  ```
 
89
  ## Citation
90
 
91
  ```bibtex
92
+ @article{su2025expanding,
93
+ title={Expanding RL with Verifiable Rewards Across Diverse Domains},
94
+ author={Su, Yi and Yu, Dian and Song, Linfeng and Li, Juntao and Mi, Haitao and Tu, Zhaopeng and Zhang, Min and Yu, Dong},
95
+ journal={arXiv preprint arXiv:2503.23829},
96
+ year={2025}
 
 
 
97
  }
98
  ```