Reinforcement Learning with Promising Tokens for Large Language Models
Published:
Recommended citation: Jing-Cheng Pang, Liang Lu, Xian Tang, Kun Jiang, Sijie Wu, Kai Zhang and Xubin Li. Reinforcement Learning with Promising Tokens for Large Language Models. CoRR abs/2602.03195, 2026.
