Reinforcement Learning with Promising Tokens for Large Language Models

Published:

Recommended citation: Jing-Cheng Pang, Liang Lu, Xian Tang, Kun Jiang, Sijie Wu, Kai Zhang and Xubin Li. Reinforcement Learning with Promising Tokens for Large Language Models. CoRR abs/2602.03195, 2026.

Direct Link