Reinforcement Learning with Promising Tokens for Large Language Models
Published:
Recommended citation: Jing-Cheng Pang, Liang Lu, Xian Tang, Kun Jiang, Sijie Wu, Kai Zhang and Xubin Li. Reinforcement Learning with Promising Tokens for Large Language Models. ICML 2026 Workshop on Foundations of Deep Generative Models.
