[NeurIPS] Regret Minimization Experience Replay in Off-Policy Reinforcement Learning
Published in NeurIPS 2021, 2021
Recommended citation: Xu-Hui Liu, Zhenghai Xue, Jing-Cheng Pang, Shengyi Jiang, Feng Xu and Yang Yu. Regret Minimization Experience Replay in Off-Policy Reinforcement Learning. In: NeurIPS, 2021.