[NeurIPS] Regret Minimization Experience Replay in Off-Policy Reinforcement Learning

Published in NeurIPS 2021, 2021

Recommended citation: Xu-Hui Liu, Zhenghai Xue, Jing-Cheng Pang, Shengyi Jiang, Feng Xu and Yang Yu. Regret Minimization Experience Replay in Off-Policy Reinforcement Learning. In: NeurIPS, 2021.

Direct Link