Self-Imitation Advantage Learning (SAIL)
https://arxiv.org/abs/2012.11989 Self-Imitation Advantage Learning Self-imitation learning is a Reinforcement Learning (RL) method that encourages actions whose returns were higher than expected, which helps in hard exploration and sparse reward problems. It was shown to improve the performance of on-policy actor-critic m arxiv.org 1. Self imitation reinforcement learning Self-imitation learning..
2021.05.31