offline RL Papers - BytesArchive

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

root October 9, 2023 0

The article investigates the problem of Q-value estimation divergence in offline reinforcement learning (RL). The authors identify a fundamental pattern, ‘self-excitation’, as the primary cause of this divergence. They propose…

Press ESC to close

offline RL

Please allow ads on our site