H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps

The article discusses the challenges of using reinforcement learning (RL) in real-world tasks due to dynamics gaps between real and simulated environments. It introduces a new algorithm, H2O+, which aims to address these issues by combining offline and online learning methods. The algorithm is designed to be flexible, accommodating various choices of offline and online learning methods and accounting for the dynamics gaps. The authors demonstrate the superior performance and flexibility of H2O+ over other RL algorithms through simulation and real-world robotics experiments.

Publication date: 25 Sep 2023
Project Page: https://sites.google.com/view/h2oplusauthors/
Paper: https://arxiv.org/pdf/2309.12716

Post Views: 400

H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Make the U in UDA Matter: Invariant Consistency Learning for Unsupervised Domain Adaptation

Multi-Label Noise Transition Matrix Estimation with Label Correlations: Theory and Algorithm

Leave a Reply Cancel reply

Please allow ads on our site