Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy

The research paper discusses the potential of causal knowledge in improving the interpretability of reinforcement learning (RL) agents’ decision-making process. The authors propose a framework that alternates between using interventions for causal structure learning during exploration and using the learned causal structure for policy guidance during exploitation. This approach is tested in a simulated fault alarm environment, demonstrating its effectiveness and robustness against other methods. The improvement in performance is attributed to the cycle of causal-guided policy learning and causal structure learning.

Publication date: 7 Feb 2024
Project Page: https://arxiv.org/abs/2402.04869v1
Paper: https://arxiv.org/pdf/2402.04869

Post Views: 284

Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

On Provable Length and Compositional Generalization

Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning

Leave a Reply Cancel reply

Please allow ads on our site