Finite-Time Analysis Papers

Critic-Actor for Average Reward MDPs with Function Approximation: A Finite-Time Analysis

root February 5, 2024 0

This research presents the first critic-actor algorithm with function approximation and finite-time analysis, used in the long-run average reward setting. The algorithm is designed to solve reinforcement learning problems by…

Press ESC to close

Finite-Time Analysis

Please allow ads on our site