Increasing Transparency of Reinforcement Learning using Shielding for Human Preferences and Explanations
The study investigates whether incorporating human preferences in Reinforcement Learning (RL) can enhance the transparency of robot behaviours. A shielding mechanism is integrated into the RL algorithm to monitor the…
Continue reading