constrained reinforcement learning

Artificial Intelligence Machine Learning

Eureka: Human-Level Reward Design via Coding Large Language Models

root October 22, 2023 0

This article presents EUREKA, a human-level reward design algorithm powered by Large Language Models (LLMs). It employs LLMs’ zero-shot generation and code-writing abilities to perform evolutionary optimization over reward code….

Artificial Intelligence Machine Learning

Safe RLHF: Safe Reinforcement Learning from Human Feedback

root October 22, 2023 0

The researchers from Peking University have proposed a novel algorithm, Safe Reinforcement Learning from Human Feedback (Safe RLHF), aimed at enhancing the safety and performance of Large Language Models (LLMs)….

Machine Learning Robotics

Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning

root October 18, 2023 0

The paper, ‘Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning’ by Yunlong Song et al., presents a study comparing the effectiveness of Reinforcement Learning (RL) and Optimal…

Machine Learning

Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks

root October 17, 2023 0

The study focuses on off-policy evaluation (OPE) in reinforcement learning using human preference data. The authors explore the sample efficiency of OPE, establishing a statistical guarantee for it. They approach…

Artificial Intelligence Machine Learning

Confronting Reward Model Overoptimization with Constrained RLHF

root October 9, 2023 0

The paper focuses on the issue of overoptimization in large language models (LLMs) which are optimized to align with human preferences. The authors highlight that human preferences are multi-faceted and…

Previous Page 6 of 6

Press ESC to close

constrained reinforcement learning

Please allow ads on our site