human preferences

Aligning Crowd Feedback via Distributional Preference Reward Modeling

root February 18, 2024 0

This paper introduces the Distributional Preference Reward Model (DPRM) to align large language models with a diverse set of human preferences. The researchers used a beta distribution to characterize preferences,…

Computation and Language

Towards Efficient and Exact Optimization of Language Model Alignment

root February 4, 2024 0

The paper focuses on aligning language models with human preferences for real-world applications. It discusses the drawbacks of reinforcement learning (RL) and direct preference optimization (DPO) in achieving this goal….

Robotics

To Lead or to Follow? Adaptive Robot Task Planning in Human-Robot Collaboration

root January 7, 2024 0

This article presents a framework for robot task planning that is adaptive to human preferences and performance. The authors focus on task allocation and scheduling in collaborative settings, aiming to…

Robotics

Increasing Transparency of Reinforcement Learning using Shielding for Human Preferences and Explanations

root November 29, 2023 0

The study investigates whether incorporating human preferences in Reinforcement Learning (RL) can enhance the transparency of robot behaviours. A shielding mechanism is integrated into the RL algorithm to monitor the…

Machine Learning

Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks

root October 17, 2023 0

The study focuses on off-policy evaluation (OPE) in reinforcement learning using human preference data. The authors explore the sample efficiency of OPE, establishing a statistical guarantee for it. They approach…

Artificial Intelligence Machine Learning

Confronting Reward Model Overoptimization with Constrained RLHF

root October 9, 2023 0

The paper focuses on the issue of overoptimization in large language models (LLMs) which are optimized to align with human preferences. The authors highlight that human preferences are multi-faceted and…

Page 1 of 1

Press ESC to close

human preferences

Please allow ads on our site