The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization
The paper investigates the impact of preference agreement on the efficacy of Reinforcement Learning from Human Feedback (RLHF) in text summarization. The authors demonstrate that including a diverse range of…
Continue reading