overoptimization Papers

Artificial Intelligence Machine Learning

Confronting Reward Model Overoptimization with Constrained RLHF

root October 9, 2023 0

The paper focuses on the issue of overoptimization in large language models (LLMs) which are optimized to align with human preferences. The authors highlight that human preferences are multi-faceted and…

Press ESC to close

overoptimization

Please allow ads on our site