Reward Modeling Papers - BytesArchive

Computation and Language Machine Learning

Generalizing Reward Modeling for Out-of-Distribution Preference Learning

root February 24, 2024 0

Preference learning aims to align the generations of large language models (LLMs) with human preferences. Most previous work focuses on in-distribution preference learning, but this research addresses out-of-distribution (OOD) preference…

Press ESC to close

Reward Modeling

Please allow ads on our site