ML Anthology
Authors
Search
About
Houliston, Sam
2 publications
NeurIPSW
2024
Uncertainty-Penalized Direct Preference Optimization
Sam Houliston
,
Alizée Pace
,
Alexander Immer
,
Gunnar Ratsch
NeurIPSW
2024
Uncertainty-Penalized Direct Preference Optimization
Sam Houliston
,
Alizée Pace
,
Alexander Immer
,
Gunnar Ratsch