Houliston, Sam

2 publications

NeurIPSW 2024 Uncertainty-Penalized Direct Preference Optimization Sam Houliston, Alizée Pace, Alexander Immer, Gunnar Ratsch
NeurIPSW 2024 Uncertainty-Penalized Direct Preference Optimization Sam Houliston, Alizée Pace, Alexander Immer, Gunnar Ratsch