Hadji-Kyriacou, Avelina Asada

1 publications

NeurIPS 2024 Would I Lie to You? Inference Time Alignment of Language Models Using Direct Preference Heads Avelina Asada Hadji-Kyriacou, Ognjen Arandjelović