Adalgeirsson, Sigurdur Orn

1 publications

ICMLW 2023 Learning Optimal Advantage from Preferences and Mistaking It for Reward W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson, Serena Booth, Anca Dragan, Peter Stone, Scott Niekum