Lutz, Roman

3 publications

ICLR 2026 Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness Erfan Shayegani, Keegan Hines, Yue Dong, Nael Abu-Ghazaleh, Roman Lutz, Spencer Whitehead, Vidhisha Balachandran, Besmira Nushi, Vibhav Vineet

NeurIPSW 2024 Lessons from Red Teaming 100 Generative AI Products Blake Bullwinkel, Amanda J. Minnich, Shiven Chawla, Gary David Lopez Munoz, Martin Pouliot, Whitney Maxwell, Joris de Gruyter, Katherine Pratt, Saphir Qi, Nina Chikanov, Roman Lutz, Raja Sekhar Rao Dheekonda, Bolor-Erdene Jagdagdorj, Rich Lundeen, Sam Vaughan, Victoria Westerhoff, Pete Bryan, Ram Shankar Siva Kumar, Yonatan Zunger, Mark Russinovich

MLOSS 2023 Fairlearn: Assessing and Improving Fairness of AI Systems Hilde Weerts, Miroslav Dudík, Richard Edgar, Adrin Jalali, Roman Lutz, Michael Madaio