Rosati, Domenic

7 publications

NeurIPS 2025 Dependency Parsing Is More Parameter-Efficient with Normalization Paolo Gajo, Domenic Rosati, Hassan Sajjad, Alberto Barrón-Cedeño
TMLR 2025 Improving Consistency in Large Language Models Through Chain of Guidance Harsh Raj, Vipul Gupta, Domenic Rosati, Subhabrata Majumdar
TMLR 2025 Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities Zora Che, Stephen Casper, Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai, Yarin Gal, Furong Huang, Dylan Hadfield-Menell
ICML 2025 Resolving Lexical Bias in Model Editing Hammad Rizwan, Domenic Rosati, Ga Wu, Hassan Sajjad
NeurIPSW 2024 Model Manipulation Attacks Enable More Rigorous Evaluations of LLM Capabilities Zora Che, Stephen Casper, Anirudh Satheesh, Rohit Gandikota, Domenic Rosati, Stewart Slocum, Lev E McKinney, Zichu Wu, Zikui Cai, Bilal Chughtai, Daniel Filan, Furong Huang, Dylan Hadfield-Menell
NeurIPS 2024 Representation Noising: A Defence Mechanism Against Harmful Finetuning Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz
NeurIPSW 2022 Measuring Reliability of Large Language Models Through Semantic Consistency Harsh Raj, Domenic Rosati, Subhabrata Majumdar