ML Anthology
Authors
Search
About
Mehri, Shikib
3 publications
NeurIPSW
2024
Anchored Optimization and Contrastive Revisions: Addressing Reward Hacking in Alignment
Karel D'Oosterlinck
,
Winnie Xu
,
Chris Develder
,
Thomas Demeester
,
Amanpreet Singh
,
Christopher Potts
,
Douwe Kiela
,
Shikib Mehri
NeurIPSW
2023
Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language
Di Jin
,
Shikib Mehri
,
Devamanyu Hazarika
,
Aishwarya Padmakumar
,
Sungjin Lee
,
Yang Liu
,
Mahdi Namazifar
NeurIPS
2018
Middle-Out Decoding
Shikib Mehri
,
Leonid Sigal