Saleh, Mohammad

9 publications

ICLR 2025 Building Math Agents with Multi-Turn Iterative Preference Learning Wei Xiong, Chengshuai Shi, Jiaming Shen, Aviv Rosenberg, Zhen Qin, Daniele Calandriello, Misha Khalman, Rishabh Joshi, Bilal Piot, Mohammad Saleh, Chi Jin, Tong Zhang, Tianqi Liu
ICLR 2025 RRM: Robust Reward Model Training Mitigates Reward Hacking Tianqi Liu, Wei Xiong, Jie Ren, Lichang Chen, Junru Wu, Rishabh Joshi, Yang Gao, Jiaming Shen, Zhen Qin, Tianhe Yu, Daniel Sohn, Anastasia Makarova, Jeremiah Zhe Liu, Yuan Liu, Bilal Piot, Abe Ittycheriah, Aviral Kumar, Mohammad Saleh
ICLR 2024 Statistical Rejection Sampling Improves Preference Optimization Tianqi Liu, Yao Zhao, Rishabh Joshi, Misha Khalman, Mohammad Saleh, Peter J Liu, Jialu Liu
ICLR 2023 Calibrating Sequence Likelihood Improves Conditional Language Generation Yao Zhao, Mikhail Khalman, Rishabh Joshi, Shashi Narayan, Mohammad Saleh, Peter J Liu
ICLR 2023 Out-of-Distribution Detection and Selective Generation for Conditional Language Models Jie Ren, Jiaming Luo, Yao Zhao, Kundan Krishna, Mohammad Saleh, Balaji Lakshminarayanan, Peter J Liu
NeurIPSW 2022 Improving the Robustness of Conditional Language Models by Detecting and Removing Input Noise Kundan Krishna, Yao Zhao, Jie Ren, Balaji Lakshminarayanan, Jiaming Luo, Mohammad Saleh, Peter J Liu
NeurIPSW 2022 Out-of-Distribution Detection and Selective Generation for Conditional Language Models Jie Ren, Jiaming Luo, Yao Zhao, Kundan Krishna, Mohammad Saleh, Balaji Lakshminarayanan, Peter J Liu
ICML 2020 PEGASUS: Pre-Training with Extracted Gap-Sentences for Abstractive Summarization Jingqing Zhang, Yao Zhao, Mohammad Saleh, Peter Liu
ICLR 2018 Generating Wikipedia by Summarizing Long Sequences Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer