He, Yiting

3 publications

ICLR 2025 Model Editing as a Robust and Denoised Variant of DPO: A Case Study on Toxicity Rheeya Uppaal, Apratim Dey, Yiting He, Yiqiao Zhong, Junjie Hu
ICML 2025 Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction Yiting He, Zhishuai Liu, Weixin Wang, Pan Xu
NeurIPSW 2024 Model Editing as a Robust and Denoised Variant of DPO: A Case Study on Toxicity Rheeya Uppaal, Apratim Dey, Yiting He, Yiqiao Zhong, Junjie Hu