Chai, Yekun

4 publications

ICLR 2025 MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Yekun Chai, Haoran Sun, Huang Fang, Shuohuan Wang, Yu Sun, Hua Wu
ICML 2024 GiLOT: Interpreting Generative Language Models via Optimal Transport Xuhong Li, Jiamin Chen, Yekun Chai, Haoyi Xiong
ICLR 2024 Tool-Augmented Reward Modeling Lei Li, Yekun Chai, Shuohuan Wang, Yu Sun, Hao Tian, Ningyu Zhang, Hua Wu
NeurIPS 2023 $\mathcal{M}^4$: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods Across Metrics, Modalities and Models Xuhong Li, Mengnan Du, Jiamin Chen, Yekun Chai, Himabindu Lakkaraju, Haoyi Xiong