Pei, Aihua

1 publications

NeurIPSW 2024 Reinforcement Learning from Multi-Role Debates as Feedback for Bias Mitigation in LLMs Ruoxi Cheng, Hao-Xuan Ma, Shuirong Cao, Jiaqi Li, Aihua Pei, Zhiqiang Wang, Pengliang Ji, Haoyu Wang, Jiaqi Huo