Lou, Hantao

4 publications

ICML 2025 SAE-V: Interpreting Multimodal Models for Enhanced Alignment Hantao Lou, Changye Li, Jiaming Ji, Yaodong Yang
AAAI 2025 Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction Hantao Lou, Jiaming Ji, Kaile Wang, Yaodong Yang
NeurIPS 2024 Aligner: Efficient Alignment by Learning to Correct Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Juntao Dai, Tianyi Qiu, Yaodong Yang
NeurIPSW 2024 Language Models Resist Alignment Jiaming Ji, Kaile Wang, Tianyi Qiu, Boyuan Chen, Changye Li, Hantao Lou, Jiayi Zhou, Josef Dai, Yaodong Yang