ML Anthology
Authors
Search
About
Lou, Hantao
4 publications
ICML
2025
SAE-V: Interpreting Multimodal Models for Enhanced Alignment
Hantao Lou
,
Changye Li
,
Jiaming Ji
,
Yaodong Yang
AAAI
2025
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction
Hantao Lou
,
Jiaming Ji
,
Kaile Wang
,
Yaodong Yang
NeurIPS
2024
Aligner: Efficient Alignment by Learning to Correct
Jiaming Ji
,
Boyuan Chen
,
Hantao Lou
,
Donghai Hong
,
Borong Zhang
,
Xuehai Pan
,
Juntao Dai
,
Tianyi Qiu
,
Yaodong Yang
NeurIPSW
2024
Language Models Resist Alignment
Jiaming Ji
,
Kaile Wang
,
Tianyi Qiu
,
Boyuan Chen
,
Changye Li
,
Hantao Lou
,
Jiayi Zhou
,
Josef Dai
,
Yaodong Yang