ML Anthology
Authors
Search
About
Gao, Anningzhe
5 publications
TMLR
2026
RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment
Yuhao Du
,
Zhuo Li
,
Pengyu Cheng
,
Zhihong Chen
,
Yuejiao Xie
,
Xiang Wan
,
Anningzhe Gao
AAAI
2025
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Chen Zhang
,
Dading Chong
,
Feng Jiang
,
Chengguang Tang
,
Anningzhe Gao
,
Guohua Tang
,
Haizhou Li
NeurIPS
2025
Intermediate Domain Alignment and Morphology Analogy for Patent-Product Image Retrieval
Haifan Gong
,
Xuanye Zhang
,
Ruifei Zhang
,
Yun Su
,
Zhuo Li
,
Yuhao Du
,
Anningzhe Gao
,
Xiang Wan
,
Haofeng Li
ICLRW
2025
Mitigating Short Board Effect via Dynamic Reward Balancing in Multi-Reward LLM Optimization
Nuo Chen
,
Yufei Gao
,
Yongnan Jin
,
Yan Hu
,
Anningzhe Gao
,
Lingyong Yan
,
Benyou Wang
TMLR
2025
Synthesizing Minority Samples for Long-Tailed Classification via Distribution Matching
Zhuo Li
,
He Zhao
,
Jinke Ren
,
Anningzhe Gao
,
Dandan Guo
,
Xiang Wan
,
Hongyuan Zha