Zheng, Zhixiao

1 publications

ICLR 2026 Cat-PO: Cross-Modal Adaptive Token-Rewards for Preference Optimization in Truthful Multimodal LLMs Zhixiao Zheng, Zheren Fu, Zhiyuan Yao, Dongming Zhang, Zhendong Mao