Yao, Jiarui

5 publications

ICLR 2026 GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving Ruida Wang, Jiarui Yao, Rui Pan, Shizhe Diao, Tong Zhang
ICLR 2026 Why Is Your Language Model a Poor Implicit Reward Model? Noam Razin, Yong Lin, Jiarui Yao, Sanjeev Arora
NeurIPS 2025 Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL Jiarui Yao, Yifan Hao, Hanning Zhang, Hanze Dong, Wei Xiong, Nan Jiang, Tong Zhang
NeurIPS 2024 Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models Yuancheng Xu, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, Furong Huang
ICLRW 2024 Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models Yuancheng Xu, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, Furong Huang