Zhao, Siyan

20 publications

ICLR 2026 Inpainting-Guided Policy Optimization for Diffusion Large Language Models Siyan Zhao, Mengchen Liu, Jing Huang, Miao Liu, Chenyu Wang, Bo Liu, Yuandong Tian, Guan Pang, Sean Bell, Aditya Grover, Feiyu Chen
ICLR 2026 SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models Chenyu Wang, Paria Rashidinejad, DiJia Su, Song Jiang, Sid Wang, Siyan Zhao, Cai Zhou, Shannon Zejiang Shen, Feiyu Chen, Tommi Jaakkola, Yuandong Tian, Bo Liu
NeurIPS 2025 D1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning Siyan Zhao, Devaansh Gupta, Qinqing Zheng, Aditya Grover
ICLR 2025 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao, Mingyi Hong, Yang Liu, Devamanyu Hazarika, Kaixiang Lin
NeurIPS 2025 MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants Hritik Bansal, Daniel Mingyi Israel, Siyan Zhao, Shufan Li, Tung Nguyen, Aditya Grover
AISTATS 2025 Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models Siyan Zhao, Daniel Mingyi Israel, Guy Van Broeck, Aditya Grover
ICMLW 2024 Fast and Memory-Efficient Multi-Sequence Generation via Structured Masking Daniel Mingyi Israel, Siyan Zhao, Guy Van den Broeck, Aditya Grover
ICLR 2024 Group Preference Optimization: Few-Shot Alignment of Large Language Models Siyan Zhao, John Dang, Aditya Grover
ICLRW 2024 Group Preference Optimization: Few-Shot Alignment of Large Language Models Siyan Zhao, John Dang, Aditya Grover
ICMLW 2024 Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models Siyan Zhao, Daniel Mingyi Israel, Guy Van den Broeck, Aditya Grover
NeurIPS 2024 Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover
ICMLW 2024 Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover
ICMLW 2024 Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover
NeurIPSW 2024 Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover
NeurIPSW 2024 Probing the Decision Boundaries of In-Context Learning in Large Language Models Download PDF Siyan Zhao, Tung Nguyen, Aditya Grover
NeurIPS 2023 Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models Siyan Zhao, Aditya Grover
ICMLW 2023 Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models Siyan Zhao, Aditya Grover
ICMLW 2023 Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models Siyan Zhao, Aditya Grover
NeurIPSW 2023 Group Preference Optimization: Few-Shot Alignment of Large Language Models Siyan Zhao, John Dang, Aditya Grover
NeurIPSW 2023 Group Preference Optimization: Few-Shot Alignment of Large Language Models Siyan Zhao, John Dang, Aditya Grover