Wang, Weixin

6 publications

ICLR 2026 Breaking the Total Variance Barrier: Sharp Sample Complexity for Linear Heteroscedastic Bandits with Fixed Action Set Heyang Zhao, Tianyuan Jin, Weixin Wang, Vincent Y. F. Tan, Pan Xu, Quanquan Gu
ICLR 2026 Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment Ruoxi Cheng, Hao-Xuan Ma, Weixin Wang, Ranjie Duan, Jiexi Liu, Xiaoshuang Jia, Simeng Qin, Xiaochun Cao, Yang Liu, Xiaojun Jia
UAI 2025 A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time Yeqi Gao, Zhao Song, Weixin Wang, Junze Yin
CPAL 2025 Fast and Efficient Matching Algorithm with Deadline Instances Zhao Song, Weixin Wang, Chenbo Yin, Junze Yin
ICML 2025 Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction Yiting He, Zhishuai Liu, Weixin Wang, Pan Xu
NeurIPS 2024 Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning Hao-Lun Hsu, Weixin Wang, Miroslav Pajic, Pan Xu