ML Anthology
Authors
Search
About
Wang, Weixin
6 publications
ICLR
2026
Breaking the Total Variance Barrier: Sharp Sample Complexity for Linear Heteroscedastic Bandits with Fixed Action Set
Heyang Zhao
,
Tianyuan Jin
,
Weixin Wang
,
Vincent Y. F. Tan
,
Pan Xu
,
Quanquan Gu
ICLR
2026
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment
Ruoxi Cheng
,
Hao-Xuan Ma
,
Weixin Wang
,
Ranjie Duan
,
Jiexi Liu
,
Xiaoshuang Jia
,
Simeng Qin
,
Xiaochun Cao
,
Yang Liu
,
Xiaojun Jia
UAI
2025
A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time
Yeqi Gao
,
Zhao Song
,
Weixin Wang
,
Junze Yin
CPAL
2025
Fast and Efficient Matching Algorithm with Deadline Instances
Zhao Song
,
Weixin Wang
,
Chenbo Yin
,
Junze Yin
ICML
2025
Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction
Yiting He
,
Zhishuai Liu
,
Weixin Wang
,
Pan Xu
NeurIPS
2024
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
Hao-Lun Hsu
,
Weixin Wang
,
Miroslav Pajic
,
Pan Xu