ML Anthology
Authors
Search
About
Hao, Botao
22 publications
ICML
2024
Efficient Exploration for LLMs
Vikranth Dwaracherla
,
Seyed Mohammad Asghari
,
Botao Hao
,
Benjamin Van Roy
TMLR
2024
Sequential Best-Arm Identification with Application to P300 Speller
Xin Zhou
,
Botao Hao
,
Tor Lattimore
,
Jian Kang
,
Lexin Li
TMLR
2023
Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Botao Hao
,
Rahul Jain
,
Dengwang Tang
,
Zheng Wen
ICML
2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao
,
Rahul Jain
,
Tor Lattimore
,
Benjamin Van Roy
,
Zheng Wen
AISTATS
2022
Confident Least Square Value Iteration with Local Access to a Simulator
Botao Hao
,
Nevena Lazic
,
Dong Yin
,
Yasin Abbasi-Yadkori
,
Csaba Szepesvari
ICML
2022
Contextual Information-Directed Sampling
Botao Hao
,
Tor Lattimore
,
Chao Qin
ALT
2022
Efficient Local Planning with Linear Function Approximation
Dong Yin
,
Botao Hao
,
Yasin Abbasi-Yadkori
,
Nevena Lazić
,
Csaba Szepesvári
ICLR
2022
Interacting Contour Stochastic Gradient Langevin Dynamics
Wei Deng
,
Siqi Liang
,
Botao Hao
,
Guang Lin
,
Faming Liang
NeurIPS
2022
Regret Bounds for Information-Directed Reinforcement Learning
Botao Hao
,
Tor Lattimore
NeurIPS
2022
The Neural Testbed: Evaluating Joint Predictions
Ian Osband
,
Zheng Wen
,
Seyed Mohammad Asghari
,
Vikranth Dwaracherla
,
Xiuyuan Lu
,
Morteza Ibrahimi
,
Dieterich Lawson
,
Botao Hao
,
Brendan O'Donoghue
,
Benjamin Van Roy
AISTATS
2021
Adaptive Approximate Policy Iteration
Botao Hao
,
Nevena Lazic
,
Yasin Abbasi-Yadkori
,
Pooria Joulani
,
Csaba Szepesvari
AISTATS
2021
Online Sparse Reinforcement Learning
Botao Hao
,
Tor Lattimore
,
Csaba Szepesvari
,
Mengdi Wang
NeurIPS
2021
Bandit Phase Retrieval
Tor Lattimore
,
Botao Hao
ICML
2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao
,
Xiang Ji
,
Yaqi Duan
,
Hao Lu
,
Csaba Szepesvari
,
Mengdi Wang
NeurIPS
2021
Information Directed Sampling for Sparse Linear Bandits
Botao Hao
,
Tor Lattimore
,
Wei Deng
ICML
2021
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient
Botao Hao
,
Yaqi Duan
,
Tor Lattimore
,
Csaba Szepesvari
,
Mengdi Wang
JMLR
2021
Sparse Tensor Additive Regression
Botao Hao
,
Boxiang Wang
,
Pengyuan Wang
,
Jingfei Zhang
,
Jian Yang
,
Will Wei Sun
AISTATS
2020
Adaptive Exploration in Linear Contextual Bandit
Botao Hao
,
Tor Lattimore
,
Csaba Szepesvari
NeurIPS
2020
High-Dimensional Sparse Linear Bandits
Botao Hao
,
Tor Lattimore
,
Mengdi Wang
AISTATS
2020
Sparse and Low-Rank Tensor Estimation via Cubic Sketchings
Botao Hao
,
Anru R. Zhang
,
Guang Cheng
NeurIPS
2019
Bootstrapping Upper Confidence Bound
Botao Hao
,
Yasin Abbasi Yadkori
,
Zheng Wen
,
Guang Cheng
JMLR
2019
Nonparametric Bayesian Aggregation for Massive Data
Zuofeng Shang
,
Botao Hao
,
Guang Cheng