Verma, Arun

18 publications

ICLRW 2025 Active Human Feedback Collection via Neural Contextual Dueling Bandits Arun Verma, Xiaoqiang Lin, Zhongxiang Dai, Daniela Rus, Bryan Kian Hsiang Low
NeurIPS 2025 Incentivizing Time-Aware Fairness in Data Sharing Jiangwei Chen, Kieu Thao Nguyen Pham, Rachael Hwee Ling Sim, Arun Verma, Zhaoxuan Wu, Chuan-Sheng Foo, Bryan Kian Hsiang Low
ICLR 2025 Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma, Zhongxiang Dai, Xiaoqiang Lin, Patrick Jaillet, Bryan Kian Hsiang Low
ICLRW 2025 Understanding the Relationship Between Prompts and Response Uncertainty in Large Language Models Ze Yu Zhang, Arun Verma, Finale Doshi-Velez, Bryan Kian Hsiang Low
ICMLW 2024 Neural Dueling Bandits Arun Verma, Zhongxiang Dai, Xiaoqiang Lin, Patrick Jaillet, Bryan Kian Hsiang Low
ICMLW 2024 Prompt Optimization with Human Feedback Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low
NeurIPS 2023 Exploiting Correlated Auxiliary Feedback in Parameterized Bandits Arun Verma, Zhongxiang Dai, Yao Shu, Bryan Kian Hsiang Low
AISTATS 2023 FAIR: Fair Collaborative Active Learning with Individual Rationality for Scientific Discovery Xinyi Xu, Zhaoxuan Wu, Arun Verma, Chuan Sheng Foo, Bryan Kian Hsiang Low
ICLR 2023 Federated Neural Bandits Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet
NeurIPS 2023 Quantum Bayesian Optimization Zhongxiang Dai, Gregory Kang Ruey Lau, Arun Verma, Yao Shu, Bryan Kian Hsiang Low, Patrick Jaillet
ICLR 2023 Risk-Aware Reinforcement Learning with Coherent Risk Measures and Non-Linear Function Approximation Thanh Lam, Arun Verma, Bryan Kian Hsiang Low, Patrick Jaillet
ICLR 2023 Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation Yao Shu, Zhongxiang Dai, Weicong Sng, Arun Verma, Patrick Jaillet, Bryan Kian Hsiang Low
ICML 2022 Bayesian Optimization Under Stochastic Delayed Feedback Arun Verma, Zhongxiang Dai, Bryan Kian Hsiang Low
NeurIPS 2021 Stochastic Multi-Armed Bandits with Control Variates Arun Verma, Manjesh Kumar Hanawal
NeurIPS 2020 Online Algorithm for Unsupervised Sequential Selection with Contextual Information Arun Verma, Manjesh Kumar Hanawal, Csaba Szepesvari, Venkatesh Saligrama
ACML 2020 Thompson Sampling for Unsupervised Sequential Selection Arun Verma, Manjesh K Hanawal, Nandyala Hemachandra
NeurIPS 2019 Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Arun Verma, Manjesh Hanawal, Arun Rajkumar, Raman Sankaran
AISTATS 2019 Online Algorithm for Unsupervised Sensor Selection Arun Verma, Manjesh Hanawal, Csaba Szepesvari, Venkatesh Saligrama