Huang, Longbo

51 publications

ICLR 2025 Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks Rui Hu, Yifan Zhang, Zhuoran Li, Longbo Huang
ICLR 2025 Efficient Online Pruning and Abstraction for Imperfect Information Extensive-Form Games Boning Li, Longbo Huang
ICML 2025 Finite-Time Analysis of Discrete-Time Stochastic Interpolants Yuhao Liu, Yu Chen, Rui Hu, Longbo Huang
TMLR 2025 Mixed Sparsity Training: Achieving 4$\times$ FLOP Reduction for Transformer Pretraining Pihe Hu, Shaolong Li, Xun Wang, Longbo Huang
ICLR 2025 uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Yu Chen, Jiatai Huang, Yan Dai, Longbo Huang
ICLR 2024 A Quadratic Synchronization Rule for Distributed Deep Learning Xinran Gu, Kaifeng Lyu, Sanjeev Arora, Jingzhao Zhang, Longbo Huang
ICML 2024 Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation Yu Chen, Xiangcheng Zhang, Siwei Wang, Longbo Huang
ICLR 2024 Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback Yu Chen, Yihan Du, Pihe Hu, Siwei Wang, Desheng Wu, Longbo Huang
ICML 2024 Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation Tonghe Zhang, Yu Chen, Longbo Huang
ICML 2024 RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning Boning Li, Zhixuan Fang, Longbo Huang
NeurIPS 2024 Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training Pihe Hu, Shaolong Li, Zhuoran Li, Ling Pan, Longbo Huang
NeurIPSW 2023 A Quadratic Synchronization Rule for Distributed Deep Learning Xinran Gu, Kaifeng Lyu, Sanjeev Arora, Jingzhao Zhang, Longbo Huang
ICML 2023 Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jiatai Huang, Yan Dai, Longbo Huang
ICLR 2023 Collaborative Pure Exploration in Kernel Bandit Yihan Du, Wei Chen, Yuko Kuroki, Longbo Huang
ICLR 2023 Generative Augmented Flow Networks Ling Pan, Dinghuai Zhang, Aaron Courville, Longbo Huang, Yoshua Bengio
ICML 2023 Multi-Task Representation Learning for Pure Exploration in Linear Bandits Yihan Du, Longbo Huang, Wen Sun
TMLR 2023 Online Min-Max Problems with Non-Convexity and Non-Stationarity Yu Huang, Yuan Cheng, Yingbin Liang, Longbo Huang
ICLR 2023 Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path Yihan Du, Siwei Wang, Longbo Huang
NeurIPS 2023 Provably Safe Reinforcement Learning with Step-Wise Violation Constraints Nuoya Xiong, Yihan Du, Longbo Huang
ICLR 2023 RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch Yiqin Tan, Pihe Hu, Ling Pan, Jiatai Huang, Longbo Huang
AAAI 2023 RePreM: Representation Pre-Training with Masked Model for Reinforcement Learning Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang
UAI 2023 Stochastic Generative Flow Networks Ling Pan, Dinghuai Zhang, Moksh Jain, Longbo Huang, Yoshua Bengio
ICLR 2023 Towards Minimax Optimal Reward-Free Reinforcement Learning in Linear MDPs Pihe Hu, Yu Chen, Longbo Huang
ICLR 2023 Why (and When) Does Local SGD Generalize Better than SGD? Xinran Gu, Kaifeng Lyu, Longbo Huang, Sanjeev Arora
ICML 2022 Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits Jiatai Huang, Yan Dai, Longbo Huang
ICML 2022 Modality Competition: What Makes Joint Training of Multi-Modal Network Fail in Deep Learning? (Provably) Yu Huang, Junyang Lin, Chang Zhou, Hongxia Yang, Longbo Huang
ICML 2022 Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation Pihe Hu, Yu Chen, Longbo Huang
NeurIPSW 2022 Online Min-Max Optimization: Nonconvexity, Nonstationarity, and Dynamic Regret Yu Huang, Yuan Cheng, Yingbin Liang, Longbo Huang
ICML 2022 Plan Better amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification Ling Pan, Longbo Huang, Tengyu Ma, Huazhe Xu
NeurIPS 2022 Provable Generalization of Overparameterized Meta-Learning Trained with SGD Yu Huang, Yingbin Liang, Longbo Huang
NeurIPSW 2022 Why (and When) Does Local SGD Generalize Better than SGD? Xinran Gu, Kaifeng Lyu, Longbo Huang, Sanjeev Arora
AAAI 2021 A One-Size-Fits-All Solution to Conservative Bandit Problems Yihan Du, Siwei Wang, Longbo Huang
AAAI 2021 Adaptive Algorithms for Multi-Armed Bandit with Composite and Anonymous Feedback Siwei Wang, Haoyun Wang, Longbo Huang
NeurIPS 2021 Continuous Mean-Covariance Bandits Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang
AAAI 2021 Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li
NeurIPS 2021 Fast Federated Learning in the Presence of Arbitrary Device Unavailability Xinran Gu, Kaixuan Huang, Jingzhao Zhang, Longbo Huang
NeurIPS 2021 Multi-Agent Reinforcement Learning in Stochastic Networked Systems Yiheng Lin, Guannan Qu, Longbo Huang, Adam Wierman
NeurIPSW 2021 Plan Better amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification Ling Pan, Longbo Huang, Tengyu Ma, Huazhe Xu
NeurIPS 2021 Regularized SoftMax Deep Multi-Agent Q-Learning Ling Pan, Tabish Rashid, Bei Peng, Longbo Huang, Shimon Whiteson
NeurIPS 2021 The Best of Both Worlds: Stochastic and Adversarial Episodic MDPs with Unknown Transition Tiancheng Jin, Longbo Huang, Haipeng Luo
NeurIPS 2021 What Makes Multi-Modal Learning Better than Single (Provably) Yu Huang, Chenzhuang Du, Zihui Xue, Xuanyao Chen, Hang Zhao, Longbo Huang
ICML 2020 Combinatorial Pure Exploration for Dueling Bandit Wei Chen, Yihan Du, Longbo Huang, Haoyu Zhao
IJCAI 2020 Reinforcement Learning with Dynamic Boltzmann SoftMax Updates Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang
NeurIPS 2020 Restless-UCB, an Efficient and Low-Complexity Algorithm for Online Restless Bandits Siwei Wang, Longbo Huang, John C. S. Lui
NeurIPS 2020 SoftMax Deep Double Deterministic Policy Gradients Ling Pan, Qingpeng Cai, Longbo Huang
AAAI 2019 A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems Ling Pan, Qingpeng Cai, Zhixuan Fang, Pingzhong Tang, Longbo Huang
NeurIPS 2019 Double Quantization for Communication-Efficient Distributed Optimization Yue Yu, Jiaxiang Wu, Longbo Huang
IJCAI 2018 A Social Interaction Activity Based Time-Varying User Vectorization Method for Online Social Networks Tianyi Hao, Longbo Huang
IJCAI 2018 Beyond the Click-Through Rate: Web Link Selection with Multi-Level Feedback Kun Chen, Kechao Cai, Longbo Huang, John C. S. Lui
NeurIPS 2018 Multi-Armed Bandits with Compensation Siwei Wang, Longbo Huang
IJCAI 2017 Fast Stochastic Variance Reduced ADMM for Stochastic Composition Optimization Yue Yu, Longbo Huang