Hu, Pihe

6 publications

TMLR 2025 Mixed Sparsity Training: Achieving 4$\times$ FLOP Reduction for Transformer Pretraining Pihe Hu, Shaolong Li, Xun Wang, Longbo Huang
ICLR 2024 Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback Yu Chen, Yihan Du, Pihe Hu, Siwei Wang, Desheng Wu, Longbo Huang
NeurIPS 2024 Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training Pihe Hu, Shaolong Li, Zhuoran Li, Ling Pan, Longbo Huang
ICLR 2023 RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch Yiqin Tan, Pihe Hu, Ling Pan, Jiatai Huang, Longbo Huang
ICLR 2023 Towards Minimax Optimal Reward-Free Reinforcement Learning in Linear MDPs Pihe Hu, Yu Chen, Longbo Huang
ICML 2022 Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation Pihe Hu, Yu Chen, Longbo Huang