Huang, Kaixuan

23 publications

IJCAI 2025 Deep Reinforcement Learning for Efficient and Fair Allocation of Healthcare Resources Yikuan Li, Chengsheng Mao, Kaixuan Huang, Hanyin Wang, Zheng Yu, Mengdi Wang, Yuan Luo
ICML 2025 Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models Yukang Yang, Declan Iain Campbell, Kaixuan Huang, Mengdi Wang, Jonathan D. Cohen, Taylor Whittington Webb
ICLRW 2025 MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities Against Hard Perturbations Kaixuan Huang, Jiacheng Guo, Zihao Li, Xiang Ji, Jiawei Ge, Wenzhe Li, Yingqing Guo, Tianle Cai, Hui Yuan, Runzhe Wang, Yue Wu, Ming Yin, Shange Tang, Yangsibo Huang, Chi Jin, Xinyun Chen, Chiyuan Zhang, Mengdi Wang
ICML 2025 MATH-Perturb: Benchmarking LLMs’ Math Reasoning Abilities Against Hard Perturbations Kaixuan Huang, Jiacheng Guo, Zihao Li, Xiang Ji, Jiawei Ge, Wenzhe Li, Yingqing Guo, Tianle Cai, Hui Yuan, Runzhe Wang, Yue Wu, Ming Yin, Shange Tang, Yangsibo Huang, Chi Jin, Xinyun Chen, Chiyuan Zhang, Mengdi Wang
ICLR 2025 SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Tinghao Xie, Xiangyu Qi, Yi Zeng, Yangsibo Huang, Udari Madhushani Sehwag, Kaixuan Huang, Luxi He, Boyi Wei, Dacheng Li, Ying Sheng, Ruoxi Jia, Bo Li, Kai Li, Danqi Chen, Peter Henderson, Prateek Mittal
ICLRW 2025 Temporal Consistency for LLM Reasoning Process Error Identification Jiacheng Guo, Yue Wu, Jiahao Qiu, Kaixuan Huang, Xinzhe Juan, Ling Yang, Mengdi Wang
NeurIPS 2024 A Theoretical Perspective for Speculative Decoding Algorithm Ming Yin, Minshuo Chen, Kaixuan Huang, Mengdi Wang
ICML 2024 Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson
ICLRW 2024 Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson
NeurIPSW 2024 Embodied LLM Agents Learn to Cooperate in Organized Teams Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang
NeurIPSW 2024 Latent Diffusion Models for Controllable RNA Sequence Generation Kaixuan Huang, Yukang Yang, Kaidi Fu, Yanyi Chu, Le Cong, Mengdi Wang
ICMLW 2024 SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths Kaixuan Huang, Xudong Guo, Mengdi Wang
AAAI 2024 Visual Adversarial Examples Jailbreak Aligned Large Language Models Xiangyu Qi, Kaixuan Huang, Ashwinee Panda, Peter Henderson, Mengdi Wang, Prateek Mittal
ICLR 2023 Deep Reinforcement Learning for Cost-Effective Medical Diagnosis Zheng Yu, Yikuan Li, Joseph Chahn Kim, Kaixuan Huang, Yuan Luo, Mengdi Wang
NeurIPS 2023 Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang
ICMLW 2023 Scaling In-Context Demonstrations with Structured Attention Tianle Cai, Kaixuan Huang, Jason D. Lee, Mengdi Wang
ICML 2023 Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang
ICMLW 2023 Visual Adversarial Examples Jailbreak Aligned Large Language Models Xiangyu Qi, Kaixuan Huang, Ashwinee Panda, Mengdi Wang, Prateek Mittal
NeurIPS 2021 Fast Federated Learning in the Presence of Arbitrary Device Unavailability Xinran Gu, Kaixuan Huang, Jingzhao Zhang, Longbo Huang
NeurIPS 2021 Going Beyond Linear RL: Sample Efficient Neural Function Approximation Baihe Huang, Kaixuan Huang, Sham Kakade, Jason Lee, Qi Lei, Runzhe Wang, Jiaqi Yang
NeurIPS 2021 Optimal Gradient-Based Algorithms for Non-Concave Bandit Optimization Baihe Huang, Kaixuan Huang, Sham Kakade, Jason Lee, Qi Lei, Runzhe Wang, Jiaqi Yang
ICLR 2020 On the Convergence of FedAvg on Non-IID Data Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, Zhihua Zhang
NeurIPS 2020 Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- a Neural Tangent Kernel Perspective Kaixuan Huang, Yuqing Wang, Molei Tao, Tuo Zhao