Ping, Wei

34 publications

NeurIPS 2025 AceReason-Nemotron: Advancing Math and Code Reasoning Through Reinforcement Learning Yang Chen, Zhuolin Yang, Zihan Liu, Chankyu Lee, Peng Xu, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping
ICML 2025 Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Sreyan Ghosh, Zhifeng Kong, Sonal Kumar, S Sakshi, Jaehyeon Kim, Wei Ping, Rafael Valle, Dinesh Manocha, Bryan Catanzaro
ICLR 2025 ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Peng Xu, Wei Ping, Xianchao Wu, Chejian Xu, Zihan Liu, Mohammad Shoeybi, Bryan Catanzaro
ICLR 2025 Fugatto 1: Foundational Generative Audio Transformer Opus 1 Rafael Valle, Rohan Badlani, Zhifeng Kong, Sang-gil Lee, Arushi Goel, Sungwon Kim, Joao Felipe Santos, Shuqi Dai, Siddharth Gururani, Aya Aljafari, Alexander H. Liu, Kevin J. Shih, Ryan Prenger, Wei Ping, Chao-Han Huck Yang, Bryan Catanzaro
ICLR 2025 Mm-Embed: Universal Multimodal Retrieval with Multimodal LLMs Sheng-Chieh Lin, Chankyu Lee, Mohammad Shoeybi, Jimmy Lin, Bryan Catanzaro, Wei Ping
ICLR 2025 NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models Chankyu Lee, Rajarshi Roy, Mengyao Xu, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping
ICML 2024 Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro
NeurIPS 2024 ChatQA: Surpassing GPT-4 on Conversational QA and RAG Zihan Liu, Wei Ping, Rajarshi Roy, Peng Xu, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro
ICML 2024 InstructRetro: Instruction Tuning Post Retrieval-Augmented Pretraining Boxin Wang, Wei Ping, Lawrence Mcafee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro
NeurIPS 2024 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu, Wei Ping, Zihan Liu, Boxin Wang, Jiaxuan You, Chao Zhang, Mohammad Shoeybi, Bryan Catanzaro
ICLR 2024 Retrieval Meets Long Context Large Language Models Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro
CVPR 2024 VILA: On Pre-Training for Visual Language Models Ji Lin, Hongxu Yin, Wei Ping, Pavlo Molchanov, Mohammad Shoeybi, Song Han
ICLR 2023 BigVGAN: A Universal Neural Vocoder with Large-Scale Training Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon
ICLR 2023 Defending Against Adversarial Audio via Diffusion Model Shutong Wu, Jiongxiao Wang, Wei Ping, Weili Nie, Chaowei Xiao
CVPR 2023 FlowGrad: Controlling the Output of Generative ODEs with Gradients Xingchao Liu, Lemeng Wu, Shujian Zhang, Chengyue Gong, Wei Ping, Qiang Liu
NeurIPS 2022 Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models Boxin Wang, Wei Ping, Chaowei Xiao, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bo Li, Anima Anandkumar, Bryan Catanzaro
NeurIPS 2022 Factuality Enhanced Language Models for Open-Ended Text Generation Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Pascale N Fung, Mohammad Shoeybi, Bryan Catanzaro
ICLR 2021 DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro
NeurIPS 2021 Long-Short Transformer: Efficient Transformers for Language and Vision Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro
ICMLW 2021 On Fast Sampling of Diffusion Probabilistic Models Zhifeng Kong, Wei Ping
ICMLW 2021 RAD-TTS: Parallel Flow-Based TTS with Robust Alignment Learning and Diverse Synthesis Kevin J. Shih, Rafael Valle, Rohan Badlani, Adrian Lancucki, Wei Ping, Bryan Catanzaro
ICML 2020 Non-Autoregressive Neural Text-to-Speech Kainan Peng, Wei Ping, Zhao Song, Kexin Zhao
ICLR 2020 Parallel Neural Text-to-Speech Kainan Peng, Wei Ping, Zhao Song, Kexin Zhao
ICML 2020 WaveFlow: A Compact Flow-Based Model for Raw Audio Wei Ping, Kainan Peng, Kexin Zhao, Zhao Song
ICLR 2019 ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech Wei Ping, Kainan Peng, Jitong Chen
ICLR 2018 Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning Wei Ping, Kainan Peng, Andrew Gibiansky, Sercan O. Arik, Ajay Kannan, Sharan Narang, Jonathan Raiman, John Miller
NeurIPS 2018 Neural Voice Cloning with a Few Samples Sercan Arik, Jitong Chen, Kainan Peng, Wei Ping, Yanqi Zhou
AISTATS 2018 Topic Compositional Neural Language Model Wenlin Wang, Zhe Gan, Wenqi Wang, Dinghan Shen, Jiaji Huang, Wei Ping, Sanjeev Satheesh, Lawrence Carin
AISTATS 2017 Belief Propagation in Conditional RBMs for Structured Prediction Wei Ping, Alexander Ihler
NeurIPS 2017 Deep Voice 2: Multi-Speaker Neural Text-to-Speech Andrew Gibiansky, Sercan Arik, Gregory Diamos, John Miller, Kainan Peng, Wei Ping, Jonathan Raiman, Yanqi Zhou
NeurIPS 2016 Learning Infinite RBMs with Frank-Wolfe Wei Ping, Qiang Liu, Alex Ihler
NeurIPS 2015 Decomposition Bounds for Marginal MAP Wei Ping, Qiang Liu, Alex Ihler
ICML 2014 Marginal Structured SVM with Hidden Variables Wei Ping, Qiang Liu, Alex Ihler
AAAI 2010 Non-I.I.D. Multi-Instance Dimensionality Reduction by Learning a Maximum Bag Margin Subspace Wei Ping, Ye Xu, Kexin Ren, Chi-Hung Chi, Furao Shen