Yadav, Prateek

12 publications

TMLR 2025 A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning Prateek Yadav, Colin Raffel, Mohammed Muqeeth, Lucas Caccia, Haokun Liu, Tianlong Chen, Mohit Bansal, Leshem Choshen, Alessandro Sordoni
ICLR 2025 BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Terry Yue Zhuo, Vu Minh Chien, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, Simon Brunner, Chen Gong, James Hoang, Armel Randy Zebaze, Xiaoheng Hong, Wen-Ding Li, Jean Kaddour, Ming Xu, Zhihan Zhang, Prateek Yadav, Naman Jain, Alex Gu, Zhoujun Cheng, Jiawei Liu, Qian Liu, Zijian Wang, Binyuan Hui, Niklas Muennighoff, David Lo, Daniel Fried, Xiaoning Du, Harm de Vries, Leandro Von Werra
TMLR 2025 ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization Prateek Yadav, Leshem Choshen, Colin Raffel, Mohit Bansal
TMLR 2025 What Matters for Model Merging at Scale? Prateek Yadav, Tu Vu, Jonathan Lai, Alexandra Chronopoulou, Manaal Faruqui, Mohit Bansal, Tsendsuren Munkhdalai
ICLR 2024 $\mathbb{D}^2$ Pruning: Message Passing for Balancing Diversity & Difficulty in Data Pruning Adyasha Maharana, Prateek Yadav, Mohit Bansal
TMLR 2024 INSPIRE: Incorporating Diverse Feature Preferences in Recourse Prateek Yadav, Peter Hase, Mohit Bansal
ICLR 2024 Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy Pingzhi Li, Zhenyu Zhang, Prateek Yadav, Yi-Lin Sung, Yu Cheng, Mohit Bansal, Tianlong Chen
NeurIPS 2023 Self-Chained Image-Language Model for Video Localization and Question Answering Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal
NeurIPS 2023 TIES-Merging: Resolving Interference When Merging Models Prateek Yadav, Derek Tam, Leshem Choshen, Colin A Raffel, Mohit Bansal
AISTATS 2019 Confidence-Based Graph Convolutional Networks for Semi-Supervised Learning Shikhar Vashishth, Prateek Yadav, Manik Bhandari, Partha Talukdar
NeurIPS 2019 HyperGCN: A New Method for Training Graph Convolutional Networks on Hypergraphs Naganand Yadati, Madhav Nimishakavi, Prateek Yadav, Vikram Nitin, Anand Louis, Partha Talukdar
AISTATS 2019 Lovasz Convolutional Networks Prateek Yadav, Madhav Nimishakavi, Naganand Yadati, Shikhar Vashishth, Arun Rajkumar, Partha Talukdar