Mehta, Sachin

26 publications

ICML 2025 CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning Qingqing Cao, Mahyar Najibi, Sachin Mehta
ICLRW 2025 From Dense to Dynamic: Token-Difficulty Driven MoEfication of Pre-Trained LLMs Kumari Nishu, Sachin Mehta, Samira Abnar, Mehrdad Farajtabar, Maxwell Horton, Mahyar Najibi, Moin Nabi, Minsik Cho, Devang Naik
ICLRW 2025 KV Prediction for Improved Time to First Token Maxwell Horton, Qingqing Cao, Chenfan Sun, Yanzi Jin, Sachin Mehta, Mohammad Rastegari, Moin Nabi
ICLR 2025 SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators Rasoul Shafipour, David Harrison, Maxwell Horton, Jeffrey Marker, Houman Bedayat, Sachin Mehta, Mohammad Rastegari, Mahyar Najibi, Saman Naderiparizi
TMLR 2024 Bytes Are All You Need: Transformers Operating Directly on File Bytes Maxwell Horton, Sachin Mehta, Ali Farhadi, Mohammad Rastegari
TMLR 2024 CLIP Meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement Mohammadreza Salehi, Mehrdad Farajtabar, Maxwell Horton, Fartash Faghri, Hadi Pouransari, Raviteja Vemulapalli, Oncel Tuzel, Ali Farhadi, Mohammad Rastegari, Sachin Mehta
ICML 2024 Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-Specific Models Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri, Sachin Mehta, Mehrdad Farajtabar, Mohammad Rastegari, Oncel Tuzel
ICMLW 2024 LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Qichen Fu, Minsik Cho, Thomas Merth, Sachin Mehta, Mohammad Rastegari, Mahyar Najibi
ICMLW 2024 OpenELM: An Efficient Language Model Family with Open Training and Inference Framework Sachin Mehta, Mohammad Hossein Sekhavat, Qingqing Cao, Maxwell Horton, Yanzi Jin, Chenfan Sun, Seyed Iman Mirzadeh, Mahyar Najibi, Dmitry Belenko, Peter Zatloukal, Mohammad Rastegari
ICLR 2024 ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models Seyed Iman Mirzadeh, Keivan Alizadeh-Vahid, Sachin Mehta, Carlo C del Mundo, Oncel Tuzel, Golnoosh Samei, Mohammad Rastegari, Mehrdad Farajtabar
CVPRW 2024 SAM-CLIP: Merging Vision Foundation Models Towards Semantic and Spatial Understanding Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari
ICLR 2024 TiC-CLIP: Continual Training of CLIP Models Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari, Raviteja Vemulapalli, Sachin Mehta, Oncel Tuzel, Vaishaal Shankar, Fartash Faghri
NeurIPSW 2024 TiC-LM: A Multi-Year Benchmark for Continual Pretraining of Language Models Jeffrey Li, Mohammadreza Armandpour, Seyed Iman Mirzadeh, Sachin Mehta, Vaishaal Shankar, Raviteja Vemulapalli, Oncel Tuzel, Mehrdad Farajtabar, Hadi Pouransari, Fartash Faghri
NeurIPSW 2023 CLIP Meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement Mohammadreza Salehi, Mehrdad Farajtabar, Maxwell Horton, Fartash Faghri, Hadi Pouransari, Raviteja Vemulapalli, Oncel Tuzel, Ali Farhadi, Mohammad Rastegari, Sachin Mehta
ICCV 2023 Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement Fartash Faghri, Hadi Pouransari, Sachin Mehta, Mehrdad Farajtabar, Ali Farhadi, Mohammad Rastegari, Oncel Tuzel
NeurIPSW 2023 SAM-CLIP: Merging Vision Foundation Models Towards Semantic and Spatial Understanding Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari
TMLR 2023 Separable Self-Attention for Mobile Vision Transformers Sachin Mehta, Mohammad Rastegari
NeurIPSW 2023 TiC-CLIP: Continual Training of CLIP Models Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari, Raviteja Vemulapalli, Sachin Mehta, Oncel Tuzel, Vaishaal Shankar, Fartash Faghri
ICLR 2022 MobileViT: Light-Weight, General-Purpose, and Mobile-Friendly Vision Transformer Sachin Mehta, Mohammad Rastegari
ECCV 2022 SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks Chien-Yu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan, Maxwell Horton, Mohammad Rastegari
ICLR 2021 DeLighT: Deep and Light-Weight Transformer Sachin Mehta, Marjan Ghazvininejad, Srinivasan Iyer, Luke Zettlemoyer, Hannaneh Hajishirzi
ICLR 2020 DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling Sachin Mehta, Rik Koncel-Kedziorski, Mohammad Rastegari, Hannaneh Hajishirzi
WACV 2018 DeepSolarEye: Power Loss Prediction and Weakly Supervised Soiling Localization via Fully Convolutional Networks for Solar Panels Sachin Mehta, Amar P. Azad, Saneem A. Chemmengath, Vikas Raykar, Shivkumar Kalyanaraman
ECCV 2018 ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation Sachin Mehta, Mohammad Rastegari, Anat Caspi, Linda Shapiro, Hannaneh Hajishirzi
WACV 2018 Learning to Segment Breast Biopsy Whole Slide Images Sachin Mehta, Ezgi Mercan, Jamen Bartlett, Donald L. Weaver, Joann G. Elmore, Linda G. Shapiro
WACV 2016 Region Graph Based Method for Multi-Object Detection and Tracking Using Depth Cameras Sachin Mehta, Balakrishnan Prabhakaran