ML Anthology
Authors
Search
About
Shah, Harshay
12 publications
ICML
2025
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
Samira Abnar
,
Harshay Shah
,
Dan Busbridge
,
Alaaeldin El-Nouby
,
Joshua M. Susskind
,
Vimal Thilak
ICLRW
2025
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
Samira Abnar
,
Harshay Shah
,
Dan Busbridge
,
Alaaeldin El-Nouby
,
Joshua M. Susskind
,
Vimal Thilak
NeurIPS
2024
ContextCite: Attributing Model Generation to Context
Benjamin Cohen-Wang
,
Harshay Shah
,
Kristian Georgiev
,
Aleksander MÄ…dry
ICMLW
2024
ContextCite: Attributing Model Generation to Context
Benjamin Cohen-Wang
,
Harshay Shah
,
Kristian Georgiev
,
Aleksander Madry
ICMLW
2024
ContextCite: Attributing Model Generation to Context
Benjamin Cohen-Wang
,
Harshay Shah
,
Kristian Georgiev
,
Aleksander Madry
ICML
2024
Decomposing and Editing Predictions by Modeling Model Computation
Harshay Shah
,
Andrew Ilyas
,
Aleksander Madry
ICMLW
2024
Decomposing and Editing Predictions by Modeling Model Computation
Harshay Shah
,
Andrew Ilyas
,
Aleksander Madry
NeurIPSW
2024
Decomposing and Editing Predictions by Modeling Model Computation
Harshay Shah
,
Andrew Ilyas
,
Aleksander Madry
ICML
2023
ModelDiff: A Framework for Comparing Learning Algorithms
Harshay Shah
,
Sung Min Park
,
Andrew Ilyas
,
Aleksander Madry
NeurIPSW
2022
A Unified Framework for Comparing Learning Algorithms
Harshay Shah
,
Sung Min Park
,
Andrew Ilyas
,
Aleksander Madry
NeurIPS
2021
Do Input Gradients Highlight Discriminative Features?
Harshay Shah
,
Prateek Jain
,
Praneeth Netrapalli
NeurIPS
2020
The Pitfalls of Simplicity Bias in Neural Networks
Harshay Shah
,
Kaustav Tamuly
,
Aditi Raghunathan
,
Prateek Jain
,
Praneeth Netrapalli