Sajjad, Hassan

16 publications

ICLR 2025 Data-Centric Prediction Explanation via Kernelized Stein Discrepancy Mahtab Sarvmaili, Hassan Sajjad, Ga Wu
NeurIPS 2025 Dependency Parsing Is More Parameter-Efficient with Normalization Paolo Gajo, Domenic Rosati, Hassan Sajjad, Alberto Barrón-Cedeño
ICML 2025 Explaining the Role of Intrinsic Dimensionality in Adversarial Training Enes Altinisik, Safa Messaoud, Husrev Taha Sencar, Hassan Sajjad, Sanjay Chawla
ICML 2025 Resolving Lexical Bias in Model Editing Hammad Rizwan, Domenic Rosati, Ga Wu, Hassan Sajjad
NeurIPSW 2024 Latent Concept-Based Explanation of NLP Models Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad
NeurIPSW 2024 Latent Concept-Based Explanation of NLP Models Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad
NeurIPS 2024 Representation Noising: A Defence Mechanism Against Harmful Finetuning Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz
NeurIPS 2024 SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations Sri Harsha Dumpala, Aman Jaiswal, Chandramouli Sastry, Evangelos Milios, Sageev Oore, Hassan Sajjad
AAAI 2023 ConceptX: A Framework for Latent Concept Analysis Firoj Alam, Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Abdul Rafae Khan, Jia Xu
JMLR 2023 Discovering Salient Neurons in Deep NLP Models Nadir Durrani, Fahim Dalvi, Hassan Sajjad
NeurIPS 2023 Evaluating Neuron Interpretation Methods of NLP Models Yimin Fan, Fahim Dalvi, Nadir Durrani, Hassan Sajjad
ICLR 2023 Learning Uncertainty for Unknown Domains with Zero-Target-Assumption Yu Yu, Hassan Sajjad, Jia Xu
ICLR 2022 Discovering Latent Concepts Learned in BERT Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu, Hassan Sajjad
ICLR 2019 Identifying and Controlling Important Neurons in Neural Machine Translation Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James Glass
AAAI 2019 NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass
AAAI 2019 What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass