Jin, Helen

2 publications

NeurIPS 2025 Probabilistic Stability Guarantees for Feature Attributions Helen Jin, Anton Xue, Weiqiu You, Surbhi Goel, Eric Wong
DMLR 2025 The FIX Benchmark: Extracting Features Interpretable to eXperts Helen Jin, Shreya Havaldar, Chaehyeon Kim, Anton Xue, Weiqiu You, Helen Qu, Marco Gatti, Daniel A Hashimoto, Bhuvnesh Jain, Amin Madani, Masao Sako, Lyle Ungar, Eric Wong