Sun, Jiuding

3 publications

ICLR 2025 HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks Jiuding Sun, Jing Huang, Sidharth Baskaran, Karel D'Oosterlinck, Christopher Potts, Michael Sklar, Atticus Geiger
ICLR 2024 Evaluating the Zero-Shot Robustness of Instruction-Tuned Language Models Jiuding Sun, Chantal Shaib, Byron C Wallace
AAAI 2023 Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing Lunyiu Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai