ML Anthology
Authors
Search
About
Sun, Jiuding
3 publications
ICLR
2025
HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Jiuding Sun
,
Jing Huang
,
Sidharth Baskaran
,
Karel D'Oosterlinck
,
Christopher Potts
,
Michael Sklar
,
Atticus Geiger
ICLR
2024
Evaluating the Zero-Shot Robustness of Instruction-Tuned Language Models
Jiuding Sun
,
Chantal Shaib
,
Byron C Wallace
AAAI
2023
Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing
Lunyiu Nie
,
Jiuding Sun
,
Yanlin Wang
,
Lun Du
,
Shi Han
,
Dongmei Zhang
,
Lei Hou
,
Juanzi Li
,
Jidong Zhai