Jia, Jinghan
19 publications
NeurIPS
2025
The Fragile Truth of Saliency: Improving LLM Input Attribution via Attention Bias Optimization
NeurIPS
2024
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models
NeurIPS
2024
UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion Models
NeurIPS
2024
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models