Huang, David

4 publications

ICML 2025 Improving LLM Safety Alignment with Dual-Objective Optimization Xuandong Zhao, Will Cai, Tianneng Shi, David Huang, Licong Lin, Song Mei, Dawn Song
ICLR 2024 PubDef: Defending Against Transfer Attacks from Public Models Chawin Sitawarin, Jaewon Chang, David Huang, Wesson Altoyan, David Wagner
NeurIPSW 2024 Stronger Universal and Transfer Attacks by Suppressing Refusals David Huang, Avidan Shah, Alexandre Araujo, David Wagner, Chawin Sitawarin
NeurIPS 2023 Data-Driven Network Neuroscience: On Data Collection and Benchmark Jiaxing Xu, Yunhan Yang, David Huang, Sophi Shilpa Gururajapathy, Yiping Ke, Miao Qiao, Alan Wang, Haribalan Kumar, Josh McGeown, Eryn Kwon