Wang, Boxin
15 publications
NeurIPS
2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
NeurIPS
2021
G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators
ICLR
2021
InfoBERT: Improving Robustness of Language Models from an Information Theoretic Perspective
ICML
2021
Uncovering the Connections Between Adversarial Transferability and Knowledge Transferability