Chi, Jianfeng

7 publications

ICLR 2025 Backtracking Improves Generation Safety Yiming Zhang, Jianfeng Chi, Hailey Nguyen, Kartikeya Upasani, Daniel M. Bikel, Jason E Weston, Eric Michael Smith
ICLR 2025 Persistent Pre-Training Poisoning of LLMs Yiming Zhang, Javier Rando, Ivan Evtimov, Jianfeng Chi, Eric Michael Smith, Nicholas Carlini, Florian Tramèr, Daphne Ippolito
NeurIPS 2025 Shape It up! Restoring LLM Safety During Finetuning ShengYun Peng, Pin-Yu Chen, Jianfeng Chi, Seongmin Lee, Duen Horng Chau
ICLR 2024 FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods Xiaotian Han, Jianfeng Chi, Yu Chen, Qifan Wang, Han Zhao, Na Zou, Xia Hu
AISTATS 2022 Towards Return Parity in Markov Decision Processes Jianfeng Chi, Jian Shen, Xinyi Dai, Weinan Zhang, Yuan Tian, Han Zhao
ICML 2021 Understanding and Mitigating Accuracy Disparity in Regression Jianfeng Chi, Yuan Tian, Geoffrey J. Gordon, Han Zhao
NeurIPS 2020 Trade-Offs and Guarantees of Adversarial Representation Learning for Information Obfuscation Han Zhao, Jianfeng Chi, Yuan Tian, Geoffrey J. Gordon