ML Anthology
Authors
Search
About
Chi, Jianfeng
7 publications
ICLR
2025
Backtracking Improves Generation Safety
Yiming Zhang
,
Jianfeng Chi
,
Hailey Nguyen
,
Kartikeya Upasani
,
Daniel M. Bikel
,
Jason E Weston
,
Eric Michael Smith
ICLR
2025
Persistent Pre-Training Poisoning of LLMs
Yiming Zhang
,
Javier Rando
,
Ivan Evtimov
,
Jianfeng Chi
,
Eric Michael Smith
,
Nicholas Carlini
,
Florian Tramèr
,
Daphne Ippolito
NeurIPS
2025
Shape It up! Restoring LLM Safety During Finetuning
ShengYun Peng
,
Pin-Yu Chen
,
Jianfeng Chi
,
Seongmin Lee
,
Duen Horng Chau
ICLR
2024
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods
Xiaotian Han
,
Jianfeng Chi
,
Yu Chen
,
Qifan Wang
,
Han Zhao
,
Na Zou
,
Xia Hu
AISTATS
2022
Towards Return Parity in Markov Decision Processes
Jianfeng Chi
,
Jian Shen
,
Xinyi Dai
,
Weinan Zhang
,
Yuan Tian
,
Han Zhao
ICML
2021
Understanding and Mitigating Accuracy Disparity in Regression
Jianfeng Chi
,
Yuan Tian
,
Geoffrey J. Gordon
,
Han Zhao
NeurIPS
2020
Trade-Offs and Guarantees of Adversarial Representation Learning for Information Obfuscation
Han Zhao
,
Jianfeng Chi
,
Yuan Tian
,
Geoffrey J. Gordon