Yi, Xiaoyuan
14 publications
NeurIPS
2025
Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models
NeurIPS
2025
ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation
ICML
2025
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
IJCAI
2023
KEST: Kernel Distance Based Efficient Self-Training for Improving Controllable Text Generation