Deng, Boyi

2 publications

ICLR 2026 SASFT: Sparse Autoencoder-Guided Supervised Finetuning to Mitigate Unexpected Code-Switching in LLMs Boyi Deng, Yu Wan, Baosong Yang, Fei Huang, Wenjie Wang, Fuli Feng
AAAI 2025 CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG Boyi Deng, Wenjie Wang, Fengbin Zhu, Qifan Wang, Fuli Feng