Chen, Qiguang
13 publications
ICLR
2026
Can LLMs Refuse Questions They Do Not Know? Measuring Knowledge-Aware Refusal in Factual Tasks
IJCAI
2025
Improving Consistency Identification in Task-Oriented Dialogue Through Multi-Agent Collaboration
ICML
2025
The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Analysis of Orthogonal Safety Directions