Yang, Ruixin

3 publications

ICLR 2026 Do Vision-Language Models Respect Contextual Integrity in Location Disclosure? Ruixin Yang, Ethan Mendes, Arthur Wang, James Hays, Sauvik Das, Wei Xu, Alan Ritter
ICLRW 2024 Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation Ruixin Yang, Dheeraj Rajagopal, Shirley Anugrah Hayati, Bin Hu, Dongyeop Kang
ICLR 2024 Training Socially Aligned Language Models on Simulated Social Interactions Ruibo Liu, Ruixin Yang, Chenyan Jia, Ge Zhang, Diyi Yang, Soroush Vosoughi