Xu, Guangxuan

5 publications

NeurIPS 2025 Rollout Roulette: A Probabilistic Inference Approach to Inference-Time Scaling of LLMs Using Particle-Based Monte Carlo Methods Isha Puri, Shivchander Sudalairaj, Guangxuan Xu, Abhishek Bhandwaldar, Kai Xu, Akash Srivastava
ICLR 2025 Unveiling the Secret Recipe: A Guide for Supervised Fine-Tuning Small LLMs Aldo Pareja, Nikhil Shivakumar Nayak, Hao Wang, Krishnateja Killamsetty, Shivchander Sudalairaj, Wenlong Zhao, Seungwook Han, Abhishek Bhandwaldar, Guangxuan Xu, Kai Xu, Ligong Han, Luke Inglis, Akash Srivastava
ICML 2024 BRAIn: Bayesian Reward-Conditioned Amortized Inference for Natural Language Generation from Feedback Gaurav Pandey, Yatin Nandwani, Tahira Naseem, Mayank Mishra, Guangxuan Xu, Dinesh Raghu, Sachindra Joshi, Asim Munawar, Ramón Fernandez Astudillo
ICLR 2022 Non-Parallel Text Style Transfer with Self-Parallel Supervision Ruibo Liu, Chongyang Gao, Chenyan Jia, Guangxuan Xu, Soroush Vosoughi
AAAI 2021 Mitigating Political Bias in Language Models Through Reinforced Calibration Ruibo Liu, Chenyan Jia, Jason Wei, Guangxuan Xu, Lili Wang, Soroush Vosoughi