Su, DiJia

4 publications

ICLR 2025 Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces DiJia Su, Sainbayar Sukhbaatar, Michael Rabbat, Yuandong Tian, Qinqing Zheng
ICML 2025 Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Dijia Su, Hanlin Zhu, Yingchen Xu, Jiantao Jiao, Yuandong Tian, Qinqing Zheng
ICLRW 2025 Training Large Language Models to Reason in a Continuous Latent Space Shibo Hao, Sainbayar Sukhbaatar, DiJia Su, Xian Li, Zhiting Hu, Jason E Weston, Yuandong Tian
ICML 2020 ConQUR: Mitigating Delusional Bias in Deep Q-Learning Dijia Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier