Shen, Yikang

40 publications

ICLR 2025 API Pack: A Massive Multi-Programming Language Dataset for API Call Generation Zhen Guo, Adriana Meza Soria, Wei Sun, Yikang Shen, Rameswar Panda
TMLR 2025 Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization Wei Liu, Anweshit Panda, Ujwal Pandey, Christopher Brissette, Yikang Shen, George Slota, Naigang Wang, Jie Chen, Yangyang Xu
ICLRW 2025 Diversity Measurement and Subset Selection for Instruction Tuning Datasets Peiqi Wang, Yikang Shen, Zhen Guo, Matthew Stallone, Yoon Kim, Polina Golland, Rameswar Panda
ICML 2025 LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation Chen-Chia Chang, Wan-Hsuan Lin, Yikang Shen, Yiran Chen, Xin Zhang
NeurIPS 2025 PaTH Attention: Position Encoding via Accumulating Householder Transformations Songlin Yang, Yikang Shen, Kaiyue Wen, Shawn Tan, Mayank Mishra, Liliang Ren, Rameswar Panda, Yoon Kim
ICLR 2025 Scaling Stick-Breaking Attention: An Efficient Implementation and In-Depth Study Shawn Tan, Songlin Yang, Aaron Courville, Rameswar Panda, Yikang Shen
ICLR 2024 CoVLM: Composing Visual Entities and Relationships in Large Language Models via Communicative Decoding Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan
NeurIPS 2024 Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Zhiqing Sun, Longhui Yu, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, Chuang Gan
ECCV 2024 FlexAttention for Efficient High-Resolution Vision-Language Models Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan
ICML 2024 Gated Linear Attention Transformers with Hardware-Efficient Training Songlin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, Yoon Kim
NeurIPSW 2024 GraphText: Graph Reasoning in Text Space Jianan Zhao, Le Zhuo, Yikang Shen, Meng Qu, Kai Liu, Michael M. Bronstein, Zhaocheng Zhu, Jian Tang
ICML 2024 LaMAGIC: Language-Model-Based Topology Generation for Analog Integrated Circuits Chen-Chia Chang, Yikang Shen, Shaoze Fan, Jing Li, Shun Zhang, Ningyuan Cao, Yiran Chen, Xin Zhang
NeurIPS 2024 Parallelizing Linear Transformers with the Delta Rule over Sequence Length Songlin Yang, Bailin Wang, Yu Zhang, Yikang Shen, Yoon Kim
ICLR 2024 SALMON: Self-Alignment with Instructable Reward Models Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Daniel Cox, Yiming Yang, Chuang Gan
NeurIPS 2024 Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training Wenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu
ICLR 2024 The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas
ICMLW 2024 The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas
AAAI 2024 Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Zhiqing Sun, Dan Gutfreund, Chuang Gan
NeurIPS 2023 Adaptive Online Replanning with Diffusion Models Siyuan Zhou, Yilun Du, Shun Zhang, Mengdi Xu, Yikang Shen, Wei Xiao, Dit-Yan Yeung, Chuang Gan
ICLR 2023 Hyper-Decision Transformer for Efficient Online Policy Adaptation Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan
CVPR 2023 Mod-SQuAD: Designing Mixtures of Experts as Modular Multi-Task Learners Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan
ICLR 2023 Planning with Large Language Models for Code Generation Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan
NeurIPS 2023 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
ICCV 2023 TextPSG: Panoptic Scene Graph Generation from Textual Descriptions Chengyang Zhao, Yikang Shen, Zhenfang Chen, Mingyu Ding, Chuang Gan
NeurIPSW 2023 The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas
ICLR 2023 Transformer-Patcher: One Mistake Worth One Neuron Zeyu Huang, Yikang Shen, Xiaofeng Zhang, Jie Zhou, Wenge Rong, Zhang Xiong
CVPR 2023 Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan
NeurIPSW 2022 Hyper-Decision Transformer for Efficient Online Policy Adaptation Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan
NeurIPSW 2022 Planning with Large Language Models for Code Generation Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan
ICML 2022 Prompting Decision Transformer for Few-Shot Policy Generalization Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua Tenenbaum, Chuang Gan
ICLR 2021 Learning Task Decomposition with Ordered Memory Policy Network Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron Courville, Joshua B. Tenenbaum, Chuang Gan
ICLR 2021 Long Range Arena : A Benchmark for Efficient Transformers Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, Donald Metzler
NeurIPS 2021 Self-Instantiated Recurrent Units with Dynamic Soft Recursion Aston Zhang, Yi Tay, Yikang Shen, Alvin Chan, Shuai Zhang
ICLR 2020 Metagross: Meta Gated Recursive Controller Units for Sequence Modeling Yi Tay, Yikang Shen, Alvin Chan, Yew Soon Ong
NeurIPS 2019 Ordered Memory Yikang Shen, Shawn Tan, Arian Hosseini, Zhouhan Lin, Alessandro Sordoni, Aaron C. Courville
ICLR 2019 Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks Yikang Shen, Shawn Tan, Alessandro Sordoni, Aaron Courville
ICLR 2018 Neural Language Modeling by Jointly Learning Syntax and Lexicon Yikang Shen, Zhouhan Lin, Chin-wei Huang, Aaron Courville
IJCAI 2017 Exploration of Tree-Based Hierarchical SoftMax for Recurrent Language Models Nan Jiang, Wenge Rong, Min Gao, Yikang Shen, Zhang Xiong
AAAI 2017 Word Embedding Based Correlation Model for Question/Answer Matching Yikang Shen, Wenge Rong, Nan Jiang, Baolin Peng, Jie Tang, Zhang Xiong
AAAI 2015 Question/Answer Matching for CQA System via Combining Lexical and Sequential Information Yikang Shen, Wenge Rong, Zhiwei Sun, Yuanxin Ouyang, Zhang Xiong