Tang, Haotian

16 publications

ICLR 2025 Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models Junyu Chen, Han Cai, Junsong Chen, Enze Xie, Shang Yang, Haotian Tang, Muyang Li, Song Han
ICLR 2025 DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads Guangxuan Xiao, Jiaming Tang, Jingwei Zuo, Junxian Guo, Shang Yang, Haotian Tang, Yao Fu, Song Han
ICLR 2025 HART: Efficient Visual Generation with Hybrid Autoregressive Transformer Haotian Tang, Yecheng Wu, Shang Yang, Enze Xie, Junsong Chen, Junyu Chen, Zhuoyang Zhang, Han Cai, Yao Lu, Song Han
ICLR 2025 LongVILA: Scaling Long-Context Visual Language Models for Long Videos Yukang Chen, Fuzhao Xue, Dacheng Li, Qinghao Hu, Ligeng Zhu, Xiuyu Li, Yunhao Fang, Haotian Tang, Shang Yang, Zhijian Liu, Yihui He, Hongxu Yin, Pavlo Molchanov, Jan Kautz, Linxi Fan, Yuke Zhu, Yao Lu, Song Han
CVPR 2025 NVILA: Efficient Frontier Visual Language Models Zhijian Liu, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Haotian Tang, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Jinyi Hu, Sifei Liu, Ranjay Krishna, Pavlo Molchanov, Jan Kautz, Hongxu Yin, Song Han, Yao Lu
ICLR 2025 SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers Enze Xie, Junsong Chen, Junyu Chen, Han Cai, Haotian Tang, Yujun Lin, Zhekai Zhang, Muyang Li, Ligeng Zhu, Yao Lu, Song Han
ICLR 2025 VILA-U: A Unified Foundation Model Integrating Visual Understanding and Generation Yecheng Wu, Zhuoyang Zhang, Junyu Chen, Haotian Tang, Dacheng Li, Yunhao Fang, Ligeng Zhu, Enze Xie, Hongxu Yin, Li Yi, Song Han, Yao Lu
ICLR 2024 LongLoRA: Efficient Fine-Tuning of Long-Context Large Language Models Yukang Chen, Shengju Qian, Haotian Tang, Xin Lai, Zhijian Liu, Song Han, Jiaya Jia
CVPR 2024 MoST: Multi-Modality Scene Tokenization for Motion Prediction Norman Mu, Jingwei Ji, Zhenpei Yang, Nate Harada, Haotian Tang, Kan Chen, Charles R. Qi, Runzhou Ge, Kratarth Goel, Zoey Yang, Scott Ettinger, Rami Al-Rfou, Dragomir Anguelov, Yin Zhou
ECCV 2024 Sparse Refinement for Efficient High-Resolution Semantic Segmentation Zhijian Liu, Zhuoyang Zhang, Samir Khaki, Shang Yang, Haotian Tang, Chenfeng Xu, Kurt Keutzer, Song Han
CVPR 2023 FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer Zhijian Liu, Xinyu Yang, Haotian Tang, Shang Yang, Song Han
CVPR 2023 SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer Xuanyao Chen, Zhijian Liu, Haotian Tang, Li Yi, Hang Zhao, Song Han
CVPRW 2023 TorchSparse++: Efficient Point Cloud Engine Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han
ECCV 2020 Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution Haotian Tang, Zhijian Liu, Shengyu Zhao, Yujun Lin, Ji Lin, Hanrui Wang, Song Han
NeurIPS 2019 Point-Voxel CNN for Efficient 3D Deep Learning Zhijian Liu, Haotian Tang, Yujun Lin, Song Han
CVPRW 2019 Unsupervised Person Re-Identification with Iterative Self-Supervised Domain Adaptation Haotian Tang, Yiru Zhao, Hongtao Lu