Fang, Qingkai

4 publications

NeurIPS 2025 FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing Shoutao Guo, Shaolei Zhang, Qingkai Fang, Zhengrui Ma, Min Zhang, Yang Feng
ICLR 2025 LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Shaolei Zhang, Qingkai Fang, Zhe Yang, Yang Feng
ICLR 2025 Llama-Omni: Seamless Speech Interaction with Large Language Models Qingkai Fang, Shoutao Guo, Yan Zhou, Zhengrui Ma, Shaolei Zhang, Yang Feng
NeurIPS 2023 DASpeech: Directed Acyclic Transformer for Fast and High-Quality Speech-to-Speech Translation Qingkai Fang, Yan Zhou, Yang Feng