Kuo, Chia-Wen

9 publications

CVPRW 2025 Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model Lu Xu, Sijie Zhu, Chunyuan Li, Chia-Wen Kuo, Fan Chen, Xinyao Wang, Guang Chen, Dawei Du, Ye Yuan, Longyin Wen
ICCV 2025 D-Attn: Decomposed Attention for Large Vision-and-Language Model Chia-Wen Kuo, Sijie Zhu, Fan Chen, Xiaohui Shen, Longyin Wen
NeurIPS 2024 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Jiachen Li, Xinyao Wang, Sijie Zhu, Chia-Wen Kuo, Lu Xu, Fan Chen, Jitesh Jain, Humphrey Shi, Longyin Wen
CVPR 2023 HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning Chia-Wen Kuo, Zsolt Kira
WACV 2023 Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation Chia-Wen Kuo, Chih-Yao Ma, Judy Hoffman, Zsolt Kira
CVPR 2022 Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning Chia-Wen Kuo, Zsolt Kira
ICLR 2021 Unbiased Teacher for Semi-Supervised Object Detection Yen-Cheng Liu, Chih-Yao Ma, Zijian He, Chia-Wen Kuo, Kan Chen, Peizhao Zhang, Bichen Wu, Zsolt Kira, Peter Vajda
ECCV 2020 FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning Chia-Wen Kuo, Chih-Yao Ma, Jia-Bin Huang, Zsolt Kira
WACV 2019 Data-Efficient Graph Embedding Learning for PCB Component Detection Chia-Wen Kuo, Jacob Ashmore, David Huggins, Zsolt Kira