Han, Mingfei

12 publications

TMLR 2026 Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos Meng Cao, Haoran Tang, Haoze Zhao, Mingfei Han, Ruyang Liu, Qiang Sun, Xiaojun Chang, Ian Reid, Xiaodan Liang
NeurIPS 2025 PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly Liang Ma, Jiajun Wen, Min Lin, Rongtao Xu, Xiwen Liang, Bingqian Lin, Jun Ma, Yongxin Wang, Ziming Wei, Haokun Lin, Mingfei Han, Meng Cao, Bokui Chen, Ivan Laptev, Xiaodan Liang
CVPR 2025 RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation Mingfei Han, Liang Ma, Kamila Zhumakhanova, Ekaterina Radionova, Jingyi Zhang, Xiaojun Chang, Xiaodan Liang, Ivan Laptev
ICLR 2025 Shot2Story: A New Benchmark for Comprehensive Understanding of Multi-Shot Videos Mingfei Han, Linjie Yang, Xiaojun Chang, Lina Yao, Heng Wang
NeurIPS 2025 WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception Zhiheng Liu, Xueqing Deng, Shoufa Chen, Angtian Wang, Qiushan Guo, Mingfei Han, Zeyue Xue, Mengzhao Chen, Ping Luo, Linjie Yang
ECCV 2024 LongVLM: Efficient Long Video Understanding via Large Language Models Yuetian Weng, Mingfei Han, Haoyu He, Xiaojun Chang, Bohan Zhuang
CVPR 2024 Video Recognition in Portrait Mode Mingfei Han, Linjie Yang, Xiaojie Jin, Jiashi Feng, Xiaojun Chang, Heng Wang
ICCV 2023 HTML: Hybrid Temporal-Scale Multimodal Learning Framework for Referring Video Object Segmentation Mingfei Han, Yali Wang, Zhihui Li, Lina Yao, Xiaojun Chang, Yu Qiao
NeurIPS 2023 Mask Propagation for Efficient Video Semantic Segmentation Yuetian Weng, Mingfei Han, Haoyu He, Mingjie Li, Lina Yao, Xiaojun Chang, Bohan Zhuang
ECCV 2022 An Efficient Spatio-Temporal Pyramid Transformer for Action Detection Yuetian Weng, Zizheng Pan, Mingfei Han, Xiaojun Chang, Bohan Zhuang
CVPR 2022 Dual-AI: Dual-Path Actor Interaction Learning for Group Activity Recognition Mingfei Han, David Junhao Zhang, Yali Wang, Rui Yan, Lina Yao, Xiaojun Chang, Yu Qiao
ECCV 2020 Mining Inter-Video Proposal Relations for Video Object Detection Mingfei Han, Yali Wang, Xiaojun Chang, Yu Qiao