RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM
Abstract
Animation has gained significant interest in the recent film and TV industry. Despite the success of advanced video generation models like Sora, Kling, and CogVideoX in generating natural videos, they lack the same effectiveness in handling animation videos. Evaluating animation video generation is also a great challenge due to its unique artist styles, violating the laws of physics and exaggerated motions. In this paper, we present a comprehensive system, AniSora, designed for animation video generation, which includes a data processing pipeline, a controllable generation model, and an evaluation benchmark. Supported by the data processing pipeline with over 10M high-quality data, the generation model incorporates a spatiotemporal mask module to facilitate key animation production functions such as image-to-video generation, frame interpolation, and localized image-guided animation. We also collect an evaluation benchmark of 948 various animation videos, with specifically developed metrics for animation video generation. Our entire project is publicly available on https://github.com/bilibili/Index-anisora/tree/main
Cite
Text
Song et al. "RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/141Markdown
[Song et al. "RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/song2024ijcai-robofusion/) doi:10.24963/ijcai.2024/141BibTeX
@inproceedings{song2024ijcai-robofusion,
title = {{RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM}},
author = {Song, Ziying and Zhang, Guoxing and Liu, Lin and Yang, Lei and Xu, Shaoqing and Jia, Caiyan and Jia, Feiyang and Wang, Li},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {1272-1280},
doi = {10.24963/ijcai.2024/141},
url = {https://mlanthology.org/ijcai/2024/song2024ijcai-robofusion/}
}