Approach for Video Classification with Multi-Label on YouTube-8m Dataset
Abstract
Video traffic is increasing at a considerable rate due to the spread of personal media and advancements in media technology. Accordingly, there is a growing need for techniques to automatically classify moving images. This paper use NetVLAD and NetFV models and the Huber loss function for video classification problem and YouTube-8M dataset to verify the experiment. We tried various attempts according to the dataset and optimize hyperparameters, ultimately obtain a GAP score of 0.8668.
Cite
Text
Shin et al. "Approach for Video Classification with Multi-Label on YouTube-8m Dataset." European Conference on Computer Vision Workshops, 2018. doi:10.1007/978-3-030-11018-5_29Markdown
[Shin et al. "Approach for Video Classification with Multi-Label on YouTube-8m Dataset." European Conference on Computer Vision Workshops, 2018.](https://mlanthology.org/eccvw/2018/shin2018eccvw-approach/) doi:10.1007/978-3-030-11018-5_29BibTeX
@inproceedings{shin2018eccvw-approach,
title = {{Approach for Video Classification with Multi-Label on YouTube-8m Dataset}},
author = {Shin, Kwangsoo and Jeon, Junhyeong and Lee, Seungbin and Lim, Boyoung and Jeong, Minsoo and Nang, Jongho},
booktitle = {European Conference on Computer Vision Workshops},
year = {2018},
pages = {317-324},
doi = {10.1007/978-3-030-11018-5_29},
url = {https://mlanthology.org/eccvw/2018/shin2018eccvw-approach/}
}