DC-NAS: Divide-and-Conquer Neural Architecture Search for Multi-Modal Classification

Abstract

Neural architecture search-based multi-modal classification (NAS-MMC) methods can individually obtain the optimal classifier for different multi-modal data sets in an automatic manner. However, most existing NAS-MMC methods are dramatically time consuming due to the requirement for training and evaluating enormous models. In this paper, we propose an efficient evolutionary-based NAS-MMC method called divide-and-conquer neural architecture search (DC-NAS). Specifically, the evolved population is first divided into k+1 sub-populations, and then k sub-populations of them evolve on k small-scale data sets respectively that are obtained by splitting the entire data set using the k-fold stratified sampling technique; the remaining one evolves on the entire data set. To solve the sub-optimal fusion model problem caused by the training strategy of partial data, two kinds of sub-populations that are trained using partial data and entire data exchange the learned knowledge via two special knowledge bases. With the two techniques mentioned above, DC-NAS achieves the training time reduction and classification performance improvement. Experimental results show that DC-NAS achieves the state-of-the-art results in term of classification performance, training efficiency and the number of model parameters than the compared NAS-MMC methods on three popular multi-modal tasks including multi-label movie genre classification, action recognition with RGB and body joints and dynamic hand gesture recognition.

Cite

Text

Liang et al. "DC-NAS: Divide-and-Conquer Neural Architecture Search for Multi-Modal Classification." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I12.29281

Markdown

[Liang et al. "DC-NAS: Divide-and-Conquer Neural Architecture Search for Multi-Modal Classification." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liang2024aaai-dc/) doi:10.1609/AAAI.V38I12.29281

BibTeX

@inproceedings{liang2024aaai-dc,
  title     = {{DC-NAS: Divide-and-Conquer Neural Architecture Search for Multi-Modal Classification}},
  author    = {Liang, Xinyan and Fu, Pinhan and Guo, Qian and Zheng, Keyin and Qian, Yuhua},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {13754-13762},
  doi       = {10.1609/AAAI.V38I12.29281},
  url       = {https://mlanthology.org/aaai/2024/liang2024aaai-dc/}
}