Hear You Say You: An Efficient Framework for Marine Mammal Sounds' Classification

Abstract

Marine mammals and their ecosystem face significant threats from, for example, military active sonar and marine transportation. To mitigate this harm, early detection and classification of marine mammals are essential. While recent efforts have utilized spectrogram analysis and machine learning techniques, there remain challenges in their efficiency. Therefore, we propose a novel knowledge distillation framework, named XCFSMN, for this problem. We construct a teacher model that fuses the features extracted from an X-vector extractor, a DenseNet and Cross-Covariance attended compact Feed-Forward Sequential Memory Network (cFSMN). The teacher model transfers knowledge to a simpler cFSMN model through a temperature-cooling strategy for efficient learning. Compared to multiple convolutional neural network backbones and transformers, the proposed framework achieves state-of-the-art efficiency and performance. The improved model size is approximately 20 times smaller and the inference time can be 10 times shorter without affecting the model’s accuracy.

Cite

Text

Liu et al. "Hear You Say You: An Efficient Framework for Marine Mammal Sounds' Classification." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I20.30230

Markdown

[Liu et al. "Hear You Say You: An Efficient Framework for Marine Mammal Sounds' Classification." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liu2024aaai-hear/) doi:10.1609/AAAI.V38I20.30230

BibTeX

@inproceedings{liu2024aaai-hear,
  title     = {{Hear You Say You: An Efficient Framework for Marine Mammal Sounds' Classification}},
  author    = {Liu, Xiangrui and Liu, Xiaoou and Du, Shan and Cheng, Julian},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {22250-22257},
  doi       = {10.1609/AAAI.V38I20.30230},
  url       = {https://mlanthology.org/aaai/2024/liu2024aaai-hear/}
}