Monitoring Primitive Interactions During the Training of DNNs

Abstract

This paper focuses on the newly emerged research topic, i.e., whether the complex decision-making logic of a DNN can be mathematically summarized into a few simple logics. Beyond the explanation of a static DNN, in this paper, we hope to show that the seemingly complex learning dynamics of a DNN can be faithfully represented as the change of a few primitive interaction patterns encoded by the DNN. Therefore, we redefine the interaction of principal feature components in intermediate-layer features, which enables us to concisely summarize the highly complex dynamics of interactions throughout the learning of the DNN. The mathematical faithfulness of the new interaction is experimentally verified. From the perspective of learning efficiency, we find that the interactions naturally belong to five groups (reliable, withdrawn, forgotten, betraying, and fluctuating interactions), each representing a distinct type of dynamics of an interaction being learned and/or being forgotten. This provides deep insights into the learning process of a DNN.

Cite

Text

Ren et al. "Monitoring Primitive Interactions During the Training of DNNs." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I19.34223

Markdown

[Ren et al. "Monitoring Primitive Interactions During the Training of DNNs." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/ren2025aaai-monitoring/) doi:10.1609/AAAI.V39I19.34223

BibTeX

@inproceedings{ren2025aaai-monitoring,
  title     = {{Monitoring Primitive Interactions During the Training of DNNs}},
  author    = {Ren, Jie and Zheng, Xinhao and Liu, Jiyu and Lizarraga, Andrew and Wu, Ying Nian and Lin, Liang and Zhang, Quanshi},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {20183-20191},
  doi       = {10.1609/AAAI.V39I19.34223},
  url       = {https://mlanthology.org/aaai/2025/ren2025aaai-monitoring/}
}