Monitoring Primitive Interactions During the Training of DNNs
DOI:
https://doi.org/10.1609/aaai.v39i19.34223Abstract
This paper focuses on the newly emerged research topic, i.e., whether the complex decision-making logic of a DNN can be mathematically summarized into a few simple logics. Beyond the explanation of a static DNN, in this paper, we hope to show that the seemingly complex learning dynamics of a DNN can be faithfully represented as the change of a few primitive interaction patterns encoded by the DNN. Therefore, we redefine the interaction of principal feature components in intermediate-layer features, which enables us to concisely summarize the highly complex dynamics of interactions throughout the learning of the DNN. The mathematical faithfulness of the new interaction is experimentally verified. From the perspective of learning efficiency, we find that the interactions naturally belong to five groups (reliable, withdrawn, forgotten, betraying, and fluctuating interactions), each representing a distinct type of dynamics of an interaction being learned and/or being forgotten. This provides deep insights into the learning process of a DNN.Downloads
Published
2025-04-11
How to Cite
Ren, J., Zheng, X., Liu, J., Lizarraga, A., Wu, Y. N., Lin, L., & Zhang, Q. (2025). Monitoring Primitive Interactions During the Training of DNNs. Proceedings of the AAAI Conference on Artificial Intelligence, 39(19), 20183–20191. https://doi.org/10.1609/aaai.v39i19.34223
Issue
Section
AAAI Technical Track on Machine Learning V