Monitoring Primitive Interactions During the Training of DNNs

Jie Ren; Xinhao Zheng; Jiyu Liu; Andrew Lizarraga; Ying Nian Wu; Liang Lin; Quanshi Zhang

doi:10.1609/aaai.v39i19.34223

Authors

Jie Ren Shanghai Jiao Tong University
Xinhao Zheng Shanghai Jiao Tong University
Jiyu Liu Shanghai Jiao Tong University Dartmouth College
Andrew Lizarraga University of California, Los Angeles
Ying Nian Wu University of California, Los Angeles
Liang Lin Sun Yat-Sen University
Quanshi Zhang Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v39i19.34223

Abstract

This paper focuses on the newly emerged research topic, i.e., whether the complex decision-making logic of a DNN can be mathematically summarized into a few simple logics. Beyond the explanation of a static DNN, in this paper, we hope to show that the seemingly complex learning dynamics of a DNN can be faithfully represented as the change of a few primitive interaction patterns encoded by the DNN. Therefore, we redefine the interaction of principal feature components in intermediate-layer features, which enables us to concisely summarize the highly complex dynamics of interactions throughout the learning of the DNN. The mathematical faithfulness of the new interaction is experimentally verified. From the perspective of learning efficiency, we find that the interactions naturally belong to five groups (reliable, withdrawn, forgotten, betraying, and fluctuating interactions), each representing a distinct type of dynamics of an interaction being learned and/or being forgotten. This provides deep insights into the learning process of a DNN.

Monitoring Primitive Interactions During the Training of DNNs

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information