The Analysis of Deep Neural Networks by Information Theory: From Explainability to Generalization

Shujian Yu

doi:10.1609/aaai.v37i13.26829

The Analysis of Deep Neural Networks by Information Theory: From Explainability to Generalization

Authors

Shujian Yu Department of Computer Science, Vrije Universiteit Amsterdam Department of Physics and Technology, UiT - The Arctic University of Norway

DOI:

https://doi.org/10.1609/aaai.v37i13.26829

Keywords:

New Faculty Highlights

Abstract

Despite their great success in many artificial intelligence tasks, deep neural networks (DNNs) still suffer from a few limitations, such as poor generalization behavior for out-of-distribution (OOD) data and the "black-box" nature. Information theory offers fresh insights to solve these challenges. In this short paper, we briefly review the recent developments in this area, and highlight our contributions.

Downloads

Published

2024-07-15

How to Cite

Yu, S. (2024). The Analysis of Deep Neural Networks by Information Theory: From Explainability to Generalization. Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 15462-15462. https://doi.org/10.1609/aaai.v37i13.26829

Download Citation

Issue

Vol. 37 No. 13: AAAI-23 Special Programs, IAAI-23, EAAI-23, Student Papers and Demonstrations

Section

New Faculty Highlights

The Analysis of Deep Neural Networks by Information Theory: From Explainability to Generalization

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription