Assumed Density Filtering Methods for Learning Bayesian Neural Networks

Soumya Ghosh; Francesco Delle Fave; Jonathan Yedidia

doi:10.1609/aaai.v30i1.10296

Authors

Soumya Ghosh Disney Research
Francesco Delle Fave Disney Research
Jonathan Yedidia Disney Research

DOI:

https://doi.org/10.1609/aaai.v30i1.10296

Abstract

Buoyed by the success of deep multilayer neural networks, there is renewed interest in scalable learning of Bayesian neural networks. Here, we study algorithms that utilize recent advances in Bayesian inference to efficiently learn distributions over network weights. In particular, we focus on recently proposed assumed density filtering based methods for learning Bayesian neural networks -- Expectation and Probabilistic backpropagation. Apart from scaling to large datasets, these techniques seamlessly deal with non-differentiable activation functions and provide parameter (learning rate, momentum) free learning. In this paper, we first rigorously compare the two algorithms and in the process develop several extensions, including a version of EBP for continuous regression problems and a PBP variant for binary classification. Next, we extend both algorithms to deal with multiclass classification and count regression problems. On a variety of diverse real world benchmarks, we find our extensions to be effective, achieving results competitive with the state-of-the-art.

Assumed Density Filtering Methods for Learning Bayesian Neural Networks

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information