Leaf-Smoothed Hierarchical Softmax for Ordinal Prediction

Wesley Tansey; Karl Pichotta; James Scott

doi:10.1609/aaai.v32i1.11754

Authors

Wesley Tansey Columbia University
Karl Pichotta The University of Texas at Austin
James Scott The University of Texas at Austin

DOI:

https://doi.org/10.1609/aaai.v32i1.11754

Keywords:

density estimation, deep learning, neural networks

Abstract

We propose a new approach to conditional probability estimation for ordinal labels. First, we present a specialized hierarchical softmax variant inspired by k-d trees that leverages the inherent spatial structure of (potentially-multivariate) ordinal labels. We then adapt ideas from signal processing on noisy graphs to develop a novel regularizer for such hierarchical softmax models. Both our tree structure and regularizer independently boost the sample efficiency of a deep learning model across a series of simulation studies. Furthermore, the combination of these two techniques produces additive gains and the model does not suffer from the pathologies of other approaches in the literature. We validate our approach empirically on a suite of real-world datasets, in some cases reducing the error by nearly half in comparison to other popular methods in the literature. Our results demonstrate that our method is a powerful new modeling technique for conditional probability estimation of ordinal labels, especially in the low-to-mid sample size regimes such as those often found in biological and other physical sciences.

Leaf-Smoothed Hierarchical Softmax for Ordinal Prediction

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription