Fast Conditional Density Estimation for Quantitative Structure-Activity Relationships

Authors

  • Fabian Buchwald Technische Universität München
  • Tobias Girschick Technische Universität München
  • Eibe Frank University of Waikato
  • Stefan Kramer Technische Universität München

DOI:

https://doi.org/10.1609/aaai.v24i1.7494

Keywords:

chemoinformatics and QSAR, supervised learning, statistical analysis

Abstract

Many methods for quantitative structure-activity relationships (QSARs) deliver point estimates only, without quantifying the uncertainty inherent in the prediction. One way to quantify the uncertainy of a QSAR prediction is to predict the conditional density of the activity given the structure instead of a point estimate. If a conditional density estimate is available, it is easy to derive prediction intervals of activities. In this paper, we experimentally evaluate and compare three methods for conditional density estimation for their suitability in QSAR modeling. In contrast to traditional methods for conditional density estimation, they are based on generic machine learning schemes, more specifically, class probability estimators. Our experiments show that a kernel estimator based on class probability estimates from a random forest classifier is highly competitive with Gaussian process regression, while taking only a fraction of the time for training. Therefore, generic machine-learning based methods for conditional density estimation may be a good and fast option for quantifying uncertainty in QSAR modeling.

Downloads

Published

2010-07-04

How to Cite

Buchwald, F., Girschick, T., Frank, E., & Kramer, S. (2010). Fast Conditional Density Estimation for Quantitative Structure-Activity Relationships. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 1268-1273. https://doi.org/10.1609/aaai.v24i1.7494