Random Mixed Field Model for Mixed-Attribute Data Restoration

Authors

  • Qiang Li University of Technology Sydney
  • Wei Bian University of Technology Sydney
  • Richard Xu University of Technology Sydney
  • Jane You The Hong Kong Polytechnic University
  • Dacheng Tao University of Technology Sydney

DOI:

https://doi.org/10.1609/aaai.v30i1.10142

Abstract

Noisy and incomplete data restoration is a critical preprocessing step in developing effective learning algorithms, which targets to reduce the effect of noise and missing values in data. By utilizing attribute correlations and/or instance similarities, various techniques have been developed for data denoising and imputation tasks. However, current existing data restoration methods are either specifically designed for a particular task, or incapable of dealing with mixed-attribute data. In this paper, we develop a new probabilistic model to provide a general and principled method for restoring mixed-attribute data. The main contributions of this study are twofold: a) a unified generative model, utilizing a generic random mixed field (RMF) prior, is designed to exploit mixed-attribute correlations; and b) a structured mean-field variational approach is proposed to solve the challenging inference problem of simultaneous denoising and imputation. We evaluate our method by classification experiments on both synthetic data and real benchmark datasets. Experiments demonstrate, our approach can effectively improve the classification accuracy of noisy and incomplete data by comparing with other data restoration methods.

Downloads

Published

2016-02-21

How to Cite

Li, Q., Bian, W., Xu, R., You, J., & Tao, D. (2016). Random Mixed Field Model for Mixed-Attribute Data Restoration. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10142

Issue

Section

Technical Papers: Machine Learning Applications