Non-Linear Regression for Bag-of-Words Data via Gaussian Process Latent Variable Set Model

Authors

  • Yuya Yoshikawa Nara Institute of Science and Technology
  • Tomoharu Iwata NTT Corporation
  • Hiroshi Sawada NTT Corporation

DOI:

https://doi.org/10.1609/aaai.v29i1.9615

Abstract

Gaussian process (GP) regression is a widely used method for non-linear prediction.The performance of the GP regression depends on whether it can properly capture the covariance structure of target variables, which is represented by kernels between input data.However, when the input is represented as a set of features, e.g. bag-of-words, it is difficult to calculate desirable kernel values because the co-occurrence of different but relevant words cannot be reflected in the kernel calculation.To overcome this problem, we propose a Gaussian process latent variable set model (GP-LVSM), which is a non-linear regression model effective for bag-of-words data.With the GP-LVSM, a latent vector is associated with each word, and each document is represented as a distribution of the latent vectors for words appearing in the document. We efficiently represent the distributions by using the framework of kernel embeddings of distributions that can hold high-order moment information of distributions without need for explicit density estimation.By learning latent vectors so as to maximize the posterior probability, kernels that reflect relations between words are obtained, and also words are visualized in a low-dimensional space.In experiments using 25 item review datasets, we demonstrate the effectiveness of the GP-LVSM in prediction and visualization.

Downloads

Published

2015-02-21

How to Cite

Yoshikawa, Y., Iwata, T., & Sawada, H. (2015). Non-Linear Regression for Bag-of-Words Data via Gaussian Process Latent Variable Set Model. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9615

Issue

Section

Main Track: Novel Machine Learning Algorithms