Learning Structured Representation for Text Classification via Reinforcement Learning

Authors

  • Tianyang Zhang Tsinghua University
  • Minlie Huang Tsinghua University
  • Li Zhao Microsoft Research Asia

DOI:

https://doi.org/10.1609/aaai.v32i1.12047

Keywords:

Reinforcement Learning, Structure, Text Classification

Abstract

Representation learning is a fundamental problem in natural language processing. This paper studies how to learn a structured representation for text classification. Unlike most existing representation models that either use no structure or rely on pre-specified structures, we propose a reinforcement learning (RL) method to learn sentence representation by discovering optimized structures automatically. We demonstrate two attempts to build structured representation: Information Distilled LSTM (ID-LSTM) and Hierarchically Structured LSTM (HS-LSTM). ID-LSTM selects only important, task-relevant words, and HS-LSTM discovers phrase structures in a sentence. Structure discovery in the two representation models is formulated as a sequential decision problem: current decision of structure discovery affects following decisions, which can be addressed by policy gradient RL. Results show that our method can learn task-friendly representations by identifying important words or task-relevant structures without explicit structure annotations, and thus yields competitive performance.

Downloads

Published

2018-04-26

How to Cite

Zhang, T., Huang, M., & Zhao, L. (2018). Learning Structured Representation for Text Classification via Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12047