Learning Structured Representation for Text Classification via Reinforcement Learning

Tianyang Zhang; Minlie Huang; Li Zhao

doi:10.1609/aaai.v32i1.12047

Authors

Tianyang Zhang Tsinghua University
Minlie Huang Tsinghua University
Li Zhao Microsoft Research Asia

DOI:

https://doi.org/10.1609/aaai.v32i1.12047

Keywords:

Reinforcement Learning, Structure, Text Classification

Abstract

Representation learning is a fundamental problem in natural language processing. This paper studies how to learn a structured representation for text classification. Unlike most existing representation models that either use no structure or rely on pre-specified structures, we propose a reinforcement learning (RL) method to learn sentence representation by discovering optimized structures automatically. We demonstrate two attempts to build structured representation: Information Distilled LSTM (ID-LSTM) and Hierarchically Structured LSTM (HS-LSTM). ID-LSTM selects only important, task-relevant words, and HS-LSTM discovers phrase structures in a sentence. Structure discovery in the two representation models is formulated as a sequential decision problem: current decision of structure discovery affects following decisions, which can be addressed by policy gradient RL. Results show that our method can learn task-friendly representations by identifying important words or task-relevant structures without explicit structure annotations, and thus yields competitive performance.

Learning Structured Representation for Text Classification via Reinforcement Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription