Learning Greedy Policies for the Easy-First Framework

Jun Xie; Chao Ma; Janardhan Rao Doppa; Prashanth Mannem; Xiaoli Fern; Thomas G. Dietterich; Prasad Tadepalli

doi:10.1609/aaai.v29i1.9509

Authors

Jun Xie Oregon State University
Chao Ma Oregon State University
Janardhan Rao Doppa Washington State University
Prashanth Mannem Oregon State University
Xiaoli Fern Oregon State University
Thomas G. Dietterich Oregon State University
Prasad Tadepalli Oregon State University

DOI:

https://doi.org/10.1609/aaai.v29i1.9509

Keywords:

Structured Prediction, Learning for Search, Imitation Learning, Coreference Resolution

Abstract

Easy-first, a search-based structured prediction approach, has been applied to many NLP tasks including dependency parsing and coreference resolution. This approach employs a learned greedy policy (action scoring function) to make easy decisions first, which constrains the remaining decisions and makes them easier. We formulate greedy policy learning in the Easy-first approach as a novel non-convex optimization problem and solve it via an efficient Majorization Minimizatoin (MM) algorithm. Results on within-document coreference and cross-document joint entity and event coreference tasks demonstrate that the proposed approach achieves statistically significant performance improvement over existing training regimes for Easy-first and is less susceptible to overfitting.

Learning Greedy Policies for the Easy-First Framework

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information