Top-Down RST Parsing Utilizing Granularity Levels in Documents

Naoki Kobayashi; Tsutomu Hirao; Hidetaka Kamigaito; Manabu Okumura; Masaaki Nagata

doi:10.1609/aaai.v34i05.6321

Authors

Naoki Kobayashi Tokyo Institute of Technology
Tsutomu Hirao NTT Corporation
Hidetaka Kamigaito Tokyo Institute of Technology
Manabu Okumura Tokyo Institute of Technology
Masaaki Nagata Tokyo Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v34i05.6321

Abstract

Some downstream NLP tasks exploit discourse dependency trees converted from RST trees. To obtain better discourse dependency trees, we need to improve the accuracy of RST trees at the upper parts of the structures. Thus, we propose a novel neural top-down RST parsing method. Then, we exploit three levels of granularity in a document, paragraphs, sentences and Elementary Discourse Units (EDUs), to parse a document accurately and efficiently. The parsing is done in a top-down manner for each granularity level, by recursively splitting a larger text span into two smaller ones while predicting nuclearity and relation labels for the divided spans. The results on the RST-DT corpus show that our method achieved the state-of-the-art results, 87.0 unlabeled span score, 74.6 nuclearity labeled span score, and the comparable result with the state-of-the-art, 60.0 relation labeled span score. Furthermore, discourse dependency trees converted from our RST trees also achieved the state-of-the-art results, 64.9 unlabeled attachment score and 48.5 labeled attachment score.

Top-Down RST Parsing Utilizing Granularity Levels in Documents

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information