An Extraction and Representation Pipeline for Literary Characters

Funing Yang

doi:10.1609/aaai.v36i11.21709

An Extraction and Representation Pipeline for Literary Characters

Authors

Funing Yang Wellesley College

DOI:

https://doi.org/10.1609/aaai.v36i11.21709

Keywords:

Natural Language Processing, Narrative Understanding, Information Extraction, Information Retrival, Machine Learning

Abstract

Readers of novels need to identify and learn about the characters as they develop an understanding of the plot. The paper presents an end-to-end automated pipeline for literary character identification and ongoing work for extracting and comparing character representations for full-length English novels. The character identification pipeline involves a named entity recognition (NER) module with F1 score of 0.85, a coreference resolution module with F1 score of 0.76, and a disambiguation module using both heuristic and algorithmic approaches. Ongoing work compares event extraction as well as speech extraction pipelines for literary characters representations with case studies. The paper is the first to my knowledge that combines a modular pipeline for automated character identification, representation extraction and comparisons for full-length English novels.

Downloads

Published

2022-06-28

How to Cite

Yang, F. (2022). An Extraction and Representation Pipeline for Literary Characters. Proceedings of the AAAI Conference on Artificial Intelligence, 36(11), 13146–13147. https://doi.org/10.1609/aaai.v36i11.21709

Download Citation

Issue

Vol. 36 No. 11: IAAI-22, EAAI-22, AAAI-22 Special Programs and Special Track, Student Papers and Demonstrations

Section

AAAI Undergraduate Consortium