An Automatic Classification of Book Texts to User-Defined Tags

Authors

  • Sharon Givon Edinburgh University
  • Theresa Wilson Edinburgh University

Abstract

We describe work on automatically assigning labels to books using user-defined tags as the label set. Using supervised learning and exploring both binary and multiclass classification, we train and test classifiers on several sets of features, focusing on the size of the sets, part-of-speech classes and named entities. Results indicate that a binary classifier, trained and tested on a feature space that consists of a limited selection of parts of speech as well as all frequent named entities, achieves a classification precision of 81%, significantly outperforming a baseline which assigns the top-10 most popular tags to each book.

Downloads

Published

2021-09-25

How to Cite

Givon, S., & Wilson, T. (2021). An Automatic Classification of Book Texts to User-Defined Tags. Proceedings of the International AAAI Conference on Web and Social Media, 2(1), 186-187. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/18643