A Multi-Pass Sieve for Name Normalization

Authors

  • Jennifer D'Souza University of Texas at Dallas

DOI:

https://doi.org/10.1609/aaai.v29i1.9729

Keywords:

normalization, data mining, information retrieval

Abstract

We propose a simple multi-pass sieve framework that applies tiers of deterministic normalization modules one at a time from highest to lowest precision for the task of normalizing names. While a sieve based architecture has been shown effective in coreference resolution, it has not yet been applied to the normalization task. We find that even in this task, the approach retains its characteristic features of being simple, and highly modular. In addition, it also proves robust when evaluated on two different kinds of data: clinical notes and biomedical text, by demonstrating high accuracy in normalizing disorder names found in both datasets.

Downloads

Published

2015-03-04

How to Cite

D’Souza, J. (2015). A Multi-Pass Sieve for Name Normalization. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9729