Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation

Aman Goel; Craig Knoblock; Kristina Lerman

doi:10.1609/aaai.v25i1.8066

Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation

Authors

Aman Goel USC Information Sciences Institute
Craig Knoblock USC Information Sciences Institute
Kristina Lerman USC Information Sciences Institute

DOI:

https://doi.org/10.1609/aaai.v25i1.8066

Abstract

Automatic semantic annotation of structured data enables unsupervised integration of data from heterogeneous sources but is difficult to perform accurately due to the presence of many numeric fields and proper-noun fields that do not allow reference-based approaches and the absence of natural language text that prevents the use of language-based approaches. In addition, several of these semantic types have multiple heterogeneous representations, while sharing syntactic structure with other types. In this work, we propose a new approach to use conditional random fields (CRFs) to perform semantic annotation of structured data that takes advantage of the structure and labels of the tokens for higher accuracy of field labeling, while still allowing the use of exact inference techniques. We compare our approach with a linear-CRF based model that only labels fields and also with a regular-expression based approach.

Downloads

Published

2011-08-04

How to Cite

Goel, A., Knoblock, C., & Lerman, K. (2011). Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation. Proceedings of the AAAI Conference on Artificial Intelligence, 25(1), 1784–1785. https://doi.org/10.1609/aaai.v25i1.8066

Download Citation

Issue

Vol. 25 No. 1 (2011): Twenty-Fifth AAAI Conference on Artificial Intelligence

Section

Student Abstracts and Posters

Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information