Extracting Ontological Selectional Preferences for Non-Pertainym Adjectives from the Google Corpus

Authors

  • John Tanner University of Central Florida
  • Fernando Gomez University of Central Florida

DOI:

https://doi.org/10.1609/aaai.v24i1.7719

Abstract

While there has been much research into using selectional preferences for word sense disambiguation (WSD), much difficulty has been encountered. To facilitate study into this difficulty and aid in WSD in general, a database of the selectional preferences of non-pertainym prenomial adjectives extracted from the Google Web 1T 5-gram Corpus is proposed. A variety of methods for computing the preferences of each adjective over a set of noun categories from WordNet have been evaluated via simulated disambiguation of pseudohomonyms. The best method of these involves computing for each noun category the ratio of single-word common (i.e. not proper) noun lemma types which can co-occur with a given adjective to the number of single-word common noun lemmata whose estimated frequency is greater than a threshold based on the frequency of the adjective. The database produced by this procedure will be made available to the public.

Downloads

Published

2010-07-04

How to Cite

Tanner, J., & Gomez, F. (2010). Extracting Ontological Selectional Preferences for Non-Pertainym Adjectives from the Google Corpus. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 1033-1038. https://doi.org/10.1609/aaai.v24i1.7719

Issue

Section

AAAI Technical Track: Natural Language Processing