MAPS-KB: A Million-Scale Probabilistic Simile Knowledge Base

Authors

  • Qianyu He Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
  • Xintao Wang Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
  • Jiaqing Liang School of Data Science, Fudan University
  • Yanghua Xiao Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University Fudan-Aishu Cognitive Intelligence Joint Research Center, Shanghai, China

DOI:

https://doi.org/10.1609/aaai.v37i5.25787

Keywords:

KRR: Knowledge Acquisition, KRR: Knowledge Engineering, SNLP: Language Models

Abstract

The ability to understand and generate similes is an imperative step to realize human-level AI. However, there is still a considerable gap between machine intelligence and human cognition in similes, since deep models based on statistical distribution tend to favour high-frequency similes. Hence, a large-scale symbolic knowledge base of similes is required, as it contributes to the modeling of diverse yet unpopular similes while facilitating additional evaluation and reasoning. To bridge the gap, we propose a novel framework for large-scale simile knowledge base construction, as well as two probabilistic metrics which enable an improved understanding of simile phenomena in natural language. Overall, we construct MAPS-KB, a million-scale probabilistic simile knowledge base, covering 4.3 million triplets over 0.4 million terms from 70 GB corpora. We conduct sufficient experiments to justify the effectiveness and necessity of the methods of our framework. We also apply MAPS-KB on three downstream tasks to achieve state-of-the-art performance, further demonstrating the value of MAPS-KB. Resources of MAPS-KB are publicly available at https://github.com/Abbey4799/MAPS-KB.

Downloads

Published

2023-06-26

How to Cite

He, Q., Wang, X., Liang, J., & Xiao, Y. (2023). MAPS-KB: A Million-Scale Probabilistic Simile Knowledge Base. Proceedings of the AAAI Conference on Artificial Intelligence, 37(5), 6398-6406. https://doi.org/10.1609/aaai.v37i5.25787

Issue

Section

AAAI Technical Track on Knowledge Representation and Reasoning