Learning Pattern-Based Extractors from Natural Language and Knowledge Graphs: Applying Large Language Models to Wikipedia and Linked Open Data

Authors

  • Célian Ringwald Université Côte d’Azur, Inria, CNRS, I3S

DOI:

https://doi.org/10.1609/aaai.v38i21.30406

Keywords:

NLP: Information Extraction, NLP: Generation, DMKM: Knowledge Acquisition From The Web, NLP: Large Language Models, DMKM: Linked Open Data Knowledge Graphs & KB Completion

Abstract

Seq-to-seq transformer models have recently been successfully used for relation extraction, showing their flexibility, effectiveness, and scalability on that task. In this context, knowledge graphs aligned with Wikipedia such as DBpedia and Wikidata give us the opportunity to leverage existing texts and corresponding RDF graphs in order to extract, from these texts, the knowledge that is missing in the corresponding graphs and meanwhile improve their coverage. The goal of my thesis is to learn efficient extractors targeting specific RDF patterns and to do so by leveraging the latest language models and the dual base formed by Wikipedia on the one hand, and DBpedia and Wikidata on the other hand.

Downloads

Published

2024-03-24

How to Cite

Ringwald, C. (2024). Learning Pattern-Based Extractors from Natural Language and Knowledge Graphs: Applying Large Language Models to Wikipedia and Linked Open Data. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23411-23412. https://doi.org/10.1609/aaai.v38i21.30406