Probing Linguistic Information for Logical Inference in Pre-trained Language Models

Zeming Chen; Qiyue Gao

doi:10.1609/aaai.v36i10.21294

Authors

Zeming Chen Rose Hulman Institute of Technology
Qiyue Gao Rose-Hulman Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v36i10.21294

Keywords:

Speech & Natural Language Processing (SNLP)

Abstract

Progress in pre-trained language models has led to a surge of impressive results on downstream tasks for natural language understanding. Recent work on probing pre-trained language models uncovered a wide range of linguistic properties encoded in their contextualized representations. However, it is unclear whether they encode semantic knowledge that is crucial to symbolic inference methods. We propose a methodology for probing knowledge for inference that logical systems require but often lack in pre-trained language model representations. Our probing datasets cover a list of key types of knowledge used by many symbolic inference systems. We find that (i) pre-trained language models do encode several types of knowledge for inference, but there are also some types of knowledge for inference that are not encoded, (ii) language models can effectively learn missing knowledge for inference through fine-tuning. Overall, our findings provide insights into which aspects of knowledge for inference language models and their pre-training procedures capture. Moreover, we have demonstrated language models' potential as semantic and background knowledge bases for supporting symbolic inference methods.

Probing Linguistic Information for Logical Inference in Pre-trained Language Models

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription