Graph Neural Prompting with Large Language Models

Authors

  • Yijun Tian University of Notre Dame
  • Huan Song Amazon
  • Zichen Wang Amazon
  • Haozhu Wang Amazon
  • Ziqing Hu Amazon
  • Fang Wang Amazon
  • Nitesh V. Chawla University of Notre Dame
  • Panpan Xu Amazon

DOI:

https://doi.org/10.1609/aaai.v38i17.29875

Keywords:

NLP: (Large) Language Models, ML: Graph-based Machine Learning, NLP: Question Answering, DMKM: Linked Open Data, Knowledge Graphs & KB Completio

Abstract

Large language models (LLMs) have shown remarkable generalization capability with exceptional performance in various language modeling tasks. However, they still exhibit inherent limitations in precisely capturing and returning grounded knowledge. While existing work has explored utilizing knowledge graphs (KGs) to enhance language modeling via joint training and customized model architectures, applying this to LLMs is problematic owing to their large number of parameters and high computational cost. Therefore, how to enhance pre-trained LLMs using grounded knowledge, e.g., retrieval-augmented generation, remains an open question. In this work, we propose Graph Neural Prompting (GNP), a novel plug-and-play method to assist pre-trained LLMs in learning beneficial knowledge from KGs. GNP encompasses various designs, including a standard graph neural network encoder, a cross-modality pooling module, a domain projector, and a self-supervised link prediction objective. Extensive experiments on multiple datasets demonstrate the superiority of GNP on both commonsense and biomedical reasoning tasks across different LLM sizes and settings. Code is available at https://github.com/meettyj/GNP.

Published

2024-03-24

How to Cite

Tian, Y., Song, H., Wang, Z., Wang, H., Hu, Z., Wang, F., Chawla, N. V., & Xu, P. (2024). Graph Neural Prompting with Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 19080-19088. https://doi.org/10.1609/aaai.v38i17.29875

Issue

Section

AAAI Technical Track on Natural Language Processing II