HLMEA: Unsupervised Entity Alignment Based on Hybrid Language Models

Authors

  • Xiongnan Jin Shenzhen University
  • Zhilin Wang Alibaba Group
  • Jinpeng Chen Beijing University of Post and Telecommunication Xiangjiang Laboratory
  • Liu Yang Central South University
  • Byungkook Oh Konkuk University
  • Seung-won Hwang Seoul National University
  • Jianqiang Li Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v39i11.33294

Abstract

Entity alignment (EA) is crucial for integrating knowledge graphs (KGs) constructed from diverse sources. Conventional unsupervised EA approaches attempt to eliminate human intervention but often suffer from accuracy limitations. With the rise of large language models (LLMs), leveraging their capabilities for EA presents a promising direction. However, it introduces new challenges: formulating the LLM-based EA problem and extracting the background knowledge in LLMs to realize EA without human intervention. This paper proposes HLMEA, a novel hybrid language model-based unsupervised EA method. HLMEA formulates the EA task into a filtering and single-choice problem and synergistically integrates small language models (SLMs) and LLMs. Specifically, SLMs filter candidate entities based on textual representations generated from KG triples. Then, LLMs refine this selection to identify the most semantically aligned entities. An iterative self-training mechanism allows SLMs to distill knowledge from LLM outputs, enhancing the EA ability of hybrid language models in subsequent rounds cooperatively. We also conducted extensive experiments on benchmark datasets to evaluate HLMEA's performance. The results demonstrate that HLMEA significantly outperforms unsupervised and even supervised EA baselines, proving its potential for scalable and effective EA across large KGs. The code and data are available at \url{https://github.com/xnjin-ai/HLMEA}.

Downloads

Published

2025-04-11

How to Cite

Jin, X., Wang, Z., Chen, J., Yang, L., Oh, B., Hwang, S.- won, & Li, J. (2025). HLMEA: Unsupervised Entity Alignment Based on Hybrid Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(11), 11888-11896. https://doi.org/10.1609/aaai.v39i11.33294

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management I