HLMEA: Unsupervised Entity Alignment Based on Hybrid Language Models

Xiongnan Jin; Zhilin Wang; Jinpeng Chen; Liu Yang; Byungkook Oh; Seung-won Hwang; Jianqiang Li

doi:10.1609/aaai.v39i11.33294

Authors

Xiongnan Jin Shenzhen University
Zhilin Wang Alibaba Group
Jinpeng Chen Beijing University of Post and Telecommunication Xiangjiang Laboratory
Liu Yang Central South University
Byungkook Oh Konkuk University
Seung-won Hwang Seoul National University
Jianqiang Li Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v39i11.33294

Abstract

Entity alignment (EA) is crucial for integrating knowledge graphs (KGs) constructed from diverse sources. Conventional unsupervised EA approaches attempt to eliminate human intervention but often suffer from accuracy limitations. With the rise of large language models (LLMs), leveraging their capabilities for EA presents a promising direction. However, it introduces new challenges: formulating the LLM-based EA problem and extracting the background knowledge in LLMs to realize EA without human intervention. This paper proposes HLMEA, a novel hybrid language model-based unsupervised EA method. HLMEA formulates the EA task into a filtering and single-choice problem and synergistically integrates small language models (SLMs) and LLMs. Specifically, SLMs filter candidate entities based on textual representations generated from KG triples. Then, LLMs refine this selection to identify the most semantically aligned entities. An iterative self-training mechanism allows SLMs to distill knowledge from LLM outputs, enhancing the EA ability of hybrid language models in subsequent rounds cooperatively. We also conducted extensive experiments on benchmark datasets to evaluate HLMEA's performance. The results demonstrate that HLMEA significantly outperforms unsupervised and even supervised EA baselines, proving its potential for scalable and effective EA across large KGs. The code and data are available at \url{https://github.com/xnjin-ai/HLMEA}.

HLMEA: Unsupervised Entity Alignment Based on Hybrid Language Models

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information