Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Authors

  • Mingyu Derek Ma University of California, Los Angeles
  • Xiaoxuan Wang University of California, Los Angeles
  • Yijia Xiao University of California, Los Angeles
  • Anthony Cuturrufo University of California, Los Angeles
  • Vijay S Nori Optum AI
  • Eran Halperin University of California, Los Angeles Optum AI
  • Wei Wang University of California, Los Angeles

DOI:

https://doi.org/10.1609/aaai.v39i23.34660

Abstract

Clinical diagnosis prediction models, when provided with a patient's medical history, aim to detect potential diseases early, facilitating timely intervention and improving prognostic outcomes. However, the inherent scarcity of patient data and large disease candidate space often pose challenges in developing satisfactory models for this intricate task. The exploration of leveraging Large Language Models (LLMs) for encapsulating clinical decision processes has been limited. We introduce MERA, a clinical diagnosis prediction model that bridges pertaining natural language knowledge with medical practice. We apply hierarchical contrastive learning on a disease candidate ranking list to alleviate the large decision space issue. With concept memorization through fine-tuning, we bridge the natural language clinical knowledge with medical codes. Experimental results on MIMIC-III and IV datasets show that MERA achieves the state-of-the-art diagnosis prediction performance and dramatically elevates the diagnosis prediction capabilities of generative LMs.

Downloads

Published

2025-04-11

How to Cite

Ma, M. D., Wang, X., Xiao, Y., Cuturrufo, A., Nori, V. S., Halperin, E., & Wang, W. (2025). Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction. Proceedings of the AAAI Conference on Artificial Intelligence, 39(23), 24786–24794. https://doi.org/10.1609/aaai.v39i23.34660

Issue

Section

AAAI Technical Track on Natural Language Processing II