Explainable Melanoma Diagnosis with Contrastive Learning and LLM-based Report Generation

Authors

  • Junwen Zheng Nanyang Technological University, Singapore
  • Xinran Xu Nanyang Technological University, Singapore
  • Li Rong Wang Nanyang Technological University, Singapore Centre for Frontier AI Research, A*STAR, Singapore
  • Chang Cai Nanyang Technological University, Singapore
  • Lucinda Siyun Tan National Skin Centre, National Healthcare Group, Singapore
  • Dingyuan Wang National Skin Centre, National Healthcare Group, Singapore
  • Hong Liang Tey Nanyang Technological University, Singapore National Skin Centre, National Healthcare Group, Singapore
  • Xiuyi Fan Nanyang Technological University, Singapore

DOI:

https://doi.org/10.1609/aaai.v40i44.41154

Abstract

Deep learning has demonstrated expert-level performance in melanoma classification, positioning it as a powerful tool in clinical dermatology. However, model opacity and the lack of interpretability remain critical barriers to clinical adoption, as clinicians often struggle to trust the decision-making processes of black-box models. To address this gap, we present a Cross-modal Explainable Framework for Melanoma (CEFM) that leverages contrastive learning as the core mechanism for achieving interpretability. Specifically, CEFM maps clinical criteria for melanoma diagnosis—namely Asymmetry, Border, and Color (ABC)—into the Vision Transformer embedding space using dual projection heads, thereby aligning clinical semantics with visual features. The aligned representations are subsequently translated into structured textual explanations via natural language generation, creating a transparent link between raw image data and clinical interpretation. Experiments on public datasets demonstrate 92.79% accuracy and an AUC of 0.961, along with significant improvements across multiple interpretability metrics. Qualitative analyses further show that the spatial arrangement of the learned embeddings aligns with clinicians’ application of the ABC rule, effectively bridging the gap between high-performance classification and clinical trust.

Published

2026-03-14

How to Cite

Zheng, J., Xu, X., Wang, L. R., Cai, C., Tan, L. S., Wang, D., … Fan, X. (2026). Explainable Melanoma Diagnosis with Contrastive Learning and LLM-based Report Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(44), 38156–38163. https://doi.org/10.1609/aaai.v40i44.41154

Issue

Section

AAAI Special Track on AI Alignment