ST-LLM: Spatial Transcriptomics Embedding with Large Language Models
DOI:
https://doi.org/10.1609/aaai.v40i40.40713Abstract
Spatial transcriptomics provides unprecedented opportunities to analyze gene patterns while preserving spatial tissue architecture. However, traditional deep learning methods for spatial transcriptomics analysis face significant challenges in multi-modal data integration, spatial dependency modeling, and biological knowledge incorporation, while existing large language models lack explicit spatial modeling capabilities for transcriptomic data. So we first present a Spatial Transcriptomics Embedding with Large Language Models (ST-LLM), a novel simple and effective approach that transforms intricate spatial graph structures into structured textual representations suitable for large language models (LLMs). ST-LLM dynamically constructs graph adjacency construction using reinforcement learning paradigms to adaptively optimize spatial relationships, converts the resulting graphs into hierarchical textual descriptions with spatial context, and leverages pre-trained semantic understanding to generate high-dimensional spatial-aware representations. Comprehensive experiments on 14 datasets demonstrate that ST-LLM achieves comparable or better performance than traditional model. ST-LLM shows that LLMs embeddings provide a new simple and effective path to encoding spatial transcriptomics biological knowledge.Downloads
Published
2026-03-14
How to Cite
Xu, Z., Wan, X., Li, L., Feng, S., Zhang, Y., Zhang, F., & Hu, B. (2026). ST-LLM: Spatial Transcriptomics Embedding with Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(40), 34178–34186. https://doi.org/10.1609/aaai.v40i40.40713
Issue
Section
AAAI Technical Track on Natural Language Processing V