SpotDiff: Spatial Gene Expression Imputation Diffusion with Single-Cell RNA Sequencing Data Integration

Authors

  • Tianyi Chen City University of Hong Kong
  • Yunfei Zhang South China University of Technology
  • Lianxin Xie South China University of Technology
  • Wenjun Shen Shantou University Medical College
  • Si Wu South China University of Technology
  • Hau-San Wong City University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v39i15.33740

Abstract

The advent of Spatial Transcriptomics (ST) has revolutionized understanding of tissue architecture by creating high-resolution maps of gene expression patterns. However, the low capture rate of ST leads to significant sparsity. The aim of imputation is to recover biological signals by imputing the dropouts in ST data to approximate the true expression values. In this paper, we introduce a Spatial Gene Expression Imputation Diffusion model to facilitate ST data imputation, and our model is referred to as SpotDiff. Specifically, we incorporate a spot-gene prompt learning module to capture the association between spots and genes. Further, SpotDiff integrates single-cell RNA sequencing data to impute gene expression at each spot. The proposed approach is able to reduce the uncertainty in the imputation process, since the aggregation of multiple single-cell measurements yield a stable representation of the corresponding spot expression profile. Extensive experiments have been performed to demonstrate that SpotDiff outperforms existing imputation methods across multiple benchmarks in terms of yielding more accurate and biologically relevant gene expression profiles, particularly in highly sparse scenarios.

Published

2025-04-11

How to Cite

Chen, T., Zhang, Y., Xie, L., Shen, W., Wu, S., & Wong, H.-S. (2025). SpotDiff: Spatial Gene Expression Imputation Diffusion with Single-Cell RNA Sequencing Data Integration. Proceedings of the AAAI Conference on Artificial Intelligence, 39(15), 15848-15856. https://doi.org/10.1609/aaai.v39i15.33740

Issue

Section

AAAI Technical Track on Machine Learning I