Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

SPADE: Spatial Transcriptomics and Pathology Alignment Using a Mixture of Data Experts for an Expressive Latent Space

Created by
  • Haebom

Author

Ekaterina Redekop, Mara Pleasure, Zichen Wang, Kimberly Flores, Anthony Sisk, William Speier, Corey W. Arnold

Outline

The rapid growth of digital pathology and advances in self-supervised deep learning have enabled the development of foundational models for various diseases. While multimodal approaches integrating diverse data sources have emerged, a critical gap remains in comprehensively integrating whole-slide imaging (WSI) and spatial transcriptomics (ST). This is essential for capturing significant molecular heterogeneity beyond standard hematoxylin & eosin (H&E) staining. In this paper, we introduce SPADE, a foundational model that integrates morphological and molecular information to drive image representation learning in a single latent space. SPADE leverages a mixed-data expert technique, where experts are generated through two-step image feature spatial clustering, to learn representations of co-registered WSI patches and gene expression profiles using contrastive learning. Pretrained on the comprehensive HEST-1k dataset, SPADE is evaluated on 20 downstream tasks, demonstrating significantly better small-shot performance than baseline models, highlighting the benefits of integrating morphological and molecular information in a single latent space.

Takeaways, Limitations

Takeaways:
Development of SPADE, a new foundational model for integrating WSI and ST data.
Improving image representation learning by leveraging mixed data expert techniques.
Demonstrating excellent small-shot performance of SPADE pre-trained on the HEST-1k dataset.
Emphasizes the importance of integrating morphological and molecular information.
Limitations:
The specific Limitations is not directly mentioned in the paper.
Additional research and validation on other data sets are needed.
Further evaluation of the model's generalization ability is needed.
👍