Beyond Labels: A Self-Supervised Framework with Masked Autoencoders and Random Cropping for Breast Cancer Sub-type Classification

Authors

  • Marco Dossena University of Piemonte Orientale
  • Christopher Irwin University of Piemonte Orientale
  • Annalisa Chiocchetti University of Piemonte Orientale
  • Luigi Portinale University of Piemonte Orientale

DOI:

https://doi.org/10.1609/aaaiss.v5i1.35546

Abstract

This work addresses the problem of breast cancer sub-type classification using histopathological image analysis. We utilize masked autoencoders (MAEs) based on Visual Transformer (ViT) to learn, through Self-Supervised Learning, embeddings tailored to computer vision tasks in this domain. Such embeddings capture informative representations of histopathological data, facilitating feature learning without extensive labeled datasets. During pre-training, we investigate employing a random crop technique to generate a large dataset from whole-slide images automatically. Additionally, we assess the performance of linear probes for multi-class classification tasks of cancer sub-types using the representations learned by the MAE. Our approach aims to achieve strong performance on downstream tasks by leveraging the complementary strengths of ViTs and autoencoders. We evaluate our model's performance on the BRACS and BACH datasets and compare it with existing benchmarks.

Downloads

Published

2025-05-28

How to Cite

Dossena, M., Irwin, C., Chiocchetti, A., & Portinale, L. (2025). Beyond Labels: A Self-Supervised Framework with Masked Autoencoders and Random Cropping for Breast Cancer Sub-type Classification. Proceedings of the AAAI Symposium Series, 5(1), 10–17. https://doi.org/10.1609/aaaiss.v5i1.35546

Issue

Section

AI for Health Symposium: Leveraging Artificial Intelligence to Revolutionize Healthcare (Full Papers)