Semi-Supervised Knowledge Transfer Across Multi-Omic Single-Cell Data

Abstract

Knowledge transfer between multi-omic single-cell data aims to effectively transfer cell types from scRNA-seq data to unannotated scATAC-seq data. Several approaches aim to reduce the heterogeneity of multi-omic data while maintaining the discriminability of cell types with extensive annotated data. However, in reality, the cost of collecting both a large amount of labeled scRNA-seq data and scATAC-seq data is expensive. Therefore, this paper explores a practical yet underexplored problem of knowledge transfer across multi-omic single-cell data under cell type scarcity. To address this problem, we propose a semi-supervised knowledge transfer framework named Dual label scArcity elimiNation with Cross-omic multi-samplE Mixup (DANCE). To overcome the label scarcity in scRNA-seq data, we generate pseudo-labels based on optimal transport and merge them into the labeled scRNA-seq data. Moreover, we adopt a divide-and-conquer strategy which divides the scATAC-seq data into source-like and target-specific data. For source-like samples, we employ consistency regularization with random perturbations while for target-specific samples, we select a few candidate labels and progressively eliminate incorrect cell types from the label set for additional supervision. Next, we generate virtual scRNA-seq samples with multi-sample Mixup based on the class-wise similarity to reduce cell heterogeneity. Extensive experiments on many benchmark datasets suggest the superiority of our DANCE over a series of state-of-the-art methods.

Cite

Text

Zhang et al. "Semi-Supervised Knowledge Transfer Across Multi-Omic Single-Cell Data." Neural Information Processing Systems, 2024. doi:10.52202/079017-1292

Markdown

[Zhang et al. "Semi-Supervised Knowledge Transfer Across Multi-Omic Single-Cell Data." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/zhang2024neurips-semisupervised/) doi:10.52202/079017-1292

BibTeX

@inproceedings{zhang2024neurips-semisupervised,
  title     = {{Semi-Supervised Knowledge Transfer Across Multi-Omic Single-Cell Data}},
  author    = {Zhang, Fan and Liu, Tianyu and Chen, Zihao and Peng, Xiaojiang and Chen, Chong and Hua, Xian-Sheng and Luo, Xiao and Zhao, Hongyu},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-1292},
  url       = {https://mlanthology.org/neurips/2024/zhang2024neurips-semisupervised/}
}