A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics

Abstract

Single-cell transcriptomics describes complex molecular features at the individual cell level, serving various roles in biological research, such as enhancing gene expression and predicting drug responses. Due to transcriptomic data structurally resembling sequential data, many researchers have trained numerous transformers on extensive transcriptomic datasets. However, they have consistently neglected to explore the intrinsic properties of the data and the appropriateness of their chosen model architecture. In this paper, we carefully investigate the nature of transcriptomics, identifying three overlooked problems: 1) long-tailed data problem, 2) model selection problem, and 3) evaluation problem. Consequently, by applying the weighted sampling strategy, we address the long-tailed data problem and achieve consistent improvement across all settings. By adapting different model structures to transcriptomic data, we discover that transformers are not the only option. By developing three downstream tasks and fair evaluation metrics, we establish a simple and comprehensive benchmark to validate the effectiveness of models for transcriptomics. Through extensive experiments, we clarify the misunderstandings in the traditional methods and provide competitive baselines, thereby paving the way for future research in this field.

Cite

Text

Qi et al. "A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I1.32049

Markdown

[Qi et al. "A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/qi2025aaai-simple/) doi:10.1609/AAAI.V39I1.32049

BibTeX

@inproceedings{qi2025aaai-simple,
  title     = {{A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics}},
  author    = {Qi, Jiaxin and Cui, Yan and Guo, Kailei and Zhang, Xiaomin and Huang, Jianqiang and Xie, Gaogang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {676-684},
  doi       = {10.1609/AAAI.V39I1.32049},
  url       = {https://mlanthology.org/aaai/2025/qi2025aaai-simple/}
}