In this paper, we present Celler, a state-of-the-art generative dictionary learning model for efficient annotation of single-cell data related to human diseases. Celler utilizes the Gaussian Inflation (GInf) loss function and a Hard Data Mining (HDM) strategy to enhance learning of rare categories and reduce the risk of overfitting to common categories. Furthermore, we build Celler-75, a large-scale single-cell dataset containing 40 million cells across 80 human tissues and 75 specific diseases, providing crucial support for exploring the potential of single-cell technology. The source code is available on GitHub.