3. Results
3.1 Identification of Differentially Expressed Genes in Pancreatic Cancer and Ribosome Biogenesis-Related Genes Identification and Enrichment Analysis
In the TCGA and GTEx cohorts, a total of 4,625 differentially expressed genes (DEGs) were identified between PAAD samples and normal pancreatic tissues, meeting the criteria of p-value < 0.05 and absolute log₂ fold change > 1.5. Among these, 3,701 genes were upregulated, and 924 genes were downregulated (Fig. 1).
3.2 Identification of Highly Correlated Gene Modules in TCGA-PAAD and Ribosome Biogenesis-Related Genes Identification and Enrichment Analysis
To further uncover key genes associated with tumors, Weighted Gene Co-expression Network Analysis (WGCNA) was employed to identify gene modules highly correlated with tumors in the TCGA-PAAD dataset. Initially, nine outlier samples were identified and excluded from the analysis through sample clustering (Fig. 2a). Subsequently, a soft-thresholding power of 5 was set (Fig. 2a), and a gene dendrogram was constructed to partition the genes into ten modules: red, purple, pink, black, yellow, magenta, green, turquoise, brown, and blue (Fig. 2b). The heatmap illustrating the correlations between different modules is also presented. The results demonstrated that tumor occurrence was significantly positively correlated with the turquoise module (correlation coefficient = 0.98, p-value = 6e-247) and the blue module (correlation coefficient = 0.88, p-value = 5e-115) (Fig. 2d). Additionally, the gene significance (GS, indicating the correlation between genes and clinical features) and module membership (MM, indicating the correlation between genes and modules) within the turquoise and blue modules were highly correlated, suggesting a strong association of genes in these modules with tumors (Fig. 2c). Genes in the yellow and green modules also exhibited strong correlations with tumors, whereas genes in other modules showed weaker associations (Figures S1). Finally, 2,875 upregulated genes (up CARGs) were extracted from the turquoise and blue modules and intersected with 2,259 ribosome biogenesis-related genes obtained from the GeneCards database using the VennDiagram package in R. This intersection analysis resulted in 60 key ribosome biogenesis-related genes (Fig. 3a). Principal Component Analysis (PCA) further validated that the expression of these 60 CARGs effectively distinguished tumor samples from non-tumor samples (Figures S2a). Additionally, a heatmap illustrated the differential expression of these 60 CARGs between tumor and non-tumor tissues (Figs. 3b).
Gene Ontology (GO) enrichment analysis was performed on these 60 CARGs to explore potential molecular mechanisms associated with PAAD progression. The results indicated that these DEGs were primarily enriched in chromosomal regions, nuclear division, and mitosis-related processes, suggesting that ribosome biogenesis contributes to the synthesis of new proteins that may play roles in cell cycle control and cell division. In terms of cellular components, enrichment analysis revealed that these genes were closely associated with structures such as the spindle, blood microparticles, and the cell cortex, further indicating that the enrichment of ribosome biogenesis in these cellular components may be related to the enhanced proliferation and migratory capabilities of pancreatic cancer. Additionally, the enriched molecular functions were closely related to DNA binding, bending, and other related activities (Fig. 3c). These results collectively suggest that ribosome biogenesis may play a crucial role in PAAD development and progression by regulating the structure and function of genetic material, as well as extracellular signal transduction. Overall, these findings indicate that ribosome biogenesis may influence PAAD development and progression through the regulation of genetic material structure and gene expression.
3.3 Construction and Validation of Prognostic Model for Ribosome Biogenesis-Related Genes
Using multiple machine learning methods, we constructed a risk prediction model based on ribosome biogenesis-related genes from the TCGA database and validated it using GEO datasets. Initially, univariate Cox regression analysis of the 60 identified CARGs revealed 28 prognostically significant genes (Fig. 3d). We then constructed models using both individual and combined machine learning algorithms and calculated the concordance index (C-index) for each model. The average C-index across three datasets was computed to evaluate the predictive performance of all models. The optimal model combination was found to be a combination of Lasso (Least Absolute Shrinkage and Selection Operator) and RSF (Random Survival Forest), and the C-index was 0.65 (Fig. 4a). This model was trained using 1,000 decision trees, with a minimum node sample size of 5 and the random selection of 3 variables at each split. Ultimately, nine genes were identified as significantly impacting survival outcomes: CKB, ECT2, HMGA2, TPX2, ERBB3, SLC2A1, KRT13, PRSS3, and CRABP2 (Figures S3).
Based on this algorithm, risk scores for each sample were calculated, and the median risk score of 40.41 was used as the threshold to classify PAAD patients into low-risk (n = 89) and high-risk (n = 88) groups. Scatter plots demonstrated a significant decline in overall survival (OS) with increasing individual risk scores (Fig. 4b). Kaplan-Meier survival curves and log-rank tests further confirmed that OS was significantly lower in the high-risk group compared to the low-risk group (p < 0.0001; Fig. 4b).
To evaluate the performance of the prognostic model, time-dependent ROC curve analysis was conducted to assess the sensitivity and specificity of the model. The area under the curve (AUC) for predicting 2-year, 3-year, and 5-year survival were 0.976, 0.987, and 0.938, respectively, indicating high accuracy in predicting OS (Fig. 4c). Subsequently, the feasibility of using ribosome biogenesis features to predict PAAD patient prognosis was validated using three GSE datasets and one ICGC dataset. Kaplan-Meier survival curves showed that patients in the low-risk group had better OS than those in the high-risk group (Figs. 5a, 5c, 5e, and 5g). Robust AUC values for the 2-year, 3-year, and 5-year survival predictions were also observed in both the training and validation cohorts (Figs. 5b, 5d, 5f, and 5h). Clinical correlation analysis revealed significant associations between risk scores and follow-up tumor status, radiation therapy, and residual tumors (Figs. 6a-d).In summary, the ribosome biogenesis-related prognostic features constructed using multiple machine learning methods demonstrated excellent performance in predicting survival outcomes for PAAD.
3.4 Ribosome Biogenesis-Related Prognostic Features as Independent Prognostic Factors in PAAD
To determine independent prognostic factors for PAAD patients, we included ribosome biogenesis-related risk scores and various clinical characteristics (including age, sex, histological grade, T stage, M stage, N stage, history of malignancy, follow-up tumor status, tumor location, DCC histological type, history of radiotherapy, residual tumor, and chronic pancreatitis) in univariate and multivariate Cox regression analyses. The univariate Cox regression results indicated that age (HR = 1.028, 95% CI: 1.007–1.049), histological stage (HR = 2.350, 95% CI: 1.078–5.126),N stage (HR = 2.0961, 95% CI: 1.249–3.519), follow-up tumor status (HR = 2.491, 95% CI: 1.077–5.762), radiation therapy (HR = 0.453, 95% CI: 0.257–0.798), DCC histological type (HR = 1.770, 95% CI: 1.077–2.908), and residual tumor (HR = 1.770, 95% CI: 1.097–2.857) were all significant prognostic factors in the TCGA cohort (Fig. 7a). In the multivariate Cox regression analysis, the risk score remained a significant independent prognostic factor closely associated with prognosis (Fig. 7b).
Figure 7.Univariate (a) and multivariate (b) regression analysis of the risk score in the TCGA-PAAD cohort; Cox proportional hazards regression.
Seven significant clinical variables, along with the risk score, were incorporated into the model. Based on the stepwise Cox regression model, a nomogram was constructed to predict 2-year, 3-year, and 5-year OS. The parameters included in the nomogram were risk score, age, N stage, histological grade, follow-up tumor status, history of radiotherapy, DCC histological type, and residual tumor. The results emphasized the predictive capability of ribosome biogenesis-related risk scores in PAAD prognosis (Fig. 8a). The AUC values for predicting 2-year, 3-year, and 5-year OS were 0.986, 0.990, and 0.949, respectively (Fig. 8b). Subsequently, calibration curves were plotted to assess the performance of the nomogram model. The calibration curves closely aligned with the 45-degree diagonal, indicating a high consistency between the predicted survival probabilities and the actual observed outcomes in the TCGA-PAAD cohort (Fig. 8c).
3.5 Functional Analysis of Ribosome Biogenesis-Related Gene Set and Mutation Analysis
To investigate the biological behaviors associated with different Ribosome Biogenesis risk score groups, we employed the GSVA package in R for functional annotation of TCGA-PAAD samples. The heatmap of GSVA-KEGG results revealed that taurine and hypotaurine metabolism-related pathways were significantly enriched in the high-risk group. Additionally, pathways related to the cell cycle and DNA replication were also markedly enriched in the high-risk group. Furthermore, the p53 pathway exhibited higher activity in the high-risk group. In contrast, protein degradation pathways were significantly enriched in the low-risk group (Fig. 9c).
Simultaneously, GSVA-HALLMARK enrichment analysis demonstrated that multiple pathways associated with cancer progression and cell proliferation were significantly enriched in the high-risk group. These pathways include MYC target-related pathways, DNA repair, G2/M checkpoint-related pathways, mTORC1 signaling, hypoxia, and apoptosis-related pathways. Notably, cancer-related pathways such as the p53 pathway, Notch signaling pathway, and TGF-β signaling were more active in the high-risk group compared to the low-risk group (Figs. 9a–b), which was generally consistent with the GSVA-KEGG results. Gene Set Enrichment Analysis (GSEA) further indicated significant enrichment of pathways related to the regulation of cell cycle checkpoints, mitosis, and keratinocyte envelope formation (Fig. 9e).
To further explore the differential gene expression between the CARGs and the risk scores in PAAD patients, we identified DEGs (DEGs 2) between the high-risk and low-risk groups and performed GO-KEGG enrichment analysis using the “org.Hs.eg.db” and “clusterProfiler” packages. The results revealed that these DEGs were enriched in various cellular components, including neuronal cell bodies, synaptic membranes, transmembrane transporter complexes, and ion channel complexes related to neuronal cell membrane structures and channel complexes. In biological processes, these genes were primarily enriched in processes such as the regulation of chemical synaptic transmission, membrane potential regulation, and potassium ion transport. In terms of molecular functions, significant enrichment was observed in potassium and metal ion transmembrane transporter activities, voltage-gated channel activities, and ion channel activities (Fig. 9d). Additionally, KEGG pathway analysis indicated enrichment in neuroactive ligand-receptor interactions and dopaminergic synapse-related pathways (Fig. 9f), suggesting that neuroregulation and metabolic abnormalities may play crucial roles in the initiation and progression of PAAD.
We also analyzed the genomic mutation characteristics associated with the CARGs gene set, including mutation counts and mutation spectra. Utilizing the maftools package, we visualized the mutation patterns of 168 patients from the TCGA-PAAD cohort. The results indicated that the overall mutation rate was higher in the high-risk group compared to the low-risk group, suggesting greater genomic instability in the high-risk group. TP53 and KRAS were the most frequently mutated genes in the high-risk group, with mutation rates significantly higher than those in the low-risk group (TP53: 70% vs. 51%, KRAS: 80% vs. 50%) (Figs. 10a–b). These mutations are closely associated with cancer progression and poor prognosis across various cancers, indicating their potential critical roles in disease progression among high-risk patients. Additionally, CDKN2A and SMAD4 exhibited relatively high mutation frequencies in both groups, with slightly higher rates in the high-risk group, further supporting their importance in tumor suppression and signal transduction. Furthermore, TTN also displayed moderately high mutation frequencies in both groups, warranting further investigation.
Regarding tumor mutation burden (TMB), the proportion of high-TMB samples was higher in the low-risk group compared to the high-risk group, and this difference was statistically significant based on the chi-square test (Figs. 10c–d, p = 0.0032). This suggests that the high-risk group may be associated with poorer responses to immunotherapy.
3.6 Correlation Analysis of Immune Infiltration and Tumor Microenvironment
To explore the relationships among Ribosome Biogenesis, immune infiltration, and the tumor microenvironment, we first conducted single-sample Gene Set Enrichment Analysis (ssGSEA) to compare immune cell infiltration levels between the high-risk and low-risk groups. The results indicated that in the high-risk group, the infiltration levels of CD4⁺ T cells and Type II helper T cells were significantly higher than those in the low-risk group (P < 0.05). Conversely, the low-risk group exhibited significantly higher infiltration levels of activated B cells and eosinophils (P < 0.01), as well as higher infiltration levels of monocytes and macrophages (P < 0.05) (Fig. 11a).
In terms of immune functions, the high-risk group showed elevated levels of antigen-presenting cell co-inhibition, immunosuppressive inflammation, and Type I interferon responses compared to the low-risk group (P < 0.05) (Fig. 11b). Using the ESTIMATE algorithm, we examined the relationship between Ribosome Biogenesis and the tumor microenvironment in 177 PAAD patients from TCGA. ESTIMATE analysis revealed that the high-risk group had higher tumor purity compared to the low-risk group (Fig. 11f), while the ESTIMATE score and immune score were lower in the high-risk group, with no significant difference in the stromal score between the two groups (Fig. 11e).
In immune checkpoint analysis, CD44, LGALS9, CD160, IDO2, and TNFSF9 were significantly more highly expressed in the high-risk group (Fig. 11c), suggesting that immune checkpoint inhibitors (ICIs) targeting these molecules may offer potential therapeutic benefits. The Tumor Immune Dysfunction and Exclusion (TIDE) algorithm, a computational tool for predicting responses to immune checkpoint blockade (ICB), was employed to assess the potential clinical responses to ICB in different risk groups. The results indicated that the relationship between patient immune treatment responses and their risk scores did not reach statistical significance, suggesting that patients in the high- and low-risk groups may not differ significantly in their responses to conventional ICB treatments (Fig. 11f).
Subsequently, using the CIBERSORT algorithm, we performed deconvolution analysis of the tumor immune microenvironment and assessed the correlation between infiltration levels and risk scores. The results demonstrated that activated mast cells and activated B cells were negatively correlated with risk scores, while M1 macrophages and activated dendritic cells were positively correlated with risk scores, with all correlations being statistically significant (P < 0.05) (Figs. 12a–c). Spearman correlation analysis revealed that among the nine CARGs, the expression levels of ECT2 were strongly correlated with the infiltration levels of Type II helper T cells, ERBB3 and PRSS3 with the infiltration levels of Type 17 helper T cells, SLC2A1 with CD56⁺dim natural killer (NK) cells, and TPX2 with activated CD4⁺ T cells (Fig. 11d).
3.7 Drug Sensitivity Prediction and CARGs Risk Score
We utilized cell line data from the Cancer Genome Project (CGP) database to predict the IC50 of commonly used chemotherapy drugs for different risk groups of PAAD patients. Gemcitabine is a first-line treatment for pancreatic cancer and can be used alone or in combination for PAAD chemotherapy [15]. According to our study, patients in the low-risk group exhibited significantly higher sensitivity to gemcitabine compared to those in the high-risk group (P < 0.001), with a correlation coefficient of 0.4 between risk scores and gemcitabine IC50 (Fig. 13a). This suggests that the median inhibitory concentration (IC50) for gemcitabine is significantly lower in the low-risk group, indicating that low-risk patients may be more sensitive to gemcitabine, while high-risk patients may be more prone to developing resistance.
Additionally, the IC50 values for other first-line chemotherapy drugs, such as platinum-based drugs (oxaliplatin, cisplatin) and topoisomerase inhibitors (irinotecan), were also significantly lower in the low-risk group (P < 0.001, Figs. 13b–d). Notably, high-risk patients exhibited significantly lower IC50 values for paclitaxel, erlotinib, Laptinib, and Acetalax compared to the low-risk group (Figs. 13e–f and Figures S4a–b), suggesting that these drugs may offer potential therapeutic benefits for patients in the high-risk category.
3.8 Analysis of Gene Methylation Levels and miRNA Expression
We conducted differential analysis of gene methylation levels and miRNA expression between the high-risk and low-risk groups. The results indicated that there were no significant differences in the methylation levels of corresponding genes between the high-risk and low-risk groups (Figures S5a). Furthermore, there was no apparent correlation between the methylation levels of the nine key prognostic CARGs and their mRNA expression levels (Figures S5b). This suggests that these genes may not be significantly associated with methylation levels.
Differential analysis of miRNA expression between the high-risk and low-risk groups revealed that miR-203a-3p, hsa-miR-196b-5p, and hsa-miR-205-5p were highly expressed in the high-risk group (Figure S5c). However, the correlation between the mRNA expression levels of the nine genes and miRNA expression levels did not reach a correlation coefficient of 0.7 (Figure S5d).
3.9 Validation of Ribosome Biogenesis-Related Genes at the Single-Cell Level and Protein Expression Analysis
To further elucidate the specific roles of CARGs in PAAD progression, we performed single-cell RNA sequencing (scRNA-seq) analysis on 10 samples (GSE155698) to study the expression profiles of LLPS-related genes in the pancreatic malignant tumor microenvironment. After quality control filtering, we obtained 24,094 cells from 10 primary pancreatic tumors. These cells were subsequently merged, clustered, and annotated (Fig. 14a). Based on cell type-specific marker genes, the cells were classified into T cells, macrophages, endothelial cells, pancreatic ductal cells, NK cells, fibroblasts, B cells, and pancreatic cancer stem cells (Fig. 14b).
Next, we plotted the expression profiles of nine ribosome biogenesis-related genes, including ECT2, CKB, HMGA2, TPX2, ERBB3, SLC2A1, KRT13, PRSS3, and CRABP2. As shown in Fig. 14c, CKB, SLC2A1, ERBB3, and CRABP2 were predominantly highly expressed in pancreatic ductal cells, while PRSS3 was highly expressed in both pancreatic acinar and ductal cells. The other genes exhibited only minimal expression (Fig. 14c). To further quantify the overall expression levels of these genes across different cell types, we calculated a module score based on the nine genes. This module score represented the collective expression level of these nine genes within individual cells. The results indicated higher expression levels in ductal epithelial cells and acinar cells (Fig. 14d). These findings suggest that ribosome biogenesis may play critical roles in these cell types.
Immunohistochemistry (IHC) data were downloaded from the Human Protein Atlas (HPA, http://www.proteinatlas.org) to evaluate the expression levels of eight genes (CKB, HMGA2, TPX2, ERBB3, SLC2A1, KRT13, PRSS3, and CRABP2) in pancreatic cancer tissues. IHC analysis demonstrated varying degrees of protein expression, ranging from low to high, in representative PAAD samples (Figs. 15–16).
3.10 Cell Communication
The intercellular signaling network diagram (Fig. 17a) reveals complex signal transduction relationships among various cell types, including macrophages, endothelial cells, pancreatic ductal cells, pancreatic acinar cells, and fibroblasts. In this network, node size represents different cell types, while the thickness of the connections indicates the strength of the signaling interactions.Heatmaps illustrating signaling patterns across cell types (Figs. 17b and 17c) demonstrate that high-scoring ductal epithelial cells exhibit strong outgoing signals in pathways such as MIF, MK, and PDGF. Conversely, macrophages show robust incoming signals in pathways including MIF, SPP1, and CSF. These findings highlight the diversity of intercellular communication.Collectively, these results elucidate the complexity of the intercellular communication network and the specificity and intensity of signaling pathways between distinct cell types. The heatmap depicting communication probability and strength between different cell types (Fig. 17d) indicates that high-scoring ductal epithelial cells engage in strong ligand-receptor interactions with B cells and macrophages. This visualization underscores the key communications among various cell types within the tumor microenvironment through diverse signaling pathways.Furthermore, we conducted an analysis of the MIF signaling pathway network (Figs. 17e and 17f), where nodes represent different cell types and the thickness of the connections reflects the intensity of MIF signal transduction. This analysis emphasizes the complex and critical role of MIF signaling in mediating intercellular communication.
4. Discussion
Ribosome biogenesis is the highly precise process of constructing ribosomal particles, intricately linked to cellular protein synthesis, proliferation, differentiation, and apoptosis. This process entails the coordinated synthesis of four ribosomal RNAs (rRNAs), eighty ribosomal proteins (RPs), and seventy small nucleolar RNAs (snoRNAs) [16,17]. Within the nucleolus of eukaryotic cells, proteins associated with rRNA, RPs, and snoRNAs collaborate with RNA polymerases I and III to facilitate the generation and maturation of ribosomal precursors, specifically the pre-60S and pre-40S subunits. Following their export to the cytoplasm, ribosomes undergo final assembly and maturation [18].
In tumors, various oncogenic signaling pathways enhance ribosome biogenesis and protein synthesis, thereby promoting tumor growth and progression. The oncogene Myc serves as a pivotal regulator in cancer progression by directly activating rRNA synthesis, thereby increasing ribosome biogenesis rates. Nucleolin (Ncl) further promotes rRNA transcription and pre-rRNA processing, aiding oncogenesis. Additionally, Ncl interacts with telomerase reverse transcriptase (TERT) in the nucleoplasm, facilitating its localization to the nucleolus, preventing cancer cell senescence, and sustaining proliferation. Emerging evidence indicates that targeting ribosome biogenesis is a promising therapeutic strategy for cancer patients [19–23]. In pancreatic cancer, mutations in oncogenic RAS and various cancer signaling pathways are potentially linked to ribosome biogenesis [11–13]. Abnormalities in ribosomal proteins and RNA-binding proteins, such as heterogeneous nuclear ribonucleoproteins (hnRNPs) and RRP9, have been shown to promote cancer cell proliferation and chemotherapy resistance [13,24]. Furthermore, studies have demonstrated that inhibitors of ribosome biogenesis, like BMH-21, which targets RNA polymerase I to inhibit rRNA synthesis, exhibit significant anticancer effects in mouse models [25]. Consequently, ribosome biogenesis has increasingly become a central focus in pancreatic cancer research.
In our study, we employed multiple machine learning methods to identify nine ribosome biogenesis-related genes that are highly associated with prognosis: ECT2, CKB, HMGA2, TPX2, ERBB3, SLC2A1, KRT13, PRSS3, and CRABP2.
ECT2 is a RhoGTPase activator essential for cytokinesis. ECT2 has been identified as an oncogene when ectopically expressed in NIH/3T3 fibroblasts, with cytoplasmic and nuclear ECT2 exhibiting distinct oncogenic mechanisms [26]. Studies have shown that in cancers such as lung and ovarian cancer, cytoplasmic ECT2 interacts with the PKCi-Par6 complex and, through PKCi-mediated phosphorylation, drives the RAC1-MEK-ERK signaling axis essential for transformational growth in tumor [27–29]. In colorectal and lung cancers, nuclear ECT2 promotes rDNA transcription and ribosome biogenesis, which may be particularly important for more invasive and highly proliferative tumors [30–31]. In pancreatic cancer, research has demonstrated that YAP1 regulates the expression of ECT2, inducing malignant transformation. Nuclear ECT2 is upregulated during the transition from premalignant lesions to pancreatic ductal adenocarcinoma (PDAC), and high levels of YAP1 or ECT2 expression are associated with poorer overall survival in PDAC patients. ECT2 plays a crucial role in the growth and metastasis of pancreatic cancer [32].
CKB (Creatine Kinase B-type) primarily functions in cellular energy metabolism. In particular tumors, CKB may support rapid proliferation and the protein synthesis demands of ribosome biogenesis by maintaining high levels of energy metabolism [33]. Due to CKB’s primary role in cellular energy metabolism, its specificity in tumors is relatively limited. Currently, no studies have directly linked CKB to pancreatic cancer. However, its high expression may be associated with the high-energy metabolic characteristics and metabolic reprogramming of tumors, suggesting a need for further investigation.
High mobility group (HMG) proteins are a class of non-histone, chromatin-associated molecules involved in maintaining and regulating DNA functions, including replication, recombination, transcription, and DNA repair. High mobility group protein A2 (HMGA2) primarily regulates gene expression by binding to AT-rich regions of DNA and interacting with RNA-binding proteins, thereby influencing the ribosome biogenesis process [39–41]. Notably, HMGA2 is re-expressed in nearly all human malignancies and promotes tumorigenesis through various mechanisms [42]. In pancreatic cancer, HMGA2 has been extensively studied. Research indicates that HMGA2 is involved in maintaining epithelial-mesenchymal transition (EMT) in human pancreatic cancer cells and serves as a prognostic marker for metastatic cancer cell states in primary PDAC. However, HMGA2 does not significantly impact chemotherapy resistance. Various studies suggest that HMGA2 is a potential therapeutic target for pancreatic cancer treatment [43–44].
TPX2 is a critical regulator of cell division, primarily involved in spindle assembly and maintaining genomic stability. It regulates ribosome biogenesis during mitosis by controlling the cell cycle [45]. Overexpression of TPX2 in many malignancies, coupled with its essential role in mitotic regulation, suggests that TPX2 is a potential oncogenic driver [46]. In pancreatic cancer, studies have demonstrated that TPX2 expression is significantly elevated compared to normal tissues. Besides, targeting TPX2 with small interfering RNA effectively inhibits the proliferation of pancreatic cancer cells, and TPX2 knockout enhances the sensitivity of these cells to paclitaxel treatment. Thus, TPX2 emerges as a promising therapeutic target for pancreatic cancer [47].
ERBB3, a member of the epidermal growth factor receptor (EGFR) family and the tyrosine kinase receptor family, primarily functions through heterodimerization with ERBB2. This dimerization activates downstream signaling pathways, such as PI3K/AKT and RAS/RAF/MEK, promoting cell growth, proliferation, and survival [48]. In various cancers, ERBB3 has been found to localize to the nucleolus, interacting with nucleolar components to participate in cell survival and proliferation by regulating transcription factors involved in ribosome biogenesis [49–51]. In pancreatic cancer, dysregulated ErbB signaling plays a pivotal role in tumorigenesis. Studies have shown that NRG-1 derived from cancer-associated fibroblasts promotes PDAC tumorigenesis through the ErbB3-AKT signaling pathway, identifying the NRG-1/ErbB3 axis as an attractive molecular target to disrupt tumor-stroma epithelial interactions within the PDAC microenvironment [52]. Additionally, valproic acid treatment may induce apoptosis in pancreatic cancer cells by downregulating EGFR, ERBB2, and ERBB3 through microRNAs targeting the ErbB family members [53]. The ErbB signaling pathway has thus become a hotspot for targeted therapy in pancreatic cancer.
SLC2A1, also known as GLUT1, is a key gene involved in cellular glucose metabolism [54]. Overexpression of SLC2A1 further enhances glycolysis and cell proliferation in various cancers, promoting the protein synthesis required for ribosome biogenesis through energy metabolism [55]. In pancreatic cancer, studies have demonstrated that SLC2A1 promotes metabolic reprogramming and chemoresistance in PDAC through the GLUT1/ALDOB/G6PD axis [56]. Additionally, SLC2A1 is regulated by upstream factors such as FOXD1, which directly promotes SLC2A1 transcription and enhances GLUT1 expression by regulating aerobic glycolysis, ultimately promoting pancreatic cancer cell proliferation, invasion, and metastasis [57].
KRT13 (Keratin 13) is an intermediate filament protein predominantly expressed in epithelial cells. In cancer, the KRT13/PG/DSP complex interacts directly to alter PG expression and nuclear translocation, thereby regulating downstream c-Myc signaling. c-Myc enhances ribosome biogenesis and function by regulating multiple biological processes, supporting the high demand for protein synthesis in rapidly proliferating cancer cells, and increasing epithelial-to-mesenchymal transition and stem cell-like phenotypes [58–59]. In pancreatic cancer, KRT13 is upregulated in pancreatic cancer stem cell-like cells and is associated with radiotherapy resistance [60].
A
PRSS3 (Protease, Serine 3) is a member of the small serine protease family in the pancreas. Research indicates that PRSS3 plays a critical role in cancer progression, particularly in promoting tumor cell invasion and metastasis [61–62]. Specifically, in pancreatic cancer, PRSS3 is overexpressed in metastatic cell lines. It upregulates VEGF expression through the PAR1-mediated ERK pathway to promote angiogenesis. As a serine protease, PRSS3 enhances extracellular matrix (ECM) degradation, altering the tumor microenvironment and thereby promoting tumor progression and metastasis in vivo. PRSS3 plays an essential role in the progression, metastasis, and prognosis of human pancreatic cancer, making it a viable target for therapeutic intervention [63–64].
CRABP2 (Cellular Retinoic Acid Binding Protein 2) is a crucial intracellular retinoic acid transport protein that transports retinoic acid (RA) to the nucleus and interacts with its receptor complex, retinoic acid receptor (RAR). CRABP2 regulates cell proliferation, apoptosis, and metastasis by transporting RA to the nucleus and interacting with RAR [65–67]. High expression of CRABP2 is associated with increased invasiveness and poor prognosis in various cancers. Studies have shown that CRABP2 interacts with the integrin/FAK/ERK signaling pathway, enhancing cellular responses to growth factors, increasing cellular metabolic activity, and promoting ribosome biogenesis, thereby leading to tumor invasiveness and metastasis [68]. However, its role in pancreatic cancer remains unexplored, presenting a potential avenue for future research.
Consistent with the reports mentioned above, our results indicate that the features of these nine genes are closely related to the clinical characteristics of malignant tumors and chemotherapy resistance. Furthermore, this nine-gene signature risk model can independently predict overall survival outcomes beyond known clinical and pathological risk factors.
Our Analysis Centered on Multiple Machine Learning Algorithms to Differentiate High and Low-Risk Groups Based on Risk Scores and Identify Potential Drugs Significantly Affecting Disease Prognosis. High-risk PAAD patients appear to exhibit resistance to chemotherapy drugs that interfere with DNA synthesis, such as gemcitabine, oxaliplatin, and cisplatin, as well as topoisomerase I inhibitors like irinotecan. Mechanisms underlying resistance to DNA synthesis inhibitors generally include enhanced DNA repair capabilities and altered cellular metabolic pathways. Enhanced DNA repair can lead to the restoration of DNA damage induced by topoisomerase I inhibitors, thereby diminishing the cytotoxic effects of irinotecan [69–71]. These mechanisms may be potentially linked to our finding that high-risk groups are enriched in DNA repair pathways.
Additionally, we discovered that paclitaxel may offer benefits to high-risk patients. Paclitaxel inhibits cell mitosis by stabilizing microtubule structures, preventing cancer cells from completing the division cycle [72]. The nanoparticle form of paclitaxel is often used in combination with gemcitabine as a first-line chemotherapy regimen for pancreatic cancer and metastatic pancreatic cancer [73]. This aligns with our enrichment results, where high-risk group patients are more enriched in pathways regulating cell cycle checkpoints and mitosis, potentially serving as potential targets for paclitaxel.
Erlotinib and Acetalax are both EGFR tyrosine kinase inhibitors (TKIs). Erlotinib, in combination with gemcitabine, remains a first-line chemotherapy option for advanced pancreatic cancer. However, compared to the FOLFIRINOX regimen, it has limitations and is suitable for specific patient populations who cannot tolerate multiple chemotherapy regimens [74]. In our cohort, high-risk group patients may benefit from erlotinib. Laptinib and Acetalax have primarily been studied for their potential roles in HER2-positive and triple-negative breast cancers, respectively [75–76]. Their effects on pancreatic cancer require further experimental and theoretical support.
Based on our study cohort's enrichment analysis results, the high-risk group is primarily enriched in MYC target-related pathways, DNA repair, G2/M checkpoint-related pathways, mTORC1 signaling, hypoxia, apoptosis-related pathways, the p53 pathway, Notch signaling pathway, and TGF-β signaling-related Hallmark pathways.
MYC is a crucial transcription factor closely associated with cell growth and proliferation. Multiple biological processes regulated by c-MYC enhance ribosome generation and function [58,77]. mTORC1 is a key regulator of cellular metabolism, protein synthesis, and cell proliferation. It enhances metabolic functions by regulating glycolysis, lipid synthesis, and ribosome generation, and supports rapid tumor cell proliferation through protein synthesis [78]. The Notch signaling pathway is believed to regulate tumor development and the tumor microenvironment [79]. Other pathways are closely linked to processes related to cell proliferation and cancer progression. GSEA enrichment analysis also revealed a strong association with cancer cell proliferation pathways related to cell cycle and mitosis. GO-KEGG analysis showed significant enrichment in neuroactive ligand-receptor interactions, transmembrane receptors, ion channels, neurotransmitter-regulated processes, and pathways, indicating a complex interplay between high metabolic activity in tumor cells and neural signaling networks and regulation.
A
In the cytokine network of pancreatic cancer, immature CD4⁺ T cells can be activated in various cytokine environments [80]. Studies have shown that in pancreatic ductal adenocarcinoma, tumor-associated macrophages (TAMs) rely on IL-10 through the NLRP3 signaling pathway to induce CD4⁺ T cell differentiation into Th2, Th17, and regulatory T cells (Tregs), while inhibiting Th1 polarization and the activation of CD8⁺ cytotoxic T cells [81]. Additionally, cytokines such as TNF-α and IL-1β released by tumor cells can induce cancer-associated fibroblasts (CAFs) to secrete thymic stromal lymphopoietin (TSLP), activating resident dendritic cells (DCs). These DCs release Th2 chemokines (CCL2, CCL17), recruiting Th2 cells to the tumor region. Subsequently, Th2 cells secrete cytokines that promote M2 polarization, mediating immune evasion [82].
These findings are consistent with our study, where high-risk patients in our model cohort exhibited higher infiltration levels of CD4⁺ T cells and Type II helper T cells. In the low-risk group, increased infiltration levels of activated B cells, eosinophils, monocytes, and macrophages are associated with enhanced immune responses and surveillance, as well as pro-inflammatory functions. The M1 subtype of macrophages possesses antitumor activities [83], suggesting that the low-risk group may predominantly harbor M1 macrophages. However, our study also found a positive correlation between M1 macrophages and risk scores, although insufficient to indicate significantly higher infiltration levels in the high-risk group. This slight increase may be related to enhanced metabolism leading to compensatory antitumor activity, warranting further investigation. These results provide a basis for further exploring the potential roles of immune cells in tumor prognosis and therapy.
Tumor immunotherapy, particularly immune checkpoint blockade (ICB) and cell therapy, remains a persistent area of interest in cancer treatment. However, its clinical efficacy is often limited by the immunosuppressive tumor microenvironment. In the TIDE analysis, we found that the differences in responses to ICB treatment between high-risk and low-risk groups did not reach statistical significance. This may reflect the complex immune microenvironment of pancreatic cancer, making it challenging for a single risk score to comprehensively predict responses to ICB treatment. Additionally, limitations of bioinformatics analyses, such as insufficient sample sizes, data heterogeneity, and lack of experimental validation, may also impact the significance of the results. Therefore, future studies should integrate larger-scale clinical data and experimental validations to deeply explore the potential associations between ribosome biogenesis risk scores and ICB treatment responses. Furthermore, our study revealed that multiple immune checkpoint genes, including CD44, LGALS9, CD160, IDO2, and TNFSF9, were significantly more highly expressed in high-risk patients. This suggests that immune checkpoint inhibitors (ICIs) targeting these molecules may offer potential therapeutic benefits. However, ICB treatment prediction indicated that risk scores did not significantly differ in their association with ICB treatment responses, possibly due to the inherently "cold tumor" characteristics of pancreatic cancer, which render it less responsive to immunotherapy [84]. Immunotherapy for pancreatic malignancies still requires further exploration.
In our analysis of somatic mutations, we found that TP53 and KRAS had significantly higher mutation frequencies in the high-risk group compared to the low-risk group. Both subgroups exhibited high mutation rates of CDKN2A and SMAD4, with slightly higher rates in the high-risk group. Mutations in KRAS and TP53 have been extensively studied and consistently shown to have profound impacts on the survival of pancreatic cancer patients. KRAS is an oncogene whose mutations lead to the continuous activation of the RAS/RAF/MEK/ERK signaling pathway, promoting cell proliferation, survival, and migration [85]. TP53 is a tumor suppressor gene whose mutations typically result in cell cycle dysregulation and reduced DNA damage repair capabilities, thereby facilitating tumor development [86]. Mutations in CDKN2A often lead to aberrant cell cycle regulation, while deletions or mutations in SMAD4 are closely associated with metastatic pancreatic cancer [87]. TTN, a structural protein in striated muscle, has the longest exon length in the genome [88]. The predictive significance of TTN mutations may be primarily driven by statistical mechanisms, with limited support in existing literature, and its specific functional mechanisms in certain cancers remain unclear [89]. Future experimental studies are needed to validate how TTN mutations affect the structural stability of pancreatic cancer cells and the specific functions of signaling pathways, thereby clarifying their clinical significance. These findings underscore the critical role of key gene mutations in the progression of pancreatic cancer and suggest that therapeutic strategies targeting these gene mutations may help improve patient prognosis and develop personalized treatment plans.
Although some miRNAs have been identified to play roles in pancreatic cancer cell proliferation, invasion, and metastasis, their study remains limited. In our study cohort, we identified that hsa-miR-203a-3p, hsa-miR-196b-5p, and hsa-miR-205-5p were highly expressed in the high-risk group based on the ribosome biogenesis risk score model, whereas these miRNAs were not highly expressed in the low-risk group.
hsa-miR-203a-3p plays a critical role in various cancers, typically acting as a tumor-suppressive miRNA. In nasopharyngeal carcinoma, miR-203a-3p inhibits tumor growth and metastasis by targeting LASP1 [91]. Its high expression in breast cancer and non-small cell lung cancer is also associated with better prognosis [92,93]. However, our study found that miR-203a-3p is highly expressed in the high-risk group of pancreatic cancer, suggesting it may have dual functions in different tumor types. Existing literature indicates that miR-203a-3p acts as a tumor suppressor in some cancers, while in others, it may promote tumor progression by enhancing epithelial-mesenchymal transition (EMT) and targeting PDE4D to promote colorectal cancer cell proliferation, colony formation, and migration and invasion [94,95]. This dual functionality may be related to its regulation of different target genes and signaling pathways in various biological environments. Therefore, future studies should further investigate the specific mechanisms of miR-203a-3p in pancreatic cancer to elucidate its potential therapeutic value.
miR-196b-5p in pancreatic cancer can regulate the expression of the ING5 gene, inhibiting apoptosis and promoting cancer cell proliferation [96]. It also enhances the invasiveness and anti-apoptotic capabilities of cancer cells in lung small cell carcinoma and colorectal cancer [97,98].
hsa-miR-205-5p exhibits dual roles in different tumors, acting both as a tumor suppressor and a tumor promoter. Its specific function depends on the type of cancer and the tissue microenvironment. Currently, there are few studies confirming miR-205 as a biomarker for pancreatic cancer. Some studies suggest it has ambiguous and pro-tumorigenic roles [99].
Additionally, we analyzed the single-cell sequencing data of the nine genes included in our model and observed differential distribution of cells expressing these molecules in pancreatic cancer. Notably, these cells were primarily distributed in ductal epithelial cells, acinar cells, and a subset of cancer stem cells. Combined with the impact of the risk model on immune infiltration, we speculate that the "cold tumor" characteristics of pancreatic cancer cells may contribute to the formation of an immunosuppressive microenvironment, ultimately resulting in an "immune desert" phenotype. Moreover, our cell communication analysis revealed intricate interactions among cell populations. Ductal epithelial cells associated with hub genes may have complex connections with the MIF signaling pathway, warranting further investigation.
Despite identifying ribosome biogenesis-related risk score groups and prognostic genes in pancreatic ductal adenocarcinoma (PAAD) for the first time, our study has several limitations. First, the number of cohorts and sequencing data used was limited. Second, clinical and pathological feature information of patients was insufficient, necessitating more practical and valuable factors to predict 1- to 5-year survival rates. Lastly, there was a lack of basic experiments to validate the expression of these prognostic genes in pancreatic cancer cell lines, which requires further investigation.
In summary, we identified key prognostic genes related to ribosome biogenesis and constructed a prognostic risk model for PAAD. The results revealed nine prognostic genes that are highly associated with tumorigenesis and immune cell infiltration. This prognostic risk model demonstrated robust and effective performance in predicting overall survival outcomes for PAAD patients. Overall, our findings hold significant implications for in-depth studies of the molecular pathways and mechanisms underlying PAAD, as well as for the development of targeted therapeutic and prognostic strategies for pancreatic cancer.
Limitations
Although this study integrated multiple public databases and employed a series of bioinformatics and machine learning analyses, there are still several limitations. First, all the data used were derived from publicly available databases, potentially leading to issues with data completeness and heterogeneity. Secondly, this study primarily relies on bioinformatics analyses, and further in vitro and in vivo experimental validation of the molecular biological mechanisms of these prognostic genes in pancreatic cancer cell lines is still needed. Finally, given the significant heterogeneity of pancreatic cancer, further validation in multiple centers with diverse patient populations is required to assess the generalizability and clinical applicability of these findings.