Zinc Finger Protein-Based Prognostic Model in Lung Adenocarcinoma Highlights FGD3 as a Potential Marker for Lorlatinib Resistance
JiayueSun1
XiaoyiHuang2,3
DinglongXue1
JiazhuangLi1
YaruHuang1
YueYang1
QingweiMeng1✉Email
1
A
Department of Medical OncologyHarbin Medical University Cancer HospitalNo. 150, Haping Road150040HarbinHeilongjiangP.R. China
2Biotherapy CenterHarbin Medical University Cancer Hospital150040HarbinHeilongjiang, HeilongjiangP.R. China
3NHC Key Laboratory of Cell TransplantationHarbin Medical University150040HarbinHeilongjiang, HeilongjiangP.R. China
Jiayue Sun1, Xiaoyi Huang2,3, Dinglong Xue1, Jiazhuang Li1, Yaru Huang1, Yue Yang1, Qingwei Meng1*
*Correspondence:
Qingwei Meng
mengqw@hrbmu.edu.cn
1Department of Medical Oncology, Harbin Medical University Cancer Hospital, No. 150, Haping Road, Harbin 150040, Heilongjiang, P.R. China
2Biotherapy Center, Harbin Medical University Cancer Hospital, Harbin, Heilongjiang, 150040, Heilongjiang, P.R. China
3NHC Key Laboratory of Cell Transplantation, Harbin Medical University, Harbin, Heilongjiang, 150040, Heilongjiang, P.R. China
Abstract
Background
Lung adenocarcinoma is the most common type of lung cancer and a major cause of cancer death. Zinc finger proteins are involved in tumor progression and may serve as diagnostic and therapeutic targets.
Methods
RNA-seq and clinical data were obtained from the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Prognosis-associated ZNF genes were identified through univariate Cox regression, least absolute shrinkage and selection operator (LASSO) regression, and multivariate Cox regression analyses. An eight-gene prognostic model was established and evaluated using Kaplan–Meier survival curves and time-dependent receiver operating characteristic (ROC) analyses, with external validation in the GSE50081 and GSE26939 datasets. A nomogram integrating independent prognostic factors was constructed and assessed by calibration plots. Immune infiltration, mutation profiles, and drug response were analyzed. Finally, we focused on a key molecule FGD3, examining its expression in LUAD cells and tissues, including lorlatinib-resistant models.
Results
The signature consisted of TRIM6, TRIM29, CTCFL, FGD3, GATA4, CASZ1, TRAF2, and ZNF322. T stage, N stage, and the risk score emerged as independent predictors of overall survival. The nomogram showed robust predictive value. High-risk patients exhibited an immune “desert” phenotype, higher somatic mutation rates, and elevated tumor mutational burden. Immunotherapy prediction and drug sensitivity analysis suggested that the model could potentially predict responses to immunotherapy, chemotherapy, and targeted therapy. FGD3 expression was downregulated in LUAD tissues and lorlatinib-resistant cells, decreasing with prolonged lorlatinib exposure. Restoration of FGD3 suppressed H3122LR proliferation and partially reversed resistance.
Conclusion
We developed and validated a novel ZNF-based prognostic model for LUAD that may serve as a valuable tool for risk stratification and guiding personalized treatment strategies.
Keywords
Zinc finger protein family
Lung adenocarcinoma
Prognostic risk model
FGD3
Lorlatinib resistance
Immune infiltration
Drug sensitivity
A
1 Introduction
A
Lung cancer was the most commonly diagnosed cancer in 2022, with approximately 2.5 million new cases, accounting for 12.4% of all cancer cases worldwide. Similarly, it ranks first in both incidence and mortality among cancers in China [1]. Non-small cell lung cancer (NSCLC) accounts for 80–85% of newly diagnosed cases, with lung adenocarcinoma (LUAD) being the most prevalent histological subtype [2]. Although most patients initially respond to standard chemotherapy, relapse often occurs rapidly [3]. LUAD exhibits high heterogeneity driven by complex genetic alterations, such as EGFR, KRAS, and ALK mutations, which serve as major targets for molecular therapy [4]. For patients without detectable driver mutations, immune checkpoint inhibitors (ICIs) have emerged as a promising therapeutic option. Despite these therapeutic advances, the prognosis for advanced LUAD remains poor, partly due to the lack of reliable prognostic markers to guide personalized treatment strategies. Currently, various biological markers have been used to predict the survival outcomes and treatment responses of patients with LUAD. For example, TP53 mutations, PD-L1 expression levels, immune microenvironment characteristics, and tumor metabolism-related molecules all play important roles in prognostic assessment [5]. However, the predictive capability of these individual markers is limited by the intrinsic heterogeneity of the disease [6], highlighting the urgent need for more accurate prognostic models.
Zinc finger proteins (ZNFs) represent the largest transcription factor family in the human genome [7]. These proteins contain one or more zinc ions coordinated with specific amino acids, forming diverse zinc-finger domains [8]. This structural diversity enables ZNFs to bind DNA, RNA, and proteins, thereby participating in numerous biological processes, including transcriptional regulation, DNA repair, protein degradation, signal transduction, cell proliferation, adhesion, migration, and cytoskeletal organization [9]. Emerging evidence suggests that ZNFs can function as either oncogenes or tumor suppressors, depending on the cellular context [7]. For instance, ZNF131 has been reported to promote non-small cell lung cancer (NSCLC) cell proliferation, invasion, and stemness [10]. Similarly, the zinc finger protein ZNF687 accelerates LUAD cell proliferation and tumor progression [11]. Conversely, ZNF652 acts as a tumor suppressor by downregulating cyclin D3 in lung cancer cells [12]. Additionally, CBX4, another zinc finger protein, promotes LUAD cell proliferation while concurrently inhibiting metastasis [13].
A
Despite these findings, the comprehensive role of ZNFs in LUAD remains incompletely understood, and their utility as prognostic biomarkers or therapeutic targets has yet to be fully elucidated. It is hypothesized that ZNFs may regulate LUAD progression through distinct oncogenic or tumor-suppressive mechanisms, and that modulating their expression could provide novel therapeutic opportunities. Therefore, an in-depth investigation of ZNFs in LUAD is urgently needed. In this pilot study, transcriptomic and clinical data for LUAD were obtained from the TCGA database. ZNFs were retrieved from the UniProt database, and differentially expressed ZNFs (DE-ZNFs) were identified. TCGA data were used as the training set to construct a prognostic model, which was subsequently validated using an independent GEO test cohort. Finally, the model’s prognostic performance and its potential to identify candidate anti-tumor agents were evaluated, offering new insights into individualized treatment strategies for LUAD patients.
2 Methods and materials
2.1 Data acquisition
RNA sequencing data and corresponding clinical information for LUAD were obtained from The Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/) using the TCGAbiolinks package [14], comprising 513 LUAD and 58 normal lung tissue samples. External validation cohorts were derived from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo), specifically from GSE50081 (127 LUAD samples) [15] and GSE26939 (116 LUAD samples)[16]. Additionally, two GEO datasets containing paired LUAD tumor and adjacent normal tissues, GSE32863 (58 paired samples)[17] and GSE75037(83 paired samples)[18], were used for external validation of FGD3 expression differences. The GEO datasets were normalized using the limma package [19]. Annotation GPL570, GPL9053 and GPL6884 were used for ID conversion. Genes with missing values were excluded, and for those with one-to-many mapping relationships, the mean expression value was retained. Genes belonging to the zinc finger protein family (Supplementary Table S1) were retrieved from the Universal Protein Resource (UniProt) database (https://www.uniprot.org/).
2.2 Identification of differentially expressed ZNFs genes in LUAD
Differential expression analysis of the TCGA count data from LUAD tumors and adjacent normal tissue samples was performed using the limma package in R software (version 4.4.0) [19]. Genes met the criteria of log2|fold change| values > 1 and adjusted p-values < 0.05 were selected. By intersecting these genes with the previously obtained ZNF gene set, differentially expressed ZNFs (DE-ZNFs) were identified. Heatmaps were generated using the pheatmap package, and volcano plots were constructed with the ggplot2 package.
2.3 Functional enrichment analysis
Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Set Enrichment Analysis (GSEA) were performed using the clusterProfiler package[20]. A p-value < 0.05 was set as the threshold for statistical significance. The results of the enrichment analyses were visualized using the ggplot2 and ggnewscale packages.
2.4 Risk prognostic model based on DE-ZNFs
Using TCGA as the training set, univariate Cox regression analysis was performed on DE-ZNFs with the survival package, using overall survival (OS) as the evaluation criterion. Genes significantly associated with OS (p < 0.05) were retained. Subsequently, least absolute shrinkage and selection operator (LASSO) regression was conducted using the glmnet and survival packages, and the results were visualized with the ggplot2 and forestplot packages. To avoid overfitting, multivariate Cox regression analysis was then applied to identify prognostic genes. The risk score for the model was calculated as follows
, where β represents the regression coefficient, E denotes the expression level of each prognostic gene, and i is the number of genes. LUAD patients were stratified into high- and low-risk groups based on the median risk score. Kaplan–Meier survival curves were generated, and log-rank tests were conducted to evaluate survival differences between groups. The model’s predictive performance was assessed using receiver operating characteristic (ROC) curve analysis, with the area under the curve (AUC) representing its discriminatory ability. To further validate the predictive power of the model, its performance was evaluated in two independent datasets, GSE50081 [15] and GSE26939 [16].
2.5 Immune cell infiltration analysis
CIBERSORT analysis was performed using the IOBR package to compare the relative abundances of 22 immune cell types between the high- and low-risk groups[21]. Single-sample gene set enrichment analysis (ssGSEA) was performed with the GSVA package to evaluate differences in 28 immune cell types[22]. Additionally, xCell analysis was conducted using the xCell package to quantify the abundances of 64 immune cell types[23]. Correlation heatmaps between the eight model genes and the 22 CIBERSORT-derived immune cell types were generated using the corrplot and ggcorrplot packages. The estimate package was applied to calculate stromal scores, immune scores, estimate scores (sum of stromal and immune scores), and tumor purity. Pearson’s product–moment correlation was used to assess associations between the risk score and stromal score, immune score, estimate score, and tumor purity.
2.6 Prediction of immunotherapy response
LUAD data were downloaded from the TIDE database (http://tide.dfci.harvard.edu/) for tumor immune dysfunction and exclusion (TIDE) analysis[24]. A higher TIDE score indicates a poorer response to immunotherapy. The immunophenoscore (IPS) for each patient was obtained from the Cancer Immunome Atlas (https://tcia.at/) and compared between different risk groups. Unlike the TIDE score, a higher IPS is positively associated with a better predicted response to immunotherapies targeting PD-L1 and CTLA-4[25]. Additionally, immunotherapy data from the urothelial carcinoma IMvigor210 dataset were used to predict responses to immunotherapy in each risk group[26].
2.7 Tumor mutation burden and chromosomal location of model genes
The tumor mutation burden (TMB) data for the TCGA cohort were downloaded from the cBioPortal database (https://www.cbioportal.org/) and calculated using the Maftools package[27]. Waterfall plots were generated to visualize the top 15 genes with the highest mutation frequency in the high- and low-risk groups. The ggpubr package was used to create violin and bar plots, and chi-square tests were applied for group comparisons. The rtracklayer, RCircos, and magrittr packages were employed to plot copy number variation (CNV) landscapes, enabling visualization of changes in DE-ZNFs model genes across human chromosomes.
2.8 Differential gene enrichment analysis between high- and low-risk groups
Differentially expressed genes (DEGs) between the two groups were identified using the DESeq2 package with the Wald test. Genes with an absolute log₂ (fold change) > 1 and p < 0.05 were selected. GO, KEGG, and GSEA of the DEGs were performed using the clusterProfiler package. Gene set variation analysis (GSVA) was conducted using the GSVA package[22].
2.9 Prediction of antitumor drug sensitivity
Drug sensitivity data were obtained from two major pharmacogenomic databases: the Genomics of Drug Sensitivity in Cancer (GDSC) and the Cancer Therapeutics Response Portal (CTRP), which contain large-scale screening results of numerous anti-cancer drugs across hundreds of cell lines. GDSC1, GDSC2 and CTRP2 drug response data were obtained from the GDSC website (https://www.cancerrxgene.org/) and the CTRP website (http://portals.broadinstitute.org/ctrp.v2.1/). The oncoPredict package was applied to predict the sensitivity of high- and low-risk LUAD patients, as defined by the prognostic model, to commonly used chemotherapeutic agents and molecularly targeted drugs[28]. The half-maximal inhibitory concentration (IC50) represents the drug concentration required to inhibit 50% of tumor cell viability. A lower IC50 value indicates greater drug sensitivity. IC50 values were expressed in micromolar (µM) units, and the natural logarithm of the IC50 was used for downstream drug response analyses.
2.10 Nomogram construction
To evaluate the association between clinical characteristics and risk grouping, univariate and multivariate Cox regression analyses were performed to determine whether the clinical variables and the risk score were independent prognostic factors for LUAD patients. Clinical data for LUAD patients in the TCGA cohort were obtained from the UCSC Xena platform (https://xenabrowser.net/datapages/). Fisher’s exact test was applied to compare clinical characteristics between high- and low-risk groups using the tableone package. A nomogram was then constructed to enhance the prognostic accuracy and predictive power of the model[29]. The risk score, T stage, and N stage were identified as independent prognostic variables to estimate the probability of overall survival (OS) at 1, 2, and 3 years. Calibration curves were generated to assess the agreement between predicted and observed outcomes, thereby evaluating the nomogram’s predictive performance.
2.11 Validation of prognostic gene expression
The expression levels of the prognostic genes were validated using the TCGA LUAD dataset. Bar plots were generated with the ggpubr and ggsci packages, and group comparisons were conducted using the Wilcoxon test. In addition, immunohistochemical (IHC) data for the prognostic genes in LUAD tissues were obtained from the Human Protein Atlas (HPA) (https://www.proteinatlas.org/). Receiver operating characteristic (ROC) curves for the prognostic model genes were constructed using the pROC package to evaluate their discriminatory performance.
2.12 Cell culture and clinical specimen collection
LUAD cell lines (H3122, H2228, H1650, H1993, and PC9) were purchased from the Cell Bank of the Chinese Academy of Sciences (Shanghai, China). Human bronchial epithelial (HBE) cells were kindly provided by the Heilongjiang Cancer Institute (Harbin, China). All cell lines were authenticated by short tandem repeat (STR) analysis prior to experimentation. H3122, H1650, H2228, and PC9 cells were cultured in RPMI 1640 medium (HyClone, USA) supplemented with 10% fetal bovine serum (Pan-Biotech, Germany) and 1% penicillin–streptomycin (HyClone, USA). H1993 cells were maintained in DMEM medium (HyClone, USA). All cells were incubated at 37°C in a humidified atmosphere containing 5% CO₂. A total of 12 paired fresh-frozen LUAD tumor tissues and adjacent normal tissues were collected from patients at Harbin Medical University Cancer Hospital between 2024 and 2025 for protein extraction. Additionally, 14 paired paraffin-embedded LUAD and adjacent normal tissue sections were obtained from the same institution between 2020 and 2022 for immunohistochemistry. None of the patients had received preoperative therapy.
A
2.13 Western blot analysis
A
Cells and tissues were harvested and lysed on ice for 30 min using lysis buffer (Boster, Wuhan, China; #AR0102-100) supplemented with 1% protease inhibitor cocktail (Boster, Wuhan, China #AR1192). The lysates were centrifuged at 12,000 rpm for 15 min at 4°C, and the supernatants were collected. Protein concentration was determined using a protein quantification kit (Beyotime, Shanghai, China; #P0012S). Equal amounts of protein (25 µg) were loaded per lane, separated by SDS–PAGE, and transferred to PVDF membranes. The membranes were blocked with 5% nonfat milk overnight at 4°C and incubated with primary antibodies for 24 h at 4°C. The following day, membranes were washed and incubated with HRP-conjugated secondary antibodies for 50 min at room temperature. Immunoreactivity was detected using an ECL kit (Thermo Fisher Scientific, Waltham, MA, USA). The primary antibodies were as follows: rabbit anti-FGD3 (Proteintech, Wuhan, China; #20347-1-AP; 1:1000) and mouse anti-GAPDH (Abcam, Cambridge, UK; #ab8245; 1:50,000).
2.14 IC50 assay
Lorlatinib was purchased from MedChemExpress (Monmouth Junction, NJ, USA; #HY-15728). A total of 5,000 cells were seeded into each well of 96-well plates and cultured in 100 µl RPMI 1640 medium supplemented with 10% fetal bovine serum (FBS) for 12 hours. After cell adhesion, the culture medium was replaced with 100 µl of complete medium containing varying concentrations of lorlatinib. Following 48 hours of drug treatment, the medium was removed, and 100 µl of fresh medium containing 10 µl of CCK-8 reagent (Seven, Shanghai, China; #SC11901) was added to each well. The plate was incubated for 1 hour at 37°C, and absorbance at 450 nm was measured using a multifunctional microplate reader. Dose-response curves were generated based on the absorbance values at each drug concentration, and the IC50 values for each cell group were calculated.
2.15 Lorlatinib-resistant cells
Parental H3122 cells were initially exposed to a low concentration of lorlatinib (0.04 µM) until they achieved full proliferation. The cells were then passaged and treated with a doubled concentration of lorlatinib. This process was repeated incrementally until the lorlatinib concentration reached 1 µM. The resulting cells exhibited an IC₅₀ value more than 50 times higher than that of the parental cells. This resistant cell line was designated as H3122 lorlatinib-resistant cells (H3122LR). H3122LR cells were subsequently expanded and continuously maintained in medium containing the same lorlatinib concentration during each passage.
2.16 Immunohistochemistry
Immunohistochemistry was performed using the high sensitive and rapid immunohistochemical kit (pH 9.0) (Elabscience, Wuhan, China; #E-IR-R220). Briefly, paraffin-embedded sections were baked at 65°C for 60 minutes, then placed in antigen retrieval and deparaffinization solution and microwaved at low power for 30 minutes, followed by natural cooling to room temperature. After washing, sections were incubated with peroxidase blocking buffer for 15 minutes, washed again, and incubated overnight at 4°C with primary antibody against FGD3 (Proteintech, Wuhan, China; #20347-1-AP; 1:100). On the following day, sections were incubated with secondary antibody at room temperature for 30 minutes, followed by DAB staining for 2 minutes and hematoxylin counterstaining for 30 seconds. The sections were then dehydrated through graded ethanol series, cleared with xylene, mounted with neutral resin, and air-dried. FGD3 immunostaining was independently evaluated by two investigators blinded to patient information. The percentage of positive cells was scored as 0 (0%), 1 (1–25%), 2 (26–50%), 3 (51–75%), and 4 (76–100%). Staining intensity was scored as 0 (negative), 1 (light yellow), 2 (dark yellow), and 3 (brown). The final immunoreactive score (IRS) was calculated by summing the percentage and intensity scores, ranging from 0 to 7.
2.17 Plasmid transfection and FGD3 overexpression
FGD3 was overexpressed in H3122LR cells either transiently or stably. For transient overexpression, cells at ~ 70% confluence were transfected with an FGD3 plasmid or empty vector using Lipofectamine 3000 (Invitrogen, USA). After 6 h, the medium was replaced, and cells were cultured for 24–48 h before downstream assays. For stable overexpression, cells were transfected as above and selected with puromycin until resistant colonies were established. Transfection efficiency and FGD3 expression were confirmed by Western blot. Plasmid sequences are provided in Supplementary Table S2.
2.18 Colony formation assay
H3122LR cells transfected with an FGD3 overexpression plasmid or empty vector were seeded into 6-well plates and cultured in complete medium. After 10 days, colonies were fixed with 4% paraformaldehyde, stained with crystal violet, and counted. Experiments were performed in triplicate.
2.19 CCK-8 assay
Cell viability was assessed using the Cell Counting Kit-8 (CCK-8, Sevenbio, Beijing, China) according to the manufacturer’s instructions. Cells were seeded in 96-well plates and cultured for 1–4 days. CCK-8 solution was then added to each well at a 1:10 dilution and incubated at 37°C for 1 h. Absorbance at 450 nm was measured using a microplate reader (Multiskan FC, Thermo, China).
2.20 Wound-healing assay
H3122LR cells transfected with an FGD3 overexpression plasmid or empty vector were seeded in 6-well plates and cultured for 24 h. A scratch was made through the center of each well using a 10 µL pipette tip, and serum-free medium was added. Wound closure was quantified as the percentage of wound area reduction over the healing period. All experiments were performed in triplicate.
2.21 Cell migration and invasion assays
For the transwell assay, 105 cells were seeded in the upper chamber (8-µm pore size, Corning, USA) in serum-free medium, either uncoated (migration) or coated with Matrigel (BD Biosciences, invasion). The lower chamber contained 600 µL of medium with 10% FBS. After 24 h (migration) or 48 h (invasion), cells on the lower surface of the membrane were fixed with 4% paraformaldehyde and stained with crystal violet. Cell numbers were quantified as the average number of cells per field of view.
2.22 Statistical analysis
For variables with normal distribution between two groups, unpaired Student’s t-tests were used to assess statistical significance. When data were not normally distributed, Mann–Whitney U tests were applied. For categorical clinical variables with small sample sizes, Fisher’s exact test was used to evaluate group differences. Pearson’s or Spearman’s correlation analyses were performed to examine linear relationships between variables. Kaplan–Meier survival analysis was conducted to compare overall survival between high- and low-risk groups, with differences assessed by the log-rank test. Receiver operating characteristic (ROC) curves and area under the curve (AUC) values were used to evaluate the predictive performance of the risk score. All analyses were performed using R software (version 4.4.0) and GraphPad Prism (version 9.5). Data are presented as means, with statistical significance defined as p < 0.05.
3 Results
3.1 Acquisition of DE-ZNFs in LUAD and functional enrichment analysis
The workflow of this study was illustrated in Fig. 1. A total of 19,934 RNA-seq genes and clinical information for 513 LUAD patients and 58 control samples were downloaded. Among them, 1,834 ZNFs genes were obtained. After differential analysis, 270 DE-ZNFs were identified, including 83 downregulated and 187 upregulated genes (Fig. 2A, B). To evaluate the potential functions of DE-ZNFs in LUAD, GO enrichment analysis was performed. Sixteen biological process (BP) pathways, 5 cellular component (CC) pathways, and 23 molecular function (MF) pathways were identified, with the top five displayed. DE-ZNFs were mainly involved in the protein polyubiquitination process (Fig. 2C, D). KEGG enrichment analysis revealed a total of 111 enriched pathways, including key biological processes such as ubiquitin-mediated proteolysis, lysine degradation, endocytosis, and chromatin remodeling (Fig. 2E). Notable signaling pathways identified by GSEA and KEGG enrichment included TNFA signaling via NF-kappa B, KRAS signaling, and the E2F transcription factor pathway (Fig. 2E, F).
Fig. 1
Flowchart showing the steps of this study
Click here to Correct
Fig. 2
Differentially expressed ZNFs and functional analysis. A, B Heat map and volcano plot of ZNF related differentially expressed genes, showing 187 upregulated and 83 downregulated ZNF genes. C Top 5 GO terms, BP, CC, and MF enrichment results of DE-ZNFs. D The cnetplot showed the BP enrichment results. E Top 10 enriched pathways involving DE-ZNFs based on KEGG enrichment. F Pathways involving DE-ZNFs according to GSEA analysis
Click here to Correct
3.2 Development and validation of the prognostic model for LUAD
Univariate Cox regression analysis identified 63 DE-ZNFs significantly associated with OS in the TCGA training set (Fig. 3A). After excluding samples with missing values and OS = 0, 500 cancer samples remained. LASSO regression was then applied to further select 19 DE-ZNFs (Fig. 3B, C). Finally, multivariate Cox regression was used to establish the predictive model and identify prognostic-related genes: TRIM6, TRIM29, CTCFL, FGD3, GATA4, CASZ1, TRAF2, and ZNF322 (Fig. 3D). The risk score was calculated as follows
LUAD samples were then divided into high- and low-risk groups based on the median risk score. Risk curves and scatter plots illustrated that the high-risk group exhibited higher risk scores and mortality rates compared to the low-risk group (Fig. 3E, F). The heatmap showed that TRIM29, GATA4, TRIM6, CTCFL, and TRAF2 were highly expressed in the high-risk group, whereas FGD3, CASZ1, and ZNF322 were highly expressed in the low-risk group (Fig. 3G). Kaplan–Meier analysis demonstrated that patients in the high-risk group had significantly poorer OS than those in the low-risk group (Fig. 4A). Time-dependent ROC analysis yielded AUC values of 0.756, 0.676, and 0.683 for 1-, 2-, and 3-year survival, respectively (Fig. 4B). Two independent GEO validation datasets also confirmed shorter OS in the high-risk groups (Fig. 4C, D). For the GSE50081 validation dataset, 1-, 2-, and 3-year AUC values were 0.714, 0.735, and 0.758, respectively; for GSE26939, the corresponding AUCs were 0.721, 0.792, and 0.792 (Fig. 4E, F).
Fig. 3
Development of the prognostic model based on OS-relevant DE-ZNFs. A Selection of 63 ZNFs genes by univariate Cox regression analysis. B, C 19 ZNFs genes underwent parameter selection via LASSO regression. D Multivariate Cox regression analysis derived forest plot showing that 8 ZNFs genes were as the most prominent parameters to construct the best prognostic model. Forest plot and hazard ratio of each of the genes in the model. E Distribution of the patients’ risk score. F Patients’ survival time along with their risk scores. G The heatplot showing expressions of the 8 model genes in the high and low-risk groups
Click here to Correct
Fig. 4
Validation of the prognostic model on internal and external LUAD cohorts. A, B OS survival curves and ROC curves for low- and high-risk subgroups in LUAD-TCGA cohort. C-F OS survival curves and ROC curves in GSE50081 and GSE26939 cohort
Click here to Correct
3.3 The prognostic model serves as a dependable indicator for tumor immunity.
According to the results from CIBERSORT, ssGSEA, and xCell analyses, the low-risk group exhibited a higher proportion of immune cell infiltration (Fig. 5), suggesting stronger tumor immunity compared with the high-risk LUAD patients. The prognostic-related gene FGD3 showed positive correlations with the infiltration of memory B cells, CD8 + T cells, and M1 macrophages, while ZNF322 and CASZ1 were associated with increased resting CD4 memory T cells (Fig. 6A). Patients in the high-risk group had lower immune scores, lower stromal scores, and higher tumor purity based on the ESTIMATE algorithm (Fig. 6B). The heatmap demonstrated that FGD3 was significantly positively correlated with immune and stromal scores, but negatively correlated with tumor purity. In contrast, CASZ1 and TRAF2 exhibited opposite correlation patterns compared to FGD3 (Fig. 6C). This prognostic model represents an innovative and effective biomarker for selecting anti-tumor immunotherapies. The TIDE score indicated that the high-risk group exhibited poorer immunotherapy efficacy (Fig. 6D). Validation using the IMvigor210 dataset confirmed that patients in the high-risk group had worse survival outcomes (Fig. 6E). To explore the relationship between risk score and immune checkpoint blockade (ICB) biomarkers, we found that expression levels of PD-L1, PD-L2, and CTLA4 were lower in the high-risk group, suggesting a reduced response to therapies targeting these molecules (Fig. 6F). CD80 and CD86, key ligands for T cell activation, were also downregulated in tumor cells from the high-risk group, which may contribute to immune escape (Fig. 6F). Furthermore, based on the IPS score, the low-risk group might derive greater benefit from anti-CTLA4 therapy (Fig. 6G). This comprehensive analysis elucidates the complex interactions between zinc finger proteins and immune therapy, confirming the prognostic model related to ZNFs as a valuable tool for guiding immunotherapy in LUAD.
Fig. 5
Tumor-immune microenvironment analysis of the high- and low-risk groups. A Difference between 22 tumor-infiltrating immune cells by cibersort. The blue bar indicates the low-risk group and the red bar the high-risk group. B Relative infiltration levels of 28 immune cell subsets between the low- and high-risk groups in the TCGA-LUAD cohort. C XCell Boxplot illustrating the relative abundance of infiltrating immune cell types in patients belonging to high and low-risk groups. Statistical significance: *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, ns: not significant
Click here to Correct
Fig. 6
Predictive value of ZNF prognostic model for immunotherapy. A Difference between 22 tumor-infiltrating immune cells and 8 model genes. B Correlation analysis between risk score and estimate results based on the estimate algorithm. C Correlation heatmap between estimate results and 8 model genes. D TIDE score between the high- and low-risk group. E Immunotherapy response between high- risk and low-risk group in IMvigor210 dataset. F Expression of ICB genes in the high- and low-risk group. G Comparison of IPS between different risk groups in LUAD
Click here to Correct
3.4 Risk score performance in TMB, enrichment and drug selection
To investigate the molecular consequences of genetic differences between the high- and low-risk groups and to better understand the biological processes associated with poor survival in the high-risk group, we analyzed the genomic variability of the ZNFs model within the TCGA LUAD cohort. Oncoprint visualization revealed that somatic mutation frequency was higher in high-risk patients (95.92%) compared to the low-risk group (88.21%). The top 15 mutated genes were identified, with missense mutations and multi-hit mutations being the most common types. Notably, mutation rates of TP53, TTN, MUC16, and KRAS exceeded expectations and were significantly elevated in the high-risk group relative to the low-risk group (Fig. 7A, B). Consistently, tumor mutation burden (TMB) was significantly higher in the high-risk group (Fig. 7C, D). However, stratification of patients based on the median TMB showed that those with higher TMB had longer overall survival (Figure S1). The chromosomal locations of the 8 prognosis-related ZNFs genes are displayed (Fig. 7E). GO functional enrichment analysis of DE-ZNFs between risk groups indicated predominant localization in the collagen-containing extracellular matrix, cornified envelope, and axon terminus. Molecular functions enriched in these genes included hormone activity, bile acid binding, and heparin binding (Fig. 8A). After ID conversion, 311 downregulated and 554 upregulated genes were retained for further analysis. KEGG pathway analysis revealed significant differences between risk groups in pathways such as chemical carcinogenesis–DNA adducts, neuroactive ligand–receptor interaction, steroid hormone biosynthesis, and drug metabolism via cytochrome P450 (Fig. 8B). GSEA identified 229 significantly enriched pathways (p < 0.05), with the top 10 pathways predominantly enriched in the high-risk group. These pathways were associated with cell division, cell cycle regulation, DNA replication, transcription and translation, chromosome maintenance, DNA methylation, and keratinization (Fig. 8C), potentially explaining the shorter overall survival observed in the high-risk group. GSVA enrichment analysis further demonstrated that poor prognosis in the high-risk group was associated with activation of the MYC signaling pathway, glycolysis, mTOR signaling, DNA repair, E2F transcription factors, unfolded protein response, and oxidative phosphorylation (Fig. 8D). These findings suggest potential therapeutic targets for LUAD patients involving ZNFs. To identify potential chemotherapeutic and targeted agents for high-risk LUAD patients, we performed drug sensitivity prediction using available pharmacogenomic datasets. The high-risk group exhibited potential resistance to carboplatin (Fig. 9A), gefitinib (Fig. 9B), and crizotinib (Fig. 9C), whereas increased sensitivity was predicted for paclitaxel (Fig. 9D), docetaxel (Fig. 9E), erlotinib (Fig. 9F), vinorelbine (Fig. 9G), and trametinib (Fig. 9H).
Fig. 7
Mutation frequency measures of prognostic groups. A Oncoprint visualization of the mutation atlas in high-risk groups across LUAD samples from TCGA cohort. B Oncoprint visualization of the mutation atlas in low-risk groups across LUAD samples from TCGA cohort. C, D The violin plot and the proportional bar chart illustrated that the TMB was higher in the high-risk group. E The location of CNV alterations of DE-ZNFs model genes on chromosomes
Click here to Correct
Fig. 8
Functional analysis of DE-ZNFs genes between high- and low-risk groups. A Top 5 GO terms, BP, CC, and MF enrichment results of DE-ZNFs between high- and low-risk groups. B Top 10 enriched KEGG pathways of DE-ZNFs genes between high- and low-risk groups. C GSEA pathways of DE-ZNFs genes between high- and low-risk groups. D GSVA enrichment of DE-ZNFs genes between high- and low-risk groups
Click here to Correct
Fig. 9
Prediction of anti-tumor drugs sensitivity using the ZNF predictive model. A-H The patients in high-risk group were less sensitive to Carboplatin (A), Gefitinib (B), and Crizotinib (C), while the low-risk patients probably benefitted more from Paclitaxel (D), Docetaxel (E), Erlotinib (F), Vinorelbine (G), and Trametinib (H). Statistical significance: *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, ns: not significant
Click here to Correct
3.5 Nomogram construction
Clinical information for the high- and low-risk groups was downloaded, and inter-group comparisons were performed using Fisher’s exact test, as summarized in Table 1. The high-risk group exhibited a higher proportion of male patients, advanced T stage, increased lymph node metastasis, and higher overall stage. Univariate and multivariate Cox regression analyses were conducted on clinicopathological factors—including gender, age, stage, TNM classification, and risk score—to identify independent prognostic factors for LUAD. Variables with p-values less than 0.05 in both analyses were deemed independent risk factors; accordingly, T stage, N stage, and risk score were selected (Fig. 10A, B). These independent prognostic factors were then integrated to construct a nomogram, facilitating the prediction of 1-, 2-, and 3-year overall survival (OS) probabilities in the TCGA-LUAD cohort. Due to the limited sample size in the T4 and N3 categories, singular fit issues arose during Cox regression modeling; hence, T3 and T4 were combined, as were N2 and N3 groups (Fig. 10C). Calibration plots for 1-, 2-, and 3-year OS demonstrated good agreement between predicted and observed survival outcomes (Fig. 10D).
Table 1
Baseline clinical information table for high- and low-risk groups
 
Overall (n = 500)
High (n = 250)
Low (n = 250)
P value
Gender (%)
    
Female
271 (54.2)
114 (45.6)
157 (62.8)
< 0.001
Male
229 (45.8)
136 (54.4)
93 (37.2)
 
Age(mean)
   
0.016
 
65.24 (10.07)
64.14 (10.87)
66.33 (9.10)
 
T (%)
   
< 0.001
T1
167 (33.6)
65 (26.2)
102 (41.0)
 
T2
269 (54.1)
135 (54.4)
134 (53.8)
 
T3
43 (8.7)
34 (13.7)
9 (3.6)
 
T4
18 (3.6)
14 (5.6)
4 (1.6)
 
N (%)
   
0.007
N0
323 (66.1)
147 (59.3)
176 (73.0)
 
N1
95 (19.4)
55 (22.2)
40 (16.6)
 
N2
69 (14.1)
44 (17.7)
25 (10.4)
 
N3
2 (0.4)
2 (0.8)
0 (0.0)
 
M (%)
   
0.375
M0
334 (93.3)
170 (91.9)
164 (94.8)
 
M1
24 (6.7)
15 (8.1)
9 (5.2)
 
Stage (%)
   
< 0.001
Stage I
269 (54.6)
107 (43.5)
162 (65.6)
 
Stage II
119 (24.1)
69 (28.0)
50 (20.2)
 
Stage III
80 (16.2)
54 (22.0)
26 (10.5)
 
Stage IV
25 (5.1)
16 (6.5)
9 (3.6)
 
Fig. 10
Evaluation of prognostic signature on predicting the OS in LUAD patients. A Univariate Cox independent prognostic analysis of stage, age, gender, grade, race, T stage, M stage, N stage, and risk score. B Multivariate Cox independent prognostic analysis of stage, age, gender, grade, race, T stage, M stage, N stage, and risk score. C The nomogram based on T stage, N stage and risk score. D Calibration curves of the nomogram and observed 1-, 2-, and 3-year survival of LUAD patients. The dashed line represents the ideal performance, and the actual performance of the signature is represented by the black lines
Click here to Correct
3.6 Expression and survival analysis of eight prognostic genes
Our study validated the expression differences of the eight ZNF prognostic model genes between patients and normal samples in the TCGA-LUAD database. We validated the differential expression of the eight ZNF prognostic model genes between LUAD patients and normal samples in the TCGA-LUAD dataset. The results showed that TRIM6, GATA4, TRAF2, and ZNF322 were significantly upregulated in LUAD patients, whereas TRIM29, FGD3, and CASZ1 were downregulated (Fig. 11). Protein expression levels of these eight prognostic genes in LUAD and normal tissues were further analyzed using data from the Human Protein Atlas (HPA) database. The protein expression patterns of GATA4, TRAF2, and ZNF322 were consistent with their RNA expression trends, showing elevated levels in LUAD patients. Similarly, FGD3 and CASZ1 exhibited lower protein expression in patients, consistent with RNA data. No significant differences were observed between tumor and normal tissues for CTCFL at either the protein or RNA level. Notably, TRIM6 protein was expressed at low levels in both tumor and normal tissues, and TRIM29 protein expression showed an inverse pattern relative to its RNA expression (Fig. 12). Kaplan–Meier survival analysis demonstrated that higher expression of TRIM6, TRIM29, and TRAF2, along with lower expression of FGD3, CASZ1, and ZNF322, were associated with poorer overall survival in LUAD patients. Expression levels of the remaining two genes did not show significant prognostic relevance (Fig. S2).
Fig. 11
Validation of expressions of risk model genes in TCGA dataset. Statistical significance: *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, ns: not significant
Click here to Correct
Fig. 12
Expression levels of the risk related model genes were evaluated in HPA database
Click here to Correct
3.7 Loss of FGD3 Contributes to LUAD Cell Proliferation and Drug Resistance
Among the eight prognostic-related genes, FGD3 exhibited the strongest correlation with ESTIMATE scores and a relatively high correlation coefficient within the prognostic model. Consequently, we prioritized FGD3 for downstream validation. Paired RNA-seq data from TCGA showed significantly lower FGD3 expression in LUAD tissues compared to normal samples (Fig. S3A), and consistent results were observed in two paired GEO datasets (Fig. S5). Protein expression of FGD3 was further validated in LUAD cell lines and normal bronchial epithelial (HBE) cells. Except for PC9, FGD3 protein levels in H3122, H2228, H1650, and H1993 cells were significantly lower than those in HBE cells (Fig. 13A). A similar expression pattern was observed in matched tumor and adjacent normal tissues from LUAD patients (Fig. 13B). The immunohistochemistry results showed that compared to LUAD tissues, FGD3 expression was significantly higher in normal tissues. Cellular localization revealed that FGD3 was expressed in both the cytoplasm and nucleus, with a greater expression in the cytoplasm. FGD3 was more highly expressed in the epithelial cells of the trachea and bronchi (Fig. 14). GO, KEGG, and GSEA enrichment analyses revealed that genes differentially expressed with FGD3 were mainly enriched in cilium movement and cell adhesion pathways in LUAD (Fig. S3B–D). Drug sensitivity prediction indicated that the low FGD3 expression group was less sensitive to the first-generation ALK inhibitor crizotinib (Fig. 13C). In lorlatinib-resistant H3122 cells (H3122LR) (Fig. S4), FGD3 expression was decreased compared to parental cells (Fig. 13A). Moreover, FGD3 expression showed a dose- and time-dependent decline in ALK-positive cell lines H3122 and H2228 with prolonged lorlatinib treatment and increasing drug concentration (Fig. 13D). The FGD3 plasmid was transfected into H3122LR cells, and transfection efficiency was confirmed by Western blot (Fig. 15A). Colony formation assays showed that FGD3 overexpression markedly reduced the number of colonies formed by H3122LR cells (Fig. 15B). Consistently, cell viability was significantly decreased upon FGD3 overexpression (Fig. 15C). Furthermore, transwell assays demonstrated that upregulation of FGD3 substantially impaired both migration and invasion capabilities (Fig. 15D). Wound-healing assays further indicated that FGD3 overexpression attenuated the cells’ wound closure ability (Fig. 15E). FGD3 overexpression can partially reverse lorlatinib resistance in H3122LR cells (Fig. 15F).
Fig. 13
FGD3 is down-regulated in LUAD and possibly involved in drug sensitivity. A The expression of FGD3 in LUAD cell lines and normal bronchial epithelial cells, revealed by western blot. B The protein level of FGD3 in both cancerous and adjacent tissues of LUAD patients. T, tumor; N, paired para-cancerous normal tissues. C Drug sensitivity prediction suggested that the low FGD3 expression group was less sensitive to crizotinib. Significance: *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, ns: not significant. D The changes of FGD3 under different treatment time and concentration of lorlatinib in ALK-positive cell lines H3122 and H2228
Click here to Correct
Fig. 14
The immunohistochemistry results showed that compared to LUAD tissues, FGD3 expression was significantly higher in normal tissues. Cellular localization revealed that FGD3 was expressed in both the cytoplasm and nucleus, with a greater expression in the cytoplasm. FGD3 was more highly expressed in the epithelial cells of the trachea and bronchi. A IHC images of FGD3 in LUAD and adjacent tissues (showing 4 cases). B The IRS of FGD3 in both cancerous and adjacent tissues of LUAD patients (n = 14). Statistical significance: *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, ns: not significant
Click here to Correct
Fig. 15
Functional validation of FGD3 in H3122LR cells. A Expression of FGD3 protein in H3122LR cells after transfection with p-FGD3. B Colony formation of H3122LR cells after p-FGD3 transfection. C Cell viability of H3122LR cells after p-FGD3 transfection. D Migration and invasion of H3122LR cells after p-FGD3 transfection. E Wound-healing assay of H3122LR cells after p-FGD3 transfection. F IC50 of lorlatinib in H3122LR cells after p-FGD3 transfection. Statistical significance: *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, ns: not significant
Click here to Correct
4 Discussion
ZNFs constitute the largest family of transcription factors in mammalian cells. They are involved in the initiation, progression, and metastasis of malignant tumors, as well as in drug resistance, by regulating transcriptional and translational processes [30], highlighting their potential as novel therapeutic targets. ZNFs have been described as a double-edged sword, exhibiting both oncogenic and tumor-suppressive functions, and serving as reliable prognostic biomarkers [31]. Numerous clinicopathological datasets, prognostic data, and ZNF-based cancer prognostic models have been developed for various cancers, including pancreatic cancer, soft tissue sarcoma, colon adenocarcinoma, esophageal cancer, breast cancer, and osteosarcoma [32–38]. Many prognostic models are currently utilized in clinical practice; for example, the Sokal, Euro, EUTOS, and ELTS scoring systems are employed to evaluate prognosis in chronic myeloid leukemia (CML) [39]. However, despite LUAD being the most prevalent malignant tumor in China [40], no prognostic model based on zinc finger proteins has yet been established for this disease.
In this study, we identified 270 DE-ZNFs between LUAD and normal tissues. Through univariate Cox regression analysis, LASSO and Multiple Cox regression analysis we developed a prognostic model containing eight core genes for OS, which demonstrated the ability to predict patient prognosis effectively. This model was rigorously validated by GEO data through K-M survival plots and time ROC curves. Subsequently, we extended our investigation to assess immunotherapy responses between the high- and low-risk groups. Immune infiltration analysis revealed that the high-risk group exhibited an immune “desert” phenotype, characterized by relatively low levels of immune cells such as activated B cells and natural killer cells within the tumor macroenvironment. Dendritic cells, key players in antigen presentation and immune activation [41, 42], were also relatively scarce. Furthermore, CD4 + memory T cells, which play a critical role in tumor immune responses by maintaining long-term immune memory and enhancing anti-tumor activity [43], were diminished in the high-risk group. The deficiency of CD4 + memory T cells may reflect tumor immune escape mechanisms that suppress immune surveillance. Consistently, ESTIMATE analysis, TIDE scores, and IMvigor210 data confirmed that the high-risk group had lower immune scores, higher tumor purity, and poorer responses to immunotherapy. Moreover, expression levels of PD-L1, PD-L2, and CTLA-4 were reduced in the high-risk group, potentially impairing T cell recognition and indicating a poor response to immune checkpoint inhibitors [44–46]. IPS score predictions aligned with these findings, suggesting that the high-risk group may be less responsive to anti-CTLA-4 therapy. Additionally, CD80 and CD86, essential co-stimulatory molecules on dendritic cells and B cells that facilitate T cell activation [47], were expressed at lower levels in the high-risk group. This suggests impaired antigen-presenting cell function and insufficient immune activation, which may contribute to immune escape.
To better understand the molecular characteristics between different groups, TMB analysis was carried out. The progressive accumulation of somatic mutations throughout life may contribute to carcinogenesis [48]. The overall mutation rate was higher in the high-risk group compared to the low-risk group (95.92% vs. 88.21%), suggesting greater tumor heterogeneity in the high-risk group.
TP53, a classic tumor suppressor which is crucial for a variety of cellular processes, including DNA repair, cell cycle arrest, and apoptosis [49], is the most commonly mutated in human cancers, including NSCLC. Co-mutations of TP53 and other driver genes further contribute to the poor prognosis of lung cancer [50]. Besides, KRAS’s a proto-oncogene, KRAS mutations are particularly prevalent in 30% LUAD [51]. TTN is also frequently mutated in a variety of cancers, overexpression of TTN-AS1 correlates with poor prognosis in lung cancer [52]. MUC16 mutation is reported to decrease the immune response in LUAD [53]. Our results showed higher mutation frequencies of TP53 (55% vs. 48%), TTN (56% vs. 41%), KRAS (33% vs. 28%), and MUC16 (46% vs. 38%) in the high-risk group compared to the low-risk group, which may partly explain the poorer prognosis of the high-risk patients. Previous studies reported higher TMB results in more neo-antigens, increasing chances for T-cell recognition, and clinically correlates with better immune checkpoint inhibitors outcomes [27]. However, our findings showed that the high-risk group had higher TMB but exhibited poorer response to immunotherapy. Additionally, there was no significant difference in overall survival between patients with high and low TMB. This aligns with literature suggesting that TMB alone is an imperfect biomarker for immunotherapy response, and composite predictors incorporating other factors such as MHC and T-cell receptor repertoire are needed [6].
We observed that differentially expressed genes in LUAD were enriched in pathways related to cellular metabolism, proliferation, genomic stability, and regulation of gene expression. These aberrant molecular processes likely cooperate to accelerate tumorigenesis and progression in the high-risk group. GSVA analysis revealed enrichment of MYC and mTOR signaling pathways in high-risk LUAD, suggesting their potential as therapeutic targets. Furthermore, enrichment of DNA damage repair, E2F transcription factor, and unfolded protein response pathways indicated that tumor cells in the high-risk group may exhibit enhanced DNA repair capacity, sustained proliferative signaling [54] and disrupted endoplasmic reticulum proteostasis [55], thereby contributing to increased treatment resistance and tumor survival. Additionally, the risk assessment model could serve as a reliable tool to guide clinical treatment selection. For instance, patients in the high-risk group may be resistant to carboplatin, gefitinib, and crizotinib but more sensitive to paclitaxel, docetaxel, erlotinib, vinorelbine, and trametinib. Moreover, we found that the risk score significantly correlated with clinicopathological parameters such as T stage and N stage, which supported the development of a predictive nomogram with high accuracy for overall survival.
Of the eight genes comprising our prognostic signature, TRIM6, GATA4, TRAF2, and ZNF322 were highly expressed in LUAD samples, whereas TRIM29, FGD3, and CASZ1 showed lower expression. Similar protein-level expression patterns were confirmed using the HPA database. Kaplan–Meier survival analysis suggested that TRAF2, TRIM29, and TRIM6 may promote carcinogenesis in LUAD, while CASZ1, FGD3, and ZNF322 likely act as tumor suppressors. As reported in previous literature, TRAF2 is an oncogenic protein involved in TNF-mediated apoptosis in NSCLC via NFκB [56]. Accumulating evidence suggests that TRIM29 is also a transcription factor involved in NSCLC carcinogenesisvia and confirmed to be highly expressed in anlotinib-resistant NSCLC cells [57]. Previous studies have proposed that TRIM6 acts as an E3-ubiquitin ligase educes ferroptosis and chemosensitivity in lung cancer [58]. Our data on ZNF322 contrast with previous reports: while prior studies describe ZNF322 as an oncogenic transcription factor promoting neo-angiogenesis and tumorigenesis in lung cancer [59–62], our findings suggest a different role. The role of CASZ1 varies across different types of tumors, and its role in lung cancer is also controversial. According to TCGA data, CASZ1 is expressed at low levels in LUAD samples. However, a study has reported that CASZ1 promotes migration, invasion, and metastasis in lung cancer [63]. GATA4 has been shown to function as an essential tumor suppressor in lung cancer both in vitro and in vivo [64]. FGD3 regulates the actin cytoskeleton and cell morphology and inhibits cancer cell migration, correlating with favorable prognosis in breast and pancreatic cancers [65, 66]. Here, we validated for the first time that FGD3 protein expression is significantly reduced in LUAD tissues and cell lines compared to normal controls. Functional assays further demonstrated that FGD3 overexpression can partially reverse lorlatinib resistance in H3122LR cells, indicating a potential role of FGD3 in modulating drug sensitivity. GO enrichment analysis suggested that FGD3 is involved in cilium movement, which may explain its higher expression in normal epithelial cells of the trachea and bronchi. KEGG pathway analysis revealed that FGD3 is associated with cell adhesion-related pathways in LUAD, consistent with the phenotypic changes observed during lorlatinib resistance, such as increased membrane protrusions and cytoskeletal remodeling. Previous studies have also shown that FGD3 facilitates actin cytoskeleton remodeling and filopodia formation. Collectively, these findings suggest that the downregulation of FGD3 contributes to the acquisition of lorlatinib resistance, and restoring FGD3 expression may partially counteract this resistant phenotype.
In summary, we constructed a robust prognostic model for LUAD based on eight ZNF genes, which not only provides a novel survival predictor but also offers insights into the potential biological roles of ZNF family members in tumor progression and therapy response. The signature may serve as a clinically useful biomarker for guiding individualized treatment decisions. Moreover, we are the first to characterize FGD3 in LUAD, revealing its downregulation in tumor tissues and lorlatinib-resistant cells. Functional assays indicate that FGD3 overexpression can partially reverse drug resistance, suggesting a potential role in modulating cytoskeletal dynamics and cell adhesion, consistent with our enrichment analyses. These findings highlight FGD3 as a potential predictive biomarker and therapeutic target in LUAD. Nevertheless, our study has limitations. The prognostic signature was validated only in public cohorts, and its generalizability should be confirmed in prospective clinical samples. In addition, further investigation is required to clarify whether FGD3 can serve as an independent prognostic predictor and to elucidate its precise molecular mechanisms in LUAD progression.
5 Conclusion
This study presents a novel ZNF-based prognostic model for LUAD and underscores FGD3 as a promising biomarker and therapeutic target to counteract drug resistance.
Abbreviations
The following abbreviations are used in this manuscript:
NSCLC
Non-small cell lung cancer
SCLC
Small cell lung cancer
LUAD
Lung adenocarcinoma
ICI
Immune checkpoint inhibitor
ZNF
Zinc finger protein
DE-ZNF
Differentially expressed ZNF
TCGA
The cancer genome atlas
GEO
Gene expression omnibus
GO
Gene ontology
KEGG
Kyoto encyclopedia of genes and genomes
GSEA
Gene set enrichment analysis
OS
Overall survival
LASSO
Least absolute shrinkage and selection operator
ssGSEA
Single-sample gene set enrichment analysis
TIDE
Tumor immune dysfunction and exclusion
IPS
Immunophenoscore
TMB
Tumor mutation burden
CNV
Copy number variation
GSVA
Gene set variation analysis
GDSC
Genomics of drug sensitivity in cancer
CTRP
Cancer therapeutics response portal
IC50
Half maximal inhibitory concentration
HPA
Human protein atlas
STR
Short-tandem repeat
H3122LR
H3122 Lorlatinib-resistant cells
BP
Biological process
CC
Cellular component
MF
Molecular function
Supplementary Information
A
Acknowledgement
We gratefully acknowledge the Cancer Genome Atlas (TCGA) Database and the Gene Expression Omnibus (GEO) for providing extensive data. The flowchart graphic was generated using smart.servier.com.
A
Author Contribution
Conceptualization, Jiayue Sun, Xiaoyi Huang, Dinglong Xue, Jiazhuang Li, Yaru Huang, Yue Yang and Qingwei Meng; Data curation, Jiayue Sun, Xiaoyi Huang; Formal analysis, Jiayue Sun and Yue Yang; Funding acquisition, Qingwei Meng; Methodology, Jiayue Sun, Dinglong Xue, and Jiazhuang Li; Project administration, Jiayue Sun and Yaru Huang; Resources, Jiayue Sun; Software, Jiayue Sun; Supervision, Xiaoyi Huang and Qingwei Meng; Validation, Yue Yang; Writing – original draft,Jiayue Sun, Xiaoyi Huang, Dinglong Xue, Jiazhuang Li, Yaru Huang, Yue Yang; Writing – review & editing, Jiayue Sun, Xiaoyi Huang, Dinglong Xue, Jiazhuang Li, Yaru Huang, Yue Yang and Qingwei Meng. All authors have read and agreed to the published version of the manuscript.
A
Funding
This research was supported by the National Natural Science Foundation of China (Grant No. 81672931), the Climbing Program of Harbin Medical University Cancer Hospital (PDYS2024-01), and the Key Research and Development Program of Heilongjiang Province (2022ZX06C11).
A
A
A
A
Data Availability
All datasets analyzed in this study are publicly available in online repositories. RNA sequencing data and corresponding clinical information for LUAD were obtained from TCGA (https://portal.gdc.cancer.gov/) and the UCSC Xena platform (https://xenabrowser.net/datapages/). External validation cohorts were retrieved from GEO (https://www.ncbi.nlm.nih.gov/geo) under accession numbers GSE50081, GSE26939, GSE32863, and GSE75037. Zinc finger protein family genes were obtained from UniProt (https://www.uniprot.org/), and the complete list is provided in Supplementary Table S1.For immune-related analyses, LUAD data were downloaded from the TIDE database (http://tide.dfci.harvard.edu/) for TIDE analysis, and IPS for each patient were obtained from TCIA (https://tcia.at/). TMB data for the TCGA cohort were acquired from cBioPortal (https://www.cbioportal.org/). Immunohistochemical data for prognostic genes in LUAD tissues were retrieved from the HPA (https://www.proteinatlas.org/).All other data generated or analyzed during this study are available from the corresponding author upon reasonable request.
All datasets analyzed in this study are publicly available in online repositories. RNA sequencing data and corresponding clinical information for LUAD were obtained from TCGA (https://portal.gdc.cancer.gov/) and the UCSC Xena platform (https://xenabrowser.net/datapages/). External validation cohorts were retrieved from GEO (https://www.ncbi.nlm.nih.gov/geo) under accession numbers GSE50081, GSE26939, GSE32863, and GSE75037. Zinc finger protein family genes were obtained from UniProt (https://www.uniprot.org/), and the complete list is provided in Supplementary Table S1.
For immune-related analyses, LUAD data were downloaded from the TIDE database (http://tide.dfci.harvard.edu/) for TIDE analysis, and IPS for each patient were obtained from TCIA (https://tcia.at/). TMB data for the TCGA cohort were acquired from cBioPortal (https://www.cbioportal.org/). Immunohistochemical data for prognostic genes in LUAD tissues were retrieved from the HPA (https://www.proteinatlas.org/).
All other data generated or analyzed during this study are available from the corresponding author upon reasonable request.
Declarations
Ethics approval and consent to participate
A
All clinical tissue and paraffin-embedded specimens were obtained from Harbin Medical University Cancer Hospital. The pathological type of LUAD was confirmed by both intraoperative frozen-section analysis and postoperative histopathological examination. For each patient, paired tumor and adjacent normal tissues were collected. The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of Harbin Medical University Cancer Hospital, which granted a waiver of informed consent (approval No. KY2021-15; April 23, 2021).
Consent for publication
Not applicable.
Competing interests
All the authors declare no competing interests.
Electronic Supplementary Material
Below is the link to the electronic supplementary material
Click here to Correct
Supplementary Material 1
Click here to Correct
Supplementary Material 2
Click here to Correct
Supplementary Material 3
Click here to Correct
Supplementary Material 4
Click here to Correct
Supplementary Material 5
References
1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al 2024; Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 74:229–263
2. Lahiri A, Maji A, Potdar PD, Singh N, Parikh P, Bisht B, et al 2023; Lung cancer immunotherapy: progress, pitfalls, and promises. Mol Cancer 22:40
3. Herbst RS, Morgensztern D, Boshoff C 2018; The biology and management of non-small cell lung cancer. Nature 553:446–454
4. Cooper AJ, Sequist LV, Lin JJ 2022; Third-generation EGFR and ALK inhibitors: mechanisms of resistance and management. Nat Rev Clin Oncol 19:499–514
5. Guo H, Zhang J, Qin C, Yan H, Liu T, Hu H, et al 2022; Biomarker-targeted therapies in non–small cell lung cancer: current status and perspectives. Cells 11:3200
6. Jardim DL, Goodman A, Gagliato D de M, Kurzrock R 2020; The challenges of tumor mutational burden as an immunotherapy biomarker. Cancer Cell 39:154
7. Jen J, Wang Y-C 2016; Zinc finger proteins in cancer progression. J Biomed Sci 23:53
8. Kamaliyan Z, Clarke TL 2024; Zinc finger proteins: guardians of genome stability. Front Cell Dev Biol 12:1448789
9. Cassandri M, Smirnov A, Novelli F, Pitolli C, Agostini M, Malewicz M, et al 2017; Zinc-finger proteins in health and disease. Cell Death Discov 3:17071
10. Fan M, Liu Q, Ma X, Jiang Y, Wang Y, Jia S, et al 2024; ZNF131-BACH1 transcriptionally accelerates RAD51-dependent homologous recombination repair and therapy resistance of non-small-lung cancer cells by preventing degradation from CUL3. Theranostics 14:7241–7264
11. Li M, Liu Z, Hou Z, Wang X, Shi H, Li Y, et al 2023; Oncogenic zinc finger protein ZNF687 accelerates lung adenocarcinoma progression by activating the PI3K/AKT pathway. Thorac Cancer 14:1223–1238
12. Xie C, Zhou X, Wu J, Chen W, Ren D, Zhong C, et al 2024; ZNF652 exerts a tumor suppressor role in lung cancer by transcriptionally downregulating cyclin D3. Cell Death Dis 15:792
13. Zhao R, Guo Y, Zhang L, Huang Z, Li X, Lan B, et al 2024; CBX4 plays a bidirectional role in transcriptional regulation and lung adenocarcinoma progression. Cell Death Dis 15:378
14. Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, et al 2016; TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res 44:e71
15. Der SD, Sykes J, Pintilie M, Zhu C-Q, Strumpf D, Liu N, et al 2014; Validation of a histology-independent prognostic gene signature for early-stage non-small-cell lung cancer. J Thorac Oncol 9:59–64
16. Wilkerson MD, Yin X, Walter V, Zhao N, Cabanski CR, Hayward MC, et al 2012; Differential pathogenesis of lung adenocarcinoma subtypes involving sequence mutations, copy number, chromosomal instability, and methylation. PLoS One 7:e36530
17. Selamat SA, Chung BS, Girard L, Zhang W, Zhang Y, Campan M, et al 2012; Genome-scale analysis of DNA methylation in lung adenocarcinoma and integration with mRNA expression. Genome Res 22:1197–1211
18. Girard L, Rodriguez-Canales J, Behrens C, Thompson DM, Botros IW, Tang H, et al 2016; An expression signature as an aid to the histologic classification of non-small cell lung cancer. Clin Cancer Res 22:4880–4889
19. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al 2015; limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43:e47
20. Xu S, Hu E, Cai Y, Xie Z, Luo X, Zhan L, et al 2024; Using clusterProfiler to characterize multiomics data. Nat Protoc 19:3292–3320
21. Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA 2018; Profiling tumor infiltrating immune cells with CIBERSORT. Methods Mol Biol 1711:243–259
22. Hänzelmann S, Castelo R, Guinney J 2013; GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14:7
23. Aran D 2020; Cell-type enrichment analysis of bulk transcriptomes using xCell. Methods Mol Biol 2120:263–276
24. Jiang P, Gu S, Pan D, Fu J, Sahu A, Hu X, et al 2018; Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat Med 24:1550–1558
25. Charoentong P, Finotello F, Angelova M, Mayer C, Efremova M, Rieder D, et al 2017; Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade. Cell Rep 18:248–262
26. Balar AV, Galsky MD, Rosenberg JE, Powles T, Petrylak DP, Bellmunt J, et al 2017; Atezolizumab as first-line treatment in cisplatin-ineligible patients with locally advanced and metastatic urothelial carcinoma: a single-arm, multicentre, phase 2 trial. Lancet 389:67–76
27. Addeo A, Friedlaender A, Banna GL, Weiss GJ 2021; TMB or not TMB as a biomarker: that is the question. Crit Rev Oncol Hematol 163:103374
28. Maeser D, Gruener RF, Huang RS 2021; oncoPredict: an R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Brief Bioinform 22: bbab260
29. Iasonos A, Schrag D, Raj GV, Panageas KS 2008; How to build and interpret a nomogram for cancer prognosis. J Clin Oncol 26:1364–1370
30. Zhao J, Wen D, Zhang S, Jiang H, Di X 2023; The role of zinc finger proteins in malignant tumors. FASEB J 37: e23157
31. Bao Y, Zhang H, Han Z, Guo Y, Yang W 2022; Zinc fingers and homeobox family in cancer: a double-edged sword. Int J Mol Sci 23:11167
32. Sun X, Zheng D, Guo W 2022; Comprehensive analysis of a zinc finger protein gene–based signature with regard to prognosis and tumor immune microenvironment in osteosarcoma. Front Genet 13:835014
33. Xu F, Sun J, Gu X, Zhou Q 2024; An innovative prognostic auxiliary for colon adenocarcinoma based on zinc finger protein genes. Transl Cancer Res 13:1623–1641
34. Li J, Zhou Q, Zhang C, Zhu H, Yao J, Zhang M 2023; Development and validation of novel prognostic models for zinc finger protein–related genes in soft tissue sarcoma. Aging (Albany NY). https://doi.org/10.18632/aging.204682
35. Zhu L, Tu D, Li R, Li L, Zhang W, Jin W, et al 2023; The diagnostic significance of the ZNF gene family in pancreatic cancer: a bioinformatics and experimental study. Front Genet 14:1089023
36. Ye M, Li L, Liu D, Wang Q, Zhang Y, Zhang J 2021; Identification and validation of a novel zinc finger protein–related gene-based prognostic model for breast cancer. PeerJ 9: e12276
37. Hong K, Yang Q, Yin H, Wei N, Wang W, Yu B 2023; Comprehensive analysis of ZNF family genes in prognosis, immunity, and treatment of esophageal cancer. BMC Cancer 23:301
38. Zhang J, Zhang C, Cao P, Zheng X, Yu B, Cao H, et al 2021; A zinc finger protein gene signature enables bladder cancer treatment stratification. Aging (Albany NY) 13:13023–13038
39. Pfirrmann M, Clark RE, Prejzner W, Lauseker M, Baccarani M, Saussele S, et al 2020; The EUTOS long-term survival (ELTS) score is superior to the Sokal score for predicting survival in chronic myeloid leukemia. Leukemia 34:2138–2149
40. Howlader N, Forjaz G, Mooradian MJ, Meza R, Kong CY, Cronin KA, et al 2020; The effect of advances in lung cancer treatment on population mortality. N Engl J Med 383:640–649
41. Yin X, Chen S, Eisenbarth SC 2021; Dendritic cell regulation of T helper cells. Annu Rev Immunol 39:759–790
42. Wang Y, Xiang Y, Xin VW, Wang X-W, Peng X-C, Liu X-Q, et al 2020; Dendritic cell biology and its role in tumor immunotherapy. J Hematol Oncol 13:107
43. Künzli M, Masopust D 2023; CD4 + T cell memory. Nat Immunol 24:903–914
44. Yamaguchi H, Hsu J-M, Yang W-H, Hung M-C 2022; Mechanisms regulating PD-L1 expression in cancers and opportunities for novel small-molecule therapeutics. Nat Rev Clin Oncol 19:287–305
45. Yi M, Zheng X, Niu M, Zhu S, Ge H, Wu K 2022; Combination strategies with PD-1/PD-L1 blockade: current advances and future directions. Mol Cancer 21:28
46. Sharma P, Goswami S, Raychaudhuri D, Siddiqui BA, Singh P, Nagarajan A, et al 2023; Immune checkpoint therapy: current perspectives and future directions. Cell 186:1652–1669
47. Chen R, Ganesan A, Okoye I, Arutyunova E, Elahi S, Lemieux MJ, et al 2020; Targeting B7-1 in immunotherapy. Med Res Rev 40:654–682
48. Martincorena I, Campbell PJ 2015; Somatic mutation in cancer and normal cells. Science 349:1483–1489
49. Tornesello ML 2025; TP53 mutations in cancer: molecular features and therapeutic opportunities (review). Int J Mol Med 55:7
50. Skoulidis F, Heymach JV 2019; Co-occurring genomic alterations in non-small cell lung cancer biology and therapy. Nat Rev Cancer 19:495–509
51. Li Z, Zhuang X, Pan C-H, Yan Y, Thummalapalli R, Hallin J, et al 2024; Alveolar differentiation drives resistance to KRAS inhibition in lung adenocarcinoma. Cancer Discov 14:308–325
52. Zheng Q-X, Wang J, Gu X-Y, Huang C-H, Chen C, Hong M, et al 2021; TTN-AS1 as a potential diagnostic and prognostic biomarker for multiple cancers. Biomed Pharmacother 135:111169
53. H L, T X, H D, Y W, C S, Y Z, et al 2023; Development and validation of a MUC16 mutation-associated immune prognostic model for lung adenocarcinoma. Aging (Albany NY). https://doi.org/10.18632/aging.204814
54. Gao Y, Qiao X, Liu Z, Zhang W 2024; The role of E2F2 in cancer progression and its value as a therapeutic target. Front Immunol 15:1397303
55. Hetz C, Zhang K, Kaufman RJ 2020; Mechanism, regulation and functions of the unfolded protein response. Nat Rev Mol Cell Biol 21:421–438
56. Deen AJ, Adinolfi S, Härkönen J, Patinen T, Liu X, Laitinen T, et al 2024; Oncogenic KEAP1 mutations activate TRAF2-NFκB signaling to prevent apoptosis in lung cancer cells. Redox Biol 69:103031
57. Wu M, Jin M-M, Cao X-H, Zhao L, Li Y-H 2024; Silencing TRIM29 sensitizes non-small cell lung cancer cells to anlotinib by promoting apoptosis via binding RAD50. Curr Cancer Drug Targets 24:445–454.
58. Zhang Y, Dong P, Liu N, Yang J-Y, Wang H-M, Geng Q 2024; TRIM6 reduces ferroptosis and chemosensitivity by targeting SLC1A5 in lung cancer. Oxid Med Cell Longev 2023; 2023:9808100.
59. Cheung CHY, Hsu C-L, Lin T-Y, Chen W-T, Wang Y-C, Huang H-C, et al 2020; ZNF322A-mediated protein phosphorylation induces autophagosome formation through modulation of IRS1-AKT glucose uptake and HSP-elicited UPR in lung cancer. J Biomed Sci 27:75
60. Lin CC, Kuo IY, Wu LT, Kuan WH, Liao SY, Jen J, Yang YE, Tang CW, Chen YR, Wang YC. Dysregulated Kras/YY1/ZNF322A/Shh transcriptional axis enhances neo-angiogenesis to promote lung cancer progression. Theranostics. 2020;10(22):10001–10015.
61. Jen J. Oncogenic zinc finger protein ZNF322A promotes stem cell-like properties in lung cancer through transcriptional suppression of c-Myc expression. Cell Death Differ. 2019;26(7):1283–98.
62. Liao S-Y, Kuo I-Y, Chen Y-T, Liao P-C, Liu Y-F, Wu H-Y, et al 2019; AKT-mediated phosphorylation enhances protein stability and transcription activity of ZNF322A to promote lung cancer progression. Oncogene 38:6723–6736
63. Taheri Baghmisheh S, Wu Y-Y, Wu J-E, Hsu K-F, Chen Y-L, Hong T-M 2023; CASZ1 promotes migration, invasion, and metastasis of lung cancer cells by controlling expression of ITGAV. Am J Cancer Res 13:176–189
64. Gao L, Hu Y, Tian Y, Fan Z, Wang K, Li H, et al 2019; Lung cancer deficient in the tumor suppressor GATA4 is sensitive to TGFBR1 inhibition. Nat Commun 10:1665
65. Susini T, Renda I 2020; FGD3 gene as a new prognostic factor in breast cancer. Anticancer Res 40:3645–3649
66. Guo F. 2022; FGD3 binds with HSF4 to suppress p65 expression and inhibit pancreatic cancer progression. Oncogene 41(5):838–851.
Total words in MS: 8242
Total words in Title: 17
Total words in Abstract: 267
Total Keyword count: 7
Total Images in MS: 15
Total Tables in MS: 2
Total Reference count: 66