Clinicopathological Model for Predicting Endometrial Cancer and Atypical Hyperplasia in Women Aged > 40 Years: Development and Evaluation in a Single-Institution Retrospective Cohort
MengfanSong1,2
ZhenHuang2
ZhilinGuo2,3
A
YudongWang1,2,4✉
A
FureiJin2,3✉
1Department of Obstetrics and Gynecology, School of MedicineInternational Peace Maternity and Child Health Hospital, Shanghai Jiao Tong University200030ShanghaiChina
2Shanghai Key Laboratory of Embryo Original Diseases200030ShanghaiChina
3Department of Traditional Chinese Medicine Gynecology, School of MedicineInternational Peace Maternity and Child Health Hospital, Shanghai Jiao Tong University200030ShanghaiChina
4Department of Gynecologic Oncology, School of MedicineInternational Peace Maternity and Child Health Hospital, Shanghai Jiao Tong University200030ShanghaiChina
Mengfan Song1, 2†, Zhen Huang 2†, Zhilin Guo2, 3, Yudong Wang1, 2, 4*, Furei Jin2, 3*
1 Department of Obstetrics and Gynecology, International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200030, China
2 Shanghai Key Laboratory of Embryo Original Diseases, Shanghai, 200030, China
3 Department of Traditional Chinese Medicine Gynecology, International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200030, China
4 Department of Gynecologic Oncology, International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200030, China.
Corresponding author: Furei Jin, Yudong Wang
Abstract
Background:
Endometrial cancer poses a significant global health burden with rising mortality. Current diagnostics for women ≥ 40 with abnormal uterine bleeding or imaging abnormalities detect malignancy in < 10% of biopsies, subjecting over 90% to unnecessary invasive procedures. Existing prediction models have suboptimal accuracy.To develop and validate a clinically practical nomogram incorporating the novel biomarker cumulative menstrual years, quantifying estrogen exposure, for predicting atypical endometrial hyperplasia or endometrial cancer risk.
Methods:
This retrospective cohort study included 1,490 women (aged > 40 years) who underwent ≥ 2 endometrial biopsies at the International Peace Maternity and Child Health Hospital between 2014–2023. Univariable and multivariable logistic regression were used to identify potential independent predictors of atypical endometrial hyperplasia or endometrial carcinoma ( AEH/EC ). A nomogram prediction model was developed using significant predictors, with its performance internally validated through AUC analysis (discrimination) and decision curve analysis (clinical utility).
Results:
Independt Risk factors were postmenopausal bleeding ≥ 5 years postmenopause (OR = 14.55, 95% CI: 7.67–27.04), cumulative menstrual years > 40 years (OR = 7.28, 95% CI: 2.50–24.01), menstrual irregularity (OR = 3.93, 95% CI: 1.74–7.99), abnormal endometrial thickness (OR = 2.92, 95% CI: 1.70–5.27), and diabetes mellitus (paradoxical OR = 0.40, 95% CI: 0.24–0.66). The nomogram demonstrated robust performance (training AUC = 0.82; validation AUC = 0.83), excellent calibration (slope = 1.000), and clinical utility across thresholds (10–50%). Risk stratification thresholds: low (< 40 points), medium (40–70 points), high (> 70 points).
Conclusion:
A
This cumulative menstrual years integrated nomogram provides a practical, high-performance tool for dynamic AEH/EC risk stratification using routine parameters, while maintaining high sensitivity, particularly in resource-limited settings. The paradoxical protective association of diabetes (OR = 0.40) requires cautious interpretation owing to incomplete BMI adjustment (dichotomized at 23 kg/m² without obesity stratification); prospective validation with granular metabolic profiling is warranted.
Keywords:
Endometrial Neoplasms
Endometrial Hyperplasia
Predictive Value of Tests
Perimenopause
Menstrual Cycle
Nomograms
A
A
INTRODUCTION
Endometrial cancer (EC) is the fourth most prevalent female malignancy globally and the fifth leading cause of cancer-related death in women(14). Alarmingly, it represents one of the few malignancies with persistently rising mortality over the past four decades(3). Significant risk factors include prolonged estrogen exposure (e.g., early menarche, late menopause), metabolic disorders (obesity, diabetes), and genetic syndromes(1, 5). Prognosis varies drastically by stage: localized disease (FIGO I–II) has 80–90% 5-year survival, while advanced stages (FIGO III–IV) drop below 20%, with 20–33% of patients presenting at advanced stages(6, 7). Given this escalating burden and stark prognostic disparity, improving early detection is urgent.
Current diagnosis relies on invasive biopsy for women ≥ 40 years with abnormal uterine bleeding (AUB) or imaging abnormalities(5, 810). However, < 10% of these biopsies in symptomatic women confirm malignancy(5, 10, 11), subjecting > 90% of women to unnecessary invasive procedures with significant physical and psychological morbidity(5). Population screening remains unfeasible due to lacking cost-effective tools(12).
Despite the availability of various prediction models (e.g., QCancer(13)), critical limitations persist: risk assessment tools based on routine clinical indicators exhibit suboptimal discriminatory accuracy (AUC 0.64–0.77)(1419), while advanced models reliant on specialized variables (e.g., molecular markers, MRI)(2022)face implementation challenges in primary care settings due to limited availability of these variables, resulting in a lack of high-precision models suitable for basic healthcare facilities; furthermore, existing models, predominantly constructed from cross-sectional studies, employ static risk stratification methods that fail to effectively capture the dynamic progression from endometrial hyperplasia to carcinoma or assess dynamic risk(15); moreover, the applicability of current models across heterogeneous patient populations is limited, particularly by insufficient data from Chinese/Asian populations, compounded by epidemiological variations in endometrial cancer across different ethnic groups, thereby restricting their generalizability within China(16, 17, 23). Collectively, this highlights the absence of readily implementable dynamic risk assessment models tailored for Chinese populations in primary care.
To bridge this critical gap in early detection and overcome the persistent limitations of current models—suboptimal accuracy with routine variables, impracticality of advanced variables in primary care, static risk assessment, and inadequate generalizability to Chinese populations—we leverage cumulative menstrual years (CMY), a novel, highly predictive biomarker readily available in basic healthcare settings that quantifies cumulative estrogen exposure and exhibits a strong dose-dependent association with AEH/EC risk (e.g., CMY ≥ 40 years confers a 5-fold increase) (24, 25). We developed a clinically interpretable dynamic risk stratification nomogram incorporating this fundamental patient metric alongside other primary care-accessible parameters (postmenopausal bleeding, cycle irregularity, endometrial thickness, vascularity). This tool aims to provide a practical solution for Chinese primary care by enabling longitudinal risk profiling, significantly reducing unnecessary biopsies (> 90%) while maintaining high sensitivity (≥ 80%) for detecting AEH/EC, thereby addressing the unmet need for accurate, implementable risk assessment.
MATERIALS AND METHODS
Study Design and Population
A
We conducted a retrospective cohort study at the International Peace Maternity and Child Health Hospital, selecting patients who undergone endometrial biopsy between January 2014 and December 2023.The cohort comprised 2,568 female patients aged 40–78 years. After applying eligibility criteria, 1,490 participants were included. The flowchart of participants is shown in Fig. 1. Comprehensive demographic, clinical, and ultrasonographic data were systematically extracted from electronic medical records (EMR-IPMCH v3.2), supplemented by structured telephone interviews when necessary. All participants received transvaginal ultrasonography; diagnostic curettage, hysteroscopy, or surgery was performed based on clinical indications.
Fig. 1
Participant selection flowchart illustrating inclusion/exclusion criteria.
Click here to Correct
Eligibility Criteria
Participants were included if they: (a) were aged 40 years or older at the time of initial endometrial biopsy; (b) had ≥ 2 histologically confirmed endometrial pathology; (c) maintained a minimum 12-month interval between first and last endometrial pathology; and (d) demonstrated non-AEH/EC (non-atypical endometrial hyperplasia and without EC) on initial pathology. Exclusion criteria comprised: (a) atypical endometrial hyperplasia (AEH) or EC on initial pathology; (b) prior cancer diagnosis; or (c) > 20% missing key variables. Outcome groups were defined by the last pathology: non-AEH/EC or AEH/EC (endometrial carcinoma or AEH).
Data Collection
Participants were identified from hospital surgical database registries and day-care procedure records, encompassing both inpatient and outpatient cases. Clinical, demographic, and ultrasound data were collected directly from discharge summaries via linkage to the hospital's EMR, with supplementary information acquired through structured telephone interviews when necessary. The collected information included age, occupation, marital status, ethnicity, parity, gravidity, age at first delivery, body mass index (BMI) categorized as underweight and normal (< 23kg/m²), overweight and obesity (≥ 23.0 kg/m²)(26), age at menarche, current use of hormonal medications (estrogen, progesterone, oral contraceptives), medical history (hypertension, diabetes, cancers ), and gynecological history (uterine fibroids, adenomyosis, endometriosis). Surgical approach, duration of follow-up, and postmenopausal status (defined as ≥ 1 year since last menses) were recorded. Age at menopause was categorized as no menopause, < 55 years, or ≥ 55 years(27), (28) with mode of menopause documented as natural or iatrogenic. Menstrual history included age at menarche (< 12 years / ≥12 years (27), (28) ), menstrual regularity (regular/irregular), duration of menstrual life(29, 30), (≤ 30 years; >30 to ≤ 40 years; >40 years ), history of abnormal uterine bleeding (present/absent) and postmenopausal bleeding (none; within 5 years of menopause; >5 years after menopause). Endometrial thickness (ET) was classified as "abnormal" if ≥ 13 mm (premenopausal)(31) or ≥ 5 mm (postmenopausal) (32).
Ultrasound and Pathological Evaluation
Transvaginal ultrasound was performed using Samsung WS80A or GE Voluson E8/E10 systems (3–12 MHz probes) following International Endometrial Tumor Analysis criteria(33). Two independent sonographers (> 5 years' experience) conducted blinded assessments, with discrepancies resolved by a senior investigator. Pathological diagnoses were rendered by board-certified gynecological pathologists using WHO 2014 classification(34), with the last endometrial pathology defining the final outcome.
Table 2
Multivariable Predictors of AEH/EC
Variable
Level
OR
95%CI
p-value
Postmenopausal uterine bleeding Onset Interval
No
1
Ref
 
 
< 5years
2.72
1.15–5.68
< 0.05
 
>= 5years
14.55
7.67–27.04
< 0.05
Endometrial thickness
Normal
1
Ref
 
 
Abnormal
2.92
1.7–5.27
< 0.05
Cumulative menstrual years
< 30years
1
Ref
 
 
30-40years
2.32
1.01–6.72
0.076
 
> 40years
7.28
2.5-24.01
< 0.05
Diabetes Mellitus
No
1
Ref
 
 
Yes
0.4
0.24–0.66
< 0.05
Menstrual regularity
Regular
1
Ref
 
 
Irregular
3.93
1.74–7.99
< 0.05
Statistical significance was set at p < 0.05.
1. Nomogram Development
A clinical prediction nomogram integrating these nine predictors was developed (Fig. 2). The tool quantifies risk through a point system (0–350 points), with total scores converting to AEH/EC probabilities (0.1–0.99). Key predictors contributing maximal points included menstrual irregularity, prolonged menstruation (> 40 years), and late postmenopausal bleeding. Risk stratification was defined as: low-risk (< 40 points), medium-risk (40–70 points), and high-risk (> 70 points). For example, a postmenopausal patient with menstrual irregularity (100 points), > 40 years menstruation (67.5 points), and abnormal endometrial thickness (27.5 points) accumulates 195 points, corresponding to 90% AEH/EC risk.
Table 1
Cohort Characteristics Stratified by Endometrial Pathology
 
level
Non-AEH/EC group
AEH/EC Group
p
No.
 
1424
66
 
Follow-up period (median [IQR])
 
3.00 [ 2.00, 5.00]
4.00 [ 3.00, 7.00]
< 0.05
Endpoint age (median [IQR])
 
50.00 [ 46.00, 54.00]
55.50 [ 50.00, 65.00]
< 0.05
Occupation ( % )
Office worker
964 ( 67.7)
25 ( 37.9)
< 0.05
 
Retired
266 ( 18.7)
28 ( 42.4)
 
 
Self-employed
85 ( 6.0)
5 ( 7.6)
 
 
Factory worker
12 ( 0.8)
0 ( 0.0)
 
 
Technical W
Worker
34 ( 2.4)
2 ( 3.0)
 
 
Farmer
1 ( 0.1)
1 ( 1.5)
 
 
Other / Unemployed
62 ( 4.4)
5 ( 7.6)
 
Marital( % )
Married
1396 ( 98.0)
65 ( 98.5)
0.63
 
Single
15 ( 1.1)
0 ( 0.0)
 
 
Divorced
13 ( 0.9)
1 ( 1.5)
 
Ethnicity( % )
Han Chinese
1415 ( 99.4)
66 ( 100.0)
0.99
 
Other ethnicity
9 ( 0.6)
0 ( 0.0)
 
Menstrual regularity ( % )
Regular
1369 ( 96.1)
57 ( 86.4)
< 0.05
 
Irregular
55 ( 3.9)
9 ( 13.6)
 
Menopausal status( % )
Premenopausal
917 ( 64.4)
27 ( 40.9)
< 0.05
 
Menopause
507 ( 35.6)
39 ( 59.1)
 
Age at menarche ( % )
>= 12 years
1356 ( 95.2)
62 ( 93.9)
0.85
 
< 12 years
68 ( 4.8)
4 ( 6.1)
 
Age at menopause ( % )
< 55years
1358 ( 95.4)
56 ( 84.8)
< 0.05
 
>= 55years
66 ( 4.6)
10 ( 15.2)
 
Cumulative menstrual years( % )
< 30years
251 ( 17.6)
5 ( 7.6)
< 0.05
 
30-40years
1104 ( 77.5)
51 ( 77.3)
 
 
> 40years
69 ( 4.8)
10 ( 15.2)
 
Age at first delivery( % )
Nulliparous
100 ( 7.0)
8 ( 12.1)
0.13
 
normal
1292 ( 90.7)
55 ( 83.3)
 
 
Advanced maternal age
32( 2.2)
3 ( 4.5)
 
Gravidity ( % )
Never
69 ( 4.8)
7 ( 10.6)
0.07
 
Ever
1355 ( 95.2)
59 ( 89.4)
 
Parity( % )
Nulliparity
100 ( 7.0)
8 ( 12.1)
0.18
 
Multiparity
1324 ( 93.0)
58 ( 87.9)
 
BMI( % )
Underweight&Normal weight
657( 46.1)
21 ( 31.8)
0.031
 
Overweight &Obesity
767( 53.9)
45( 68.3)
 
Hypertension ( % )
No
1151 ( 80.8)
45 ( 68.2)
< 0.05
 
Yes
273 ( 19.2)
21 ( 31.8)
 
Diabetes Mellitus ( % )
No
1384 ( 97.2)
60 ( 90.9)
< 0.05
 
Yes
40 ( 2.8)
6 ( 9.1)
 
Gynecological history( % )
No
521 ( 36.6)
31 ( 47.0)
0.12
 
Yes
903 ( 63.4)
35 ( 53.0)
 
Indications for estrogen/progestin therapy
    
Estrogen( % )
No
1267 ( 89.0)
62 ( 93.9)
0.29
 
Yes
157 ( 11.0)
4 ( 6.1)
 
Progestin ( % )
No
388 ( 27.2)
32 ( 48.5)
< 0.05
 
Yes
1036 ( 72.8)
34 ( 51.5)
 
Oral contraceptives ( % )
No
1272 ( 89.3)
63 ( 95.5)
0.17
 
Yes
152 ( 10.7)
3 ( 4.5)
 
Abnormal uterine bleeding ( % )
No
1093 ( 76.8)
50 ( 75.8)
0.97
 
Yes
331 ( 23.2)
16 ( 24.2)
 
Postmenopausal uterine bleeding Onset Interval( % )
No
1284 ( 90.2)
39 ( 59.1)
< 0.05
 
< 5years
97 ( 6.8)
8 ( 12.1)
 
 
>=5years
43 ( 3.0)
19 ( 28.8)
 
Ultrasound imaging
    
Endometrial thickness ( % )
Normal
717 ( 50.4)
17 ( 25.8)
< 0.05
 
Abnormal
707 ( 49.6)
49 ( 74.2)
 
Endometrial mass/lesion( % )
No
659 ( 46.3)
29 ( 43.9)
0.81
 
Yes
765 ( 53.7)
37 ( 56.1)
 
Endometrial and mass blood flow ( % )
No
929 ( 65.2)
37 ( 56.1)
0.16
 
Yes
495 ( 34.8)
29 ( 43.9)
 
Endometrial echo ( % )
Regular and homogeneous endometrial echo
398 ( 27.9)
14 ( 21.2)
0.29
 
Irregular and heterogeneous endometrial echo
1026 ( 72.1)
52 ( 78.8)
 
Pathological Sampling Methods ( % )
Hysteroscopic curettage
1337 ( 93.9)
54 ( 83.1)
< 0.05
 
Fractional curettage
87 ( 6.1)
11 ( 16.9)
 
Non-normally distributed continuous variables are described using the median(IQR).
Values are presented as number (%).
Continuous variables use the Wilcoxon rank-sum test, and categorical variables use the Chi-square test.
Statistical significance was set at p < 0.05.
Gynecological history: Uterine fibroids, Adenomyosis, Endometriosis
Postmenopausal uterine bleeding Onset Interval: Interval between menopause onset and lastest postmenopausal uterine bleeding
Other ethnicity: Korean Chinese, Hui Chinese, Manchu Chinese, Mongol Chinese
Abbreviations
AEH/EC, Atypical endometrial hyperplasia or Endometrial carcinoma
AEH, Atypical endometrial hyperplasia
AUB, abnormal uterine bleeding
AUC, area under the ROC curve
BMI, body mass index
CI, Confidence Interval
CMY, cumulative menstrual years
EC, Endometrial carcinoma
EMR-IPMCH, Electronic Medical Record-International Peace Maternity and Child Health Hospital
ET, Endometrial thickness
IQR, interquartile range
OR, Odds Ratio
PMB, postmenopausal bleeding (defined as vaginal bleeding ≥ 12 months after cessation of menses)
PMB Onset Interval, Age at PMB diagnosis - Age at menopause
ROC, receiver operating characteristic
SD, standard deviation
WHO, World Health Organization
Ethical Approval
A
This study was reviewed and approved by the Ethics Committee of the International Peace Maternity and Child Health Hospital (Approval No. GKLW-A-2024-107-01).
A
Informed consent was waived for this retrospective analysis, with all patient identifiers removed prior to data processing.
Statistical Analysis
All analyses were performed using R software (version 4.2.2). Continuous variables are expressed as mean ± standard deviation (SD) for normally distributed data or median with interquartile range (IQR) for non-normally distributed data, while categorical variables are reported as frequencies and percentages. Normality was evaluated using Shapiro-Wilk tests. Group comparisons between non-AEH/EC and AEH/EC cohorts utilized independent samples t-tests for normally distributed continuous variables, Wilcoxon rank-sum tests for non-normally distributed continuous variables, and chi-square or Fisher’s exact tests for categorical variables.
Prior to modeling, categorical variables (e.g., occupation, hypertension status) were converted to factors, and continuous variables (e.g., age) to numeric format. Missing data (< 20% missingness) were handled via multiple imputation with chained equations (MICE package; m = 5 imputations, seed = 6666), with pooled estimates derived from the imputed datasets. Univariate screening identified candidate predictors (P < 0.05) using chi-square tests for categorical variables and t-tests/Wilcoxon tests for continuous variables. Class imbalance was addressed by synthetic oversampling (ROSE package). A multivariable logistic regression model was constructed through backward elimination of significant univariate predictors. The cohort was stratified into training (70%) and validation (30%) sets using the createDataPartition() function.
Model performance was assessed through three primary metrics: discrimination quantified by the area under the receiver operating characteristic curve (AUC) with 95% confidence intervals, calibration evaluated via Hosmer-Lemeshow goodness-of-fit tests and calibration plots, and clinical utility measured by decision curve analysis to estimate net benefit across threshold probabilities.
RESULTS
2.
Cohort Characteristics
This retrospective cohort study enrolled 1,490 women treated at the International Peace Maternity and Child Health Hospital (2014–2023), comprising 1,424 non-AEH/EC cases and 66 AEH/EC cases. Statistically significant intergroup differences (P < 0.05) were observed in key parameters (Table 1). The AEH/EC group exhibited an older endpoint age (P < 0.05) and higher prevalence of postmenopausal status (59.1% vs. 35.6%; P < 0.05). Notably, 28.8% of AEH/EC patients experienced postmenopausal bleeding ≥ 5 years after menopause versus 3.0% in non-AEH/EC controls (P < 0.05). Abnormal endometrial thickness was more frequent in the AEH/EC group (74.2% vs. 49.6%; P < 0.05), while progestin therapy usage was lower (51.5% vs. 72.8%; P < 0.05). Additional differences included higher rates of menstrual irregularity (13.6% vs. 3.9%; P < 0.05) and prolonged menstruation duration > 40 years (15.2% vs. 4.8%; P < 0.05) in the AEH/EC cohort (Table 1).
Additionally, the median follow-up was 3.0 years (IQR 2.0–5.0), with 92% of participants completing ≥ 2 biopsies during this period. The AEH/EC group had a higher proportion of retirement as occupation (42.4% vs. 18.7%; P < 0.05), increased menopause at ≥ 55 years (15.2% vs. 4.6%; P < 0.05), reduced menstruation duration < 30 years (7.6% vs. 17.6%; P < 0.05), greater overweight/obesity prevalence (68.3% vs. 53.9%; P = 0.031), higher hypertension (31.8% vs. 19.2%; P < 0.05) and diabetes mellitus (9.1% vs. 2.8%; P < 0.05), and more frequent use of fractional curettage for pathological sampling (16.9% vs. 6.1%; P < 0.05). No significant differences were observed in marital status, ethnicity, age at menarche, gynecological history (uterine fibroids/adenomyosis/endometriosis), or most ultrasound parameters (endometrial mass/lesion, blood flow, echo pattern).
3.
Variable Selection Results
Multivariable logistic regression identified five independent predictors of AEH/EC (Table 2).
Postmenopausal uterine bleeding occurring ≥ 5 years after menopause demonstrated the strongest association (OR = 14.55, 95% CI: 7.67–27.04; P < 0.05), followed by menstruation duration > 40 years (OR = 7.28, 95% CI: 2.50–24.01; P < 0.05). Menstrual irregularity significantly increased risk (OR = 3.93, 95% CI: 1.74–7.99; P < 0.05), as did abnormal endometrial thickness (OR = 2.92, 95% CI: 1.70–5.27; P < 0.05). Diabetes mellitus was associated with reduced risk (OR = 0.40, 95% CI: 0.24–0.66; P < 0.05). Postmenopausal bleeding within 5 years of menopause also conferred elevated risk (OR = 2.72, 95% CI: 1.15–5.68; P < 0.05), while menstruation duration of 30–40 years showed non-significant association (OR = 2.32, 95% CI: 1.01–6.72; P = 0.076) compared to < 30 years.
A
Fig. 2
Nomogram for endometrial cancer/atypical hyperplasia risk prediction.
Points assigned per predictor (top) yield total points (middle) corresponding to risk probability (bottom). Risk stratification: low (< 40), medium (40–70), high (> 70).
Model Validation
The nomogram demonstrated robust discrimination and calibration performance during validation. ROC analysis yielded AUCs of 0.82 (95% CI: 0.80–0.85) in the training cohort and 0.83 (95% CI: 0.73–0.93) in the validation cohort (both P < 0.05; Fig. 3a). Calibration plots showed excellent agreement between predicted and observed outcomes, with a calibration slope of 1.000 and intercept of − 0.177 (Fig. 3b). The model achieved a Brier score of 0.172, indicating low overall prediction error. Decision curve analysis further confirmed the model's clinical utility across threshold probabilities of 10% to 50%, particularly showing significantly greater standardized net benefit compared to the "intervene-all" or "intervene-none" strategies within the critical 20%-40% high-risk threshold range(Fig. 3c).
Click here to Correct
A
Fig. 3a
ROC curves: training (navy; AUC = 0.82), validation (red; AUC = 0.83);
Click here to Correct
A
Fig. 3b
Calibration: logistic fit (solid), nonparametric (dashed) vs. ideal (grey); Brier = 0.172. (c) Decision curve: model
Click here to Correct
A
Fig. 3c
Decision curve: model (thick solid) vs. 'All' (dashed) and 'None' (thin solid) strategies; net benefit superior at 10–50% thresholds (optimal 20–40%).
DISCUSSION
This study developed a nomogram demonstrating significant advantages in predicting AEH/EC risk among women who are over 40 years old and undergoing endometrial biopsy. Key findings identified Cumulative Menstrual Years (CMY > 40 years) as a strong independent risk factor (increasing risk by 7.28-fold) and confirmed the critical predictive value of Postmenopausal Bleeding (PMB) timing, particularly PMB occurring ≥ 5 years postmenopause (OR = 14.55). By menstrual irregularity, integrating CMY, BMI status, parity, diabetes status, AUB, stratified PMB, endometrial thickness, and duration of progesterone administration, the model achieved exceptional predictive performance (AUC 0.82–0.83), significantly outperforming existing models based on conventional indicators.
Regarding risk factors, the key findings of this study show both consensus and differences with previous research. Points of consensus include: CMY > 40 years, as a quantifiable indicator of cumulative estrogen exposure, significantly increasing AEH/EC risk (OR = 7.28) aligns with numerous epidemiological studies (24, 25); the strongest risk associated with late PMB (≥ 5 years) confirmed by stratified analysis (OR = 14.55) is consistent with clinical guidelines and the majority of academic viewpoints(35); significantly increased risk associated with thickened endometrium (≥ 13mm premenopause, ≥5mm postmenopause; OR = 2.92) also matches classical research and guideline recommendations(31, 32); and menstrual irregularity (OR = 3.93) as an independent risk factor likely reflects anovulation or endocrine dysfunction(36).
The primary difference lies in this study’s observed inverse association between diabetes and endometrial cancer risk with an odds ratio of 0.40, contrasting with most studies reporting positive associations. (37, 38) This paradox may stem from insufficient control of BMI confounding, as adjustment occurred only via dichotomization at the Asian standard of 23 kg/m², lacking fine stratification or continuous variable analysis; whereas literature identifies BMI as a key shared risk factor and potent confounder—evidenced by Luo et al(39).and Lucenteforte et al(40). showing attenuated associations after BMI adjustment. Other factors include surveillance bias, where increased gynecological examinations in diabetics may enable earlier precancerous intervention; metformin’s potential influence despite the WHI finding no protective effect with a hazard ratio of 1.00, while lab studies suggest antitumor mechanisms; and population heterogeneity, where differing Asian BMI thresholds of 23 kg/m² versus Western standards of ≥ 25 kg/m² may introduce bias. Thus, the true diabetes-endometrial cancer relationship requires validation through larger prospective studies such as WHI-style extensions; rigorous confounder control including BMI modeled as a continuous variable or finely stratified into six tiers, with waist-hip ratio inclusion; obesity-diabetes interaction assessment; and stratification by diabetes duration and treatment per the WHI’s time-dependent analysis showing residual risk in new-onset diabetes.
Compared to models reported in previous literature, the nomogram constructed in this study possesses significant advantages and unique value. The primary advantage is a markedly improved prediction accuracy (AUC 0.82–0.83), significantly better than prior models relying on conventional indicators (age, BMI, bleeding history; AUC 0.64–0.77). This is mainly attributed to the innovative integration of CMY as a quantifiable indicator of estrogen exposure and the critical stratification of PMB timing (< 5 years vs ≥ 5 years). Secondly, this model addresses the limitation of "dynamic risk assessment" in existing models. Existing models often use static stratification based on cross-sectional data, whereas this model, based on a retrospective cohort (≥ 2 biopsies, median follow-up 3 years), inherently allows its predictors (such as continuously accumulating CMY or newly occurring late PMB) to be suitable for dynamically monitoring changes in individual risk over time (e.g., CMY extension, new onset of late PMB). This enables assessment of risk evolution from benign/hyperplastic conditions to AEH/EC, thereby achieving more precise timing for follow-up and intervention. Thirdly, the model boasts high clinical practicality and applicability in primary care settings. All included variables (CMY, PMB timing, ET, menstrual regularity, diabetes status) can be easily obtained at the primary care level through history taking, basic physical examination, and transvaginal ultrasound. This overcomes the major obstacle faced by models relying on advanced imaging (e.g., MRI texture analysis) or expensive molecular markers, which are difficult to implement in resource-limited areas. Its intuitive risk score and stratification (low/medium/high risk) facilitate rapid clinical decision-making.
The limitations of this study include its single-center retrospective design with inherent selection bias from including only a biopsy cohort; a limited number of AEH/EC cases (n = 66), affecting precision despite synthetic oversampling; a highly homogeneous cohort (99.4% Han Chinese), reducing generalizability; incomplete adjustment for BMI (a key confounder in the diabetes-EC association); and lack of data on genetic syndromes, molecular markers, and diabetes-specific variables (such as type, duration, and treatment). Future directions include: external validation in multi-ethnic, geographically diverse cohorts; prospective evaluation of biopsy reduction rates and impact on early detection; model refinement through incorporating BMI as a continuous variable or finer strata (e.g., WHO Asian BMI categories); development of a digital calculator for point-of-care use; and larger studies to clarify diabetes’ independent role through rigorous confounder adjustment. The model is developed based on data from a Chinese population (99.4% Han Chinese cohort) and employs BMI cutoffs suitable for Asian populations. This partially addresses the issue of insufficient generalizability of existing mainstream models (mostly built on European and American population data) to the Chinese population, providing a practical tool for primary healthcare in China that is more aligned with local epidemiological characteristics. Its clear potential for clinical benefit lies in effectively identifying low-risk individuals (score < 40 points), potentially substantially reducing (> 90%) unnecessary invasive endometrial biopsies and their associated physical and psychological burdens and economic costs, while maintaining a high detection rate (sensitivity ≥ 80%) for AEH/EC. This ability to reduce unnecessary biopsies and optimize resource utilization in primary care and resource-limited settings lacking advanced diagnostic equipment is one of its most significant value propositions.
Conclusion
This study developed and validated a practical nomogram that significantly improves prediction of AEH/EC risk in women undergoing endometrial biopsy. Its superior performance (AUC 0.82–0.83) stems from integrating two key innovations: cumulative menstrual years (CMY > 40 years; OR = 7.28) as a novel measure of estrogen exposure, and stratified PMB timing (≥ 5 years postmenopause; OR = 14.55) as a critical high-risk indicator. Relying solely on accessible clinical/ultrasound parameters, the model enables dynamic risk monitoring and effectively identifies low-risk women (score < 40 points), while maintaining high sensitivity. Optimized for the Han Chinese population, this tool offers substantial clinical utility for resource-limited settings.Future work must: (1) validate the model in multi-ethnic cohorts, (2) refine BMI adjustment using WHO Asian classifications, and (3) prospectively quantify biopsy reduction rates and cost savings. Development of an open-access digital calculator is prioritized for point-of-care use.
Click here to Correct
Ethics approval and consent to participate
This study was reviewed and approved by the Ethics Committee of the International Peace Maternity and Child Health Hospital (Approval No. GKLW-A-2024-107-01).
A
The ethics committee of the International Peace Maternity and Child Health Hospital waived the requirement for informed consent due to the retrospective nature of the study and all data being anonymized.
A
All methods were carried out in accordance with relevant guidelines and regulations (e.g., Helsinki Declaration).
Consent for publication
Not applicable
A
Data Availability
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Competing Interests
No potential conflict of interest relevant to this article was reported.
A
Funding
This work was supported by Decision Consulting Project of 2025 China Hospital Development Institute, Shanghai Jiao Tong University (CHDI-2025-Z-39), the Collaborative Guidance Project of Traditional Chinese and Western Medicine in General Hospitals of Shanghai Municipal Health Commission (ZXXT-202315), the major difficult diseases of Chinese and Western clinical cooperation construction project of the National Health Commission of the People's Republic of China (ZXXTQJ-2024), the Fundamental Research Funds for the Central Universities (YG2025QNA31)and Three-Year Action Plan for Improving Clinical Research Capability of International Peace Maternity and Child Health Hospital (IPMCH2024CR02).
A
Author Contribution
S.M. and H.Z. contributed equally to this work as co-first authors, with S.M. listed first. J.F. and W.Y. are co-corresponding authors, with J.F. as the lead. S.M., H.Z. and J.F. conceived and designed the study. S.M., G.Z., W.Y. and J.F. acquired funding. S.M., H.Z. and W.Y. developed the methodology. S.M., H.Z. and G.Z. curated the data. S.M. and H.Z. wrote the original draft. All authors reviewed, edited, and approved the final manuscript.
A
Acknowledgement
We extend our gratitude to the Gynecology Department for their meticulous documentation of patients' daily data, which was fundamental to ensuring the completeness of the data collection. We would also like to thank the Information Department for their invaluable support in the data acquisition process.
Electronic Supplementary Material
Below is the link to the electronic supplementary material
Click here to Correct
Supplementary Material 1
REFERENCES
1.
Gu B, Shang X, Yan M, Li X, Wang W, Wang Q, Zhang C. Variations in incidence and mortality rates of endometrial cancer at the global, regional, and national levels, 1990–2019. Gynecol Oncol. 2021;161(2):573–80.
2.
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209–49.
3.
Siegel RL, Kratzer TB, Giaquinto AN, Sung H, Jemal A. Cancer statistics, 2025. CA: A Cancer. J Clin. 2025;75(1):10–45.
4.
Baker-Rand H, Kitson SJ. Recent Advances in Endometrial Cancer Prevention, Early Diagnosis and Treatment. Cancers. 2024;16(5).
5.
Gynecologists ACoOa. ACOG Practice Bulletin 149: Endometrial Cancer. Obstet Gynecol. 2015;125(4):1006–26.
6.
Siegel RL, Miller KD, Wagle NS, Jemal A. Cancer statistics, 2023. CA Cancer J Clin. 2023;73(1):17–48.
7.
Van Nyen T, Moiola CP, Colas E, Annibali D, Amant F. Modeling Endometrial Cancer: Past, Present, and Future. Int J Mol Sci. 2018;19(8).
8.
Siegel RL, Miller KD, Wagle NS, Jemal A, Cancer statistics. 2023. CA: A Cancer Journal for Clinicians. 2023;73(1):17–48.
9.
Gynecologists ACoOa. ACOG Committee Opinion No. 734: The Role of Transvaginal Ultrasonography in Evaluating the Endometrium of Women With Postmenopausal Bleeding. Obstetrics & Gynecology. 2018;131(5):e124-e9.
10.
Smith-Bindman R, Kerlikowske K, Feldstein VA, Subak L, Scheidler J, Segal M, et al. Endovaginal Ultrasound to Exclude Endometrial Cancer and Other Endometrial Abnormalities. JAMA. 1998;280(17):1510–7.
11.
Clarke MA, Long BJ, Del Mar Morillo A, Arbyn M, Bakkum-Gamez JN, Wentzensen N. Association of Endometrial Cancer Risk With Postmenopausal Bleeding in Women. JAMA Intern Med. 2018;178(9).
12.
Kitson SJ, Khan U, Crosbie EJ. Lay and general practitioner attitudes towards endometrial cancer prevention: a cross-sectional study. Fam Pract. 2024;41(6):949–55.
13.
Forder BH, Ardasheva A, Atha K, Nentwich H, Abhari R, Kartsonaki C. Models for predicting risk of endometrial cancer: a systematic review. Diagn Progn Res. 2025;9(1):3.
14.
Franco EL, Pfeiffer RM, Park Y, Kreimer AR, Lacey JV, Pee D et al. Risk Prediction for Breast, Endometrial, and Ovarian Cancer in White Women Aged 50 y or Older: Derivation and Validation from Population-Based Cohort Studies. PLoS Med. 2013;10(7).
15.
Shin W, Yang SJ, Park SY, Kang S, Lee DO, Lim MC, Seo SS. A predictive model based on site-specific risk factors of recurrence regions in endometrial cancer patients. BMC Cancer. 2022;22(1):1111.
16.
Shi J, Kraft P, Rosner BA, Benavente Y, Black A, Brinton LA, et al. Risk prediction models for endometrial cancer: development and validation in an international consortium. J Natl Cancer Inst. 2023;115(5):552–9.
17.
Pfeiffer RM, Park Y, Kreimer AR, Lacey JV Jr., Pee D, Greenlee RT, et al. Risk prediction for breast, endometrial, and ovarian cancer in white women aged 50 y or older: derivation and validation from population-based cohort studies. PLoS Med. 2013;10(7):e1001492.
18.
Fortner RT, Husing A, Kuhn T, Konar M, Overvad K, Tjonneland A, et al. Endometrial cancer risk prediction including serum-based biomarkers: results from the EPIC cohort. Int J Cancer. 2017;140(6):1317–23.
19.
Husing A, Dossus L, Ferrari P, Tjonneland A, Hansen L, Fagherazzi G, et al. An epidemiological model for prediction of endometrial cancer risk in Europe. Eur J Epidemiol. 2016;31(1):51–60.
20.
Li X, Dessi M, Marcus D, Russell J, Aboagye EO, Ellis LB et al. Prediction of Deep Myometrial Infiltration, Clinical Risk Category, Histological Type, and Lymphovascular Space Invasion in Women with Endometrial Cancer Based on Clinical and T2-Weighted MRI Radiomic Features. Cancers (Basel). 2023;15(8).
21.
Lefebvre TL, Ueno Y, Dohan A, Chatterjee A, Vallieres M, Winter-Reinhold E, et al. Development and Validation of Multiparametric MRI-based Radiomics Models for Preoperative Risk Stratification of Endometrial Cancer. Radiology. 2022;305(2):375–86.
22.
Chen T, Yang Y, Huang Z, Pan F, Xiao Z, Gong K, et al. Prognostic risk modeling of endometrial cancer using programmed cell death-related genes: a comprehensive machine learning approach. Discov Oncol. 2025;16(1):280.
23.
Ye Y, Yingmei W, Yindan W, Wenxin M, Wenyan T, Fengxia X. Incidence and mortality of endometrial cancer in China, data from China Cancer Registry Annual Report, 2004–2017. Zhonghua Yi Xue Za Zhi. 2024;104(10):721-8.
24.
Gavrilyuk O, Braaten T, Weiderpass E, Licaj I, Lund E. Lifetime number of years of menstruation as a risk index for postmenopausal endometrial cancer in the Norwegian Women and Cancer Study. Acta Obstet Gynecol Scand. 2018;97(10):1168–77.
25.
Cramer DW. Incessant ovulation: a review of its importance in predicting cancer risk. Front Oncol. 2023;13:1240309.
26.
Consultation WHOE. Appropriate body-mass index for Asian populations and its implications for policy and intervention strategies. Lancet. 2004;363(9403):157–63.
27.
Dossus L, Allen N, Kaaks R, Bakken K, Lund E, Tjonneland A, et al. Reproductive risk factors and endometrial cancer: the European Prospective Investigation into Cancer and Nutrition. Int J Cancer. 2010;127(2):442–51.
28.
Colombo N, Creutzberg C, Amant F, Bosse T, Gonzalez-Martin A, Ledermann J et al. ESMO-ESGO-ESTRO Consensus Conference on Endometrial Cancer. International Journal of Gynecological Cancer. 2016;26(1):2–30.
29.
Lin Q, Zhang J, Liu X, Zheng Q, Lin D, Pan M. Association between Healthy Eating Index-2015 total and component food scores with reproductive lifespan among postmenopausal women: a population-based study from NHANES 2005–2016. BMC Public Health. 2024;24(1):2631.
30.
Ou R, Wei Q, Hou Y, Zhang L, Liu K, Lin J et al. Reproductive Lifespan and Motor Progression of Parkinson’s Disease. J Clin Med. 2022;11(20).
31.
Aytekin O, Karagöz Ç, Göktaş E, Tokalıoğlu AA, Tiryaki Güner G, Özkaya Uçar Y, et al. Preoperative predictors of concurrent endometrial carcinoma in patients with endometrial intraepithelial neoplasia: the role of HALP score and other inflammatory markers. J Turkish-German Gynecol Association. 2025;26(1):34–40.
32.
Rebecca Smith-Bindman MKK, Vickie MD, Feldstein A, Leslee Subak MD, Juergen Scheidler MD. PhD; Richard Brand, PhD; Deborah Grady, MD. Review Endovaginal Ultrasound to Exclude Endometrial Cancer and Other Endometrial Abnormalities. JAMA. 1998;280:17.
33.
Leone FP, Timmerman D, Bourne T, Valentin L, Epstein E, Goldstein SR, et al. Terms, definitions and measurements to describe the sonographic features of the endometrium and intrauterine lesions: a consensus opinion from the International Endometrial Tumor Analysis (IETA) group. Ultrasound Obstet Gynecol. 2010;35(1):103–12.
34.
Emons G, Beckmann M, Schmidt D, Mallmann P. New WHO Classification of Endometrial Hyperplasias. Geburtshilfe Frauenheilkd. 2015;75(02):135–6.
35.
Bengtsen MB, Veres K, Nørgaard M. First-time postmenopausal bleeding as a clinical marker of long-term cancer risk: A Danish Nationwide Cohort Study. Br J Cancer. 2019;122(3):445–51.
36.
Yanli Y, Mei WT, Cong L, Schatten H. Analysis of Characteristics of Endometrial Carcinoma in Peri- and Postmenopausal Women with Abnormal Uterine Bleeding. Biomed Res Int. 2024;2024:1–7.
37.
Saed L, Varse F, Baradaran HR, Moradi Y, Khateri S, Friberg E et al. The effect of diabetes on the risk of endometrial Cancer: an updated a systematic review and meta-analysis. BMC Cancer. 2019;19(1).
38.
Wang M, Yang Y, Liao Z. Diabetes and cancer: Epidemiological and biological links. World J Diabetes. 2020;11(6):227–38.
39.
Luo J, Beresford S, Chen C, Chlebowski R, Garcia L, Kuller L, et al. Association between diabetes, diabetes treatment and risk of developing endometrial cancer. Br J Cancer. 2014;111(7):1432–9.
40.
Lucenteforte E, Bosetti C, Talamini R, Montella M, Zucchetto A, Pelucchi C, et al. Diabetes and endometrial cancer: effect modification by body weight, physical activity and hypertension. Br J Cancer. 2007;97(7):995–8.
Total words in MS: 3985
Total words in Title: 21
Total words in Abstract: 267
Total Keyword count: 6
Total Images in MS: 5
Total Tables in MS: 2
Total Reference count: 40