A
A
DongjuLi1,4Emaillidongju@shmu.edu.cn
JiaqiLiu2,4Emailliujiaqi@fudan.edu.cn
HuimingXu3,4Emailhuimingxu@sina.com
ShulanMa1Emailslma@fudan.edu.cn
1Experimental Teaching Center of Basic Medical Science, School of Basic Medical SciencesFudan UniversityShanghaiChina
2Student Affairs Office, School of Basic Medical SciencesFudan UniversityShanghaiChina
3Nanjing West Road Community Health Service CenterShanghaiChina
4165 Cheng-du North RoadP.O. Box 117200032ShanghaiChina
Dongju Li1†; Jiaqi Liu2†; Huiming Xu3†; Shulan Ma1
1. Experimental Teaching Center of Basic Medical Science, School of Basic Medical Sciences, Fudan University, Shanghai, China.
2. Student Affairs Office, School of Basic Medical Sciences, Fudan University, Shanghai, China.
3. Nanjing West Road Community Health Service Center, Shanghai, China.
D. Li: P.O. Box 117. 138 Yi-Xue-Yuan Road, Shanghai, 200032, China; e-mail: lidongju@shmu.edu.cn
J. Liu: P.O. Box 275. 138 Yi-Xue-Yuan Road, Shanghai, 200032, China; e-mail: liujiaqi@fudan.edu.cn
H. Xu: 165 Cheng-du North Road, Shanghai, 20040, China; e-mail: huimingxu@sina.com
D. Li & J. Liu & H. Xu contributed equally to this work.
Correspondence should be addressed to: Shulan Ma. PhD, P.O. Box 117. 138 Yi-Xue-Yuan Road, Shanghai, 200032, China; e-mail: slma@fudan.edu.cn
Longitudinal Trajectories of Empathy among Chinese Medical Students: A Five-Year Prospective Study Using a Dual-Scale Assessment
Abstract
Background
Empathy is a core competence in medical practice that contributes to patient outcomes and physician well-being. Although widely studied, the developmental trajectory of empathy during medical education remains unclear, particularly in China. To date, no long-term longitudinal studies have examined empathy development among Chinese medical students.
Methods
We conducted a five-year prospective longitudinal study among medical students at Fudan University. Both the Jefferson Scale of Empathy-Student version (JSE-S) and the Interpersonal Reactivity Index (IRI) were administered annually from 2019–2023. Mixed-effects models were used to analyze changes in empathy and their predictors, with fixed effects including academic year, age, career aspiration, and entry year.
Results
A total of 104 students provided 266 valid responses across multiple waves. The JSE-S scores significantly decreased over time, with fifth-year scores markedly lower than first-year scores (B = -11.57, p < .001). Older students and those with stronger medical career aspirations reported higher JSE-S scores. IRI scores positively predicted JSE-S scores (B = 0.40, p < .001) but remained stable across academic years, suggesting that trait empathy was unaffected by medical training.
Conclusions
Clinical empathy among Chinese medical students significantly decreased over five years, whereas trait empathy remained stable. These findings highlight the distinct developmental trajectories of clinical and trait empathy and emphasize the value of longitudinal dual-scale assessment. Educational interventions are needed to sustain empathy as a professional competence throughout medical training.
Keywords:
empathy
Chinese medical students
longitudinal study
Jefferson Scale of Empathy
Interpersonal Reactivity Index
A
A
Background
Empathy is commonly defined as the capacity to perceive, understand, and share the emotions or experiences of others, and it plays a critical role in medical practice. In medical contexts, this construct is conceptualized as clinical empathy, encompassing not only emotional resonance and a cognitive understanding of patients’ experiences but also emotion regulation to maintain clinical judgment and the ability to convey understanding through caring behaviors. The Jefferson Scale of Empathy (JSE) was specifically developed to assess clinical empathy in healthcare settings, particularly its student version (JSE-S) for medical students [1, 2]. In contrast, the Interpersonal Reactivity Index (IRI) evaluates trait empathy, reflecting stable empathic tendencies in general social contexts [3, 4]. Employing both instruments provides a comprehensive evaluation, capturing context-sensitive clinical capacities alongside more enduring dispositional empathy.
Clinical empathy is a core component of professional competence in medicine, contributing not only to increased patient satisfaction, treatment adherence, and clinical outcomes [5, 6] but also to reduced physician burnout, increased job satisfaction, and improved mental and physical well-being [7–9]. The development of empathy among medical students may be influenced by demographic factors (e.g., age, gender, career aspirations, parental education) and educational factors (e.g., curriculum design, clinical exposure, role modeling) [10, 11]. Additionally, professional identity formation (PIF) has been proposed as a mechanism that reinforces the recognition of empathy’s importance and facilitates its internalization as a stable professional behavior [1, 12, 13].
Although the critical importance of empathy is widely recognized, its developmental trajectory throughout medical education remains unclear. Systematic reviews synthesized approximately 60 empirical studies conducted between 1998 and 2019 across North America, Europe, and Asia. Taken together, the findings are inconsistent: 27 studies reported a decline in empathy during medical education, 20 reported no significant change, and 13 reported an increase. In contrast, among the 13 longitudinal studies, the results were more convergent, with 10 showing decreases, 2 remaining stable, and only 1 short-term study (1 year) reporting an increase [11, 14–16]. This pattern suggests that longitudinal designs provide greater reliability in capturing developmental trends. Therefore, combining longitudinal tracking with dual-scale assessments allows for a more nuanced and robust evaluation of empathy development.
In China, despite hosting the largest population of medical students worldwide, long-term longitudinal studies examining empathy development remain scarce. To address this gap, the present study employed a longitudinal design, following the same cohort of students enrolled in the five-year clinical medicine program over the course of five years. Both the JSE-S and the IRI were administered with repeated measures analysis to depict empathy trajectories and explore potential demographic and educational predictors across the entire medical education cycle. The findings are anticipated to enrich empirical evidence on empathy development in the Chinese context and provide theoretical and practical insights for optimizing empathy cultivation in medical education.
Methods
Participants
A
A five-year prospective longitudinal study involving clinical medicine students enrolled in a five-year undergraduate program at Fudan University was conducted. The initial data collection yielded 383 questionnaires. To ensure anonymity, identifiers were coded using the first three and last four digits of participants’ mobile phone numbers (e.g., 189****9292). After 11 questionnaires with incomplete identifiers were excluded, 372 valid responses remained.
For longitudinal analyses, only students who completed two or more survey waves were included (n = 104), resulting in a total of 266 valid questionnaires. Students who responded only once (n = 106) were excluded from the longitudinal modeling. The final analytic sample thus represented 49.5% of the participants with at least two valid responses (104/210) and accounted for 69.6% of all collected questionnaires (266/383).
Owing to staggered entry into the study, participants’ first responses occurred in different academic years (ranging from the first year to the fourth year).
A
For each participant, the first available response was defined as the baseline, and subsequent responses were aligned accordingly for longitudinal analysis.
Instruments
The self-administered questionnaire consisted of three sections:
• Section A collected demographic information, including gender, age, and other background variables.
• Section B included the Chinese version of the Jefferson Scale of Empathy–Student version (JSE-S), comprising 20 items rated on a 7-point Likert scale (1 = strongly disagree to 7 = strongly agree). Half of the items were positively worded, and half were negatively worded. The total scores ranged from 20 to 140, with higher scores reflecting greater levels of clinical empathy. The JSE-S has demonstrated satisfactory psychometric reliability in previous studies [17, 18].
• Section C included the Chinese version of the Interpersonal Reactivity Index (IRI), which assesses general empathy. It contains 22 items rated on a 5-point Likert scale (1 = strongly disagree to 5 = strongly agree), with 5 items negatively worded. The total scores ranged from 22 to 110, with higher scores reflecting stronger dispositional empathy.
Procedures
We recruited class advisors—faculty members at Chinese universities responsible for both academic guidance and student affairs—as data collectors. After standardized training to ensure procedural consistency, collectors distributed the online questionnaire via class WeChat groups and invited students to participate voluntarily during the same period each academic year. The first wave of data collection took place in October 2019, and the final wave was completed in October 2023.
A
An explanatory statement was provided at the beginning of the questionnaire, informing students that participation was voluntary and anonymous and that data would be reported in aggregate for research purposes only. Completion of the questionnaire was regarded as implied informed consent.
A
The study protocol was approved by the Research Ethics Committee of the School of Basic Medical Sciences, Fudan University.
Statistical Analyses
All analyses were conducted via IBM SPSS Statistics version 20.0. Descriptive statistics were performed on baseline data (i.e., each participant’s first available response), and the internal consistency of the JES-S and IRI was assessed via Cronbach’s α.
To evaluate changes in empathy over time and its associated factors, a linear mixed-effects model was constructed with the JSE-S total score as the dependent variable. Fixed effects included categorical variables—academic year, age group, career aspiration, and entry year (i.e., the year in which a student first joined the study)—as well as the continuous covariates of the total IRI score. The entry year was modeled as a categorical factor to adjust for potential cohort-specific differences due to staggered enrollment, especially considering structural disruptions such as the COVID-19 pandemic.
Participant ID was treated as a random effect to account for repeated measurements, with an autoregressive covariance structure [AR(1)] assumed. Type III sums of squares were used for hypothesis testing (α = 0.05).
A parallel model was constructed with the IRI total score as the dependent variable to compare predictors of trait empathy, using the same fixed and random effects structure.
Results
1. Descriptive Statistics
Baseline data, including data from 104 medical students, were obtained from the first measurement. Most participants were female (63.46%) and aged ≤ 19 years (82.69%). The majority were in their first (59.62%) or second (33.65%) year of study. A high proportion (88.46%) reported a willingness to pursue a medical career, and 45.19% held student leadership roles. Additionally, 58.65% were from one-child families. The participants' hometowns spanned cities, towns, and rural areas (Table 1).
Table 1
Sample Characteristics at Baseline Measurement (N = 104)
Variable | Category | n | % |
|---|
Gender | Male | 38 | 36.54 |
| | Female | 66 | 63.46 |
Age group | ≤ 19 years | 86 | 82.69 |
| | 20–22 years | 18 | 17.31 |
School year | 1st year | 62 | 59.62 |
| | 2nd year | 35 | 33.65 |
| | 3rd year | 4 | 3.85 |
| | 4th year | 3 | 2.88 |
Student leadership role | Yes | 47 | 45.19 |
| | No | 57 | 54.81 |
Only child status | Yes | 61 | 58.65 |
| | No | 43 | 41.35 |
Hometown location | Municipality | 19 | 18.27 |
| | Provincial capital | 16 | 15.38 |
| | Small/medium city | 30 | 28.85 |
| | Town | 23 | 22.12 |
| | Rural area | 16 | 15.4 |
Career aspiration: willingness to become a doctor | Willing | 92 | 88.46 |
| | Unwilling | 1 | 0.96 |
| | Undecided | 11 | 10.58 |
[Insert Table 1 near here.]
Both the JSE-S and the IRI demonstrated good internal consistency, with Cronbach’s α values of 0.80 and 0.78 at baseline and 0.82 and 0.79 across all valid responses, respectively.
Across the full dataset, the JSE-S total scores ranged from 72–134 (theoretical range: 20–140), with a mean of 106.84 (SD = 11.89). The IRI scores ranged from 52 to 106, with a mean of 78.32 (SD = 8.80). The skewness and kurtosis values were − 0.12 and − 0.20 for JSE-S and − 0.03 and − 0.26 for IRI, respectively. All corresponding Z values were within ± 1.96, indicating approximate normal distributions for both scales. No outliers were detected.
2. Model Fit and Fixed Effects
A linear mixed-effects model was constructed with the JSE-S total score as the dependent variable. When an AR(1) covariance structure and maximum likelihood estimation were used, the model fit was good: −2 restricted log-likelihood = 1923.91; AIC = 1929.91; and BIC = 1940.51.
The covariance parameter estimates indicated an AR (1) diagonal variance of 81.21 (SE = 14.08), an autoregressive coefficient (rho) of 0.20 (SE = 0.17), and a subject-level random intercept variance of 25.70 (SE = 13.98).
Type III tests of fixed effects revealed that academic year (F = 4.68, p = .001), age group (F = 5.71, p = .004), career aspiration (F = 4.31, p = .014), and the IRI score (F = 25.77, p < .001) were significant predictors of JES-S. Entry year was modeled as a categorical factor and did not have a significant main effect (F = 1.54, p = .207) (see Table 2).
Table 2
Tests of Fixed Effects in the Mixed Linear Model Predicting Empathy (JSE-S)
Predictor | df (num/den) | F | p |
|---|
Academic year | 4 / 170.60 | 4.68 | .001 |
Age group | 2 / 229.70 | 5.71 | .004 |
Career aspiration | 2 / 237.68 | 4.31 | .014 |
Entry year | 3 / 112.72 | 1.54 | .207 |
IRI score | 1 / 234.05 | 25.77 | < 0.001 |
| Note. Fixed effects were tested using Type III F-tests with numerator (num) and denominator (den) degrees of freedom. |
[Insert Table 2 near here.]
3. Negative impact of academic year on JSE-S scores
Controlling for IRI, age group, career aspiration, and entry year, academic year had a significant main effect on JES-S scores (F (4, 170.60) = 4.68, p = .001). The empathy scores were highest in the first year and showed a general declining trend across subsequent academic years. Compared with those of first-year students, JES-S scores were significantly lower in the third year (B = − 6.29, p = .019, 95% CI [− 11.55, − 1.04]) and in the fifth year (B = − 11.57, p < .001, 95% CI [− 17.69, − 5.56]) (see Table 3). Bonferroni-adjusted pairwise comparisons confirmed significant differences between the 5th and 1st years (p = .002) and between the 5th and 2nd years (p = .001). Differences between the 5th and 4th years (p = .075) and between the 3rd and 2nd years (p = .088) showed marginal significance (see Fig. 1).
Table 3
Fixed Effects Estimates from the Mixed Linear Model Predicting Empathy (JES), Adjusted for IRI
Parameter | Estimate | SE | df | t | p | 95% (CI) |
|---|
Intercept | 80.70 | 8.84 | 177.98 | 9.13 | < 0.001 | [63.25, 98.14] |
IRI | 0.40 | 0.08 | 234.05 | 5.01 | < 0.001 | [0.24, 0.55] |
Academic year |
First year (ref.) | – | – | – | – | – | – |
Second year | -0.59 | 1.55 | 141.68 | -0.39 | .706 | [-3.66, 2.49] |
Third year | -6.29 | 2.67 | 224.89 | -2.36 | .019 | [-11.55, -1.04] |
Fourth year | -5.03 | 3.11 | 219.72 | -1.62 | .107 | [-11.16, 1.10] |
Fifth year | -11.57 | 3.05 | 227.30 | -3.79 | < 0.001 | [-17.58, -5.56] |
Age group |
≤ 19 years (ref.) | – | – | – | – | – | – |
20–22 years | 5.65 | 2.17 | 236.88 | 2.60 | .010 | [1.36, 9.93] |
23–25 years | 12.27 | 3.70 | 250.99 | 3.21 | .001 | [4.97, 19.56] |
Career aspirations after graduation |
Willing to be a doctor(ref.) | – | – | – | – | – | – |
Unwilling to be a doctor | -2.75 | 4.48 | 232.57 | -0.62 | .539 | [-11.57, 6.07] |
Undecided | -5.67 | 1.93 | 242.80 | -2.94 | .004 | [-9.47, -1.87] |
| Note. Estimates are unstandardized regression coefficients (B) from the mixed linear model using maximum likelihood estimation with an AR(1) covariance structure. Reference groups are indicated as “ref.”. SE = standard error; df = denominator degrees of freedom; CI = confidence interval. |
| Results |
[Insert Table 3 near here.]
[Insert Fig. 1 near here.]
4. Age-related increases in JSE-S scores
Older students reported significantly higher empathy scores (F (2, 229.71) = 5.71, p = .004). Compared with students aged ≤ 19 years, those aged 20–22 years scored higher (B = 5.65, p = .010, 95% CI [1.36, 9.93]), and those aged 23–25 years scored even higher (B = 12.27, p = .001, 95% CI [4.97, 19.56]) (see Table 3).
Bonferroni-adjusted comparisons revealed that both the 23–25 and 20–22 groups scored higher than ≤ 19 (p = .003 and p = .030, respectively). The 23–25 group also scored higher than the 20–22 group did, with marginal significance (p = .068) (see Fig. 1).
5. Effects of Career Aspiration on JSE-S scores
Career aspiration significantly influenced JES-S scores (F (2, 237.68) = 4.31, p = .014). Students who expressed a willingness to become doctors scored significantly higher than those who were uncertain did (B = − 5.67, p = .004, 95% CI [− 9.47, − 1.87]) (see Table 3).
6. Positive Predictive Effect of IRI on JES-S
Pearson correlation analysis revealed a moderate positive association between the JSE-S score and the IRI (r = .371, p < .001). In mixed linear models, IRI scores significantly predicted JSE-S (B = 0.40, p < 0.001) even after adjusting for age, grade, career aspiration, and entry year. However, the IRI scores remained stable across academic years (F (4, 171.34) = 0.57, p = .685), with no significant associations for age group, career aspiration, or entry year in the adjusted models. This stability contrasts with the significant longitudinal decline in JES-S scores (see Fig. 2).
[Insert Fig. 2 near here.]
Discussion
To our knowledge, this study is the first longitudinal investigation in China examining the trajectory of empathy among undergraduate medical students. Our findings revealed a general downward trend in context-specific clinical empathy, as measured by the JSE-S, over the five years of medical education. In contrast, scores on the IRI, which reflect stable empathic traits, remained relatively constant throughout the study period. Additionally, older students were found to score higher on the JSE-S, suggesting that age has a positive association with clinical empathy. Career aspiration was also a significant predictor of empathy: students who were undecided about pursuing a medical career had significantly lower JSE-S scores than those who clearly expressed their willingness to become doctors.
In China, the five-year clinical medicine program typically follows a "4 + 1" training model, consisting of four years of coursework followed by one year of clinical internship. At Fudan University, a comprehensive university [19], the medical school curriculum is structured as follows: The first year mainly consists of general education and foundational science courses with limited medical content. The second year focuses on basic medical sciences. The first semester of the third year continues with basic medical sciences (accounting for three-quarters of weekly contact hours), whereas the second semester marks a full transition to clinical coursework and incorporates bedside teaching. The fourth year continues with clinical coursework and clerkships, and the fifth year consists of full-time clinical internships with rotations through multiple hospital departments. Given the limited clinical exposure of students in the early years, this study employed both the JSE-S, which assesses context-specific empathy in healthcare settings, and the IRI, a generic empathy scale, to provide a comprehensive evaluation of medical students’ empathy. The use of dual instruments helped offset the limitations of using a single scale and minimized the interpretive bias caused by differing measurement emphases.
A key finding of this study is the divergent trajectories of the two empathy measures: JSE-S scores showed an overall declining trend across academic years, reaching their lowest point in the fifth year during the internship, whereas IRI scores remained stable throughout the same period. This discrepancy may be explained by the fundamental differences in the measurement focus of the two scales [20]. The IRI captures trait empathy, a relatively stable personality disposition shaped by long-term life experiences, whereas the JSE-S assesses context specific, clinical empathy, particularly the cognitive and behavioral aspects of physician–patient interactions, which are subject to situational modulation. Our findings suggest that while dispositional empathy remained intact, students’ capacity for empathic expression in clinical contexts diminished over time. This underscores that empathy is not merely a personal trait but also a context-sensitive construct shaped by educational environments, psychological factors, and curricular structures.
This distinction is also supported in the literature. Studies using the IRI have often reported stable [21–23] or only minimally varying empathy levels [24] across medical education and low sensitivity to educational interventions [25]. In contrast, JSE scores are more responsive to contextual factors such as curriculum design, patient expectations, and cultural factors [26], exhibiting greater malleability in response to educational interventions [27]. For example, educational interventions aimed at enhancing empathy often yield significant improvements in JSE scores but produce negligible [28] or no changes [27] in IRI scores.
Our findings are consistent with those of prior studies in China that documented a decline rather than an increase in empathy among medical students [29, 30]. Similar trends have been observed in other countries, including the United States and Iran [31, 32]. This decline aligns with our previous cross-sectional study of medical students in an eight-year program, which revealed a significant decrease in empathy during the internship phase. Several factors may contribute to this decline. First, upon transitioning to clinical rotations, students are immersed in a high-intensity, high-workload clinical environment that may compel them to prioritize technical tasks, reducing opportunities for physician‒patient communication. This results in a task-oriented approach overshadowing humanistic care, thereby suppressing opportunities for empathic engagement [17, 33]. Second, students may resort to psychological self-protection strategies such as “dehumanizing patients” to shield themselves from the emotional burden associated with pain, suffering, and death, consequently attenuating empathic expression [14]. Third, structural limitations in medical education may also contribute: current admission criteria emphasize standardized academic examination scores with limited attention to humanistic attributes. Once enrolled, the curriculum often prioritizes biomedical knowledge and technical competencies over the cultivation of emotional competence. Upon entering clinical settings, a hospital culture dominated by professional detachment, technological orientation, and efficiency-first principles may further hinder the development of empathy [33].
In contrast, trait empathy, as measured by the IRI, a dispositional and personality-based construct, appears more resistant to contextual factors such as professional role expectations and curricular structures, showing high cross-situational stability throughout medical education. Our data further revealed a significant positive correlation between the IRI and JSE scores: each one-point increase in the IRI was associated with an average increase of 0.4 points in the JSE. This suggests that although trait empathy remains stable, it continues to play a facilitating role in students’ context-specific empathic expression in clinical settings.
We also found that age was a significant positive predictor of JSE scores: older students scored significantly higher than younger students did, whereas IRI scores remained stable across age groups. This pattern may be interpreted via Richter’s model of "multidirectional age differences in empathy" [34], which divides empathy into three core dimensions: cognitive empathy (the ability to perceive another's emotions accurately), affective empathy (the capacity to share others’ emotions), and sympathy (concern for others' predicaments and compassionate expressions). According to this theory, sympathy tends to increase steadily with age and is relatively resistant to contextual factors, whereas cognitive and affective empathy are more sensitive to situational influences. Given that many JSE items closely align with the sympathy dimension conceptualized by Richter—emphasizing physicians’ understanding, compassion, and concern for patients—which may explain the age-related increase in JSE scores observed in our study. In contrast, the IRI captures dispositional cognitive and affective empathy without being anchored to specific contexts. Richter emphasized that age-related differences in these two dimensions are more likely to emerge in emotionally salient or context-specific situations and are often absent in neutral or decontextualized settings. This may help explain why the IRI scores, which assess empathy in a generalized, nonsituational manner, remained stable across age groups in our study.
Importantly, although age had a positive effect on JSE scores, the overall trend still showed a decline across academic years. This finding indicates that the development of empathy is shaped not only by age-related factors but also by contextual influences. As students progress through medical school, increasing academic demands, pressures related to postgraduate advancement and career preparation, and increasing clinical challenges may collectively erode their capacity for context-specific empathic expression in clinical settings. In other words, the potentially beneficial effects of age on empathy may be attenuated or even overridden by the modulatory influences of the educational and clinical environment, thereby contributing to the observed downward trend in JSE scores. From this perspective, the relationship between age and empathy should be understood as a contextually modulated, multidirectional pattern whose manifestations are jointly shaped by factors such as the type of measurement tool, the empathy dimension assessed, and the educational environment. This may help explain the inconsistent findings regarding age-related empathy reported in the literature [29, 35, 36].
Our previous cross-sectional studies revealed a strong association between medical students’ career aspirations and their empathy levels [17, 18]. Specifically, students who aspired to pursue a medical career after graduation consistently presented higher JSE scores than those who were uncertain or did not plan to pursue medical careers. The present longitudinal findings are consistent with earlier results, further confirming that career aspiration is a key determinant of JSE scores. Mixed linear model analyses revealed that students who expressed a willingness to become doctors had significantly higher JSE scores than those who were undecided did, with a mean difference of 5.67 points (β = -5.67, p = 0.004). Although no significant difference in JSE scores was observed between the two groups at baseline (t (101) = -1.28, p = 0.203; Cohen's d = -0.41) and both groups exhibited similar rates of decline over time (time × career interaction effect, P > 0.05), students intending to pursue a medical career consistently demonstrated higher JSE scores throughout their medical education. This difference is unlikely to stem from divergent trajectories of empathy development but rather suggests that the divergence emerged and was sustained during the course of medical education.
Given the close association between career aspirations and professional identity formation (PIF) and the well-documented role of empathy as a core component of PIF with strong positive correlations [13], it is plausible that PIF may serve as a mediating factor in the relationship between students’ career aspirations and their empathy levels. PIF involves both the internalization of professional roles, values, and behaviors and the integration of individual characteristics with the expectations of the medical profession. Empathy acts as a key bridge in this process of integration and internalization [37]. We speculate that students with a clear intention to pursue a medical career are more likely to develop a stable and coherent professional identity throughout their training, which in turn helps to reinforce and sustain their empathic attitudes and behaviors. Although the JSE scores showed an overall decline during medical education, the positive effects of PIF may be retained and reflected in the consistently higher JSE scores observed among career-committed students. Future studies should incorporate validated PIF assessment tools to explore their mediating role in this relationship and to deepen our understanding of empathy development within the context of medical education.
Limitations
Despite the strengths of its five-year longitudinal design and the use of dual measurement tools, this study has several limitations. First, the sample was drawn from a single university, which may limit the generalizability of the findings. Second, both the JSE-S and the IRI are self-reported measures that assess empathy-related attitudes rather than actual behaviors and are subject to social desirability bias. Future studies should complement self-reports with additional methods such as direct observation, peer evaluations, and feedback from standardized patients. Finally, this study followed students only through the first semester of their final year, excluding the complete internship period. Since empathy and professional identity continue to evolve during clinical training, extending the observation period through graduation would provide a more complete view of the developmental trajectory.
Conclusion
This study is the first longitudinal investigation of empathy development among Chinese undergraduate medical students. The findings revealed a general decline in context-specific clinical empathy (JSE) over five years, reaching its lowest point during the internship, whereas trait-based empathy (IRI) remained relatively stable. This divergence underscores the significant impact of educational context on shaping the expression of clinical empathy. Age positively predicted JSE scores, supporting the theory of multidirectional age-related empathy development. Moreover, career aspiration emerged as an important factor, with students who clearly intended to become doctors consistently achieving higher JSE scores. These results indicate that empathy is shaped by both individual dispositions and educational exposures. Medical educators should therefore prioritize curriculum design, learning environments, and clinical training that foster the sustained development of empathy throughout medical education.
Abbreviation
JSE-S
Student Version of the Jefferson Scale of Empathy
IRI
Interpersonal Reactivity Index
PIF
Professional Identity Formation
A
Acknowledgement
The authors wish to thank Spencer XU for her critical role in data cleaning. During the preparation of this manuscript, the authors used ChatGPT-4 for language polishing. The authors take full responsibility for the content. We would also like to thank AJE for the English language check.
A
Author Contribution
DL and JL contributed to the establishment of the questionnaire database and data collection. HX performed the data analysis and interpretation. SM conceived and designed the study, conducted the data analysis, and drafted the manuscript. All the authors have read and approved the final manuscript.
Availability of data and materials
The data presented in this manuscript have not been published elsewhere. Data from this project will not be shared. Consent was not sought from participants to share the data more widely than for this study.
Reference
1.Kelly CP. Empathic instruction through literary narratives: A quasi-experimental study of an occupational therapy course [dissertation]. ProQuest Dissertations & Theses Global. 2021. Order No. 28681265.
2.Hojat M. The Jefferson scale of physician empathy development and preliminary psychometric data. Educ Psychol Meas. 2011;71(1):186–91.
3.Hojat M, Gonnella JS. What Matters More About the Interpersonal Reactivity Index and the Jefferson Scale of Empathy? Their Underlying Constructs or Their Relationships With Pertinent Measures of Clinical Competence and Patient Outcomes? Acad Med. 2017;92(6):743–5. 10.1097/ACM.0000000000001424.
4.Costa P, de Carvalho-Filho MA, Schweller M, Thiemann P, Salgueira A, Benson J, et al. Measuring Medical Students' Empathy: Exploring the Underlying Constructs of and Associations Between Two Widely Used Self-Report Instruments in Five Countries. Acad Med. 2017;92(6):860–7. 10.1097/ACM.0000000000001449.
5.Kim SS, Kaplowitz S, Johnston MV. The effects of physician empathy on patient satisfaction and compliance. Eval Health Prof. 2004;27(3):237–51. 10.1177/0163278704267037.
6.Hojat M, Louis DZ, Markham FW, Wender R, Rabinowitz C, Gonnella JS. Physicians' empathy and clinical outcomes for diabetic patients. Acad Med. 2011;86(3):359–64. 10.1097/ACM.0b013e3182086fe1.
7.Thirioux B, Birault F, Jaafari N. Empathy is a protective factor of burnout in physicians: new neuro-phenomenological hypotheses regarding empathy and sympathy in care relationship. Front Psychol. 2016;7:763. 10.3389/fpsyg.2016.00763.
8.Gleichgerrcht E, Decety J. Empathy in clinical practice: how individual dispositions, gender, and experience moderate empathic concern, burnout, and emotional distress in physicians. PLoS ONE. 2013;8(4):e61526. 10.1371/journal.pone.0061526.
9.Paro HB, Silveira PS, Perotta B, Gannam S, Enns SC, Giaxa RR, et al. Empathy among medical students: is there a relation with quality of life and burnout? PLoS ONE. 2014;9(4):e94133. 10.1371/journal.pone.0094133.
10.Chen H, Xuan H, Cai J, Liu M, Shi L. The impact of empathy on medical students: an integrative review. BMC Med Educ. 2024;24:455. 10.1186/s12909-024-05448-5.
11.Andersen FA, Johansen AB, Søndergaard J, Andersen CM, Assing Hvidt E. Revisiting the trajectory of medical students' empathy, and impact of gender, specialty preferences and nationality: a systematic review. BMC Med Educ. 2020;20:52. 10.1186/s12909-020-1964-5.
12.Cruess RL, Cruess SR, Steinert Y. Amending Miller's Pyramid to Include Professional Identity Formation. Acad Med. 2016;91(2):180–5. 10.1097/ACM.0000000000000913.
13.Baseer M, Mahboob U, Shaheen N, Mehboob B, Abdullah S, Siddique A. Effectiveness of empathy portfolios in developing professional identity formation in medical students: a randomized controlled trial. BMC Med Educ. 2024;24:600. 10.1186/s12909-024-05529-5.
14.Neumann M, Edelhäuser F, Tauschel D, Fischer MR, Wirtz M, Woopen C, et al. Empathy decline and its reasons: a systematic review of studies with medical students and residents. Acad Med. 2011;86(8):996–1009. 10.1097/ACM.0b013e318221e615.
15.Ferreira-Valente A, Monteiro JS, Barbosa RM, Salgueira A, Costa P, Costa MJ. Clarifying changes in student empathy throughout medical school: a scoping review. Adv Health Sci Educ Theory Pract. 2017;22(5):1293–313. 10.1007/s10459-016-9704-7.
16.Ponnamperuma G, Yeo SP, Samarasekera DD. Is empathy change in medical school geo-socioculturally influenced? Med Educ. 2019;53(7):655–65. 10.1111/medu.13819.
17.Li D, Xu H, Kang M, Ma S. Empathy in Chinese eight-year medical program students: differences by school year, educational stage, and future career preference. BMC Med Educ. 2018;18:241. 10.1186/s12909-018-1348-2.
18.Yi K, Kang M, Li D, Wang Z, Bai J, Xu H, et al. A multi-institutional and cross-sectional study on empathy in Chinese medical students: differences by student cadre or not, future career preference, and father's education status. BMC Med Educ. 2020;20:24. 10.1186/s12909-020-1935-x.
19.Academic Affairs Office of Fudan University. Curriculum framework for 5-year clinical medicine program. (2019 edition). In: Undergraduate Education Handbook. Shanghai: Fudan University; 2019. pp. 574–578.
20.Costa P, de Carvalho-Filho MA, Schweller M, Thiemann P, Salgueira A, Benson J, et al. Measuring Medical Students' Empathy: Exploring the Underlying Constructs of and Associations Between Two Widely Used Self-Report Instruments in Five Countries. Acad Med. 2017;92(6):860–7. 10.1097/ACM.0000000000001449.
21.Bratek A, Bulska W, Bonk M, Seweryn M, Krysta K. Empathy among physicians, medical students and candidates. Psychiatr Danub. 2015;27(Suppl 1):S48–52.
22.Quince T, Thiemann P, Benson J, Hyde S. Undergraduate medical students' empathy: Current perspectives. Adv Med Educ Pract. 2016;7:443–55. 10.2147/AMEP.S76800.
23.Quince TA, Parker RA, Wood DF, Benson JA. Stability of empathy among undergraduate medical students: a longitudinal study at one UK medical school. BMC Med Educ. 2011;11:90. 10.1186/1472-6920-11-90.
24.Stratton TD, Saunders JA, Elam CL. Changes in medical students' emotional intelligence: an exploratory study. Teach Learn Med. 2008;20(3):279–84. 10.1080/10401330802199625.
25.Kim KJ. Project-based learning approach to increase medical student empathy. Med Educ Online. 2020;25(1):1742965. 10.1080/10872981.2020.1742965.
26.Kataoka HU, Koide N, Ochi K, Hojat M, Gonnella JS. Measurement of empathy among Japanese medical students: Psychometrics and score differences by gender and level of medical education. Acad Med. 2009;84(9):1192–7. 10.1097/ACM.0b013e3181b180d4.
27.Kim K, Kim SH, Yoon H, Shin HS, Lee YM. Assessing the effects of an empathy education program using psychometric instruments and brain fMRI. Adv Health Sci Educ Theory Pract. 2020;25(2):283–95. 10.1007/s10459-019-09918-0.
28.Schweller M, Costa FO, Antônio MÂ, Amaral EM, de Carvalho-Filho MA. The impact of simulated medical consultations on the empathy levels of students at one medical school. Acad Med. 2014;89(4):632–7. 10.1097/ACM.0000000000000195.
29.Huang L, Thai J, Zhong Y, Peng H, Koran J, Zhao XD. The Positive Association Between Empathy and Self-Esteem in Chinese Medical Students: A Multi-Institutional Study. Front Psychol. 2019;10:1921. 10.3389/fpsyg.2019.01921.
30.Geng Y, Fei W, Tang Z, Wang S, Yu J, Zhang M, et al. Parental care and depressive symptoms among Chinese medical students: roles of empathy and gender. BMC Med Educ. 2022;22:451. 10.1186/s12909-022-03524-2.
31.Hojat M, Vergare MJ, Maxwell K, Brainard G, Herrine SK, Isenberg GA, et al. The devil is in the third year: a longitudinal study of erosion of empathy in medical school. Acad Med. 2009;84(9):1182–91. 10.1097/ACM.0b013e3181b17e55.
32.Shariat SV, Habibi M. Empathy in Iranian medical students: measurement model of the Jefferson scale of empathy. Med Teach. 2013;35(1):e913–8. 10.3109/0142159X.2012.714881.
33.Howick J, Dudko M, Feng SN, Ahmed AA, Alluri N, Nockels K, et al. Why might medical student empathy change throughout medical school? a systematic review and thematic synthesis of qualitative studies. BMC Med Educ. 2023;23:270. 10.1186/s12909-023-04165-9.
34.Richter D, Kunzmann U. Age differences in three facets of empathy: Performance based evidence. Psychol Aging. 2011;26(1):60–70. 10.1037/a0021048.
35.Park KH, Roh H, Suh DH, Hojat M. Empathy in Korean medical students: Findings from a nationwide survey. Med Teach. 2015;37(10):943–8. 10.3109/0142159X.2014.956058.
36.Dyrbye LN, Satele D, West CP. Association of Characteristics of the Learning Environment and US Medical Student Burnout, Empathy, and Career Regret. JAMA Netw Open. 2021;4(8):e2119110. 10.1001/jamanetworkopen.2021.19110.
37.Rachakonda VS, Addanki S, Nasef H, Rajput V. Comparing and Contrasting Professional Identity Formation Among Health Professional Students. Cureus. 2024;16(7):e65577. 10.7759/cureus.
Figure 1. Adjusted Marginal Means of JSE-S Scores by Key Predictors
Legend: Marginal means derived from linear mixed models adjusted for the IRI (fixed at 78.32), with age, entry year, and gender held at sample means. The error bars represent 95% confidence intervals. Significance: *p < .05, **p < .01 vs. reference group (1st year/≤19 years/Willing); ### p < .001 vs. 2nd year.
Table 1. Sample Characteristics at Baseline Measurement (N = 104)
Variable | Category | n | % |
|---|
Gender | Male | 38 | 36.54 |
| | Female | 66 | 63.46 |
Age group | ≤ 19 years | 86 | 82.69 |
| | 20–22 years | 18 | 17.31 |
School year | 1st year | 62 | 59.62 |
| | 2nd year | 35 | 33.65 |
| | 3rd year | 4 | 3.85 |
| | 4th year | 3 | 2.88 |
Student leadership role | Yes | 47 | 45.19 |
| | No | 57 | 54.81 |
Only child status | Yes | 61 | 58.65 |
| | No | 43 | 41.35 |
Hometown location | Municipality | 19 | 18.27 |
| | Provincial capital | 16 | 15.38 |
| | Small/medium city | 30 | 28.85 |
| | Town | 23 | 22.12 |
| | Rural area | 16 | 15.4 |
Career aspiration: willingness to become a doctor | Willing | 92 | 88.46 |
| | Unwilling | 1 | 0.96 |
| | Undecided | 11 | 10.58 |
Table 2. Tests of Fixed Effects in the Mixed Linear Model Predicting Empathy (JSE-S)
Predictor | df (num/den) | F | p |
|---|
Academic year | 4 / 170.60 | 4.68 | .001 |
Age group | 2 / 229.70 | 5.71 | .004 |
Career aspiration | 2 / 237.68 | 4.31 | .014 |
Entry year | 3 / 112.72 | 1.54 | .207 |
IRI score | 1 / 234.05 | 25.77 | < 0.001 |
Table 3. Fixed Effects Estimates from the Mixed Linear Model Predicting Empathy (JES), Adjusted for IRI
Parameter | Estimate | SE | df | t | p | 95% (CI) |
|---|
Intercept | 80.70 | 8.84 | 177.98 | 9.13 | < 0.001 | [63.25, 98.14] |
IRI | 0.40 | 0.08 | 234.05 | 5.01 | < 0.001 | [0.24, 0.55] |
Academic year |
First year (ref.) | – | – | – | – | – | – |
Second year | -0.59 | 1.55 | 141.68 | -0.39 | .706 | [-3.66, 2.49] |
Third year | -6.29 | 2.67 | 224.89 | -2.36 | .019 | [-11.55, -1.04] |
Fourth year | -5.03 | 3.11 | 219.72 | -1.62 | .107 | [-11.16, 1.10] |
Fifth year | -11.57 | 3.05 | 227.30 | -3.79 | < 0.001 | [-17.58, -5.56] |
Age group |
≤ 19 years (ref.) | – | – | – | – | – | – |
20–22 years | 5.65 | 2.17 | 236.88 | 2.60 | .010 | [1.36, 9.93] |
23–25 years | 12.27 | 3.70 | 250.99 | 3.21 | .001 | [4.97, 19.56] |
Career aspirations after graduation |
Willing to be a doctor(ref.) | – | – | – | – | – | – |
Unwilling to be a doctor | -2.75 | 4.48 | 232.57 | -0.62 | .539 | [-11.57, 6.07] |
Undecided | -5.67 | 1.93 | 242.80 | -2.94 | .004 | [-9.47, -1.87] |