Reconsidering the ‘Predictive’ Value of Subjective Aging for Longevity and
the Impact of Confounding – Insights from Epidemiology
A
Dr.
Adrian Richter 1,2,7✉
Phone(+49)03834–75 41 Email
Sarah K. Schäfer 3,4,5
Susanne Wurm 5
Wolfgang Hoffmann 6
Till Ittermann 1
1 Section SHIP-KEF: Clinical-Epidemiological Research, Institute for Community Medicine University Medicine Greifswald Greifswald Germany
2 Epidemiology and Health Services Research German Rheumatology Research Center Berlin Germany
3 Clinical Psychology and Psychotherapy for Children and Adolescents Technische Universität Braunschweig Braunschweig Germany
4 Leibniz Institute for Resilience Research Mainz Germany
5 Section for Prevention Research and Social Medicine, Institute for Community Medicine University Medicine Greifswald Greifswald Germany
6 Section Epidemiology of Health Care and Community Health, Institute for Community Medicine University Medicine Greifswald Greifswald Germany
7
A
Institute for Community Medicine Walther-Rathenau-Str. 48 17475 Greifswald
Adrian Richter1,2*, Sarah K. Schäfer3,4,5*, Susanne Wurm5, Wolfgang Hoffmann6
& Till Ittermann1
1 Section SHIP-KEF: Clinical-Epidemiological Research, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
2 Epidemiology and Health Services Research, German Rheumatology Research Center, Berlin, Germany
3 Clinical Psychology and Psychotherapy for Children and Adolescents, Technische Universität Braunschweig, Braunschweig, Germany
4 Leibniz Institute for Resilience Research, Mainz, Germany
5 Section for Prevention Research and Social Medicine, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
6 Section Epidemiology of Health Care and Community Health, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
Authors ORCID-IDs:
AR: https://orcid.org/0000-0002-3372-2021
SKS: https://orcid.org/0000-0001-9885-3252
SW: https://orcid.org/0000-0001-6924-8299
WH: https://orcid.org/0000-0002-6359-8797
TI: https://orcid.org/0000-0002-0154-7353
Dr. Adrian Richter
Section SHIP-KEF: Clinical-Epidemiological Research,
Institute for Community Medicine
Walther-Rathenau-Str. 48
17475 Greifswald
Tel.: (+ 49) 03834–75 41
Email: adrian.richter@med.uni-greifswald.de
Dr. Adrian Richter and Sarah K. Schäfer contributed equally to this work.
Corresponding author:
Abstract
Many studies found subjective aging, that is, how individuals perceive their own aging process (self-perceptions of aging, SPA) and subjective age being associated with longevity. While accounting for confounding in regression models, some studies suggested survival advantages between 6 and 13 years for those with more positive SPA, based on unadjusted survival curves. In press articles these effects were explained by the predictive role of SPA.
This study assesses the robustness of these findings by replicating the study of Wurm and Schäfer (2022) with an expanded study population of 14,548 individuals from the German Ageing Survey. We focused on the role of confounders, such as chronological age, in adjusted survival analyses and compared the predictive value of SPA for mortality against other covariates using resampling techniques and the concordance index.
SPA related to ongoing development (SPA-OD) was strongly associated with mortality, with an unadjusted survival benefit of 9.6 years for participants with above-median SPA-OD. However, the benefit reduced to 1.2 years when confounders were included, with chronological age being the most relevant. SPA-OD’s predictive value was notably lower than for age and sex, adding SPA-OD to age and sex as a covariate improved predictive performance no more than 1.06%.
While an association between SPA-OD and mortality is evident, the survival benefit due to more positive SPA-OD is considerably smaller than previously suggested. The results highlight the importance of distinguishing association, confounding, and prediction to disentangle genuine effects, being relevant in various domains of Psychology.
Keywords:
mortality
survival
self-perceptions of aging
subjective age
confounding
balancing
A
A
Introduction
A
Subjective aging includes both self-perceptions of aging (SPA) and which age an individual feels (subjective age; SA) (Diehl & Wahl, 2015). Both concepts reflect subjective experiences of aging, with SPA and SA only sharing small to moderate correlations (Spuling et al., 2020). Research has repeatedly shown associations between subjective aging and health (Alonso Debreczeni & Bailey, 2021; Westerhof et al., 2014; Westerhof et al., 2023), with more favorable SPA and younger SA being associated with better physical and mental health as well as longevity. While many studies examined the relationship between SA and health, analyses on its association with longevity (or mortality, vice versa) – as the most finite health outcome – gained attention in aging research (Westerhof et al., 2023) and beyond in public (Goldberg, 2022; Wurm, 2022).
Concrete information on how many years longer individuals may live if they perceive aging more positively were obtained from unadjusted models in which the sample was split into two groups for illustrative purposes, with either more positive or more negative subjective aging. This approach, however, might have led to wrong interpretation of the respective survival benefits because it conceals the role of confounders.
Confounding can be a major problem in observational studies (VanderWeele, 2019), and correcting for confounding is crucial but often difficult due to unobserved confounders. By definition, a confounder affects both exposure and outcome (Greenland et al., 1999) and can bias the true association between the two so that it appears larger, smaller or even absent. In order to obtain valid estimates of the true association, relevant confounders need to be controlled for. An example of confounding is the role of maternal age in the association between “birth rank order“ (exposure) and the prevalence of Down syndrome (outcome) (Rothman & Greenland, 2005). This association suggests that the probability of Down syndrome depends on the number of children. However, when models account for the maternal age, which is the true underlying effect, the association between birth rank and Down syndrome vanishes. Confounding bias also increases when there are differences between the groups being compared regarding the confounder. For subjective aging, this means that people with more positive subjective aging may differ from those reporting more negative subjective aging in many ways related to survival, e.g., in chronological age, sex, health status.
Starting with work of Maier and Smith (1999) and Levy et al. (2002) who examined the association between unidimensional SPA and longevity, a relevant number of longitudinal observational studies investigated the relationship between subjective aging and longevity (see Table SM1). A recent meta-analysis (Westerhof et al., 2023) identified 21 samples from 19 primary studies examining the association between subjective aging and longevity, finding an overall significant positive association.
Effect estimates in the included studies were mostly controlled for some confounding variables (cf. Supplementary Table 1). Out of 19 studies, 16 employed variants of Cox proportional hazard (Cox PH) models (Cox, 1972) which is the most widely used model for the analysis of survival (Harrell, 2015). The inclusion of covariates in Cox PH models is key to control for potential confounders. If a confounder is adequately operationalized the model than yields hazard ratios (HR) that are adjusted for the impact of this confounder (Denz et al., 2023). In research into subjective aging the number and nature of covariates varied widely, ranging from two additional covariates (Markides & Pappas, 1982) to 17 covariates (Kaspar et al., 2021). Chronological age (18 out of 19 studies), gender/sex (17/19 studies), and a quantitative indicator of comorbidities (15/19 studies) were most often included as covariates.
Most studies examined the association between subjective aging and longevity based on continuous measures of subjective aging (e.g., SPA measures or discrepancy scores for SA). Comparing groups derived from categorization of continuous measures of subjective aging, is more common in research into SA, where several studies contrasted groups of people feeling younger, similar or older compared to their chronological age (Lim et al., 2013; Rippon & Steptoe, 2015). For SPA, a small number of studies used group comparisons for illustrative purposes, e.g., based on means or medians differentiating between people with more negative or positive SPA.
A
Introducing groups based on differences in SA or SPA values enables to explore imbalances between the compared groups with regard to confounders in a table of baseline characteristics stratified by different levels of SA or SPA. Such a baseline table is recomended in epidemiological research according to the CONSORT statement (Schulz et al., 2010) and the STROBE recommendations (von Elm et al., 2007). However, only three out of 19 studies provided such a table (Levy & Myers, 2005; Lim et al., 2013; Uotinen et al., 2005).
For categorical variables, such as SA or SPA groups, survival analyses may also include the Kaplan-Meier survival graphs showing the estimated percentage of individuals surviving in a particular group at a given time (Bland & Altman, 1998). As neither SPA nor SA are categorical variables per se, Kaplan-Meier curves often include median splits (Levy et al., 2002) or other approaches to derive survival estimates for different levels of SPA or SA (Wurm & Schäfer, 2022). Kaplan-Meier survival curves are generally unadjusted, i.e. do not account for confounders. Adjusted survival curves, that account for covariate adjustment, can be obtained from the fitted Cox PH model or by using balancing techniques (Denz et al., 2023). Our review of previous research indicated that six primary studies presented survival curves, of which five were unadjusted Kaplan-Meier plots (Levy & Myers, 2005; Levy et al., 2002; Lim et al., 2013; Schroyen et al., 2020; Wurm & Schäfer, 2022), while one study used survival curves adjusted for sex, education and illness burden, but not chronological age (Benyamini & Burns, 2020). Based on unadjusted survival curves, Levy et al. (2002) as well as Wurm and Schäfer (2022), who replicated the hallmark study of Levy et al. (2002) with a multidimensional assessment of SPA, reported survival advantages of 7.5 years and 13 years, respectively.
Another limitation of previous research derives from the approaches to account for confounding introduced by specific comorbidities, with the vast majority of studies using sum scores reflecting the total number of comorbid diseases (Kotter-Grühn et al., 2009; Levy & Bavishi, 2018; Wurm & Schäfer, 2022). This approach combines a diverse range of diseases with varying impact on mortality. For example, a sum score of three can be caused by the presence of gastrointestinal disorders, back pain, and visual problems, as well as by the presence of diabetes, heart attack, and cancer. The latter have a significantly larger impact on mortality (Ahmad & Anderson, 2021; Hübner et al., 2021). As we have observed, using a sum score of heterogeneous diseases to adjust for comorbidities, it is possible that the presumed detrimental effect on mortality is replaced by an, yet non-significant, effect numerically trending into the direction of a protective effect (Wurm & Schäfer, 2022).
To summarize, some of the analytical and methodological approaches used in previous research on the link of subjective aging and longevity suggest further evaluation. The main challenges are: (1) the insufficient control of confounders in survival curves, (2) a lack of analyses suitable to examine potentially causal relationships, (3) missing analyses on the predictive value of subjective aging, (4) the reliance on unadjusted models, and (5) the inability to randomize the exposure.
Yet, the study of survival is not unique to aging research. Survival is a common outcome in epidemiological studies, which nowadays provide valuable insights for reconsidering the predictive value of subjective aging for survival (Newman & Murabito, 2013). In the field of Epidemiology, certain measures had to be implemented and shortcomings to be acknowledged following calls for more modest epidemiological research (Boffetta et al., 2008; Ioannidis, 2005). The use of causal language, for example, is often not recommended in the context of observational designs (Olarte Parra et al., 2021; VanderWeele, 2021) unless rigorous methodological approaches such as balancing techniques are applied (Hainmueller, 2012; Rosenbaum & Rubin, 1983; Stuart, 2010).
For the current project, researchers from Epidemiology and Psychology joint forces aiming to check previous findings on subjective aging and longevity for robustness. Based on the previous publication of Wurm and Schäfer (2022), we will reproduce the results in the 1996 sample of the German Ageing Survey (DEAS) and address the abovementioned shortcomings. We will focus on SPA due to the small (Westerhof et al., 2023) to non-significant (Wurm & Schäfer, 2022) associations of lower SA and longevity. Moreover, based on the results of Wurm and Schäfer (2022), we concentrate our analyses on positive SPA related to ongoing development (SPA-OD), which was found the only significant correlate of survival in the DEAS sample. We will further extend the study population to three independent DEAS samples in 2002, 2008, and 2014 to test the results for robustness. The predictive value of SPA will be assessed using methods for benchmarking prediction models. Based on these results, we aim at deriving implications for research into subjective aging and related psychological fields.
Methods
Data sources
Since 1996 the German Ageing Survey (DEAS) comprises several representative data collections from the German population 40 + years (Klaus et al., 2017; Vogel et al., 2021). A cohort-sequential study design is applied, i.e., similar samples of the population are drawn from population registration offices at equidistant time intervals using a multistage random procedure. The DEAS started in 1996 with recruitment of the first sample; in 2002, 2008, and 2014 further samples were drawn. Respective cross-sections of the general population are then repeatedly interviewed at predetermined intervals but only in those study participants who consented in follow-up examinations (Fig. 1). The data are collected in face-to-face interviews (usually in participants’ homes) and via paper-pencil questionnaires, which are to be completed by the participants either immediately or within a few days after the interview. The data collection comprises, among other topics, socio-demographic and health information. An overview over all DEAS samples is provided by Klaus et al. (Klaus et al., 2017) and detailed information can be obtained from the corresponding Research Data Center (https://www.dza.de/en/research/fdz/german-ageing-survey). Funding for the German Ageing Survey is provided by the Federal Ministry for Family Affairs, Senior Citizens, Women and Youth (BMFSFJ).
All-cause mortality
Deaths of DEAS participants are collected by the Institute for Applied Social Sciences (infas, https://www.infas.de/about-infas/?lang=en) on a regular basis for participants that had consented into participation in follow-up waves. The information on deceased DEAS participants is mostly obtained during routine data collection and reaffirmed by local registration offices for those cases where the last known residence of participants has changed. So far, 1208 from N = 3022 (39.97%) participants of the DEAS 1996 deceased (DEAS 2002 = 545 (25.70%), DEAS 2008 = 692 (14.95%), DEAS 2014 = 247 (5.17%)).
Fig. 1
Flowchart of panel sequence and number of participants with consent for follow-up.
Click here to Correct
Measures
Self-perceptions of aging
Self-perceptions of aging (SPA) were measured by established multidimensional scales (AgeCog scales) that differentiate between three SPA dimensions related to (1) physical losses, (2) social losses, and (3) ongoing development (Steverink et al., 2001; Wurm et al., 2007). The latter dimension consists of four items, all starting with: Aging means to me that: (i) … I continue to make plans, (ii) … I can still learn new things, (iii) … I can still put my ideas into practice, and (iv) … my capabilities are increasing. These statements are answered each on a 4-point Likert scale ranging from 1 = “strongly agree” to 4 = “strongly disagree”.
In this study, SPA ongoing development (SPA-OD) will be considered and operationalized similarly to previous studies based on DEAS (Diehl et al., 2021; Wurm & Schäfer, 2022), i.e. item coding (1 to 4) is reversed so that higher scores correspond to more positive SPA (range of sum scores: 4–16).
Risk factors of mortality
All samples were stratified by age, sex, and region (i.e., former West or East Germany). As additional risk factors for mortality we included household income (Chetty et al., 2016), and highest level of education (Sabanayagam & Shankar, 2012) as proxies of socioeconomic status.
While the Research Data Centre provides information on comorbidity by means of a sum score (Klaus et al., 2017), information was collected in greater detail during initial DEAS data collection. In the 1996 and 2002 DEAS samples, information on comorbidities is collected during the so-called drop-off questionnaire with the following categories: (1) cardiac and circulatory disorders, (2) bad circulation, (3) joint, bone, spinal or back problems, (4) respiratory problems, asthma, shortness of breath, (5) stomach and intestinal problems, (6) cancer, (7) diabetes, (8) gall bladder, liver or kidney problems, (9) bladder problems, (10) insomnia, (11) eye problems, vision impairment, and (12) ear problems, hearing problems. From the list of comorbidities, we included (1 + 2) as cardiovascular disorders, (6) cancer, and (7) diabetes. In the 2008 and 2014 DEAS samples, additional information on comorbidities was collected within the face-to-face interview with the survey on cardiovascular diseases being more specific including previous myocardial infarction or angina pectoris, heart failure, stroke, circulatory disorders in the brain/legs, high blood pressure, and high cholesterol. For those waves, we considered a DEAS participant as having cardiovascular disorder if one of these categories was affirmed during the interview.
For the DEAS samples 2002, 2008, and 2014 also body mass index (BMI) as a proxy for the measurement of obesity (Darbà et al., 2015) and smoking behavior (Jia & Lubetkin, 2010) were included which were not collected for the DEAS 1996.
Statistical analyses
For continuous covariates descriptive statistics of mean, median, standard deviations, minimum and maximum are presented. Discrete covariates are described using absolute numbers and percentages. The number of missing data is presented for each covariate in compliance with STROBE (von Elm et al., 2007). For some descriptive analyses dichotomization of SPA-OD according to a median split is conducted in line with previous studies that used this approach to illustrate mortality differences between individuals with more or less positive SPA (Levy et al., 2002; Wurm & Schäfer, 2022). Different to previous studies, in our descriptive analysis of raw data (without imputations) a separate stratum is created for observations with missing SPA-OD.
Missing data
To account for missing data multiple imputations were created using the R package mice (van Buuren & Groothuis-Oudshoorn, 2011) and parallelized computations (Weston & Microsoft Corporation, 2022) to reduce computational time. Imputations were conducted for each panel of the DEAS separately due to varying participant characteristics. Items of SPA-OD were imputed individually and sum scores were calculated afterwards (Eekhout et al., 2014). We also included items of SPA physical losses for a broader imputation model as in the study of Wurm & Schäfer (Wurm & Schäfer, 2022). To ensure correspondence of imputation and analysis models, all variables used for analysis were part of the imputation model. After 30 burn-in iterations, 20 datasets with imputations were created (Horton & Lipsitz, 2001; Sterne et al., 2009). Due to convergence issues of multiple imputation in all DEAS samples, univariate outliers in household incomes were removed which resolved these issues (DEAS 1996: n = 23 (0.76%), DEAS 2002: n = 13 (0.61%), DEAS 2008: n = 52 (1.12%), DEAS 2014: n = 64 (1.34%)),
A
see Supplementary Fig. 1.
Modelling all-cause mortality
We repeated the more basic approach to control for confounding using adjustment of Cox PH models (Therneau & Grambsch, 2013) as this has been done in all but one primary studies on the link between subjective aging and survival. Continuous covariates of age, household income, and BMI were modelled using natural splines with three degrees of freedom to allow for non-linear associations with all-cause mortality (Gauthier et al., 2020; Harrell, 2015). Due to non-interpretability of coefficients from spline functions, contrasts of model-predictions at certain increments of the respective covariate (Carriquiry et al., 2015; Shepherd et al., 2017) were computed using the R package emmeans (Lenth, 2016) (age: 10 year intervals, BMI: 5 unit intervals, household income: 500€ to 2000€ by 500€, 2000€ to 5000€ by 1000€) using the center of the distribution as reference category. Fitted Cox PH models were also used to calculate bivariate marginal distribution of the estimated probabilities to decease for covariates SPA-OD and chronological age.
All analyses were conducted on multiply imputed data; results were pooled according to Rubins’ rule (Rubin, 1996). Proportional hazard assumption was examined across all samples using Schoenfeld residuals as implemented in R package survival (Therneau & Grambsch, 2013) (Supplementary Fig. 2–5).
Graphical presentation of survival estimates is conducted using (a) unadjusted Kaplan-Meier estimates, in line with the studies of Wurm and Schäfer (2022), Levy et al. (2005; 2002), Lim et al. (2013), and Schroyen et al. (2020), and (b) adjusted survival curves resulting from Cox PH models (Denz et al., 2023; Therneau et al., 2015).
Prediction of all-cause mortality
According to the TRIPOD statement for multivariable prediction models (Collins et al., 2015), our approach represents a Type 2a modelling approach, i.e. data are randomly and repeatedly split into internal learning data (used for model fit) and internal validation data (used for prediction of the mortality risk). The bootstrap approach is used for resampling (Hastie et al., 2009).
To examine the (additive) predictive value of SPA-OD, five different Cox PH models are fitted to internal learning data (in-bag) and the fitted models are then used to predict the risk of death in internal validation data (out-of-bag). The first Cox-PH model comprised only the SPA-OD sum score, the second only age and sex, and the third model included age, sex, and SPA-OD. This specification was applied to determine the predictive value of SPA-OD alone, to compare it with the predictive value of a model adjusted only for age and sex, and to obtain an estimate of improvement if SPA-OD is added to age and sex as an additional covariate. The fourth model comprised all covariates but not SPA-OD that was included in the fifth model.
Harrels’ c-index was computed from the predicted vs. the observed risk (Rahman et al., 2017). The c-index can be considered the fraction of pairs in the data, in which model-based predictions, e.g. survival times or probabilities, correspond to observed values. The index is calculated by:
Here,
corresponds to observed survival time or probability and
corresponds to the model-based prediction of individual ith observation time (or probability) (Pencina & D'Agostino, 2004). A models’ c-index of 0.5 is equivalent to the accuracy of flipping a coin; a c-index close to 1 indicates very good predictive performance.
In addition, the integrated discrimination index (IDI; (Pencina et al., 2008) was calculated to provide information on the improvement in sensitivity and specificity in terms of discriminating those who survived and those deceased (Pencina et al., 2010; Yates, 1982). The IDI is calculated as follows:
Here,
represents the average over model-based predictions
, “new” refers to the new or updated model in either “events” (e.g., deceased participants) or “non-events” (e.g., survivors) (Pencina et al., 2008). The IDI is positive, if discrimination between deceased and survivors is better for the new model compared to the old model.
This procedures was applied to each of the 20 imputed data sets separately (100 replications per imputation and 2000 repetitions in total); aggregates of percent change in c-index/IDI and corresponding confidence intervals were calculated using Rubins’ rule (Rubin, 1987).
Sensitivity analysis
To evaluate the effect of a sum score of comorbidities on all-cause mortality, data preparation (incl. multiple imputation) was repeated while individual comorbidities were replaced by a sum score of (physical) comorbidities. Similarly, in adjusted Cox PH models’ diabetes, cardiovascular disorders, and cancer were replaced by the sum score.
Computation
All analyses of this study were conducted using R software (R Development Core Team, 2023) version 4.3.3. Table 1 of participant characteristics is created using R package Table1 (Rich, 2023) and rendered with flextable (Gohel & Skintzos, 2023). Graphical illustrations were generated using R packages ggplot (Wickham, 2016) and ggsci (Xiao, 2023). For adjusted survival plots the R package adjustedCurves was used (Denz et al., 2023). Parallelized computations were conducted using R packages foreach and doParallel (Weston & Microsoft Corporation, 2022). All R code is stored in a public repository to enable for reproducibility (Richter, 2024).
Results
Participant characteristics
Participant characteristics of the DEAS 1996 sample are shown in Table 1 using stratification according to a median split of the sum score of SPA ongoing development (SPA-OD) as this has been done in previous studies (see Table 1 Supplementary Materials). Participants not completing the paper-pencil questionnaire or with missing information on SPA are presented as a separate stratum. With respect to all characteristics, participants with SPA-OD sum score below the median are considerably different from those with SPA-OD above the median (except for cancer disease (p = 0.007), all t-test or Chi² test p-values < 0.001). The differences were particularly notable regarding age, the frequency of male sex, and number of comorbidities. Similar results are found for all other DEAS samples (Supplementary Table 1 to 3). Figure 2 illustrates the inverse association of chronological age and SPA-OD if the sum score is split by the median.
The amount of missing information is low to moderate, being highest for SPA-OD with 15.8% and for the comorbidities (only DEAS 1996 and 2002 sample). In the remaining DEAS samples, there is no missing information on comorbidities as this information was collected as part of the face-to-face interviews.
Table 1
Participant characteristics of the German Ageing Survey (DEAS) 1996 panel.
 
SPA: ongoing development
All
 
lower
(sum score < 12)
unknown
(no CRF or missing)
higher
(sum score ≥ 12)
 
N
1192
477
1353
3022
Age (years)
       
Mean (SD)
62.8 (11.9)
59.3 (12.4)
55.4 (10.8)
58.9 (12.0)
Median [Min, Max]
63.0 [40.0, 85.0]
58.0 [40.0, 85.0]
54.0 [40.0, 85.0]
58.0 [40.0, 85.0]
Sex
       
Male
709 (59.5%)
245 (51.4%)
676 (50.0%)
1630 (53.9%)
Female
483 (40.5%)
232 (48.6%)
677 (50.0%)
1392 (46.1%)
Birth region
       
West
701 (58.8%)
322 (67.5%)
916 (67.7%)
1939 (64.2%)
East
491 (41.2%)
155 (32.5%)
437 (32.3%)
1083 (35.8%)
Education (ISCED)
       
1 = low (ISCED 0–1)
192 (16.1%)
72 (15.1%)
125 (9.2%)
389 (12.9%)
2 = medium (ISCED 2–3)
678 (56.9%)
281 (58.9%)
780 (57.6%)
1739 (57.5%)
3 = high (ISCED 5–6)
319 (26.8%)
123 (25.8%)
447 (33.0%)
889 (29.4%)
Missing
3 (0.3%)
1 (0.2%)
1 (0.1%)
5 (0.2%)
Household income
       
Mean (SD)
1800 (927)
1860 (948)
2240 (1150)
2000 (1060)
Median [Min, Max]
1530 [153, 7670]
1530 [241, 6390]
2050 [460, 7670]
1790 [153, 7670]
Missing
75 (6.3%)
66 (13.8%)
115 (8.5%)
256 (8.5%)
Diabetes
       
yes
136 (11.4%)
14 (2.9%)
73 (5.4%)
223 (7.4%)
no
1038 (87.1%)
78 (16.4%)
1268 (93.7%)
2384 (78.9%)
Missing
18 (1.5%)
385 (80.7%)
12 (0.9%)
415 (13.7%)
Cardiovascular disorders
       
yes
669 (56.1%)
49 (10.3%)
531 (39.2%)
1249 (41.3%)
no
508 (42.6%)
39 (8.2%)
806 (59.6%)
1353 (44.8%)
Missing
15 (1.3%)
389 (81.6%)
16 (1.2%)
420 (13.9%)
Cancer disease
       
Yes
48 (4.0%)
1 (0.2%)
29 (2.1%)
78 (2.6%)
No
1126 (94.5%)
89 (18.7%)
1311 (96.9%)
2526 (83.6%)
Missing
18 (1.5%)
387 (81.1%)
13 (1.0%)
418 (13.8%)
Observation time
       
Mean (SD)
187 (89.3)
208 (84.6)
232 (71.4)
211 (83.3)
Median [Min, Max]
199 [1.00, 299]
218 [6.00, 299]
253 [2.00, 300]
220 [1.00, 300]
Missing
164 (13.8%)
79 (16.6%)
132 (9.8%)
375 (12.4%)
Vital status
       
Alive
564 (47.3%)
282 (59.1%)
968 (71.5%)
1814 (60.0%)
Deceased
628 (52.7%)
195 (40.9%)
385 (28.5%)
1208 (40.0%)
Fig. 2
Illustration of the inverse association of chronological age and self-perceptions of aging related to ongoing development (SPA-OD) if the sum score of SPA-OD is split at the median (median = 12 in all panel of the DEAS) as this has been done in previous studies for Kaplan-Meier curves (Levy et al., 2002; Levy & Myers, 2005; Schroyen et al., 2020; Wurm & Schäfer, 2022), the proportion of those with higher SPA-OD decreases consistently over age groups and, accordingly, the proportion of those with lower SPA-OD increases over age groups.
Click here to Correct
Models of all-cause mortality
Multivariable Cox PH models showed very consistent findings across all samples of the DEAS for most of the covariates (see Table 2). The risk to decease increased nonlinear with age and was lower for females. Similarly, the risk was higher with the presence of cancer disease and diabetes. Participants in the lowest household income group, compared to a household income of 2000 € (reference), had an increased risk to decease. The risk was attenuated with household incomes > 2000 €. Results were less consistent for education; in the DEAS 1996 and 2008 sample, the risk to decease was lower for higher educational level but not in DEAS 2002 and 2014. In the DEAS samples 2002, 2008, and 2014, current smoking was consistently associated with higher risk to decease. For the association of BMI and all-cause mortality a non-linear association was found but likely due to limited power in the tails of the distribution, confidence intervals include the null value. Lastly, higher scores of SPA-OD were consistently associated with lower risk to decease across all samples of the DEAS. But compared to the effect of chronological age, changes in the probability of death obtained from adjusted models vary only slightly with SPA-OD (Supplementary Fig. 6).
Table 2
Adjusted results from Cox-PH models of all-cause mortality for the four German Ageing Survey (DEAS) panels after multiple imputation.
Covariate
DEAS 1996
DEAS 2002
DEAS 2008
DEAS 2014
 
HR
95% CI
HR
95% CI
HR
95% CI
HR
95% CI
Age *
               
40
0.27
[0.11; 0.63]
0.14
[0.04; 0.47]
0.15
[0.04; 0.51]
0.27
[0.06; 1.31]
50
0.39
[0.24; 0.64]
0.42
[0.23; 0.78]
0.47
[0.25; 0.88]
0.46
[0.20; 1.05]
60 (ref.)
               
70
2.88
[1.81; 4.59]
2.52
[1.35; 4.69]
2.01
[1.09; 3.71]
2.59
[1.16; 5.81]
80
6.97
[3.89; 12.48]
8.45
[4.13; 17.3]
6.43
[3.33; 12.5]
5.99
[2.51; 14.3]
Female vs. male
0.49
[0.43; 0.56]
0.56
[0.45; 0.69]
0.57
[0.47; 0.68]
0.65
[0.48; 0.86]
Education (ISCED, ref.=low)
               
medium (ISCED 2–3)
0.96
[0.81; 1.15]
1.18
[0.90; 1.56]
0.95
[0.75; 1.20]
1.06
[0.67; 1.68]
high (ISCED 5–6)
0.81
[0.65; 1.00]
1.35
[0.98; 1.86]
0.71
[0.54; 0.94]
0.94
[0.56; 1.58]
Birth region (East vs. West)
1.10
[0.96; 1.25]
0.91
[0.75; 1.10]
1.03
[0.87; 1.23]
1.22
[0.91; 1.62]
Household income *
               
500
1.81
[0.91; 3.60]
1.72
[0.73; 4.04]
1.48
[0.68; 3.22]
2.69
[1.04; 6.97]
1000
1.34
[0.83; 2.15]
1.41
[0.79; 2.54]
1.29
[0.74; 2.25]
1.75
[0.87; 3.53]
1500
1.09
[0.75; 1.59]
1.17
[0.80; 1.72]
1.13
[0.80; 1.59]
1.24
[0.80; 1.91]
2000 (ref.)
               
3000
0.91
[0.56; 1.49]
0.79
[0.43; 1.45]
0.82
[0.47; 1.43]
0.97
[0.48; 1.94]
4000
0.83
[0.44; 1.57]
0.69
[0.33; 1.43]
0.73
[0.36; 1.48]
0.98
[0.39; 2.47]
5000
0.76
[0.32; 1.77]
0.65
[0.29; 1.49]
0.71
[0.33; 1.53]
0.87
[0.32; 2.35]
Diabetes
1.51
[1.26; 1.80]
1.75
[1.36; 2.24]
1.53
[1.24; 1.87]
1.35
[0.98; 1.85]
Cardiovascular disorders
1.18
[1.04; 1.35]
1.02
[0.84; 1.24]
1.38
[1.14; 1.66]
1.01
[0.74; 1.37]
Cancer
1.40
[1.05; 1.87]
1.49
[1.05; 2.12]
1.36
[1.07; 1.73]
1.64
[1.16; 2.30]
SPA: ongoing development
0.94
[0.92; 0.96]
0.92
[0.88; 0.96]
0.91
[0.88; 0.94]
0.92
[0.86; 0.98]
Smoking (ref.=never)
               
Previous
   
1.05
[0.83; 1.34]
1.22
[0.99; 1.49]
1.21
[0.84; 1.75]
Current
   
2.00
[1.50; 2.66]
2.38
[1.80; 3.13]
2.27
[1.51; 3.40]
Body mass index*
               
20
   
1.07
[0.51; 2.22]
1.59
[0.84; 3.05]
1.48
[0.61; 3.62]
25 (ref.)
               
30
   
1.18
[0.67; 2.06]
1.02
[0.61; 1.69]
0.88
[0.45; 1.73]
35
   
1.42
[0.68; 2.96]
1.11
[0.57; 2.16]
0.99
[0.42; 2.35]
40
   
1.72
[0.75; 3.96]
1.26
[0.58; 2.70]
1.37
[0.53; 3.53]
Note. * age, household income, and body mass index were modelled using natural splines with three degrees of freedom to examine nonlinear associations with all-cause mortality; covariate effects are shown as contrasts of marginal means with reference categories selected from the center of the distribution (Shepherd et al., 2017). DEAS = German Ageing Survey, HR = hazard ratio, CI = confidence interval (includes variance increase introduced from multiple imputation), ISCED = International Standard Classification of Education.
Survival estimates
Figure 2 compares unadjusted (left panel) vs. adjusted (right panel) survival estimates in the DEAS 1996 sample. The unadjusted model comprised only SPA-OD (median split) as a stratification variable and shows a considerable discriminatory effect of SPA-OD (Fig. 2, left panel). Comparing the difference in observation time at a survival probability of 0.75 (similar to Wurm & Schäfer (Wurm & Schäfer, 2022)), the median difference over all imputations between lower and higher SPA-OD is 115.5 months (min = 109; max = 123). After adjustment for covariates as shown in Table 2, the median difference over all imputations between lower and higher SPA-OD is 14 months (min = 7; max = 22). Similar results were found for all other DEAS samples (Supplementary Fig. 7 to 9).
Fig. 3
Survival estimates aggregated over 20 imputations (Denz et al., 2023) in the DEAS 1996 sample (left panel: unadjusted, i.e. except for a stratification variable of SPA-OD (median split), no further covariate entered the model; right panel: estimated survival differences between strata of SPA-OD obtained from a Cox PH model adjusted for age, sex, region of birth, education, household income, diabetes, cardiovascular disorders, and cancer). The figure does not provide an at-risk table as these numbers vary over 20 imputations. Time on the x-axis is measured by month.
Click here to Correct
Prediction of all-cause mortality
Regarding prediction of survival outcomes in internal validation data, Harrels’ c-index is shown for 2000 bootstrap replications in Fig. 3. Compared to a Cox PH model adjusted for age and sex only (Model 2), the predictive accuracy was reduced by SPA-OD (Model 1 in Fig. 4) on average by -17.3% [-18.4%; -16.3%] (percent change in c-index, Model 1 vs. Model 2 in Fig. 3) in the DEAS 1996 sample (DEAS 2002: -19.9 [-21.2%; -18.6%], DEAS 2008: -14.9% [-16.6%; -13.2%], DEAS 2014: -14.8% [-17.0%; -12.7%]).
Modeling SPA-OD in addition to age and sex (Model 3 vs. Model 2), improved prediction accuracy on average by 0.11% [-0.04%; 0.26%] in the DEAS 1996 sample. Improvements did not exceed 1.1% in other samples of the DEAS (2002, 2008, and 2014): 0.27% [0.10%; 0.45%], 1.06% [0.68%; 1.44%], and 0.96% [0.36%; 1.57%] respectively. The completely specified Cox PH model as shown in Table 2 vs. a model without SPA-OD (Model 5 vs. Model 4, Fig. 3), improved prediction accuracy on average for DEAS 1996 by 0.18% [0.09%; 0.29%], DEAS 2002 by 0.32% [0.15%; 0.48%], DEAS 2008 by 0.50% [0.27%; 0.73%], and for the DEAS 2014 by 0.29% [-0.06%; 0.65%]. Results regarding the integrated discrimination index (IDI) were similar (Supplementary Table 4). Largest improvement in discrimination of those who survived from those deceased was obtained when modeling age and sex alone; adding SPA-OD to the model led to marginal improvement being highest with 0.76% [0.53%; 0.99%] in the DEAS 2008 when comparing Model 3 vs. Model 1.
Fig. 4
Prediction accuracy measured by Harrels' c-index of five different specifications of a Cox PH model in internal validation data (bootstrap out-of-bag data). M1-5: Models 1 to 5.
Click here to Correct
Discussion
To focus on the role of confounding in the relationship between subjective aging and longevity, we re-examined the results from a recent publication by Wurm and Schäfer (2022). In line with previous research (Westerhof et al., 2023; Wurm & Schäfer, 2022), we found lower mortality with increasing SPA-OD. At the same time, the study showed that individuals with more positive SPA-OD were on average younger, more likely to be female, and healthier (Table 1). In an adjusted comparison, the survival benefit in those with more positive SPA-OD reduced from 115.5 months to 14 months (Fig. 3), i.e. about 88% of the crude survival difference is explained by confounders.
Albeit still relevant, this gain in lifetime is much smaller than the difference in lifetime suggested by previous studies, which reported survival advantages of 6 to 13 years (Levy et al., 2002; Schroyen et al., 2020; Wurm & Schäfer, 2022). Respective studies correctly accounted for confounders in regression models, e.g., 11 variables including chronological age and sex (Wurm & Schäfer, 2022). Yet, unadjusted survival curves neglected that individuals with more or less positive SPA also differed systematically in chronological age, sex, and health status. This methodological issue could have been noticed if a baseline table had been presented, which however was only included in three of theses studies (see Table SM-1). Thus, the use of unadjusted survival curves in research on subjective aging and longevity (Levy & Myers, 2005; Levy et al., 2002; Lim et al., 2013; Schroyen et al., 2020; Wurm & Schäfer, 2022), should be avoided.
This ties in with a general problem around causal language in Psychology, that is, language that suggests that an event, act or state initiates or permits a sequence of events resulting in a certain effect (Rothman, 1976). For health events, causal factors represent events, conditions, or characteristics which have a role in the health outcome (Bonita et al., 2006). The frequent use of the ambiguous term "predictor" in psychological research is part of the problem - due to the lack of methodological differentiation this potentially provokes erroneous conclusions.
In fact, in Psychology the term is mostly used to refer to covariates in a regression model, thereby describing associations in cross-sectional or longitudinal settings (Rohrer, 2024). Association studies are common but they lack means to verify model results (Yarkoni & Westfall, 2017). By contrast, prediction studies are the next step in model validation and require different methods. There is robust evidence that even strong associations do not necessarily imply good prediction (Lo et al., 2015; Shmueli, 2010), as this study has shown for the link between SPA-OD and longevity, which only transfers to a modest prognostic value (Fig. 4). This supports the need to disentangle association and prediction (Varga et al., 2020).
For many psychologists, the use of ‘predict’ or ‘predictor’ appears unproblematic and harmless (Vowels, 2023). Notwithstanding the aforementioned methodological inconsistencies, the respective terms may also prove problematic in the context of science communication. When disseminating research to the broader public, terms like ‘predict’ might evoke the concept causality (Bruine de Bruin & Bostrom, 2013). In fact, this was the case, though not intended by the authors, for the recent study by Wurm and Schäfer (2022) and previous work of Levy et al. (2002), where press articles report on the prediction of survival advantages of “13 years” and “7.5 years”, respectively (Goldberg, 2022; Levy et al., 2002; Stillman, 2024; Wurm, 2022; Wurm & Schäfer, 2022). Based on these experiences, we have derived recommendations for science communication (please see Supplementary Material).
Nonetheless, all the studies included in the systematic review (Westerhof et al., 2023) used terms such as ‘predict’, with only two studies pointing to potential misinterpretations (McLachlan et al., 2020; Uotinen et al., 2005). Such issues can also be found in other psychological fields. For example, recent studies on the association of longevity with self-rated health (Wuorela et al., 2020) and resilience factors (Assari, 2017; Craig et al., 2021) use similar approaches and comparable causal language.
Taken together, in line with previous findings (Westerhof et al., 2023) this study confirms an association between SPA and mortality. However, this association is subject to considerable confounding by chronological age, leading to misinterpretation and biased perception; especially, when using causal language. By means of more consistent consideration of confounding, we found the effect to be only small with a survival advantage of 1.2 years for those with above median SPA-OD. Moreover, prediction models suggested that the predictive value of SPA-OD beyond chronological age and sex is only modest compared to other risk factors (e.g., smoking (Bryazka et al., 2024) and diabetes (Duncan & Schmidt, 2023)).
Limitations
Our findings need to be interpreted in the light of some limitations. Our study also uses observational data that might be subject to residual confounding. In line with all previous studies on subjective aging and mortality, this analysis did not consider time-varying covariates. Situations of participants, particularly regarding comorbidities, can change over time, which is not taken into account in this study. Due to high panel attrition (e.g., 68.5% in the DEAS 1996 sample at 1st follow-up (Klaus et al., 2017)), we were unable to model time-varying covariates appropriately. Effects of covariates were not fully consistent across waves, with higher levels of education being associated with greater risks of mortality for the 2002 DEAS wave, which is inconsistent with the remaining samples and previous findings (Balaj et al., 2024). Similar inconsistencies emerged for birth region. Moreover, the definition of comorbidities might be criticized, especially for cardiovascular diseases as the definition applied here likely overestimates the prevalence of this disease type (Busch & Kuhnert, 2017). In addition, there were also between-wave differences in the assessment of comorbidities. These factors may result in mixed findings for cardiovascular risk factors. Continuous covariates such as chronological age, BMI, and household income were modelled using natural splines with an arbitrary default of three degrees of freedom (Gauthier et al., 2020; Harrell, 2015). Indeed, non-linear associations are often more plausible. In addition, non-linear modeling compared to linear modelling of covariates means slight overfitting in the case of linear associations, while linear modelling as the default can lead to the conclusion of a non-existent association (Nakatochi et al., 2023). Based on previous findings (Wurm & Schäfer, 2022), we focused on a single indicator of subjective aging, that is SPA-OD. Moreover, the analysis sample is biased as a substantial number of participants did not agree to participate in follow-up assessments which is a prerequisite for assessing their survival status. Wurm and Schäfer (2022) found that those participants included in follow-ups were on average younger, healthier, more educated, more likely to be male and live in Western Germany, and reported more positive SPA. Similar biases, however, existed in other previous studies.
Implications for future research
It is plausible to assume that other health-benefitting factors, that were found to be associated with mortality such as optimism (Kim et al., 2017) or self-efficacy (Assari, 2017), are subject to similar patterns of confounding. To illustrate, in a study on optimism (Kim et al., 2017), the results consistently moved towards a null effect when more confounders were included, supporting their critical role. Research in these areas will benefit from application of more robust methods and more “causal thinking” (Rohrer, 2024).
Moreover, there are implications for the analysis of comorbidities. So far, most studies use non-validated sum scores to measure comorbid diseases, neglecting differential associations of specific diseases with mortality and carrying at risk for confounding by mental health (Østergaard & Foldager, 2011). The selection and consideration of comorbidities should be based on the research question at hand. For some outcomes such as mortality validated sum scores can be used (Austin et al., 2015). For other research questions, specific comorbidities might be preferable.
A
In many psychological fields, the common explanatory research that generates effects of association should be extended by a subsequent step of causal thinking. In particular, it is recommended to include methods permitting the analysis of evidence pertaining to causal relationships (Stuart, 2010). Models should also be evaluated in independent data or using internal validation (Collins et al., 2015). Such a holistic approach has recently been described as integrative modelling (Hofman et al., 2021).
Conclusion
This study set out to join forces from Epidemiology and Psychology to check the relationship between subjective aging and longevity for robustness. Although SPA-OD is associated with all-cause mortality, this study has shown, that most of the previously reported survival benefit attributed to SPA-OD is explained by confounders, with chronological age being the most important. In terms of predicting mortality, the prognostic value of SPA-OD is only modest. The study underscored the importance of using adjusted survival curves, which are not yet standard in psychological research. Subjective aging, as many other psychological variables, is unlikely to produce strong causal effects on longevity. Future research is needed to explore the interplay between psychological variables, biopsychological and behavioral processes while simultaneously accounting for confounding.
A
Declaration of Sources of Funding
This research did not receive any external funding. The German Ageing Survey was funded under Grant No. 301-6083-05/003*2 by the German Federal Ministry of Family Affairs, Senior Citizens, Women and Youth. The content is the sole responsibility of the authors.
A
A
References
Ahmad FB, Anderson RN (2021) The Leading Causes of Death in the US for 2020. JAMA 325(18):1829–1830. https://doi.org/10.1001/jama.2021.5469
Alonso Debreczeni F, Bailey PE (2021) A Systematic Review and Meta-Analysis of Subjective Age and the Association With Cognition, Subjective Well-Being, and Depression. Journals Gerontology: Ser B 76(3):471–482. https://doi.org/10.1093/geronb/gbaa069
Assari S (2017) General Self-Efficacy and Mortality in the USA; Racial Differences. J Racial Ethnic Health Disparities 4(4):746–757. https://doi.org/10.1007/s40615-016-0278-0
Austin SR, Wong YN, Uzzo RG, Beck JR, Egleston BL (2015) Why Summary Comorbidity Measures Such As the Charlson Comorbidity Index and Elixhauser Score Work. Med Care 53(9):e65–72. https://doi.org/10.1097/MLR.0b013e318297429c
Benyamini Y, Burns E (2020) Views on aging: older adults’ self-perceptions of age and of health. Eur J Ageing 17(4):477–487. https://doi.org/10.1007/s10433-019-00528-8
Bland JM, Altman DG (1998) Survival probabilities (the Kaplan-Meier method). BMJ 317(7172):1572. https://doi.org/10.1136/bmj.317.7172.1572
Boffetta P, McLaughlin JK, La Vecchia C, Tarone RE, Lipworth L, Blot WJ (2008) False-positive results in cancer epidemiology: a plea for epistemological modesty. J Natl Cancer Inst 100(14):988–995. https://doi.org/10.1093/jnci/djn191
Bonita R, Beaglehole R, Kjellström T (2006) Basic epidemiology. World Health Organization
de Bruine W, Bostrom A (2013) Assessing what to address in science communication. Proceedings of the National Academy of Sciences, 110(supplement_3), 14062–14068. https://doi.org/10.1073/pnas.1212729110
Bryazka D, Reitsma MB, Abate YH, Al Magied A, Abdelkader AHA, Abdollahi A, Abdoun A, Abdulkader M, Zuñiga RSAbeldaño, Abhilash RA, Abiodun ES, Abiodun OO, Aboagye O, Abreu RG, Abtahi LG, Abualruz D, Abubakar H, Abu-Rmeileh B, Aburuz NME, Gakidou S, E (2024) Forecasting the effects of smoking prevalence scenarios on years of life lost and life expectancy from 2022 to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet Public Health 9(10):e729–e744. https://doi.org/10.1016/S2468-2667(24)00166-X
Carriquiry G, Fink V, Koethe JR, Giganti MJ, Jayathilake K, Blevins M, Cahn P, Grinsztejn B, Wolff M, Pape JW, Padgett D, Madero JS, Gotuzzo E, McGowan CC, Shepherd BE (2015) Mortality and loss to follow-up among HIV-infected persons on long-term antiretroviral therapy in Latin America and the Caribbean. J Int AIDS Soc 18(1):20016. https://doi.org/10.7448/IAS.18.1.20016
Chetty R, Stepner M, Abraham S, Lin S, Scuderi B, Turner N, Bergeron A, Cutler D (2016) The Association Between Income and Life Expectancy in the United States, 2001–2014. JAMA 315(16):1750–1766. https://doi.org/10.1001/jama.2016.4226
Collins GS, Reitsma JB, Altman DG, Moons KGM (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med 13(1):1. https://doi.org/10.1186/s12916-014-0241-z
Cox DR (1972) Regression Models and Life-Tables. J Roy Stat Soc: Ser B (Methodol) 34(2):187–202. https://doi.org/10.1111/j.2517-6161.1972.tb00899.x
Craig HJ, Ryan J, Freak-Poli R, Owen A, McNeil J, Woods R, Ward S, Britt C, Gasevic D (2021) Dispositional Optimism and All-Cause Mortality in Older Adults: A Cohort Study. Psychosom Med 83(8). https://doi.org/10.1097/PSY.0000000000000989
Darbà, J., Kaskens, L., Detournay, B., Kern, W., Nicolucci, A., Orozco-Beltrán, D.,& Ramírez de Arellano, A. (2015). Disability-adjusted life years lost due to diabetes in France, Italy, Germany, Spain, and the United Kingdom: a burden of illness study.ClinicoEconomics and Outcomes Research,7(null), 163–171. https://doi.org/10.2147/CEOR.S78132
Denz R, Klaaßen-Mielke R, Timmesfeld N (2023) A comparison of different methods to adjust survival curves for confounders. Stat Med 42(10):1461–1479. https://doi.org/10.1002/sim.9681
Diehl M, Wahl H-W (2015) Subjective aging: New developments and future directions. Springer Publishing
Diehl M, Wettstein M, Spuling SM, Wurm S (2021) Age-related change in self-perceptions of aging: Longitudinal trajectories and predictors of change. Psychol Aging 36(3):344. https://doi.org/10.1037/pag0000585
Duncan BB, Schmidt MI (2023) Many years of life lost to young-onset type 2 diabetes. Lancet Diabetes Endocrinol 11(10):709–710. https://doi.org/10.1016/S2213-8587(23)00255-3
Eekhout I, de Vet HCW, Twisk JWR, Brand JPL, de Boer MR, Heymans MW (2014) Missing data in a multi-item instrument were best handled by multiple imputation at the item score level. J Clin Epidemiol 67(3):335–342. https://doi.org/10.1016/j.jclinepi.2013.09.009
Gauthier J, Wu QV, Gooley TA (2020) Cubic splines to model relationships between continuous variables and outcomes: a guide for clinicians. Bone Marrow Transplant 55(4):675–680. https://doi.org/10.1038/s41409-019-0679-x
Gohel D, Skintzos P (2023) flextable: Functions for Tabular Reporting. In https://CRAN.R-project.org/package=flextable
Goldberg H (2022) 2022/04/23). Your attitude about aging could add 7.5 years to your life https://nypost.com/2022/04/23/your-bad-attitude-about-aging-could-add-7-5-years-to-your-life/
Greenland S, Pearl J, Robins JM (1999) Causal Diagrams for Epidemiologic Research. Epidemiology 10(1):37–48. http://www.jstor.org/stable/3702180
Hainmueller J (2012) Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies. Political Anal 20(1):25–46. https://doi.org/10.1093/pan/mpr025
Harrell FE Jr (2015) Regression Modeling Strategies: With Applications, to Linear Models, Logistic and Ordinal Regression, and Survival Analysis, 2nd ed. Springer Cham Heidelberg New York Dordtrecht London. https://doi.org/10.1007/978-3-319-19425-7
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction. Springer Science & Business Media
Hofman JM, Watts DJ, Athey S, Garip F, Griffiths TL, Kleinberg J, Margetts H, Mullainathan S, Salganik MJ, Vazire S, Vespignani A, Yarkoni T (2021) Integrating explanation and prediction in computational social science. Nature 595(7866):181–188. https://doi.org/10.1038/s41586-021-03659-0
Horton NJ, Lipsitz SR (2001) Multiple Imputation in Practice. Am Stat 55(3):244–254. https://doi.org/10.1198/000313001317098266
Hübner J, Mattutat J, Katalinic A (2021) [Years of life lost: known methods and a refined approach using the example of the most frequent causes of death in Germany]. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 64(11):1463–1472. https://doi.org/10.1007/s00103-021-03424-8(Verlorene Lebensjahre: Bekanntes und Neues zur Methodik am Beispiel der häufigsten Todesursachen in Deutschland.)
Ioannidis JP (2005) Why most published research findings are false. PLoS Med 2(8):e124. https://doi.org/10.1371/journal.pmed.0020124
Jia H, Lubetkin EI (2010) Trends in Quality-Adjusted Life-Years Lost Contributed by Smoking and Obesity. Am J Prev Med 38(2):138–144. https://doi.org/10.1016/j.amepre.2009.09.043
Kaspar R, Wahl H-W, Diehl M (2021) Awareness of Age-Related Change as a Behavioral Determinant of Survival Time in Very Old Age [Original Research]. Frontiers in Psychology, 12. https://doi.org/10.3389/fpsyg.2021.727560
Kim ES, Hagan KA, Grodstein F, DeMeo DL, De Vivo I, Kubzansky LD (2017) Optimism and Cause-Specific Mortality: A Prospective Cohort Study. Am J Epidemiol 185(1):21–29. https://doi.org/10.1093/aje/kww182
Klaus D, Engstler H, Mahne K, Wolff JK, Simonson J, Wurm S, Tesch-Römer C (2017) Cohort Profile: The German Ageing Survey (DEAS). Int J Epidemiol 46(4):1105–1105g. https://doi.org/10.1093/ije/dyw326
Kotter-Grühn D, Kleinspehn-Ammerlahn A, Gerstorf D, Smith J (2009) Self-perceptions of aging predict mortality and change with approaching death: 16-year longitudinal results from the Berlin Aging Study. Psychol Aging 24(3):654. https://doi.org/10.1037/a0016510
Lenth RV (2016) Least-squares means: the R package lsmeans. J Stat Softw 69(1):1–33. https://doi.org/10.18637/jss.v069.i01
Levy B, Bavishi A (2018) Survival advantage mechanism: Inflammation as a mediator of positive self-perceptions of aging on longevity. Journals Gerontology: Ser B 73(3):409–412. https://doi.org/10.1093/geronb/gbw035
Levy B, Myers LM (2005) Relationship between respiratory mortality and self-perceptions of aging. Psychol Health 20(5):553–564. https://doi.org/10.1080/14768320500066381
Levy B, Slade M, Kunkel S, Kasl S (2002) Longevity increased by positive self-perceptions of aging. J Personal Soc Psychol 83(2):261. https://doi.org/10.1037//0022-3514.83.2.261
Lim MY, Stephens EK, Novotny P, Price K, Salayi M, Roeker L, Peethambaram P, Jatoi A (2013) Self-perceptions of age among 292 chemotherapy-treated cancer patients: Exploring associations with symptoms and survival. J Geriatric Oncol 4(3):249–254. https://doi.org/10.1016/j.jgo.2013.02.001
Lo A, Chernoff H, Zheng T, Lo S-H (2015) Why significant variables aren’t automatically good predictors. Proceedings of the National Academy of Sciences, 112(45), 13892–13897. https://doi.org/10.1073/pnas.1518285112
Maier H, Smith J (1999) Psychological Predictors of Mortality in Old Age. Journals Gerontology: Ser B 54B(1):P44–P54. https://doi.org/10.1093/geronb/54B.1.P44
Markides KS, Pappas C (1982) Subjective Age, Health, and Survivorship in Old Age. Res Aging 4(1):87–96. https://doi.org/10.1177/016402758241004
McLachlan KJ, Cole JH, Harris SE, Marioni RE, Deary IJ, Gale CR (2020) Attitudes to ageing, biomarkers of ageing and mortality: the Lothian Birth Cohort 1936. J Epidemiol Community Health 74(4):377–383. https://doi.org/10.1136/jech-2019-213462
Newman AB, Murabito JM (2013) The Epidemiology of Longevity and Exceptional Survival. Epidemiol Rev 35(1):181–197. https://doi.org/10.1093/epirev/mxs013
Olarte Parra C, Bertizzolo L, Schroter S, Dechartres A, Goetghebeur E (2021) Consistency of causal claims in observational studies: a review of papers published in a general medical journal. BMJ open 11(5):e043339. https://doi.org/10.1136/bmjopen-2020-043339
Østergaard SD, Foldager L (2011) The association between physical illness and major depressive episode in general practice. Acta psychiatrica Scandinavica 123(4):290–296. https://doi.org/10.1111/j.1600-0447.2010.01668.x
Pencina MJ, Agostino Sr D, Agostino RBD Jr, R. B., Vasan RS (2008) Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond. Stat Med 27(2):157–172. https://doi.org/10.1002/sim.2929
Pencina MJ, D'Agostino RB (2004) Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Stat Med 23(13):2109–2123. https://doi.org/10.1002/sim.1802
Pencina MJ, D'Agostino RB, Vasan RS (2010) Statistical methods for assessment of added usefulness of new biomarkers. Clin Chem Lab Med 48(12):1703–1711. https://doi.org/10.1515/CCLM.2010.340
R Development Core Team (2023) R: A Language and Environment for Statistical Computing. In R Foundation for Statistical Computing. https://www.R-project.org/
Rahman MS, Ambler G, Choodari-Oskooei B, Omar RZ (2017) Review and evaluation of performance measures for survival prediction models in external validation settings. BMC Med Res Methodol 17(1):60. https://doi.org/10.1186/s12874-017-0336-2
Rich B (2023) table1: Tables of Descriptive Statistics in HTML. R package version 1.4.3. In https://CRAN.R-project.org/package=table1
Richter A (2024) Public GitLab project: SPA Mortality. GitLab. Retrieved 2024/11/13 from https://gitlab.com/Adrian_HGW/SPA_mortality
Rippon I, Steptoe A (2015) Feeling Old vs Being Old: Associations Between Self-perceived Age and Mortality. JAMA Intern Med 175(2):307–309. https://doi.org/10.1001/jamainternmed.2014.6580
Rohrer JM (2024) Causal inference for psychologists who think that causal inference is not for them. Soc Pers Psychol Compass 18(3):e12948. https://doi.org/10.1111/spc3.12948
Rosenbaum PR, Rubin DB (1983) The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika 70(1):41–55. https://doi.org/10.2307/2335942
Rothman KJ (1976) CAUSES Am J Epidemiol 104(6):587–592. https://doi.org/10.1093/oxfordjournals.aje.a112335
Rothman KJ, Greenland S (2005) Causation and Causal Inference in Epidemiology. Am J Public Health 95(S1):S144–S150. https://doi.org/10.2105/ajph.2004.059204
Rubin DB (1987) Multiple imputation for survey nonresponse. Wiley, In: New York
Rubin DB (1996) Multiple Imputation after 18 + Years. J Am Stat Assoc 91(434):473–489. https://doi.org/10.1080/01621459.1996.10476908
Sabanayagam C, Shankar A (2012) Income is a stronger predictor of mortality than education in a national sample of US adults. J Health Popul Nutr 30(1):82–86. https://doi.org/10.3329/jhpn.v30i1.11280
Schroyen S, Letenneur L, Missotten P, Jérusalem G, Adam S (2020) Impact of self-perception of aging on mortality of older patients in oncology. Cancer Med 9(7):2283–2289. https://doi.org/10.1002/cam4.2819
Schulz KF, Altman DG, Moher D (2010) & the, C. G. CONSORT 2010 Statement: updated guidelines for reporting parallel group randomised trials. Trials, 11(1), 32. https://doi.org/10.1186/1745-6215-11-32
Shepherd BE, Rebeiro PF, the, Caribbean C, epidemiology S (2017) A. n. f. H. Brief Report: Assessing and Interpreting the Association Between Continuous Covariates and Outcomes in Observational Studies of HIV Using Splines. JAIDS Journal of Acquired Immune Deficiency Syndromes, 74(3), e60-e63. https://doi.org/10.1097/qai.0000000000001221
Shmueli G (2010) To Explain or to Predict? Stat Sci 25(3):289–310. https://doi.org/10.1214/10-STS330
Spuling SM, Klusmann V, Bowen CE, Kornadt AE, Kessler E-M (2020) The uniqueness of subjective ageing: convergent and discriminant validity. Eur J Ageing 17(4):445–455. https://doi.org/10.1007/s10433-019-00529-7
Sterne JAC, White IR, Carlin JB, Spratt M, Royston P, Kenward MG, Wood AM, Carpenter JR (2009) Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ, 338, b2393. https://doi.org/10.1136/bmj.b2393
Steverink N, Westerhof GJ, Bode C, Dittmann-Kohli F (2001) The Personal Experience of Aging, Individual Resources, and Subjective Well-Being. Journals Gerontology: Ser B 56(6):P364–P373. https://doi.org/10.1093/geronb/56.6.P364
Stillman J (2024) 2024/08/26). A Yale Psychologist Says This Simple Mindset Change Helps People Live 7.5 Years Longer on Average https://www.inc.com/jessica-stillman/yale-psychologist-simple-mindset-change-helps-people-live-7.5-years-longer-average.html
Stuart EA (2010) Matching methods for causal inference: A review and a look forward. Stat Sci 25(1):1–21. https://doi.org/10.1214/09-sts313
Therneau TM, Crowson CS, Atkinson EJ (2015) Adjusted survival curves. CiteSeerx. https://cran.r-project.org/web/packages/survival/vignettes/adjcurve.pdf
Therneau TM, Grambsch PM (2013) Modeling Survival Data: Extending the Cox Model. Springer, New York. https://books.google.de/books?id=oj0mBQAAQBAJ
Uotinen V, Rantanen T, Suutama T (2005) Perceived age as a predictor of old age mortality: a 13-year prospective study. Age Ageing 34(4):368–372. https://doi.org/10.1093/ageing/afi091
van Buuren S, Groothuis-Oudshoorn K (2011) mice: Multivariate Imputation by Chained Equations in R. J Stat Softw 45(3):1–67. https://doi.org/10.18637/jss.v045.i03
VanderWeele TJ (2019) Principles of confounder selection. Eur J Epidemiol 34(3):211–219. https://doi.org/10.1007/s10654-019-00494-6
VanderWeele TJ (2021) Can Sophisticated Study Designs With Regression Analyses of Observational Data Provide Causal Inferences? JAMA Psychiatry 78(3):244–246. https://doi.org/10.1001/jamapsychiatry.2020.2588
Varga TV, Niss K, Estampador AC, Collin CB, Moseley PL (2020) Association is not prediction: A landscape of confused reporting in diabetes – A systematic review. Diabetes Res Clin Pract 170:108497. https://doi.org/10.1016/j.diabres.2020.108497
Vogel C, Klaus D, Wettstein M, Simonson J, Tesch-Römer C (2021) German Ageing Survey (DEAS). In D. Gu & M. E. Dupre (Eds.), Encyclopedia of Gerontology and Population Aging (pp. 2152–2160). Springer International Publishing. https://doi.org/https://doi.org/10.1007/978-3-030-22009-9_1115
von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP (2007) The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet 370(9596):1453–1457. https://doi.org/10.1016/S0140-6736(07)61602-X
Vowels MJ (2023) Misspecification and unreliable interpretations in psychology and social science. Psychol Methods 28(3):507–526. https://doi.org/10.1037/met0000429
Westerhof GJ, Miche M, Brothers AF, Barrett AE, Diehl M, Montepare JM, Wahl H-W, Wurm S (2014) The influence of subjective aging on health and longevity: a meta-analysis of longitudinal data. Psychol Aging 29(4):793. https://doi.org/10.1037/a0038016
Westerhof GJ, Nehrkorn-Bailey AM, Tseng H-Y, Brothers A, Siebert JS, Wurm S, Wahl H-W, Diehl M (2023) Longitudinal effects of subjective aging on health and longevity: An updated meta-analysis. Psychol Aging 38:147–166. https://doi.org/10.1037/pag0000737
Weston S, Microsoft Corporation (2022) &. doParallel: Foreach Parallel Adaptor for the 'parallel' Package. R package version 1.0.17. In https://CRAN.R-project.org/package=doParallel
Wickham H (2016) ggplot2: elegant graphics for data analysis. Springer Cham. https://doi.org/10.1007/978-3-319-24277-4
Wuorela M, Lavonius S, Salminen M, Vahlberg T, Viitanen M, Viikari L (2020) Self-rated health and objective health status as predictors of all-cause mortality among older people: a prospective study with a 5-, 10-, and 27-year follow-up. BMC Geriatr 20(1):120. https://doi.org/10.1186/s12877-020-01516-9
A
Wurm S (2022) 2022/09/05). Altersforscherin: Das ist die Formel für ein langes Leben https://www.morgenpost.de/ratgeber-wissen/article401488046/altersforscherin-rentner-rente-ziele-langes-leben.html
Wurm S, Schäfer SK (2022) Gain- but not loss-related self-perceptions of aging predict mortality over a period of 23 years: A multidimensional approach. Journal of Personality and Social Psychology, No Pagination Specified-No Pagination Specified. https://doi.org/10.1037/pspp0000412
Wurm S, Tesch-Römer C, Tomasik MJ (2007) Longitudinal findings on aging-related cognitions, control beliefs, and health in later life. Journals Gerontol Ser B: Psychol Sci Social Sci 62(3):P156–P164. https://doi.org/https://doi.org/10.1093/geronb/62.3.P156
Xiao N (2023) ggsci: Scientific Journal and Sci-Fi Themed Color Palettes for 'ggplot2'. R package version 3.0.0. In https://CRAN.R-project.org/package=ggsci
Yarkoni T, Westfall J (2017) Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning. Perspect Psychol Sci 12(6):1100–1122. https://doi.org/10.1177/1745691617693393
Yates JF (1982) External correspondence: Decompositions of the mean probability score. Organizational Behav Hum Perform 30(1):132–156. https://doi.org/10.1016/0030-5073(82)90237-9
1
For reasons of clarity, we use the term ‚covariate’ to refer to independent variables included in regression models. In Psychology those variables are often referred to as ‘predictors’. We refrain from using this term as we aim at differentiating between associations and proper prediction in this manuscript.
Total words in MS: 6764
Total words in Title: 10
Total words in Abstract: 244
Total Keyword count: 6
Total Images in MS: 4
Total Tables in MS: 2
Total Reference count: 94