Introduction

Due to the global rise in life expectancy and that the incidence of most cancers increases with age [1], a better understanding on the risk factors of cancer is needed. While advancing age is the single most important risk factor for cancer overall and several cancer types [2], there is a large variability in the health status of older adults [3]. Independent of chronological age, other aging-related traits such as high blood pressure [4], sarcopenia [5], and impaired lung function [6], as well as biological age measures [7, 8], might be associated with the risk of cancer, too.

Frailty, a geriatric syndrome associated with multisystem dysfunction, is a manifestation of decline across several homeostatic systems [9]. It can also be considered as a measure of biological aging, such that it captures the heterogeneity in how individuals age [10]. Frailty correlates moderately with other markers of biological age, such as epigenetic clocks and telomere length, and explains a unique part of mortality risk not explained by other markers [11]. In addition to mortality, frailty is predictive of other adverse outcomes such as cardiovascular diseases [12], falls [13], hospitalizations [14], and disability [15]. Frailty is most commonly defined by the Rockwood frailty index (FI) [16] and the Fried frailty phenotype (FP) [17]. The FI is a multidimensional definition that operationalizes frailty as an accumulation of deficits across various health domains, such as diseases, symptoms, physical functioning, and mental well-being [16]. The FP views frailty as a physical syndrome with distinct physiological and functional manifestations characterized by weight loss, slowness, weakness, exhaustion, and low physical activity [17]. Despite the different definitions, both the FI and FP are valid predictors of adverse outcomes [18] and tap the same root causes of the syndrome [19]. The overall prevalence of frailty in individuals aged ≥ 50 is 24% using the FI and 12% using the FP [20].

Since cancer diagnosis and treatment are stressors that can deplete physiological reserves, frailty is of particular concern among cancer patients [21]. Studies have shown that 42% of older cancer patients are frail [22], and that frailty confers a high risk of chemotherapy intolerance, postoperative complications, disease progression, and mortality in cancer patients [21,22,23]. Having a cancer diagnosis may also lead to a higher risk of frailty [24], although the mechanisms are not understood. Meanwhile, there is a paucity of research on whether higher baseline frailty scores increase the risk of incident cancer. Existing literature mainly suggests no significant association between frailty and overall cancer incidence, yet most of these studies had relatively small sample sizes and did not analyze the risk of specific cancers [25,26,27]. As both frailty and cancer are closely linked to aging, we hypothesize that there could be a bidirectional relationship between frailty and cancer, such that a multisystem physiological dysregulation may also lead to an increased susceptibility to cancer risk. Improved understanding on the relationship between frailty and cancer may also help to inform clinical decisions and provide insights into the biological mechanisms underlying aging and cancer.

To this end, we aimed to assess whether higher baseline frailty scores predict the incidence of any cancer and the five most common cancers in Europe [28], namely breast, prostate, lung, colorectal, and melanoma skin cancer in two large population-based cohorts in the UK and Sweden. As a secondary aim, we used a co-twin control method to analyze whether the observed associations are explained by familial factors, i.e., genetics and shared environment.

Methods

Study population

Data were drawn from two prospective cohorts: the UK Biobank (UKB) study [29], and the Screening Across the Lifespan Twin (SALT) study in Sweden [30]. Both studies were approved by the local research ethics committees. All participants provided a written informed consent prior to data collection.

Between 2006 and 2010, postal invitations were sent to over 9 million adults registered with the UK’s National Health Service and lived close to one of the 22 assessment centers throughout England, Wales, and Scotland [31]. In total, > 500,000 participants aged 38–73 years from the general population were recruited to UKB (response rate 5.5%) [29, 31]. Participants completed a touch-screen questionnaire, had physical measurements taken, and provided biological samples during the baseline assessment. SALT is part of the population-based Swedish Twin Registry [30]. Between 1998 and 2002, all twins born in 1958 or before were invited to participate in the survey with an aim to screen for common diseases [32]. In total, 44,919 twin individuals aged 41–103 years were recruited (response rate was 65% for those born in 1886–1925, and 74% for those born in 1926–1958) [30, 32]. SALT participants completed a computer-assisted telephone interview at baseline with questions on diseases, symptoms, medication use and lifestyle.

In this analysis, we excluded UKB and SALT participants who had a history of cancer diagnosis (except non-melanoma skin cancer) before baseline and those with missing data on frailty. For feasible comparison, we further excluded SALT participants aged > 73 years at baseline to match with the age range of UKB participants. In total, we included 453,144 participants from UKB and 36,888 participants from SALT in the analytical samples (Supplementary Fig. 1).

Frailty assessment

Frailty was measured using the FI and FP in UKB, and only the FI in SALT. Following the deficit accumulation model [16], we have previously constructed and validated FIs for both cohorts [33, 34]. Briefly, 49 and 44 self-reported deficit items, including signs, symptoms, and diseases in various physiological and mental domains, were selected for construction of the FIs in UKB and SALT, respectively (Supplementary Table 1). Participants who had missing data on > 20% of the frailty items were excluded. We calculated the FI as the sum of deficits divided by the number of non-missing items (e.g., an individual with 7 deficits out of 44 items would receive an FI of 7/44 = 0.16). Following the cut-offs that have been used in our previous work [34, 35], we considered four FI categories: relatively fit (≤ 0.03), less fit (> 0.03–0.1), least fit (> 0.1–0.21), and frail (> 0.21). The FI was used as both continuous (per 10% increase) and categorical variables in the analysis.

A modified FP was previously created for UKB participants using the five frailty criteria [17]: weight loss, exhaustion, slowness, low physical activity, and weakness [36]. The first four criteria were assessed by self-reported questionnaires, and weakness was determined by the grip strength measured using a Jamar J00105 hydraulic hand dynamometer (Supplementary Table 2). Participants were categorized into non-frail (met none of the five criteria), pre-frail (1–2), and frail (≥ 3) [17]. We excluded those who had missing data on any of the five criteria.

Cancer ascertainment

Incident cancers in UKB were identified through a linkage to the UK national cancer registries; complete follow-up was available from the baseline through February 29, 2020. Incident cancers in SALT were identified through a linkage to the Swedish National Patient Register; we used a comparable follow-up period as in UKB and followed SALT participants up to December 31, 2011. We defined cancers according to the International Classification of Diseases, 10th revision codes: breast cancer (C50), prostate cancer (C61), lung cancer (including trachea, C33-34), colorectal cancer (C18-20), melanoma of skin (C43); and any cancer diagnosis, excluding non-melanoma skin cancer (C00-97, except C44). The analysis on breast and prostate cancer was performed only in women and men, respectively.

Covariates

We considered age, birth year, sex, baseline assessment center, ethnic background, body mass index (BMI), smoking, alcohol consumption, education, and deprivation index as the common confounders for all cancer sites in UKB. BMI was derived from the weight and height measured by trained nurses at baseline. These variables, except baseline assessment center, ethnic background, and deprivation index, were also available and adjusted for in SALT. We also considered the following covariates for site-specific cancers in UKB: models for breast, prostate, lung, and colorectal cancer were additionally adjusted for family history of the corresponding cancer, while the models for melanoma were additionally adjusted for physical activity, time spent outdoors during summer, use of ultraviolet protection, childhood sunburns, solarium/sunlamp use, ease of skin tanning, skin color, and hair color (these variables were not available in SALT). Family history of melanoma was not available in UKB. Details on the categorization and descriptive statistics of the covariates are presented in Supplementary Tables 34.

Statistical analysis

Analyses were performed separately in UKB and SALT. Participants were followed from the date of baseline assessment to the date of cancer diagnosis, death, or end of follow-up, whichever came first.

Survival analysis. Kaplan–Meier curves were first plotted to evaluate the probabilities of cancer incidence across frailty categories. To account for the competing risk of death, we also calculated cumulative incidences of the cancers using the Aalen-Johansen estimator [37].

We calculated hazard ratios (HRs) and 95% confidence intervals (CIs) using Cox proportional-hazard models, with attained age as the underlying timescale. All models were adjusted for sex, birth year, and the other covariates described above, as relevant for each cancer. Indicator variables for the missing data of covariates were created and included in the models when necessary. We also assessed whether frailty was associated with the two most common subtypes of lung cancer, adenocarcinoma and squamous cell carcinoma, in UKB. Cluster-robust standard errors were used in SALT to account for the relatedness of individuals in twin pairs. Discrimination ability of the Cox models was assessed using the Harrell’s C-statistics [38].

For the cancers that had statistically significant associations with the baseline frailty scores in both cohorts, we additionally stratified the analysis by baseline age (< 60 vs. ≥ 60 years), sex, BMI (< 25 vs. ≥ 25), and smoking status (non-smokers vs. ever-smokers). We also performed several sensitivity analyses. First, we tested for non-proportional hazards over time by fitting flexible parametric survival models, where the baseline hazard function was modelled using a 5 degrees-of-freedom natural cubic spline, and the time-dependent effect of frailty using a 3 degrees-of-freedom spline. Second, since the FIs included self-reported cancer items (two items in UKB and one in SALT), we removed these items from the FIs to assess whether they had any effect on the results. Similarly, as we observed an association between the FI and lung cancer in UKB, we removed seven lung cancer-related items from the UKB FI (i.e., wheezing, pneumonia, chronic lung disease, asthma, chest pain, any cancer diagnosis, and multiple cancers diagnosed) to examine whether the association was influenced by these items. Finally, instead of using the missing-indicator approach, we performed a complete-case analysis by excluding individuals with missing data on any covariates from the Cox models.

Within-twin-pair analysis. In SALT, we fitted stratified Cox models within dizygotic (DZ) and monozygotic (MZ) twin pairs to examine whether the significant associations observed in the population-level analysis were due to familial factors [39]. The co-twin control design rests on the assumption that MZ twins are genetically identical and DZ twins share ~ 50% of their segregating genes. Both MZ and DZ twins share the same family environment (early rearing environment and everything that makes twins in a pair similar to each other). If the association is not affected by familial influences (i.e., in line with a causal hypothesis), the effect size should remain similar in the within-twin-pair analysis compared to the population-level analysis [39]. In contrast, if the association between frailty and cancer is explained by genetic factors, we would expect an attenuation of the association in DZ twins, and an even greater attenuation in MZ twins. If the association is explained by shared environmental factors, a similar attenuation is in both DZ and MZ twins.

To account for multiple comparisons (two frailty measures and five cancer sites), we applied the Bonferroni adjustment and considered p < 0.005 (0.05/10) as statistically significant. All analyses were performed in Stata 16 and R 4.0.5.

Results

Sample characteristics

For the included 453,144 UKB participants and 36,888 SALT participants, the median follow-up periods for any cancer were 10.9 years (interquartile range [IQR] 10.0–11.6) and 10.7 years (IQR 9.7–11.9), respectively. A total of 53,049 (11.7%) incident cancers were documented in UKB and 4,362 (11.8%) in SALT. The cumulative incidence of the site-specific cancers was similar in both cohorts (Table 1). The mortality rate in SALT was higher than that in UKB (9.7% vs. 5.0%), possibly because SALT participants were mostly from an older birth cohort than UKB participants. The proportion of frail individuals defined by FI was 11.9% in UKB and 14.4% in SALT, and by FP was 3.6% in UKB (Table 1). There was a weak-to-moderate correlation between FI and FP in UKB (Spearman’s correlation = 0.35) (Supplementary Fig. 2).

Table 1 Characteristics of study participants. Data are numbers (%) unless otherwise indicated

Frailty and risk of cancer

Kaplan–Meier curves showed that higher baseline frailty scores were associated with an increased probability of any cancer in both cohorts, and a higher probability of lung cancer and a lower probability of prostate cancer and melanoma in UKB (log-rank p < 0.005) (Fig. 1). Results were similar when accounted for the competing risk of death, except for that the cumulative incidence of any cancer appeared to be lower in frail individuals defined by the UKB FP, possibly because the most frail individuals may have already died before getting cancers during the follow-up (Supplementary Fig. 3).

Fig. 1
figure 1

Probability of cancer incidence over attained age, stratified by frailty status in UK Biobank and SALT. A Incidence of any cancer, except non-melanoma skin cancer; B incidence of breast cancer in women; C incidence of prostate cancer in men; D incidence of lung cancer; E incidence of colorectal cancer; F incidence of melanoma. Frailty was assessed using a 49-item FI and a modified FP in UK Biobank, and by a 44-item FI in SALT. Probability of cancer incidence was based on the Kaplan–Meier estimates. P-values were based on log-rank tests. Abbreviations: FI, frailty index; FP, frailty phenotype; SALT, Screening Across the Lifespan Twin Study

In multivariable-adjusted Cox models, we observed significant associations for baseline FI (HR for frail vs. relatively fit = 1.22; 95% CI = 1.17–1.28) and FP scores (HR for frail vs. non-frail = 1.16; 95% CI = 1.11–1.21) with any cancer in UKB (Table 2). Similarly, in SALT, frail vs. relatively fit participants as defined by the FI also had an increased risk of any cancer (HR = 1.31; 95% CI = 1.15–1.49) (Table 3). For specific cancer sites in UKB, higher FI and FP scores were associated with increased risks of all lung cancer (Table 2), as well as its subtypes, adenocarcinoma and squamous cell carcinoma, although the association between FP and adenocarcinoma was not statistically significant in the fully-adjusted model (Supplementary Table 5). Higher FI was also associated with a reduced risk of melanoma in UKB, although no significant association was observed between FP and melanoma (Table 2). The FI was not statistically significantly associated with the specific cancers in SALT after accounting for multiple comparisons (Table 3).

Table 2 Associations between baseline frailty scores by the frailty index and frailty phenotype and cancer incidence in UK Biobank (n = 453,144). Data are hazard ratios (95% confidence intervals) unless otherwise indicated
Table 3 Associations between frailty index and cancer incidence in SALT (n = 36,888). Data are hazard ratios (95% confidence intervals) unless otherwise indicated

We examined the discrimination ability of frailty scores for cancers using the Harrell’s C-statistics (Supplementary Table 6). Compared to the models including age, sex, and traditional cancer risk factors as explanatory variables, adding frailty scores did not improve discrimination for any cancer and most cancer types. There was only slight improvement in the C-statistic (0.784 vs. 0.779) when adding FI to the model including common risk factors (e.g., age, sex, and smoking) in predicting lung cancer in UKB (Supplementary Table 6).

In subgroup analysis, the associations between baseline frailty scores and incidence of any cancer were stronger in men vs. women (Pinteraction < 0.001 for FI and Pinteraction = 0.040 for FP) and in ever-smokers vs. non-smokers (Pinteraction < 0.001 for both FI and FP) in UKB (Supplementary Table 7). No significant interactions were observed across the subgroups in SALT (Supplementary Table 8).

When allowing for time-varying hazards, we observed relatively stable associations between both the FI and FP and any cancer over the follow-up period in UKB (Supplementary Fig. 4). However, in SALT, the association between FI and any cancer was the strongest at approximately one year after baseline and was attenuated after four years since follow-up (Supplementary Fig. 4). The association between FI and any cancer remained essentially unchanged after removing self-reported cancer items from the FI in both samples (Supplementary Table 9). Moreover, when removing lung cancer-related items from the FI in UKB, the FI-lung cancer association remained robust and statistically significant (Supplementary Table 10). Finally, we observed largely consistent results in the complete-case analysis (Supplementary Table 11).

Within-twin-pair analysis

To test whether the association between FI and risk of any cancer in SALT was due to shared familial confounding, we performed a within-twin-pair analysis in DZ and MZ twins. We first obtained population-level estimates in multivariable Cox models for DZ (HR per 10% increase in FI = 1.13; 95% CI = 1.08–1.19) and MZ twins (HR per 10% increase in FI = 1.04; 95% CI = 0.95–1.14) and found that they were comparable to the population-level estimate in the full SALT sample (Fig. 2). Compared to population-level estimates, the association between FI and risk of any cancer was similar in the within-pair analysis in DZ twin pairs (HR per 10% increase in FI = 1.13; 95% CI = 1.04–1.24), but was attenuated in MZ pairs (HR per 10% increase in FI = 0.86; 95% CI = 0.71–1.04) (Fig. 2).

Fig. 2
figure 2

Association between the frailty index and risk of any cancer from the multivariable Cox models and with-pair analyses in the full SALT cohort (n = 36,888), complete DZ twin pairs (n = 9,920 pairs) and complete MZ twin pairs (n = 3,610 pairs). Error bars represent the 95% confidence intervals. Closed symbols represent p < 0.005. Abbreviations: DZ, dizygotic; FI, frailty index; MZ, monozygotic; SALT, Screening Across the Lifespan Twin Study

Discussion

Using data from two large population-based cohorts, we showed that higher baseline frailty scores, measured using the FI and FP, were associated with an increased risk of any cancer after adjusting for common cancer risks such as age, sex, BMI, smoking, alcohol consumption and education. We also observed an elevated risk of lung cancer and its two main subtypes, adenocarcinoma and squamous cell carcinoma, with higher frailty scores in UKB. However, no statistically significant associations between frailty and the site-specific cancers were found in SALT. Frailty scores also provided limited added discriminative ability for cancers on top of age, sex, and traditional cancer risk factors. For any cancer, we performed a within-pair analysis in SALT and found that the association within DZ twins remained comparable to the population-level estimate, whereas the association within MZ twins was attenuated, indicating that genetic factors may in part explain the increased risk of incident cancer associated with frailty.

Given that both frailty and cancer are closely linked to aging and may share some of the underlying mechanisms such as chronic inflammation and immunosenescence [40,41,42], it is conceivable that there could be a bidirectional relationship between frailty and cancer. Nevertheless, prior research has primarily focused on the impact of frailty in older cancer patients [21]. Whether frailty may predict the risk of cancers in individuals without a history of cancer remain largely unexplored. The handful of studies to date mainly suggested no significant association between frailty scores and incident cancer [25,26,27]. For example, Aguayo et al. analyzed the association between 35 frailty scores, including the FI and FP, and incident self-reported cancer in 5,294 participants of the English Longitudinal Study of Ageing [26]. They found no significant associations in the fully adjusted models, and none of the frailty scores improved the C-statistic of the basic prediction model [26]. Klein et al. observed non-significant cross-sectional association between a functional frailty measure and self-reported cancer in 2,515 participants of the Beaver Dam Eye Study [25]. Petermann-Rocha et al. assessed the joint association of sarcopenia and FP on incident cancer in 341,668 UKB participants, but found no significant associations [27]. On the other hand, studies have suggested that some of the subcomponents of the FP may be associated with cancer risks. For example, a previous UKB study found that lower grip strength was associated with increased incidence of colorectal, lung, breast, and prostate cancers [43]. Similarly, low physical activity was found to be associated with higher risks of lung, breast, and colon cancers [44].

We can only speculate the reasons why we detected significant associations between frailty scores and incident cancers, while previous studies did not. The most likely explanations include the large sample and long follow-up. However, consistent with a previous study [26], we found that adding FI and FP to the model on any cancer resulted in no improvement of the predictive accuracy, while to the model on lung cancer resulted only in small improvement. It should also be noted that we found mixed results for lung cancer, with a strong association found in UKB but not in SALT. The discrepancy could be attributed to cohort or country-related differences other than the age of the participants or the follow-up time that were similar for both cohorts. We also found a reduced risk of melanoma with higher FI scores in UKB, which could be explained by the fact that frail individuals are usually less physically active and may spend less time outside and be exposed to ultraviolet radiation. Although we carefully controlled for several sun exposure and physical activity-related variables, there was a relatively high proportion of missing data in some of these variables, which may have led to misclassification. We were also unable to adjust for family history of melanoma as it was not available in UKB.

We performed a within-twin-pair analysis in SALT to assess if the observed association between frailty scores and any cancer may be influenced by shared genetic and/or environmental factors. The attenuation of the FI-cancer association within MZ twins implies that it could be attributable to genetic influences underlying both traits, instead of being causal. Intervening on frailty is therefore unlikely to result in significant improvement of cancer outcomes. However, this result should also be interpreted with caution. The attenuation of the association between the FI and any cancer in the within-pair analysis of MZ twins to < 1 is somewhat unexpected, given that the corresponding DZ estimate remained similar to population-level estimate in the full sample. In addition, when we performed the population-level analysis in the DZ and MZ twins, the estimates remained similar to the population-level estimate in the full sample, indicating that characteristics of the MZ twin sample per se are unlikely to explain the estimate of < 1. Of note, within-pair-estimates may be biased in the presence of confounding not shared by twins and measurement error in the exposure [45]. The CI was also wide for the within-pair estimate in MZ twins due to the relatively small sample size (n = 3,610 complete pairs). Hence, more studies are warranted to explore to which extent genetic factors account for the association between frailty and risk of cancer.

This study has strengths and limitations. The large sample consisting of two independent population-based cohorts enabled us to have enough statistical power to delineate the relationship between frailty and cancers that has thus far been understudied. The use of two different frailty measures also allowed us to compare and confirm the associations. However, UKB may not be representative to the general population due to healthy selection [31]. In particular, cancer incidence in UKB has shown to be lower than that in the general population, especially for older adults and women [31]. Besides, as we included mostly white participants, our results may not be generalizable to other populations. As in other observational studies, although we adjusted for several socio-economic and lifestyle factors, there may still be residual or unmeasured confounding. We also could not account for the stage or severity of cancer due to a lack of related information.

To conclude, this study is the largest so far to assess the predictive value of frailty on incident cancer, demonstrating that baseline frailty scores are significantly associated with a higher incidence of any cancer and lung cancer. Meanwhile, the associations may be confounded by genetic factors, and frailty scores provide only limited incremental discriminative ability over common cancer risk factors. Therefore, the currently available frailty scores may not be the optimal tools for cancer risk stratification at the population level.