Introduction

Radon is an established lung carcinogen and an important occupational and environmental cause of lung cancer (UNSCEAR 2020). This was demonstrated in residential radon studies in the general population and in studies of uranium and other radon-exposed underground miners. Cohorts of uranium miners continue to form an important basis for radiation protection standards for radon progeny. They consistently show that the excess relative rate (ERR) of lung cancer mortality increases linearly with increasing cumulative exposure to radon progeny (in the following abbreviated to “radon”) in WLM and that the ERR/WLM is modified by attained age, time since exposure or age at exposure, and exposure rate (NRC 1999, UNSCEAR 2020). The calculation of the lifetime excess absolute risk (LEAR) allows the comparison of estimates of the ERR/WLM obtained from different studies with different characteristics while using the same exposure scenario and baseline mortality rates. Such estimates are also useful for policy considerations. The LEAR of lung cancer related to exposure to radon for example has been used in the past as the basis for the dose conversion convention for radon (ICRP 1993, 2010). This method has been used to convert measured radon activity concentrations into an effective dose in mSv, which is important to check the compliance with occupational radiation limits given in mSv. In this "epidemiological" approach of dose conversion, the LEAR of lung cancer per unit of exposure to radon progeny is divided by the detriment (representing the harm) per unit of effective dose (ICRP 2010). Determining the most appropriate value of this dose conversion coefficient has been the subject of much controversy in recent years (Harrison et al. 2020, 2021; Laurier et al. 2020; Marsh et al. 2021).

While previously an LEAR of 2.8 × 104 per WLM was assumed based on risk models derived from a meta-analysis of 7 miners cohort studies (ICRP 1993), this value was revised to 5 × 104 per WLM by the International Commission on Radiological Protection (ICRP) in 2010 (ICRP 2010) based on new risk models from a pooled analysis of 11 miners studies (BEIR VI study) (NRC 1999) and a pooled Czech/French study (Tomasek et al. 2008a). Both LEAR calculations used a mixed male/female Euro-American-Asian population for baseline rates of lung cancer mortality (ICRP 2007) and assumed an exposure scenario of 2 WLM per year between age 18 and 64 years. In 2020, the United Nations Scientific Committee on the Effects of Atomic Radiation (UNSCEAR) reviewed epidemiological studies and calculated the LEAR per WLM for death from lung cancer in a similar manner using data from four miners studies (UNSCEAR 2020; Tomasek 2020), among them for the first time the large German Wismut cohort (Kreuzer et al. 2018). The LEAR ranged from 2.4 (Wismut cohort) to 7.5 (Eldorado cohort) × 104 per WLM. Heterogeneity in radiation risk estimates between studies may explain differences in the LEAR and could be due to several factors: differences in the range of cumulative exposure and exposure rate, concomitant exposures to other lung cancer carcinogens, duration of follow-up and employment, methods of mortality follow-up, composition of the study population, existence and control for potential confounders, measurement error, loss to follow-up and competing risks for mortality, statistical power, and also statistical methods.

A major step forward was therefore the worldwide pooling of uranium miners studies, the Pooled Uranium Miners Analysis (PUMA) study (Rage et al. 2020; Richardson et al. 2021), which aims to get more precise estimates of the lung cancer risk associated with radon based on standardized statistical analyses of existing cohorts. PUMA includes twice as many uranium miners and about three times as many lung cancer deaths (Rage et al. 2020) as the pooled BEIR VI study (NRC 1999). The majority of included studies have an updated mortality follow-up and all studies follow a common study protocol and statistical methods. Recently, two papers on radon-lung cancer mortality associations among men in PUMA have been published, addressing: (1) the 1960 + sub-cohort of miners hired in 1960 or later (Richardson et al. 2022) with chronic low radon exposures and exposure rates mostly based on measurements, and (2) the full PUMA cohort (Kelly-Reif et al. 2023) including very high radon exposures from the early years of mining and low radon exposures in the later years.

The aim of the present paper is to calculate the LEAR per WLM for death from lung cancer using the new risk models based on the pooled data of the PUMA study and the risk models of previously published uranium miners studies, including the recently updated German Wismut cohort (Kreuzer et al. 2023), while using the same methods for all analyses. To be comparable to previous LEAR calculations as in UNSCEAR (2020) and ICRP (2010), the exposure scenario was defined as 2 WLM per year from age 18–64 years, and baseline mortality rates of the ICRP mixed Euro-American-Asian population (ICRP 2007) were chosen.

Methods

PUMA data

The PUMA study includes seven cohorts from Canada, the Czech Republic, France, Germany, and USA, which have been previously described in detail (Rage et al. 2020; Richardson et al. 2022; Kelly-Reif et al. 2023). The ERR/WLM was estimated in analyses of men included in PUMA based on the BEIR VI exposure–age–concentration model (NRC 1999, UNSCEAR 2020) and an alternative risk model (i.e., the BEIR VI model, but with age at exposure instead of time since exposure). The corresponding statistical methods and findings have been published for the full cohort (Kelly-Reif et al. 2023) and the PUMA 1960 + sub-cohort of miners hired in 1960 or later (Richardson et al. 2022). Main characteristics of both cohorts are described briefly in Table 1.

Table 1 Characteristics of the PUMA full cohort and PUMA 1960 + sub-cohort of miners hired in 1960 or later

Statistical methods

Lifetime risks reflect the probability of developing or dying from a specific disease of interest (here: lung cancer mortality) in the course of a lifetime. The lifetime excess absolute risk (LEAR) is defined as the difference between the lifetime risk \({\text{LR}}_E\) for an individual with exposure \(E\) (here: exposure to occupational radon) and the lifetime risk \({\text{LR}}_0\) for an individual without exposure

$$\begin{aligned} {\text{LEAR}} & = {\text{LR}}_E - {\text{LR}}_0 \\ & = \mathop \int \limits_0^\infty r_E \left( a \right)S\left( a \right)da - \mathop \int \limits_0^\infty r_0 \left( a \right)S\left( a \right)da, \\ \end{aligned}$$

with survival function \(S\left( a \right) = e^{ - \mathop \smallint \limits_0^a q_0 \left( u \right)du}\) describing the probability to survive until age \(a\), and baseline mortality rates for all causes of death \(q_0 \left( a \right)\) and for lung cancer \(r_0 \left( a \right)\) at age \(a\) in absence of exposure. The lung cancer mortality rate \(r_E (a)\) at age \(a\) under exposure is assumed to follow the typical general model structure \(r_E (a)\) = \(r_0 \left( a \right)\left( {1 + {\text{ERR}}\left( a \right)} \right)\) with excess relative risk term \({\text{ERR}}\left( a \right)\). Based on this assumption, the LEAR can be approximated and finally technically calculated by

$$\begin{aligned} {\text{LEAR}} & \approx \mathop \sum \limits_{a = 0}^{a_{\max } } r_E \left( a \right)\tilde{S}(a) - \mathop \sum \limits_{a = 0}^{a_{\max } } r_0 \left( a \right)\tilde{S}(a) \\ & = \mathop \sum \limits_{a = 0}^{a_{\max } } r_0 \left( a \right){\text{ERR}}\left( a \right)\tilde{S}\left( a \right), \\ \end{aligned}$$

where \(\tilde{S}(a) = e^{ - \mathop \sum \limits_{u = 0}^{a - 1} q_0 (u)}\) approximates the survival function \(S\left( a \right)\). The \({\text{ERR}}\left( a \right)\) depends on an exposure pattern and a specific risk model, e.g., with a structure as in the BEIR VI exposure–age–concentration model, and a lag time. The final summary result is reported as the LEAR per WLM, obtained by dividing the calculated LEAR by the cumulative exposure accrued over the entire exposure scenario (here: 94 WLM). For example, an LEAR for lung cancer mortality per WLM of 5 × 10–4 means that among 100 people with a cumulative occupational radon exposure of 100 WLM five additional (excess) lung cancer deaths would occur due to this exposure during lifetime.

Lag time

A lag is assumed between exposure to radon and any observed change in the lung cancer mortality rate. In the risk models employed here, the lag is either directly described by the model structure (e.g., BEIR VI exposure–age–concentration model with \({\text{ERR}}(a) = 0\) for \({\text{TSE}}(a) < 5\)) or by the data grouping process prior to any model fit as for the parametric risk models with continuous effect modifying variables (e.g., for the Czech/French cohort, Tomasek et al. 2008b). In that analysis, a miner’s exposure is lagged a priori by \(L = 5\) years. A lag assumption may be incorporated in the LEAR calculation by calculating \({\text{ERR}}(a)\) at age \(a\) only with information about radon exposure until age \(a - L\). However, doing so would violate the important equation \(a = {\text{AME}}(a) + {\text{TSME}}(a)\), with \({\text{AME}}\) being the time-varying age at median exposure and \({\text{TSME}}\) the time since median exposure. This is technically solved by considering \({\text{AME}}(a - L)\) and \({\text{TSME}}\left( {a - L} \right) + L\) in the calculation of \({\text{ERR}}(a)\).

LEAR calculations

For the calculation of LEAR, the maximum age was set to \(a_{\max } = 94\), i.e., the LEAR was calculated up to age < 95 years. The baseline lung cancer mortality rates \(r_0 (a)\) and all-cause mortality rates \(q_0 (a)\) were taken from the ICRP mixed Euro-American-Asian population (ICRP 2007) to be comparable with previous publications (UNSCEAR 2020, Tomasek et al. 2008b). According to UNSCEAR (2020) and other LEAR calculations (ICRP 2010, Tomasek et al. 2008b), the exposure scenario was defined as 2 WLM per year from age 18 to 64 years with a lag of \(L = 5\) years.

To compare LEAR estimates for mortality from lung cancer of the PUMA study with those from previous studies, the LEAR per WLM for all published risk models of uranium miners studies that include time- and age-related effect modifiers have been re-estimated, while using the same exposure scenario, baseline rates, and survival function. For this reason, some estimates may slightly differ from previously published LEAR values. The coefficients describing the relative risk model were the values as reported in the original papers, and are described in Tables 2 and 3 for the PUMA study and in Supplementary Tables 1–3 for other studies. The LEAR for the complete exposure scenario (i.e., 2 WLM per year from age 18 to 64 years, resulting in a cumulative exposure of 94 WLM) can be obtained by multiplying the value for the LEAR per WLM with 94. All LEAR calculations were performed with the statistical software R (R Core Team 2022).

Table 2 Lifetime excess absolute risk (LEAR) estimates obtained using a model with effect modifiers defined by categories of time since exposure, attained age, and annual exposure rate (BEIR VI exposure–age–concentration model) in the full PUMA cohort and the PUMA 1960 + sub-cohort
Table 3 Lifetime excess absolute risk (LEAR) estimates obtained using a model with effect modifiers defined by categories of age at exposure, attained age, and annual exposure rate in the full PUMA cohort and the PUMA 1960 + sub-cohort

Results

Table 2 shows the radon-related lung cancer risk in the full PUMA cohort and in the PUMA 1960 + sub-cohort based on the BEIR VI exposure–age–concentration model with categorical effect modifiers time since exposure, attained age, and annual exposure rate. The ERR/100 WLM at attained age < 55 years, 5–14 years since exposure, and exposure rate < 0.5 WL was 4.68 (95% CI: 2.88, 6.96) and 6.98 (95% CI: 1.97, 16.15) in the full cohort (Kelly-Reif et al. 2023) and 1960 + sub-cohort (Richardson et al. 2022), respectively. The estimated ERR/100 WLM decreased with increasing attained age, radon exposure rate and time since exposure, the latter decrease, however, is only present in the full cohort and not the 1960 + sub-cohort. The estimated LEAR per WLM is slightly higher in the 1960 + sub-cohort compared with the full cohort (7.50 × 104 vs 5.38 × 104, respectively). This is also illustrated in Fig. 1 (upper part) where the \({\text{ERR}}(a)\) is plotted as a function of attained age, \(a\), under the exposure scenario of interest (i.e., 2 WLM per year from age 18 to 64 years). Notably, using the model coefficients derived for the 1960 + sub-cohort, the ERR/100 WLM increases slightly after age 75 years, which is mainly due to the value of the parameter estimate for the effect modifier time since exposure. The estimated value of the coefficient for this modifier was highest for the category 5–14 years after exposure (reference category 1.0), decreased for the category 15–24 years after exposure to 0.64, and increased again for the category 25 years or more after exposure to 0.89. The bottom part of Fig. 1 shows the corresponding age-specific contribution to LEAR, \(r_0 \left( a \right){\text{ERR}}(a)\tilde{S}(a)\) for each age \(a\). Within the full PUMA cohort, the largest LEAR contribution is observed at ages 70–75 years, which is 5–10 years after the maximum cumulative exposure is reached. From age 75 years onwards, there is a strong decrease in the age-specific contribution to the LEAR which reflects the decreasing baseline lung cancer mortality rates \(r_0 \left( a \right)\), the decrease in \({\text{ERR}}(a)\) with increasing time since exposure, and the decreasing fraction of the cohort who remains at risk of lung cancer (Supplementary Fig. 1). For the PUMA 1960 + sub-cohort, a similar pattern is observed; however, the peak in the contribution to LEAR is between 60 and 65 years, thus 10 years earlier than in the full PUMA cohort.

Fig. 1
figure 1

LEAR components by attained age (Upper part: \({\text{ERR}}(a)\), Bottom part: age-specific contribution to LEAR, \(r_0 \left( a \right){\text{ERR}}(a)\tilde{S}(a)\)) predicted in the full PUMA cohort (Kelly-Reif et al. 2023, solid line) and the PUMA 1960 + sub-cohort (Richardson et al. 2022, dashed line) for the exposure scenario of 2 working level months (WLM) per year from age 18 to 64 up to age < 95 years, assuming a 5-year lag for the BEIR VI exposure–age–concentration model, and using baseline mortality rates derived from the ICRP mixed Euro-American-Asian population (ICRP 2007)

Table 3 presents the LEAR per WLM based on an alternative risk model for the PUMA study with categorical effect modifiers age at exposure, attained age, and exposure rate (i.e., the BEIR VI model, but with age at exposure instead of time since exposure). The ERR/WLM decreases with increasing attained age and increases with increasing age at exposure in both cohorts. The corresponding LEAR per WLM is 7.66 × 104 in the 1960 + sub-cohort and 5.57 × 104 in the full cohort, respectively, and thus comparable to that based on the BEIR VI exposure–age–concentration model. Again, the estimated LEAR is higher in the 1960 + sub-cohort compared to the full cohort. Supplementary Fig. 2 (upper part) shows that the \({\text{ERR}}(a)\) is highest at younger attained ages, however no further decrease in \({\text{ERR}}(a)\) is observed after age 65 years in the full cohort or after age 75 years in the 1960 + sub-cohort. Supplementary Fig. 2 (bottom part) provides a similar pattern as for the BEIR VI model. Again, in the full PUMA cohort, there is an increase in the age-specific contribution to LEAR up to age 70–75 years and then a strong decrease. This corresponding peak in the 1960 + sub-cohort is again at ages 60–65 years.

To compare the LEAR per WLM of the PUMA study with those estimated in the previous studies, the LEAR per WLM for all published risk models of uranium miners studies that include time- and age-related effect modifiers have been re-estimated. Table 4 provides an overview of these studies, their characteristics, and related LEAR per WLM. The very first published study providing a relative risk model was a meta-analysis of 7 cohorts, the so-called “Jacobi study” (ICRP 1993; Chmelevsky et al. 1994) with a re-estimated LEAR per WLM of 3.20 × 104 based on 1047 lung cancer deaths and 31,486 miners. The cohort included a wide range of exposures and exposure rates; however, risk models did not account for exposure rate. This may have introduced an underestimation of true risk at low exposures due to ignoring the well-established inverse exposure-rate effect (NRC 1999, UNSCEAR 2020). In 1999, the results of the pooled analyses of the 11 miners cohort study were published (NRC 1999), including more than twice the number of miners (n = 67,897) and three times more lung cancer deaths (n = 2787) than the Jacobi study. In addition, as a new model, the BEIR VI exposure–age–concentration model was developed and applied (NRC 1999). For this risk model, the estimated LEAR per WLM was 5.97 × 104 and thus two times higher compared to the Jacobi study.

Table 4 Lifetime excess absolute risk (LEAR) estimates obtained using previously published risk models of uranium miners cohort studies that include time- and age-related effect modifiers

The BEIR VI pooled analysis did not include the newly established large German Wismut cohort (Grosche et al. 2006). The full Wismut cohort comprises 58,974 workers and 3942 lung cancer deaths at end of follow-up in 2013 (Kreuzer et al. 2018) and resulted in an LEAR per WLM of 2.50 × 104. With the extended mortality follow-up to end of 2018 and additional baseline stratification by duration of employment like in the PUMA cohort, the LEAR per WLM increased to 3.13 × 104 (Kreuzer et al. 2023). Two smaller individual studies, the Czech (UNSCEAR 2020) and Eldorado (Lane et al. 2010) cohorts, showed LEAR estimates per WLM of 4.22 × 104 and 8.20 × 104, respectively. The PUMA full cohort is currently the largest study with 119,709 miners and 7754 lung cancer deaths and integrates most of the updated studies included in BEIR VI and the Wismut cohort. The estimated LEAR per WLM of 5.38 × 104 or 5.57 × 104 (depending on choice of model) is consistent with that of the BEIR VI study and two times higher than that for the full Wismut cohort.

Table 4 additionally provides information on cohorts restricted to chronic low exposures and exposure rates. The estimated LEAR per WLM was around 4.6 × 104 for two smaller studies, the pooled analyses of full Czech and French cohorts with restriction of person-years at risk to measured radon exposure (Tomasek et al. 2008a) and of the Czech, French and Eldorado sub-cohorts with restriction to more recent years and exposures less than 100 WLM (Lane et al. 2019). The LEAR in the Wismut 1960 + sub-cohort with end of follow-up 2013 and 2018 (Kreuzer et al. 2023) were 9.22 × 104 and 6.10 × 10−4, respectively. Among these low exposure/exposure-rate studies, the PUMA 1960 + sub-cohort is by far the largest study (57,873 miners and 1217 lung cancer deaths) and involves the lowest average radon exposure (13 WLM), the corresponding LEAR was around 7.50 × 104. Compared to the respective full cohorts, the LEAR of the 1960 + sub-cohorts of the PUMA and the Wismut study were somewhat higher.

Discussion

PUMA provides the largest and most informative database to date to estimate the risk of death from lung cancer due to cumulative radon exposure in studies of uranium miners. The LEAR per WLM is estimated to lie between 5.38 × 104 and 7.66 × 104 depending on the choice of model and the use of the full cohort or the 1960 + sub-cohort with a focus on more recent periods of chronic low exposure. While the choice of model within a given cohort has a nearly negligible effect on the resulting LEAR, the consideration of either the full cohort or 1960 + sub-cohort makes a difference, with somewhat higher LEAR results for the latter cohort. In contrast to the PUMA full cohort, in the 1960 + sub-cohort the estimated parameters of the relative risk model have less heterogeneity between studies, but wider confidence intervals.

Comparison of results from full and 1960 + sub-cohorts

In the full PUMA cohort, heterogeneity in risk estimates between studies has been reported by Kelly-Reif et al. (2023), which was in part attributed to the Wismut study, which forms half of the data of PUMA (2.2 out of 4.3 million person-years at risk). The PUMA full cohort excluding the Wismut study would result in an LEAR per WLM of 8.78 × 104. This restricted cohort differs from the PUMA Wismut cohort in some characteristics, e.g., appreciably lower exposures and shorter duration of employment (see also Table 1). For example, within the full PUMA study, 82% of the person-years at risk accrued from radon exposures above 250 WLM and about 70% of all person-times at risk with duration of employment more than 10 years are from the Wismut cohort (Kelly-Reif et al. 2023 Suppl. Table 1), respectively. It is unclear whether this difference has some influence on the risk estimates. The overview on LEAR estimates from published uranium miners studies in Table 4 shows that the findings of the full Wismut cohort are at the lower end of the range of all calculated LEAR. Possible reasons for this observed lower estimated risk like competing risk of silicosis, measurement error in exposure assessment, or possibly incomplete follow-up in the very early years (1946–1960) were addressed in detail in Kreuzer et al. (2023).

In contrast to the analyses based on full cohorts (Kelly-Reif et al. 2023), PUMA analyses of the 1960 + sub-cohorts did not provide any evidence of heterogeneity in risk estimates between studies (Richardson et al. 2022). The 1960 + sub-cohorts allow direct estimation of health effects of chronic exposure to low radon concentrations at low exposure rates which is of interest for radiation protection today. It also allows to exclude miners with extreme levels of exposure (estimated effective doses for some miners employed in the early years could reach several hundreds or thousands of mSv per year) (Laurier 2020). In addition, no complex modeling of exposure rate is necessary as compared to the full cohort; in several of the component studies, exposure rates were one or two orders of magnitude higher in the early years compared to 1960 or later. Furthermore, exposure assessment in these later years was often based on measurements rather than on expert rating. A higher quality of exposure assessment decreases measurement error and thus the potential for underestimation of risk. However, the 1960 + sub-cohorts involve lower statistical power due to smaller size, high uncertainty in parameter estimates, shorter duration of follow-up, and younger age compared to the full cohorts. The observed increase in ERR at older ages in the PUMA 1960 + sub-cohort (Fig. 1 upper part) and particularly in the Wismut 1960 + sub-cohort with end of follow-up 2013 (Kreuzer et al. 2023, Supplementary Fig. 3) seems implausibly high. In these young 1960 + sub-cohorts, lung cancer deaths are still rare at ages over 75 years and at more than 35 years since exposure (see Tables 2 and 3). Consequently, it is likely that the decrease in ERR/WLM with increasing time since exposure and attained age cannot be completely described by the data of 1960 + sub-cohorts. For example, in the Wismut 1960 + sub-cohort, the extension of end of follow-up from 2013 to 2018 led to a decrease of LEAR per WLM from 9.22 to 6.10 × 104 (Kreuzer et al. 2023 Suppl. Table 3). Thus, further follow-up of individual PUMA studies will allow refining risk estimates derived from 1960 + cohorts in the future.

Strengths and limitations

The current calculations of the LEAR for lung cancer due to radon from various uranium miners studies offer several strengths. First, similar methods have been used, and thus, LEAR values based on different studies and relative risk models are directly comparable. Second, for the first time, LEAR was calculated based on the worldwide largest and most informative study PUMA. More than 4.3 million person-years at risk and nearly 8000 lung cancer deaths with a long duration of follow-up form the basis for PUMA (Rage et al. 2020; Richardson et al. 2021, 2022; Kelly-Reif et al. 2023). This large database allows—in contrast to many individual studies—for detailed consideration of relevant effect modifiers age, time since exposure and exposure rate in the risk model, a recommendation that was recently reinforced by UNSCEAR (2020). Third, the LEAR were determined for cohorts restricted to low exposures and exposure rates including all three effect modifiers in the risk models.

A limitation of the current LEAR analyses is that many factors with potential influence on the LEAR have not yet been evaluated. This concerns (1) the use of different and more suitable baseline mortality rates as well as evaluation of effects of the increasing survival trend for lung cancer, (2) consideration of smoking (interaction of smoking with radon, change of smoking patterns over time), (3) application of other scenarios from occupational or residential radon exposure, (4) consideration of annual instead of average exposure rates in risk models (Tomasek 2020), (5) use of different risk projection models (relative/additive/mixed), and (6) evaluation of uncertainties associated with LEAR estimates (e.g., confidence intervals). A general limitation of all the uranium miners studies considered in this paper is that they include only men and that only mortality and no incidence data for lung cancer are available.

Conclusion

PUMA clearly strengthens evidence on the shape of the exposure–response relationship between radon exposure and lung cancer mortality in uranium miners and thus the estimation of the LEAR. The range of currently available LEAR values for lung cancer at low exposures and exposure rates derived from different models and previous publications based on smaller studies is 2.5 to 9.2 × 10–4 per WLM, with the current PUMA findings (5.4 up to 7.7 × 104 per WLM) being in the upper half of this range. Continued mortality follow-up of the studies included in PUMA, particularly of the 1960 + sub-cohorts, is expected to provide additional insights and is therefore strongly recommended.