Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Prevalence of lung cancer in Colombia and a new diagnostic algorithm using health administrative databases: A real-world evidence study

  • Javier Amaya-Nieto ,

    Contributed equally to this work with: Javier Amaya-Nieto, Gabriel Torres, Giancarlo Buitrago

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Visualization, Writing – original draft, Writing – review & editing

    jamayan@unal.edu.co

    Affiliations Department of Research and Innovation, Hospital Universitario Nacional de Colombia, Bogotá, Colombia, Health Systems and Services Research Group, Universidad Nacional de Colombia, Bogotá, Colombia, Instituto de Investigaciones Clínicas, Universidad Nacional de Colombia, Bogotá, Colombia

  • Gabriel Torres ,

    Contributed equally to this work with: Javier Amaya-Nieto, Gabriel Torres, Giancarlo Buitrago

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Health Systems and Services Research Group, Universidad Nacional de Colombia, Bogotá, Colombia

  • Giancarlo Buitrago

    Contributed equally to this work with: Javier Amaya-Nieto, Gabriel Torres, Giancarlo Buitrago

    Roles Conceptualization, Formal analysis, Funding acquisition, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Research and Innovation, Hospital Universitario Nacional de Colombia, Bogotá, Colombia, Health Systems and Services Research Group, Universidad Nacional de Colombia, Bogotá, Colombia, Instituto de Investigaciones Clínicas, Universidad Nacional de Colombia, Bogotá, Colombia

Abstract

Reliable, timely and detailed information on lung cancer prevalence, mortality and costs from middle-income countries is essential to policy design. Thus, we aimed to develop an electronic algorithm to identify lung cancer prevalent patients in Colombia by using administrative claims databases, as well as to estimate prevalence rates by age, sex and geographic region. We performed a cross-sectional study based on national claim databases in Colombia (Base de datos de suficiencia de la Unidad de Pago por Capitación and Base de Datos Única de Afiliados) to identify lung cancer prevalent patients in 2017, 2018 and 2019. Several algorithms based on the presence or absence of oncological procedures (chemotherapy, radiotherapy and surgery) and a minimum number of months that each individual had lung cancer ICD-10 codes were developed. After testing 16 algorithms, those with the closest prevalence rates to those rates reported by aggregated official sources (Global Cancer Observatory and Cuenta de Alto Costo) were selected. We estimated prevalence rates by age, sex and geographic region. Two algorithms were selected: i) one algorithm that was defined as the presence of ICD-10 codes for 4 months or more (the sensitive algorithm); and ii) one algorithm that was defined by adding the presence of at least one oncological procedure (the specific algorithm). The estimated prevalence rates per 100,000 inhabitants ranged between 11.14 and 18.05 for both, the contributory and subsidized regimes over years 2017, 2018 and 2019. These rates in the contributory regime were higher in women (15.43, 15.61 and 17.03 per 100,000 for years 2017, 2018 and 2019), over 65-years-old (63.45, 56.92 and 61.79 per 100,000 for years 2017, 2018 and 2019) who lived in Central, Bogota and Pacific regions. Selected algorithms showed similar aggregated prevalence estimations to those rates reported by official sources and allowed us to estimate prevalence rates in specific aging, regional and gender groups for Colombia by using national claims databases. These findings could be useful to identify clinical and economical outcomes related to lung cancer patients by using national individual-level databases.

Introduction

Lung cancer has progressed to be one of the most frequent and deadly variations of cancer worldwide, even in low-middle-income countries (LMICs) [1]. Despite advances in diagnosing and treating this disease, lung cancer prevalence and mortality have increased. According to the last report of The Global Cancer Observatory (GLOBOCAN), this type of cancer represents the leading cause of mortality in cancer, with 23 deaths per 100,000 being reported globally [2, 3]. The low prevalence rate in LMIC is a concern because 80% of smokers live in these countries [4].

Health outcomes are worse in Latin American countries than in other regions because of their fragmented and underbudgeted health systems [5]. For optimal cancer care, LMICs must demonstrate significant prevention and early diagnosis strategies, as well as high quality measures, in medication, surgery and radiation delivery backed by patient support. However, health systems in LMICs are precarious, and most people with this condition are diagnosed in late stages if their cancer occurs, thus causing higher mortality rates [6].

Lung cancer in Colombia has an important impact, as it represents the fourth place in mortality among men and the fifth place among women [7]. Furthermore, a demographic change in the population has been described, which indicates an increase in individuals aged over 60-years-old [8]. It has also been described that lung cancer risk increases over time, especially in elderly people over 65-years-old [9, 10]. All of these combined factors can cause an increase in prevalence rates in some regions of the country due to the high prevalence of smokers or mining activities [11, 12].

The Colombian health system covers 99% of Colombian citizens [13]. It has a centralized fund controlled by the government that gathers payroll taxes and contributes an additional amount of money to ensure health care for the population, including the poorest individuals of this population. At the level of the insurers, there are public and private health institutions (known as Empresas Promotoras de Salud [EPS]) that act as intermediate payers between the centralized fund and health service providers (known as Instituciones Prestadoras de Salud [IPS]). These IPSs compete to be included in each EPS service network. Moreover, the insured population can be classified into a group that contributes to the health system via payroll taxes (known as the contributory regime) and a group funded by the national government (known as the subsidized regime) [14, 15].

This study aimed to develop an electronic algorithm to identify lung cancer-prevalent patients over 20-years-old in Colombia by using official databases, as well as to estimate prevalence rates of lung cancer by age, sex, and geographic region. This information could provide valuable information to Colombian policymakers, clinicians and the world to improve knowledge about lung cancer in LMICs.

Methods

Ethics and approval

This research was evaluated and approved by the ethics committee of the Universidad Nacional de Colombia as required by the national law [16]. This ethics committee waived the requirement of inform consent considering it was retrospective research and all data was anonymized in the datasets used for the study.

Design and population

We performed a cross-sectional study based on national claims databases in Colombia (Base de datos de suficiencia de la Unidad de Pago por Capitación [UPC] and Base de Datos Única de Afiliados [BDUA]) to identify lung cancer-prevalent patients in 2017, 2018 and 2019. These databases contain information on the population that was registered yearly from 2015 to 2019; additionally, these databases provide information used for this research under academic agreements between the Ministry of Health (MoH) and Universidad Nacional de Colombia. All the populations over 20-years-old were evaluated to identify individuals with lung cancer.

Database characteristics

Several sources of information were used for this study. The first utilized source included the databases used by the Ministry of Health to calculate the sufficiency of the premiums that represents the amount of money that the government pays to insurers for each enrollee per year. It contained individual-level information about all the health services that were used by each person in the system reported yearly by each health provider. Data provided in this database include the location of each service that was used, the date of utilization, the type of health service, the diagnostic code (ICD-10) associated with each health service, the cost of the service, the health provider and the municipality. This information was available at an individual level and provided for the 10 largest insurers in the contributory regime that represent 88% of the contributory regime nationwide. In Colombia the contributory regime represents 45.8% (23.5 million people) of the population covered by the system. The system covers 99.06% of the population nationwide [17]. Information registered in the premium sufficiency databases was verified through a validation process designed by the Ministry of Health that involves correcting inconsistencies found in the data and the use of double validation with other databases used by the government. Information for the subsidized regime for Bogotá was also available by using the same information from the largest insurer in the city. The other database used includes sociodemographic and insurer (i.e., regime) information of all the enrollees in Colombia. These datasets were anonymized by the Ministry of Health before authorization of use was awarded to the Clinical Research Institute at Universidad Nacional de Colombia and they are further described in the supporting document (S1 File). In case datasets are required you can contact The Ministry of Health of Colombia using the official website: www.minsalud.gov.co or correo@minsalud.gov.co to get authorization for use.

Other secondary, aggregated and publicly available data sources were used. This information was collected from GLOBOCAN [18], National Cancer Institute [19] in Colombia and Cuenta de Alto Costo [20]. These data were used to verify the aggregated prevalence rates that were calculated in this study after having estimated differentiated prevalence rates over different age, region, and sex groups.

Algorithms of case identification

To identify individuals over 20-years-old with lung cancer diagnoses, this study used 16 different electronic algorithms. Some of the algorithms were based on identifying any ICD-10 codes for tumors of the trachea, bronchi or lungs with malignant behavior (for example, roots C33 and C34) during a minimum number of months and at least one oncological procedure in the same individual in the previous three years (e.g., 2017, 2016 and 2015 for identifying people with lung cancer in 2018). We called these algorithms as “specific”. The second type of algorithms were called as the “sensitive” algorithms by our team and aimed to identify patients by solely using the ICD-10 codes for tumors of the trachea, bronchi or lungs with malignant behavior during a minimum period of time. Specific and sensitive algorithms were compared, and two of them were chosen by selecting the closest prevalence rates to those rates reported by aggregated official sources (GLOBOCAN [18], National Cancer Institute [19] in Colombia and Cuenta de Alto Costo-Colombia [20]). Prevalence estimations were calculated for the contributory by different strata of age, geographical region, and sex. For the subsidized regime this estimations were performed by different strata of age and sex.

Prevalence method description

For global prevalence, the following equation summarizes the method that was used in this paper: where Pg is the general prevalence for Colombia for each year; Nglobal is the total number of patients affiliated to the system by June 2017, June 2018 and June 2019 in those insurers apporting information to the premium estimation(Unidad de Pago por Capitación); nijk is the number of prevalent cases identified for each region i, age group j and sex k in the contributory regime; and mijk is the number of prevalent cases identified for each age group j and sex k in the subsidized regime.

Results

After testing 16 algorithms, two electronic algorithms were selected, as they had the closest prevalence rates to those rates reported by aggregated official sources (GLOBOCAN [18], National Cancer Institute [19] in Colombia and Cuenta de Alto Costo-Colombia [20]). These comparisons were evaluated by comparing point estimates provided the mentioned sources to the confidence intervals calculated using bootstrapping with 1 000 subsamples as shown in Tables 1 and 2. The first selected algorithm was defined as the presence of ICD-10 lung cancer codes for four months or more (the sensitive algorithm), and the second selected algorithm was defined as the same criteria from the first algorithm and the presence of at least one oncological procedure (the specific algorithm).

thumbnail
Table 1. Comparison between the prevalence rates identified in this study versus official aggregated data sources for 2017, 2018 and 2019 in the contributory regime.

https://doi.org/10.1371/journal.pone.0269079.t001

thumbnail
Table 2. Comparison between the prevalence rates identified in this study versus official aggregated data sources for 2017, 2018 and 2019 in the subsidized regime.

https://doi.org/10.1371/journal.pone.0269079.t002

By using previously selected algorithms, the identified general prevalence rates in the contributory regime were 11.15(IC 95 7.87 to 14.59), 11.14 (IC95 8.78 to 13.95) and 12.37(IC 95 9.41; 14.83) per 100,000 people with the use of the sensitive algorithm for the years 2017, 2018 and 2019, respectively. Whereas prevalence rates estimated using the specific algorithm were 7.09(IC 95 5.07 to 9.40), 7.37 (IC95 5.89 to 9.35) and 8.82(IC 95 6.94; 10.55) per 100,000 people for years 2017, 2018 and 2019 respectively (Table 1).

General prevalence rates in the subsidized regime for the years 2017, 2018 and 2019 were 14.75 (IC95 1.90 to 26.34), 18.05 (IC95 2.38 to 34.70) and 15.78 (IC95 1.32 to 24.54) per 100,000 people. Using the specific algorithm, general prevalence rates were 8.77 (IC95 0.95 to 13.88), 10.48 (IC95 0.95 to 17.30) and 10.39 (IC95 0.88 to 14.21), respectively. Distribution of estimations by sex and age groups is show in Table 2.

Discussion

Using prevalence estimates is a useful strategy to design public policy and calculate attributable cost [21]. In this research two out of sixteen algorithms were selected as they had the closest prevalence rates to those rates reported by aggregated official sources. These algorithms were defined for this study as a sensitive algorithm and a specific algorithm because of their characteristics for identifying lung cancer cases, as described above. Using these two algorithms, the global prevalence rates for the contributory regime via the sensitive and specific algorithms ranged from 11.14 to 12.37 per 100,000 affiliates between 2017 and 2019 using the sensitive algorithm. In the case of the subsidized regime, general prevalence rates using the sensitive algorithm were 14.75, 18.05 and 15.78 cases per 100,000 affiliates for years 2017, 2018 and 2019, respectively.

Estimation methods used by the first two institutions were based on data obtained from samples of a limited number of patients with lung cancer attended at various providers around the country, thus introducing possible bias to the calculated rates [22, 23]. Furthermore, prevalence rates reported by Cuenta de Alto Costo were based on data provided by insurers using clinical reports as data source; although it included information from 98% of the total population affiliated with the health system, these reported data could have underestimated the real prevalence, as it depended on passive reporting [24, 25]. In the case of our research, the identification process involved the use of algorithms that take information from the usage of the reported services registered in national administrative health claims databases. This represents an advantage in comparison to the other sources, as our estimates rely on service usage rather than an active process of reporting or an estimation performed over a limited sample.

In this study, the estimations obtained by the two algorithms offer some additional advantages for policy-makers and clinicians. First, it allowed us to calculate prevalence rates differentiating between gender, regions and age groups. Second, it considered almost the total population affiliated with the contributory regime and the reported services that were utilized in the health system. Third, these estimations relied on the UPC administrative database, which represented consistent and validated data over the system, as it contains information with economical purposes. Fourth, it is possible to actively identify cases with lung cancer over the entire of population of patients attending to different providers. Finally, it is paramount to point out that these methods for identifying and estimating prevalence rates could help other LMICs to optimize their epidemiological surveillance system and to understand its impact in a context with different characteristics, compared to the developed world [2629].

As LMIC countries enact changes to the population aging pattern, there is also a change in the burden of disease patterns from communicable to noncommunicable diseases (such as lung cancer), and efficient estimations are required to follow up on trends in public health [28, 3032]. Authors such as Sierra et al. [33] have conducted research to address this problem and collected information from thirteen different countries in Central and South America. They found that lung cancer ranged from first to third regarding causes in incidence rates over these countries, thus showing concordance with a change in the aging patterns. In 2018, a study performed by Pilleron et al. [34] estimated that there were 679,000 new cases of cancer in Latin America and the Caribbean, and the most frequent variants were prostate, colorectal and lung cancer. While for 2020 the number of new cases estimated with lung cancer was 2,294,438 for the region according to GLOBOCAN and representing the second most frequent type of cancer [18]. The findings in this study showed a general prevalence in the contributory and the subsidized regimes ranging between 11.14 and 18.05 (using the sensitive algorithm) cases per 100,000 that positions the disease in the top ten list of prevalent cancer in the country and that corresponds to the prevalence that was identified in our study.

Another important aspect of the lung cancer prevalence rates is inequality and differences in its distribution. It has been observed that developed countries have increased prevalence rates in comparison to developing countries [35]. There is also evidence showing that at the country scale, there is a trend of higher prevalence rates within lower socioeconomic groups. This situation can be explained by differences in smoking rates or other exposure factors, such as chemical exposure, mining activities or higher exposure to air pollution that usually affects lower income population groups or states within the country [3639]. For this study, the results showed two important patterns related to his idea. The first result showed that there is a higher prevalence rate of the disease in regions with highly polluted cities or concentrations of mining activities. The second trend found that prevalence rates that were calculated in the subsidized regime were higher than in the contributory regime. This finding follows the literature and could be explained because of the inequalities in health care access that lead to worse healthcare conditions and there fore, more case of lung cancer being detected and treated. This ideas require more research in the future.

This study had several strengths and limitations. As a strength, this research used real-world evidence from administrative claim databases that were considered to be highly reliable regarding its quality and validation procedure. Information calculated via the sensitive and specific algorithms showed results that were similar to those reported by other institutions and a range of possible prevalence rates, rather than an absolute number. In contrast, there were some concerns regarding the estimation of prevalence rates in the subsidized regime because these data were calculated based on information provided by the largest insurer in the capital city; however, this is the best information available, and it could not be easily extrapolated to the country level. There is a natural challenge in estimating real prevalence rates of lung cancer via the selected algorithms because they identify only patients with the disease and are detected by the system. However, there is a proportion of patients with the disease that are not detected by the system; hence, they were not included in our estimation. When taking into account that health coverage in Colombia reaches 99% of the total population [13], we assume that this difference was very small. Finally, it is also possible that information bias affected the study; however, quality and validation processes that were implemented by MoH helped to control this risk.

Conclusion

Selected algorithms showed similar prevalence estimations to those reported by official sources and allowed us to estimate prevalence rates in specific aging, regional and gender groups for Colombia by using national claims databases. These findings could be useful to formulate new public health strategies and to measure prevalence trends in lung cancer at a national scale by using administrative databases.

Supporting information

S1 File. Data sources description and Stata code used.

https://doi.org/10.1371/journal.pone.0269079.s001

(PDF)

Acknowledgments

The authors thank Juan Camilo Forero and Juan Sebastian Castillo for their commentaries and support during the research process.

References

  1. 1. de Groot PM, Wu CC, Carter BW, Munden RF. The epidemiology of lung cancer. Transl Lung Cancer Res. 2018;7: 220–233. pmid:30050761
  2. 2. All cancers. 2020. Available: https://gco.iarc.fr/today
  3. 3. Cancer. [cited 24 Jan 2021]. Available: https://www.who.int/news-room/fact-sheets/detail/cancer
  4. 4. Bade BC, Dela Cruz CS. Lung Cancer 2020: Epidemiology, Etiology, and Prevention. Clin Chest Med. 2020;41: 1–24. pmid:32008623
  5. 5. Raez LE, Nogueira A, Santos ES, Dos Santos RS, Franceschini J, Ron DA, et al. Challenges in Lung Cancer Screening in Latin America. J Glob Oncol. 2018;4: 1–10. pmid:30241252
  6. 6. Nwagbara UI, Ginindza TG, Hlongwana KW. Health systems influence on the pathways of care for lung cancer in low- And middle-income countries: A scoping review. Global Health. 2020;16: 1–11. pmid:32188467
  7. 7. Costo C de A. Situación del cáncer en la población adulta atendida en el SGSSS de Colombia 2019. 2019.
  8. 8. Henríquez G, de Vries E. El efecto del envejecimiento para la carga de cáncer en Colombia: proyecciones para las primeras cinco localizaciones por departamento y sexo en Colombia, 2020 y 2050. Rev Colomb Cancerol. 2017;21: 104–112.
  9. 9. Venuta F, Diso D, Onorati I, Anile M, Mantovani S, Rendina EA. Lung cancer in elderly patients. J Thorac Dis. 2016;8: S908–S914. pmid:27942414
  10. 10. Junjun L, Pei W, Ying Y, Kui S. Prognosis and risk factors in older patients with lung cancer and pulmonary embolism: a propensity score matching analysis. Sci Rep. 2020;10: 1272. pmid:31988400
  11. 11. Ministerio de Salud y Protección Social. Guía de Práctica Clínica para la detección temprana, diagnóstico, estadificación y tratamiento del cáncer de pulmón. Bogotá; 2016.
  12. 12. Barta JA, Powell CA, Wisnivesky JP. Global epidemiology of lung cancer. Ann Glob Heal. 2019;85: 1–16. pmid:30741509
  13. 13. Páginas—Comportamiento del aseguramiento. [cited 27 Apr 2022]. Available: https://www.minsalud.gov.co/proteccionsocial/Regimensubsidiado/Paginas/coberturas-del-regimen-subsidiado.aspx
  14. 14. Lamprea E, García J. Closing the gap between formal and material health care coverage in Colombia. Health Hum Rights. 2016;18: 49–65. pmid:28559676
  15. 15. Prada SI, Perez AM, Nieto-Aristizábal I, Tobón GJ. Direct cost of lupus care in the developing world: the case of Colombia. Lupus. 2019;28: 970–976. pmid:31204587
  16. 16. Colombia. ministerio de salud y protección social de. Resolución 8430 de 1993. Bibl Digit. 1993.
  17. 17. Ministerio de salud y protección social de Colombia. Páginas—Cifras de aseguramiento en salud. [cited 1 Mar 2023]. Available: https://www.minsalud.gov.co/proteccionsocial/Paginas/cifras-aseguramiento-salud.aspx
  18. 18. Globocan. Cancer Today. In: 2018 [Internet]. [cited 16 Nov 2020]. Available: https://gco.iarc.fr/today/online-analysis-table?v=2018&mode=cancer&mode_population=continents&population=900&populations=900&key=asr&sex=0&cancer=39&type=0&statistic=5&prevalence=0&population_group=0&ages_group%5B%5D=0&ages_group%5B%5D=17&group_cancer=1&i
  19. 19. Cáncer en Cifras INC—Instituto Nacional de Cancerología. [cited 27 Apr 2022]. Available: https://www.cancer.gov.co/conozca-sobre-cancer-1/informacion-sobre-cancer-para-profesionales/cancer-cifras-inc
  20. 20. Cuenta de alto costo. Situación del cáncer en la población adulta atendida en el SGSSS de Colombia 2018. Bogotá D.C.; 2018.
  21. 21. Mariotto AB, Enewold L, Zhao J, Zeruto CA, Robin Yabroff K. Medical care costs associated with cancer survivorship in the United States. Cancer Epidemiol Biomarkers Prev. 2020;29: 1304–1312. pmid:32522832
  22. 22. Cancerología IN de. Análisis de situación del cáncer en Colombia: 2015. 2017.
  23. 23. Globocan. Fact sheet Tracheal, Bronchus and Lung Cancer. 2020.
  24. 24. Valencia O, Lopes G, Sánchez P, Acuña L, Uribe D, González J. Incidence and Prevalence of Cancer in Colombia: The Methodology Used Matters. J Glob Oncol. 2018;4: 1–7. pmid:30241220
  25. 25. Hernández Vargas JA, Ramírez Barbosa PX, Gil Quijano AM, Valbuena AM, Acuña L, González JA. Patterns of breast, prostate and cervical cancer incidence and mortality in Colombia: an administrative registry data analysis. BMC Cancer. 2020;20: 1097. pmid:33176754
  26. 26. Mills A. Health Care Systems in Low- and Middle-Income Countries. N Engl J Med. 2014;370: 552–557. pmid:24499213
  27. 27. Hung YW, Hoxha K, Irwin BR, Law MR, Grépin KA. Using routine health information data for research in low- and middle-income countries: a systematic review. BMC Health Serv Res. 2020;20: 790. pmid:32843033
  28. 28. Fanelli S, Salvatore FP, De Pascale G, Faccilongo N. Insights for the future of health system partnerships in low- and middle-income countries: a systematic literature review. BMC Health Serv Res. 2020;20: 571. pmid:32571317
  29. 29. Piñeros M, Abriata MG, Vries E, Barrios E, Bravo LE, Cueva P, et al. Progress, challenges and ways forward supporting cancer surveillance in Latin America. Int J Cancer. 2020; 1–9. pmid:33231289
  30. 30. Lanas F, Seron P. Understanding life and death in Latin America. Nat Med. 2021;27: 373–375. pmid:33664489
  31. 31. Curcio C-L, Pineda A, Quintero P, Rojas Á, Muñoz S, Gómez F. Successful Aging in Colombia: The Role of Disease. Gerontol Geriatr Med. 2018;4: 2333721418804052–2333721418804052. pmid:30397638
  32. 32. The Demography of Aging in Low- and Middle-Income Countries: Chronological versus Functional Perspectives—Future Directions for the Demography of Aging—NCBI Bookshelf. [cited 28 Apr 2021]. Available: https://www.ncbi.nlm.nih.gov/books/NBK513069/
  33. 33. Sierra MS, Soerjomataram I, Antoni S, Laversanne M, Piñeros M, de Vries E, et al. Cancer patterns and trends in Central and South America. Cancer Epidemiol. 2016;44: S23–S42. pmid:27678320
  34. 34. Pilleron S, Soerjomataram I, Soto-Perez-de-Celis E, Ferlay J, Vega E, Bray F, et al. Aging and the cancer burden in Latin America and the Caribbean: Time to act. J Geriatr Oncol. 2019;10: 799–804. pmid:30853302
  35. 35. Wang Z, Hu L, Li J, Wei L, Zhang J, Zhou J. Magnitude, temporal trends and inequality in global burden of tracheal, bronchus and lung cancer: Findings from the Global Burden of Disease Study 2017. BMJ Glob Heal. 2020;5: 1–10. pmid:33028698
  36. 36. Soerjomataram I, Barendregt JJ, Gartner C, Kunst A, Møller H, Avendano M. Reducing inequalities in lung cancer incidence through smoking policies. Lung Cancer. 2011;73: 268–273. pmid:21333375
  37. 37. Jenkins WD, Christian WJ, Mueller G, Robbins KT. Population cancer risks associated with coal mining: a systematic review. PLoS One. 2013;8: e71312–e71312. pmid:23977014
  38. 38. Luo J, Hendryx M, Ducatman A. Association between six environmental chemicals and lung cancer incidence in the United States. J Environ Public Health. 2011/07/10. 2011;2011: 463701. pmid:21776439
  39. 39. Lepeule J, Laden F, Dockery D, Schwartz J. Chronic exposure to fine particles and mortality: an extended follow-up of the Harvard Six Cities study from 1974 to 2009. Environ Health Perspect. 2012;120: 965–970. pmid:22456598