The role of causal inference in health services research II: a framework for causal inference

Moser, André; Puhan, Milo A.; Zwahlen, Marcel

doi:10.1007/s00038-020-01334-1

The role of causal inference in health services research II: a framework for causal inference

Hints & Kinks
Open access
Published: 12 February 2020

Volume 65, pages 367–370, (2020)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Public Health

The role of causal inference in health services research II: a framework for causal inference

Download PDF

André Moser¹,
Milo A. Puhan¹ &
Marcel Zwahlen²

3246 Accesses
1 Citation
4 Altmetric
Explore all metrics

Introduction

In a previous Hints and Kinks, we discussed the role of causal inference in tasks of health services research (HSR) using examples from health system interventions (Moser et al. 2020). In the present Hints and Kinks, we more formally introduce a principled framework for causal inference. Specifically, we discuss in more detail the role of counterfactuals for the definition of a causal effect and the ‘association is not causation’ adage. We continue on the example of a hospital merger (HM) as a health system intervention.

Counterfactuals and causal effect

We introduced counterfactuals as hypothetical outcomes which are actually not observed in a real-world setting (Hernán 2004). We used an example of a HM, where we were interested in the causal question whether a HM reduces hospital readmissions (Moser et al. 2020). To answer this question, we need to define a causal effect, a statistical measure which relates probabilities of hospital readmissions when (1) every patient is treated under the situation of a HM versus (2) the HM would not have been implemented. Note that we never observe one of the two situations, because either the HM is implemented or not, but not both. We now introduce a formal notation for causal inference which allows us to mathematically define a causal effect.

For each patient, we would like to know his or her outcome (here, a hospital readmission) if the HM had not been implemented (denoted as Y^noHM) together with the outcome under the HM (denoted as Y^HM). The superscripts denote the counterfactual outcomes we can formalize, but which are actually not observed: Only Y^HM can be observed if the HM is implemented. An average causal effect in the study population can then be defined by the risk difference Probability(Y^HM = 1)–Probability(Y^noHM = 1), abbreviated as RD^Causal. Note that we could also use other risk measures, for example a relative risk, for the definition of a causal effect. The choice of the used effect measure depends on the research question because the underlying scale (i.e., an additive scale for a risk difference or multiplicative scale for a risk ratio) influences its final interpretation (Hernán and Robins 2020).

An important question remains: How can we assess an effect measure based on outcomes which are actually not observed? One could compare the outcomes in the region with HM to outcomes in a 'control' region with no HM. Table 1 shows hypothetical patients with (known) counterfactual outcomes and actually observed outcomes (denoted with subscriptsY_noHM, Y_HM, Y_Observed). For example, the patient with ID 5 was treated in the HM region with no observed hospital readmission (Y_Observed = 0). The observed outcome is equal to the counterfactual outcome in the HM region (Y_Observed = Y_HM = Y^HM = 0). Note that if this patient would have been treated in the control region, he or she would have had a readmission (Y^noHM = 1). Because this patient is actually only observed in the HM region, one will never observe the outcome of the control region (Y_noHM is missing). The mathematical notation for counterfactuals might be initially confusing, yet it is a necessary component for a causal inference framework.

Table 1 Study population of five patients

Full size table

What is the average causal effect in the study population from Table 1? We get that the risk difference RD^Causal is zero, because Probability(Y^HM = 1) = 3/5 and Probability(Y^noHM = 1) = 3/5. Thus, the HM does not reduce hospital readmissions.

Association versus causation

An associational effect measure generally compares risks in subsets of a study population by conditioning on certain study characteristics (see Fig. 1) (Hernán 2004). In the example of Table 1, one relates the risk of hospital readmissions among patients in the HM region with the risk among patients in the control region. Let us define

$$\begin{gathered} {\text{RD}}^{{{\text{Associational}}}} := {\text{Probability}}\left( {Y^{{{\text{Observed}}}} = {1\text{ among patients in the HM region}}} \right) \hfill \\ - {\text{Probability}}\left( {Y^{{{\text{Observed}}}} = {1\text{ among patients in the control region}}} \right), \hfill \\ \end{gathered}$$

as the associational risk difference in the study population. We obtain from Table 1 that the first expression of RD^{Associational} is 0 (two patients were treated in the HM region without an observed hospital readmission) and the second expression 1/3 (three patients were treated in the control region with one hospital readmission). Thus, RD^{Associational} is equal to 0–1/3 = –1/3, i.e., the risk of hospital readmissions in the HM region is lower compared to the risk in the control region.

The difference between the derived causal effect RD^Causal and the associational effect RD^{Associational} leads to the famous ‘association is not causation’ adage. Likely because of this adage, many researchers in HSR avoid any causal terminology, especially when they use ‘only’ observational data (Hernán 2018). They argue that the above comparison of outcomes between an ‘intervention’ and a ‘control’ region does not allow for any causal conclusions because the regions differ in several ways, for example, due to the case mix of treated patients, the skill-grade mix of medical personnel or the availability of health care services. When a study design randomly allocates patients before hospital entry to either the HM region or the control region (and patients and health care providers perfectly comply with that assignment), researchers would interpret statistical findings as causal. But in fact, many studies in HSR are observational studies without a random allocation of patients to treatment groups. Still, often only ‘descriptive’ and ‘modeling’ approaches are then used to support decision-making in health systems, even if the background is inherently causal. Whether the reported effect measure should be used from a causal inference approach or from descriptive and modeling approaches strongly depends on the intended HSR question.

How can researchers integrate ‘causality’ in HSR? Our above introduced components of a framework for causal inference is the backbone for modern causal inference. Modern causal inference allows for inference which mimics a situation as if patients would have been assigned by random allocation, despite using an observational study design. Topics for recent calls of causal inference approaches in HSR include, for example, comparative effectiveness research, payment scheme evaluations, health care utilization or the use of simulation studies (see Table 2). Principles of modern causal inference are described and explained in several textbooks (van der Laan and Sherri 2011; Pearl et al. 2016; Hernán and Robins 2020).

Table 2 Selected study examples using causal inferences approaches in health services research

Full size table

Discussion

In the present Hints and Kinks, we introduced components for a principled framework for causal inference in HSR. Because ‘causal inference’ is conceptually different from ‘description’ or ‘modeling’, HSR needs the integration of a causal inference framework which includes a specific notation, definitions and analysis techniques to extend the traditional tasks of ‘description’ and ‘modeling’. Public health decision-making which solely relies on associational effect measures might lead to inappropriate decisions because questions about optimal decision-making are inherently causal. We plea that students and researchers in the field of HSR are aware of the different available frameworks to successfully address ‘description’, ‘modeling’ and ‘causal inference’, depending on the intended research question.

References

Danaei G, García Rodríguez LA, Cantero OF et al (2018) Electronic medical records can be used to emulate target trials of sustained treatment strategies. J Clin Epidemiol 96:12–22. https://doi.org/10.1016/j.jclinepi.2017.11.021
Article PubMed PubMed Central Google Scholar
Dickerman BA, García-Albéniz X, Logan RW et al (2019) Avoidable flaws in observational analyses: an application to statins and cancer. Nat Med 25:1601–1606. https://doi.org/10.1038/s41591-019-0597-x
Article CAS PubMed Google Scholar
García-Albéniz X, Hsu J, Hernán MA (2017) The value of explicitly emulating a target trial when using real world evidence: an application to colorectal cancer screening. Eur J Epidemiol 32:495–500. https://doi.org/10.1007/s10654-017-0287-2
Article PubMed PubMed Central Google Scholar
Gaughan J, Gutacker N, Grašič K et al (2019) Paying for efficiency: Incentivising same-day discharges in the English NHS. J Health Econ 68:102226. https://doi.org/10.1016/j.jhealeco.2019.102226
Article PubMed Google Scholar
Hernán MA (2004) A definition of causal effect for epidemiological research. J Epidemiol Community Health 58:265–271. https://doi.org/10.1136/jech.2002.006361
Article PubMed PubMed Central Google Scholar
Hernán M (2018) The C-word: the more we discuss it, the less dirty it sounds. Am J Public Health 108:625–626. https://doi.org/10.2105/AJPH.2018.304392
Article PubMed PubMed Central Google Scholar
Hernán M, Robins J (2020) Causal inference: what if. CRC Press, Boca Raton
Google Scholar
Héroux J, Moodie EEM, Strumpf E et al. (2014) Marginal structural models for skewed outcomes: identifying causal relationships in health care utilization. Stat Med 33:1205–1221. https://doi.org/10.1002/sim.6020
Article PubMed Google Scholar
Kuehne F, Jahn B, Conrads-Frank A et al (2019) Guidance for a causal comparative effectiveness analysis emulating a target trial based on big real world evidence: when to start statin treatment. J Comp Eff Res 8:1013–1025. https://doi.org/10.2217/cer-2018-0103
Article PubMed Google Scholar
Moser A, Puhan MA, Zwahlen M (2020) The role of causal inference in health services research I: tasks in health services research. Int J Public Health. https://doi.org/10.1007/s00038-020-01333-2
Article PubMed PubMed Central Google Scholar
Murray EJ, Robins JM, Seage GR et al. (2018) Using observational data to calibrate simulation models. Med Decis Mak 38:212–224. https://doi.org/10.1177/0272989X17738753
Article Google Scholar
Neugebauer R, Fireman B, Roy JA et al (2012) Dynamic marginal structural modeling to evaluate the comparative effectiveness of more or less aggressive treatment intensification strategies in adults with type 2 diabetes. Pharmacoepidemiol Drug Saf 21:99–113. https://doi.org/10.1002/pds.3253
Article PubMed Google Scholar
O’Neill S, Kreif N, Grieve R et al (2016) Estimating causal effects: considering three alternatives to difference-in-differences estimation. Heal Serv Outcomes Res Methodol 16:1–21. https://doi.org/10.1007/s10742-016-0146-8
Article Google Scholar
Pearl J, Glymour M, Jewell NP (2016) Causal inference in statistics: a primer. Wiley, Hoboken
Google Scholar
Reed ME, Huang J, Brand RJ et al (2019) Patients with complex chronic conditions: health care use and clinical events associated with access to a patient portal. PLoS ONE 14:e0217636. https://doi.org/10.1371/journal.pone.0217636
Article CAS PubMed PubMed Central Google Scholar
Sofrygin O, van der Laan MJ, Neugebauer R (2017) Simcausal R Package: conducting transparent and peproducible simulation studies of causal effect estimation with complex longitudinal data. J Stat Softw. https://doi.org/10.18637/jss.v081.i02
Article PubMed PubMed Central Google Scholar
Sofrygin O, Zhu Z, Schmittdiel JA et al (2019) Targeted learning with daily EHR data. Stat Med 38:3073–3090. https://doi.org/10.1002/sim.8164
Article PubMed Google Scholar
van der Laan MJ, Sherri R (2011) Targeted learning—causal inference for observational and experimental data. Springer, New York
Google Scholar
Zhang Y, Young JG, Thamer M, Hernán MA (2018) Comparing the effectiveness of dynamic treatment strategies using electronic health records: an application of the parametric g-formula to anemia management strategies. Health Serv Res 53:1900–1918. https://doi.org/10.1111/1475-6773.12718
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Hirschengraben 84, 8001, Zurich, Switzerland
André Moser & Milo A. Puhan
Institute of Social and Preventive Medicine, University of Bern, Mittelstrasse 43, 3012, Bern, Switzerland
Marcel Zwahlen

Authors

André Moser
View author publications
You can also search for this author in PubMed Google Scholar
Milo A. Puhan
View author publications
You can also search for this author in PubMed Google Scholar
Marcel Zwahlen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to André Moser.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Moser, A., Puhan, M.A. & Zwahlen, M. The role of causal inference in health services research II: a framework for causal inference. Int J Public Health 65, 367–370 (2020). https://doi.org/10.1007/s00038-020-01334-1

Download citation

Received: 16 July 2018
Revised: 17 January 2020
Accepted: 21 January 2020
Published: 12 February 2020
Issue Date: April 2020
DOI: https://doi.org/10.1007/s00038-020-01334-1

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The role of causal inference in health services research II: a framework for causal inference

Introduction

Counterfactuals and causal effect

Association versus causation

Discussion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation