Original Article
Strong instrumental variables biased propensity scores in comparative effectiveness research: A case study in oncology

https://doi.org/10.1016/j.jclinepi.2023.01.002Get rights and content

Abstract

Background and Objectives

Some medications require specific medical procedures in the weeks before their start. Such procedures may meet the definition of instrumental variables (IVs). We examined how they may influence treatment effect estimation in propensity score (PS)-adjusted comparative studies, and how to remedy.

Study Design and Setting

Different covariate assessment periods (CAPs) did and did not include the month preceding treatment start were used to compute PS in the French claims database (Sytème National des Données de Santé-SNDS), and 1:1 match patients with metastatic castration resistant prostate cancer initiating abiraterone acetate or docetaxel. The 36-month survival was assessed.

Results

Among 1, 213 docetaxel and 2, 442 abiraterone initiators, the PS distribution resulting from the CAP [-12; 0 months] distinctly separated populations (c = 0.93; 273 matched pairs). The CAPs [-12;-1 months] identified 765 pairs (c = 0.81). Strong docetaxel treatment predictors during the month before treatment start were implantable delivery systems (1% vs. 59%), which fulfilled IV conditions. The 36-month survival was not meaningfully different under the [-12; 0 months] CAP but differed by 10% points (38% vs. 28%) after excluding month −1.

Conclusion

In the setting of highly predictive pretreatment procedures, excluding the immediate pre-exposure time from the CAP will reduce the risk of including potential IVs in PS models and may reduce bias.

Introduction

Nonexperimental and pharmacoepidemiological studies relying on longitudinal health care databases are considered as a valuable complementary source of evidence to randomized controlled trial (RCT) evidence. Measurement issues aside, inferring causality from nonexperimental studies requires adjustment for confounding, i.e., the potential distortion of the true relationship between the assessed drug and the outcome of interest that may result from imbalance in outcome predictors between the treatment groups [[1], [2], [3]]. Propensity scores (PSs) are frequently used to minimize confounding in the analysis of health care databases as they can adjust for many proxies of the patients’ health status observable in secondary data [4]. A variety of ways to implement PS analyses in nonexperimental studies exists, including matching, weighting, stratification, and modeling [[5], [6], [7], [8]]. A PS is defined as the probability for a patient to start a treatment according to its baseline characteristics. Under some assumptions [5], the PS can act as a balancing score that mimics randomization in RCTs conditional on the observable factors, resulting in comparative cohorts with similar distribution of baseline characteristics.

At the time of comparing the effect of two treatments, pharmacy dispensing records and by extension their associated claims are considered reliable markers for drug exposure [9]. Consequently, in comparative cohort studies based on claims data, dates of initial dispensing are usually used as the beginning of exposure, as well as the end of the covariate assessment period (CAP) essential to the construction of a PS [6,10]. However, it is worth remembering that in its original definition, PS refers to the “probability of assignment to a particular treatment” [5], and so, to the time when the decision to treat was made. In some situations, this decision to treat may occur well before the actual treatment onset because of a specific pretreatment check (e.g., specific laboratory tests), or drug administration-related medical procedure (e.g., implantable chamber for chemotherapy). In such situations, treatment-specific services occurring in patient journeys before the treatment start but after the decision to treat was made may be strongly associated with the exposure (i.e., treatment proxies) but poorly associated with patient characteristics. Measurements of such pretreatment services therefore have properties similar to instruments (or instrumental variables–IVs) [11].

Adjusting for preexposure patient characteristics that have properties of instruments may amplify any residual confounding [[12], [13], [14]], especially strong treatment predictors [15]. In practical settings, it is sometimes difficult to identify such IVs and if so then the impact on treatment effect estimates remains unclear. Although the theoretical process of variable selection in PS construction have been broadly discussed [13,[16], [17], [18]], few illustrative examples of results issued from designs based on an IV-biased PS have been published and contextualized. The objective of this article is to illustrate a situation of strong treatment predictors in oncology through a comparative cohort study conducted in the French Nationwide health care database (Sytème National des Données de Santé–SNDS) [19,20], to exemplify how including IVs in a PS can alter the composition of the comparator groups and impact effect estimates, and to introduce solutions to address this issue.

Section snippets

General design

As part of the project “TherapeutiC strAtegy in MEtastatic castration-Resistant pRostate cAncer” (CAMERRA), a comparative cohort study was setup to assess the effectiveness of two first-line treatments used for metastatic castration–resistant prostate cancer (mCRPC) management in association with prednisone/prednisolone—abiraterone acetate and docetaxel [19,20]. Patients with mCRPC and initiating abiraterone acetate or docetaxel as first-line treatment in 2014 were identified and extracted from

Population

A total of 386,127 prevalent prostate cancer cases were identified in the French population covered by the general health insurance scheme in 2014, including 12,951 mCRPC. Among them, 1, 213 initiated docetaxel and 2, 442 abiraterone acetate, with a similar distribution of initiation according to calendar months (chi-Square test P-value = 0.52).

High-dimensional propensity score assessment

The two different CAPs–whose only difference is the ending point–were used to compute the hdPS and to execute the 1:1 matching on hdPS ±0.01.

The CAP

Discussion

In this comparative cohort study, two different treatments against prostate cancer were compared using a PS matching strategy. The CAP running over the year before treatment initiation led to a poor overlap of the populations to compare. The exclusion of the month prior to treatment initiation drastically improved the overlap of the PS distribution between the two groups.

The limited overlap between populations as observed on Figure 2 should not stay unnoticed when performing comparative

Conclusion

The current work illustrated that strong treatment predictors that act like IVs may be frequent in select conditions especially when comparing different treatment modalities like intravenous vs. oral oncology treatment, and their adjustment in PSs or other models may lead to biased findings. Excluding the immediate pre-exposure time from the pre-exposure CAP is one option to reduce the risk of bias. However, the suitability of this approach, and where appropriate, the period of time to be

References (33)

  • S. Schneeweiss et al.

    High-dimensional propensity score adjustment in studies of treatment effects using health care claims data

    Epidemiol

    (2009)
  • P.C. Austin

    An introduction to propensity score methods for reducing the effects of confounding in observational studies

    Multivariate Behav Res

    (2011)
  • E. Williamson et al.

    Propensity scores: from naive enthusiasm to intuitive understanding

    Stat Methods Med Res

    (2012)
  • P. Blin et al.

    Comparative effectiveness and safety of standard or reduced dose dabigatran vs. Rivaroxaban in nonvalvular atrial fibrillation

    Clin Pharmacol Ther

    (2019)
  • J. Pearl
  • D.B. Rubin

    Estimating causal effects from large data sets using propensity scores

    Ann Intern Med

    (1997)
  • Cited by (0)

    Conflicts of interest: NH Thurin, J Jové, R Lassalle, M Rouyer, S Lamarque, P Blin, C Droz-Perroteau are researchers at Bordeaux PharmacoEpi, a research platform of the Bordeaux University and its subsidiary the ADERA, which performs financially supported studies for public and private partners in compliance with the ENCePP Code of Conduct. P Bosco-Levy is now employed by Horiana, Bordeaux, France. C. Segalas has declared no conflicts of interest. Dr. Schneeweiss (ORCID# 0000-0003-2575-467X) is participating in investigator-initiated grants to the Brigham and Women's Hospital from Boehringer Ingelheim and UCB unrelated to the topic of this study. He is a consultant to Aetion Inc., a software manufacturer of which he owns equity. He is an advisor to Temedica GmbH, a patient-oriented data generation company. His interests were declared, reviewed, and approved by the Brigham and Women's Hospital in accordance with their institutional compliance policies.

    Funding: The authors received no financial support for the research, authorship, and publication of this article. However, the example used was drawn from the TherapeutiC strategy in Metastatic castration-Resistant pRostate cAncer: target population and changes between 2012 and 2014 (CAMERRA) study, which was funded by Janssen-Cilag, France and carried out by the Bordeaux PharmacoEpi platform under the supervision of a Scientific Committee.

    Author contributions: NH Thurin contributed to conceptualization, investigation, methodology, validation, roles/writing—original draft; J Jové contributed to data curation, formal analysis, methodology, software, visualization, roles/writing—original draft; R Lassalle contributed to methodology, validation, writing—review and editing; M Rouyer contributed to conceptualization, investigation, project administration, writing—review and editing; S Lamarque contributed to conceptualization, investigation, project administration, writing—review and editing; P Bosco-Levy contributed to methodology, writing—review and editing; C Segalas contributed to methodology, visualization, roles/writing—original draft; S Schneeweiss contributed to methodology, writing—review and editing; P Blin contributed to conceptualization, investigation, methodology, writing—review and editing; C Droz-Perroteau contributed to project administration, supervision, writing—review and editing.

    View full text