FormalPara Key Points for Decision Makers

In methotrexate (MTX)-tolerant patients, the addition of tocilizumab (TCZ) to the standard biologic, disease-modifying anti-rheumatic drug, rheumatoid arthritis treatment sequence in the UK was associated with similar costs and a modest improvement in quality of life per patient.

In patients contraindicated to MTX, for whom there are fewer available treatments, the estimated quality-of-life benefit was more pronounced.

Overall, the addition of TCZ was estimated to be a cost-effective strategy, with a lower incremental cost-effectiveness ratio if used at first compared with second line.

1 Introduction

Rheumatoid arthritis (RA) is a chronic, progressive and disabling inflammatory condition typically causing symmetrical chronic arthritis characterised by joint pain, stiffness and swelling. It affects approximately 0.5–1 % of the UK population and affects nearly three times as many women as men [1]. RA is associated with increased mortality, attributable at least in part to a higher risk of ischaemic heart disease as well to other factors, including infections related to co-morbidities, other systemic manifestations of the disease and immunosuppressive therapy [24].

Counting its direct, indirect and work-related disability costs, RA is estimated to cost the UK economy between £3.8 and £4.75 billion annually [5]. In early RA, these costs are driven by indirect costs, including the paid employment forgone by informal caregivers [6, 7]. As RA progresses and pain, discomfort and physical impairment worsen, healthcare utilisation and medication costs become the principal contributors to overall cost [8].

In the absence of a curative treatment for RA, the focus of RA treatment is currently the prevention or control of joint damage, minimisation of loss of function and potential disability, avoidance of pain and improvement of quality of life (QoL). Certain drugs such as glucocorticoids and non-steroidal anti-inflammatory drugs (NSAIDs) are effective in controlling RA symptoms; however, disease-modifying anti-rheumatic drugs (DMARDs), alone or in combination, are the mainstay of RA management, and are used to slow progression of disease and improve function. They are divided into two categories: synthetic DMARDs (sDMARDs)—including methotrexate (MTX), leflunomide, sulfasalazine, azathioprine, ciclosporin and hydroxychloroquine—and biologic DMARDs (bDMARDs)—including abatacept, adalimumab, certolizumab, etanercept, golimumab, infliximab, rituximab and tocilizumab (TCZ). bDMARDs are licensed for the treatment of RA, but their use in the UK is currently restricted to patients who have failed to respond to (or tolerate) at least two sDMARDs.

An important clinical subgroup encompasses those patients in whom bDMARDs cannot be given in combination with MTX [9]. Therefore, this analysis focuses on both combination treatment as well as biologic monotherapy.

Tocilizumab is a humanised monoclonal antibody against the interleukin-6 receptor. It is currently licensed for the treatment of RA and juvenile idiopathic arthritis (systemic juvenile idiopathic arthritis and polyarticular juvenile idiopathic arthritis) in combination with MTX, or as monotherapy in the case of intolerance to MTX or where continued treatment with MTX is inappropriate. A positive recommendation from the National Institute for Health and Care Excellence (NICE) and Scottish Medicines Consortium (SMC) in 2010 led to TCZ being reimbursed by the National Health Service (NHS) in the UK [10, 11].

The objective of this cost-effectiveness analysis was to determine whether the addition of TCZ at the beginning of a treatment pathway is cost effective in the treatment of moderate to severe RA.

2 Methods

2.1 Model Overview

The design of the economic analysis followed guidelines set by the Outcome Measures in Rheumatoid Arthritis Clinical Trials (OMERACT) Economics Working Group [12], and the structure was designed to closely represent clinical practice.

An individual sampling model, designed in MS Excel®, was used to analyse transitions of patients through a sequence of treatments and, by maintaining patient history, translate this into economic and QoL outcomes.

The analysis compared the cost effectiveness of different strategies containing a sequence of biologic therapies licensed for use in patients with moderate to severe RA who had had an inadequate response (IR) to one or more sDMARDs. Two scenarios were explored separately: patients contraindicated to MTX and those who were MTX tolerant.

We defined the current standard of care (SoC) as a sequence of bDMARDs. In the MTX-tolerant population, the first treatment was an anti-tumour necrosis factor alpha (aTNF) drug; certolizumab pegol has the lowest cost and therefore we assumed it would be offered first. Rituximab was offered following IR to the first aTNF, in accordance with the NICE guidance [13]. Thereafter, bDMARDs were ranked on the basis of their annual cost—assuming the one with the lowest cost would be used first. The MTX-contraindicated population model followed a similar ranking, including only treatments with a licence for monotherapy. Alternative sequences for the SoC were explored during the development of the model; the current one was demonstrated to be the most conservative.

We considered the addition of TCZ to the current SoC strategy, and explored its effect at first line and second line (see Tables 1 and 2).

Table 1 Strategies for MTX-contraindicated population
Table 2 Strategies for MTX-tolerant population (biologics + MTX)

The characteristics of the model cohort were based on data from the British Society for Rheumatology Biologics Register (BSRBR) [14]; they are consistent with the drug licence and are representative of the UK RA population (see Table 3). The average patient weight was assumed to be 70 kg [13, 15]. The analysis time horizon was patient lifetime.

Table 3 Baseline characteristics of model cohort

American College of Rheumatology (ACR) response probabilities were used to measure response to treatment as these are readily available from the randomised controlled trial (RCT) evidence. The model used a 6-month cycle length, which is in line with the timing of most RCT outcomes included in the analysis. Disease severity was represented by changes in Health Assessment Questionnaire (HAQ) score and Visual Analogue Scale (VAS) pain score, two surrogate health outcomes which can be translated to utility scores and ultimately to quality-adjusted life-years (QALYs). Once patients exhausted all treatments in the sequence, they moved into palliative care, where they remained until death. Patients could transition to death on the basis of a mortality risk adjusted for RA [16]. The possible transitions and the structure of the model are illustrated in Fig. 1.

Fig. 1
figure 1

Model structure, health states and transitions. ACR American College of Rheumatology, HAQ Health Assessment Questionnaire, QoL quality of life, sDMARD synthetic disease-modifying anti-rheumatic drugs, VAS Visual Analogue Scale

The analysis took a UK NHS perspective, with costs expressed in 2011/2012 pound sterling and health benefits measured in QALYs. Both costs and benefits were discounted at an annual rate of 3.5 %, and half-cycle correction was applied. The performance of alternative treatment sequences was estimated using incremental cost-effectiveness ratios (ICERs), defined as the added cost of a given strategy divided by its added benefit compared with the next most expensive strategy.

2.2 Treatment Efficacy

Efficacy in the model was determined by response to treatment, which was subsequently translated to a drop in HAQ score and change in VAS pain. Responders were categorised according to their level of ACR response—ACR00 (no response), ACR20, ACR50 and ACR70. The proportion of patients achieving each level of response differed by treatment and was derived through a network meta-analysis (NMA) of RCT data (see Table 4) [17, 18]. MEDLINE® and EMBASE® databases were searched simultaneously for articles published in English, from 1990 to October 2011, using a predefined search strategy. To allow comparison with the recent TCZ studies, the literature review focused on studies that included only patients with sDMARD inadequate response (sDMARD-IR). Twenty-two relevant studies were identified with ACR response rates at 24 weeks. Despite some variation in the patient characteristics across the identified studies (i.e. duration of disease), there were no observed systematic differences, indicating the feasibility of the indirect comparison. Because of limited data identified for aTNFs used in monotherapy, response data for all aTNF treatments were pooled, assuming exchangeability of the efficacy profile for those agents. An NMA was conducted (Bayesian framework) to synthesise the identified evidence and to obtain effect estimates for the bDMARDs. The probability of ACR response at 24 weeks for the bDMARDs is presented in Table 4. Rituximab was used in the model in an aTNF-IR position (in line with its label). Data from a study on aTNF-IR patients was used for the ACR response probability with rituximab [18].

Table 4 Key model parameters and assumptions

Response to treatment was assumed to impact disease severity and pain, as measured by HAQ and VAS pain score, respectively. The impact of response to treatment on HAQ and VAS pain score was assumed to be treatment independent. Patient-level data from three phase III clinical trials were pooled and analysed to determine the relationship of ACR response with HAQ (2,204 observations) and VAS pain score (2,342 observations) (details presented in Appendix A in the Electronic Supplementary Material) [1921]. These studies were selected because their respective patient characteristics were similar to those assumed in the economic model and patient-level data were available for analysis during the model development. Results suggest a negative correlation between both ACR response and HAQ score and ACR response and VAS pain score; that is, the higher the observed ACR response the greater the drop in HAQ and VAS pain scores (see Table 4).

The relationship between ACR response, HAQ and VAS pain was assumed to be the same across all treatments. A patient’s disease was assumed to remain stable whilst on biologic treatment and progress whilst on palliative care, an approach taken by previous evidence reimbursement submissions in the UK [10, 11] and a recent NICE multiple technology appraisal [22] (see Table 4). The level of VAS pain for patients in palliative care was assumed to be 55, on the basis of clinical expert opinion (AF, TH).

2.3 Treatment Discontinuation

Since patients were assumed to try all drugs in a given strategy, treatment discontinuation became an important driver of the model outcomes. An analysis of a large biologics data registry (n = 1,464 without rituximab sample) in Europe (Swiss Clinical Quality Management in Rheumatic Diseases [SCQM-RA]) showed that discontinuation can be response related. Since the SCQM-RA data set did not have ACR response evidence for many patients, we used European League Against Rheumatism (EULAR) response data as a proxy for this input [23]. A range of parametric survival models (Weibull, exponential, loglogistic, lognormal) were explored. We selected the best fitting model based on Akaike Information Criteria (AIC) and visual inspection of the resulting curves against the SCQM-RA Kaplan–Meier curves and UK observational data [24]. The loglogistic model had the best fit for bDMARD-naive patients, and the lognormal model for the bDMARD-IR patients. These were used as a base case while alternative models were tested in the sensitivity analysis (details of the treatment discontinuation analysis are presented in Appendix B in the Electronic Supplementary Material).

Little evidence exists on the long-term sustained benefit of treatment after patients withdraw because of a lack of efficacy. Some evidence suggests that a “rebound” occurs when therapy is withdrawn [25]. In our analysis, we assumed that HAQ worsening was equal to the initial HAQ improvement and that it occurred immediately at the point of withdrawal. That is, patients who withdrew because of a lack of efficacy immediately returned to their starting HAQ score, until they began their next treatment.

2.4 Treatment Safety

The safety of biologic treatments was derived from a recent systematic review and NMA [26]. The pooled results for all biologic treatments indicated that the odds of serious infection (defined as those associated with death, hospitalisation and use of intravenous antibiotics) were significantly greater than for control [odds ratio 1.37, 95 % confidence interval (CI) 1.04–1.82]. Singh and colleagues [26] also presented results for each individual biologic therapy, among which only the odds ratio for certolizumab pegol reached statistical significance (4.75, 95 % CI 1.52–18.65). In order to reflect this difference, the probability of serious infection for certolizumab pegol was derived from its individual odds ratio (4.75) and for all other biologics from the generic odds ratio (1.37). The resulting absolute risks, which were applied at every cycle whilst on a given treatment, are shown in Table 4.

The safety of drugs typically used as part of palliative care was also reviewed. A survey of clinical experts indicated that most patients who have failed all previous biologic treatments would be on a mixture of therapies including glucocorticoids and sDMARDs for disease management, selective and non-selective NSAIDs for symptom control, and opioids and analgesics for pain relief. Reviews of those treatments indicated that the most frequent adverse events (AEs) include dyspepsia, nausea, constipation, infections, headaches, dizziness, skin rash/allergy and alopecia [19, 2731]. The incidence of these events was combined with associated costs and utilities for inclusion in the economic model (details of the palliative care survey are presented in Appendix C in the Electronic Supplementary Material).

2.5 Health-Related Quality of Life

Change in HAQ, VAS pain and the associated utility gain were used in the model to determine the impact of treatment on overall health. Estimates of utility gain were calculated using a regression model for mapping HAQ scores to the EuroQol 5-Dimensions (EQ-5D) score [32]. This model represents an improvement to previous mapping algorithms in that it accommodates not only the ceiling effect and the gap in the data but also the observed multi-modality in EQ-5D. It also considers the effect of VAS pain, which is one of the most heavily weighted items in the EQ-5D. The EQ-5D score for each simulated individual was estimated based on the Alava model, the simulated HAQ score which is linked to ACR response, VAS pain scores from the clinical trials (Table 4), and patient age in each cycle.

For the disutility from serious AEs while on a bDMARD, we reviewed six of the largest studies identified from the Cochrane safety review [26]. We found that the most common and significant serious infection was pneumonia [19, 2731]. A utility for pneumonia was identified in the literature—0.21 [33]; it was adjusted for the expected duration of the event (7 days) and the baseline age and gender of the Sisk et al. [33] cohort. The resulting utility multiplier (0.9858) was combined with the incidence of serious infections and applied to the baseline utility derived from the Alava model.

Disutility for AEs associated with palliative care was identified through a targeted review of published economic evaluations, and multipliers were derived in the same fashion (details in Appendix D in the Electronic Supplementary Material). Utility multipliers were then combined with the incidence of each AE to estimate a weighted average utility multiplier for palliative care; note that this is slightly different for patients receiving MTX and those for whom MTX is contraindicated (Table 4).

2.6 Treatment-Related Costs

The treatment cost for each drug was sourced from the British National Formulary [35], and the dose schedules for each treatment were taken from the electronic Medicines Compendium (eMC) website [36] (see Table 5). For weight-based dosing, the average patient was assumed to weigh 70 kg. A patient access scheme (PAS) for certolizumab pegol was considered in the calculations; the manufacturer provides the first 12 weeks (ten pre-loaded 200-mg syringes) free of charge to all patients starting treatment.

Table 5 Treatment, AE and disease-related costs

Ten per cent of subcutaneous injection treatment was assumed to be administered by a district nurse. Monitoring patients who receive subcutaneously injected medicines was assumed to involve an outpatient visit or a general practitioner visit and certain examinations and tests, such as full blood count, erythrocyte sedimentation rate, C-reactive protein, liver function test, chest X-ray, urea, electrolytes and creatinine. The resource use assumed in this analysis was consistent with previous evidence reimbursement submissions [10, 11]. The cost of administering and monitoring intravenous drugs was assumed to be £171.33 per infusion [37].

The drug cost for palliative care was based on a weighted average of treatments (glucocorticoids, sDMARDs, selective and non-selective NSAIDs, opioids). No costs were assumed for administering palliative care, but monitoring costs were assumed to be high, driven by a greater number of primary and secondary care visits and blood tests.

2.7 Adverse Event-Related Costs

The cost for pneumonia was sourced from the NHS reference costs [38]. The costs of other AEs associated with drugs used in palliative care included those of dyspepsia and constipation. No costs were reported for experiencing nausea, hypertension, headaches, dizziness, skin rash/allergy or alopecia. These were synthesised with incidence figures and relative usage of different drugs from the expert survey to arrive at a set of values for palliative care (see Table 5).

2.8 Disease-Related Costs

In addition to treatments received and the monitoring included therein, many patients were assumed to require inpatient care associated with their RA. The model assumed six different bands of HAQ score to reflect mild, moderate and severe disease [39, 40]. Data from the Norfolk Arthritis Register (NOAR) in the UK were analysed such that mean days in hospital over a 12-month period could be estimated for each HAQ score category. Based on a cost of £240 per inpatient day, each HAQ score category was assigned a cost (see Table 5).

2.9 Sensitivity Analysis and Parameter Uncertainty

One-way sensitivity analysis (OWSA) was performed to identify the model drivers and the magnitude of their impact on the model results. In order to explore uncertainty around the model’s input, probabilistic sensitivity analysis (PSA) was also performed (1,000 samples).

3 Results

Aggregate results were based on 3,000 simulations of hypothetical patient pathways. Table 6 displays the cost-effectiveness results for the monotherapy strategies. The estimated total cost for the SoC strategy was £139,000; the total QALYs accrued were 8.0162 over an average 21.5 years in the model. The addition of TCZ to the SoC after sDMARD-IR increased the estimated total cost to £143,000 and produced 8.4987 QALYs. In the TCZ strategy, time on biologic treatment was extended by an average of 2.5 years compared with the SoC strategy. The ICER was estimated to be £7,300 per QALY gained. If TCZ was added in the sequence in later positions (e.g. second line), the costs and QALYs increased. The ICER was estimated to be £11,400 per QALY gained.

Table 6 Base-case cost-effectiveness results for monotherapy strategies

Table 7 presents the cost-effectiveness results for the combination treatment strategies. The three strategies were similar in terms of total costs and total QALYs gained. The number of patients receiving palliative care was reduced substantially across all comparator strategies, on average by 3 and 6 % of simulated individuals on TCZ or SoC strategy, respectively. In the SoC strategy, patients were estimated to stay on biologics for 21 years. In the strategy by which TCZ was offered first after sDMARD-IR, patients stayed on biologics for almost the duration of the model (21.5 years on average).

Table 7 Base-case cost-effectiveness results for combination therapy strategies

The OWSA is presented in Tables 8 and 9. Respective changes to model parameters have the same impact in both the monotherapy and the MTX-combination analyses. Overall, the model was sensitive to changes in the TCZ treatment cost. Changes to the assumed patient weight affected the TCZ cost and consequently the final ICER. Given that the results between the two comparator strategies are similar in the combination model, the increase in cost changes the conclusion of the cost-effectiveness analysis; the TCZ strategy no longer produces cost-savings and the result is a high ICER.

Table 8 One-way sensitivity analysis of cost-effectiveness results for monotherapy strategies
Table 9 One-way sensitivity analysis of cost-effectiveness results for combination therapy strategies

Assumptions around palliative care did not have a substantial impact on the overall cost-effectiveness results; neither did changes to the assumptions on AEs of biologics or ACR response rates [9]. The choice of discontinuation risk (time dependent or constant) and the choice of QoL mapping model had a moderate effect on the model results.

Figures 2 and 3 present the results of the PSA with cost-effectiveness acceptability curves for the monotherapy and the combination with MTX analyses. In both cases, the probability of cost effectiveness was estimated to be higher when TCZ was added first in the treatment sequence. Details of the parameters, ranges and distributions assumed for the PSA are presented in Appendix E in the Electronic Supplementary Material.

Fig. 2
figure 2

Cost-effectiveness acceptability curve for monotherapy strategies. SoC standard of care, TCZ tocilizumab

Fig. 3
figure 3

Cost-effectiveness acceptability curve for combination treatment strategies. SoC standard of care, TCZ tocilizumab

4 Discussion

To our knowledge, this is the first analysis investigating the cost effectiveness of adding TCZ to SoC, compared as monotherapy or in combination with MTX, in treating adults with moderate to severe RA in the UK. The analysis was based on the reimbursement application to NICE in the UK; all assumptions follow the NICE reference case [41].

Our economic evaluation synthesised evidence of patient characteristics of a UK population [14], evidence from an NMA of randomised clinical trials for drug efficacy [17], registry data analysis for discontinuation (SCQM-RA), and cost estimates from the NHS UK perspective, with a PAS where applicable. With regard to QoL estimates, as noted in the “Methods” section, the analysis used a mapping model that the authors here consider an improvement on previous algorithms [25, 4244].

In both populations (monotherapy and MTX combination), the addition of TCZ to the current treatment sequence in the UK appeared to be a cost-effective strategy. However, its impact in QALYs gained was more pronounced in the monotherapy than in the combination model. Part of this was due to the additional benefits in QALYs driven by TCZ efficacy, and another part was due to the displacing of time on palliative care. Since the latter is more likely to happen in the monotherapy model, where there are fewer treatment options for an RA patient, the impact is more pronounced there and less in the combination treatment model, where, on average, patients will spend significantly less time on palliative care, in the last months of their lives.

The assumptions on the risk of discontinuation were an important driver of the model, especially since we assumed that patients would try all treatments in a strategy. We ran a post hoc analysis of observational data to estimate the probability of discontinuation (assumed time dependent). It is a limitation of this study that we did not have access to a UK registry for this analysis, and therefore the input data could suffer from generalising from a Swiss cohort of patients. Nevertheless, when we compared the data with evidence from Hyrich et al. [14] and Soliman et al. [24] from the UK, the resulting risk was found to be similar. Another limitation regarding the SCQM-RA data analysis is that because of a very small number of ACR response data, we used EULAR response as a proxy.

Our model used the ACR response criteria to weigh the benefit of treatment (HAQ, VAS pain scores and QALYs). Ideally, the model could use Disease Activity Scores using 28 joints or EULAR response, in line with NICE decision rules. The use of ACR response criteria ensures a wider choice of studies that inform our NMA and, on balance we considered this to be more important for the economic model input.

With regard to the treatment cost calculations, after the addition of administration and monitoring costs, the average total cost for a biologic was just over £10,000 per year. Since our comparison was one of treatment strategies, it was the cost of TCZ (solely based on patients’ weight) that had the most impact on the cost-effectiveness results—as shown from the sensitivity analysis.

The research conducted on the cost of palliative care (patient monitoring and safety) added to the completeness of the overall analysis, but had no notable impact on the model results; its overall cost was very low compared with the add-on bDMARD (TCZ). At the development phase of the palliative care survey, we anticipated greater convergence amongst the individual responses. However, we found large variation amongst responses, and this prohibited any statistical analysis or summarisation of the survey findings. This could be due to methodological issues: the type of questions, the format of the interview, the rounds of revision or clarifications, etc. It also appears likely that since clinicians are less and less likely to face those cases (patients who run out of treatment options), they tend to consider extreme examples in their responses. Furthermore, clinical practice is often determined by the number of treatments available to clinicians, which may vary a lot from one jurisdiction to another. Since the data survey on palliative care involved only six clinical experts and responses varied significantly, the results should be interpreted with caution.

Further research is necessary for a number of parameters of the economic model. In particular, with regard to discontinuation, one could explore different hypotheses that test treatment-related discontinuation, age, gender and other patient characteristics. Our analysis assumed that discontinuation was response related, which fitted well with our model structure and produced plausible results. Furthermore, as new and more bDMARDs become available to RA patients, research could identify subgroups where treatments are more efficacious, potentially producing more cost savings for the healthcare provider.

Overall, although we found that the model was sensitive to changes that affect the cost of TCZ and time on treatment, we believe that further refinement of those parameters would not significantly change the analysis conclusions.

5 Conclusion

The addition of TCZ to the SoC is estimated to be a cost-effective strategy in the treatment of patients with moderate to severe RA. In 1,000 samples in the PSA, the strategy with TCZ at first line had a higher estimated probability of being cost effective than TCZ at second line.