Introduction

Globally, breast cancer is the most common cancer in women, with more than 2 million new cases diagnosed in 2018 and 0.5 million deaths worldwide [1]. Approximately 25–30% of breast cancers harbor a gene mutation that produces an excess of the human epidermal growth factor receptor 2 (HER2) protein [2]. HER2-positive tumors tend to be more aggressive, with an increased probability of relapse and increased risk of brain metastases, and less likely to respond to treatment compared with other breast cancer subtypes [3,4,5]. HER2-positive breast cancer is associated with decreases in overall survival (OS) and time in remission [6]. Several treatment options specifically for HER2-positive breast cancer are available, including trastuzumab, pertuzumab, the antibody–drug conjugate ado-trastuzumab emtansine (T-DM1), and, more recently, trastuzumab deruxtecan and tucatinib. Other traditional chemotherapy used to treat this patient population following T-DM1 treatment includes eribulin, capecitabine, vinorelbine, and lapatinib [7,8,9]. These treatments have improved progression-free survival (PFS) and OS [10]. Nevertheless, most patients with HER2-positive breast cancer eventually do not respond to these treatments [11].

New treatments for HER2-positive breast cancer that offer longer survival benefits have been approved, but these treatments have different safety profiles. Little is currently known about patient preferences for the benefit and risk attributes associated with advanced breast cancer treatments or the willingness of patients with advanced disease to take on safety risks associated with continued treatment to obtain additional PFS benefits. Using a discrete-choice experiment (DCE), we aimed to quantify patients’ benefit-risk preferences for the attributes that differentiate HER2-targeted treatments for advanced or metastatic breast cancer, estimate the relative importance of efficacy and safety attributes evaluated in clinical trials, and calculate the minimum acceptable benefit (MAB) in terms of PFS to accept the risk of a given side effect. The sample included women with self-reported stage III/IV breast cancer from the United States (US), the United Kingdom (UK), and Japan. We also evaluated differences in preferences between subgroups defined by disease stage, prior treatment experience, HER2 status, whether the respondent has children, whether the respondent has ever worked in the medical field, and the respondent’s country of residence.

Materials and methods

Survey development

A DCE survey was administered to patients with self-reported stage III/IV breast cancer in the US, the UK, and Japan. Development of the survey followed good research practice guidelines [12,13,14] and Center for Devices and Radiological Health guidelines [15].

The survey instrument asked respondents to assume that, after visiting their doctor for a checkup, their doctor had told the respondent that their cancer was getting worse and they needed to start a new treatment. Respondents answered a series of nine DCE questions. Each question offered a choice between two hypothetical treatment profiles, Medicine A or Medicine B. The hypothetical treatments in the choice questions were defined by six attributes with a range of varying levels (Table 1). Figure 1 presents an example DCE question in which each hypothetical treatment profile reflected one of the levels for each attribute. Previous DCEs conducted in oncology identified the importance to patients of efficacy, including OS and PFS, mild-to-moderate adverse events (AEs) that affect daily quality of life, and the risk of severe AEs [16]. Consistent with the purpose of this study, attributes were selected to characterize the key benefit as well as important AEs associated with HER2-targeted therapies for advanced or metastatic breast cancer, including newer treatments and AEs that affect daily quality of life. The ranges of attribute levels were based on available data associated with approved and investigational HER2-targeted therapies and were described to correspond to the risk of serious AEs and to grades defined by the Common Terminology Criteria for Adverse Events [17,18,19,20,21,22,23]. Efficacy was described in terms of PFS, rather than OS, because PFS data were available for all treatments of interest at the time the survey was designed [18,19,20,21,22,23]. In addition, the survey instrument contained questions on the respondent’s disease and treatment experience, demographic questions, and questions to assess respondent comprehension of the treatment attributes and DCE questions. The Metastatic Breast Cancer Alliance, a patient advocacy group, reviewed and provided input on the study protocol. Before the survey was administered to the study samples, the survey instrument was qualitatively pretested in a series of telephone interviews, each lasting approximately 60 min, with 14 eligible participants in the US, 5 in the UK, and 5 in Japan. The pretests confirmed that the survey text and questions were easily understood and the attributes and levels were relevant and appropriately described.

Table 1 Attributes and levels for the discrete-choice experiment survey
Fig. 1
figure 1

Example discrete-choice experiment question

The profiles were experimentally designed to have statistical properties that allowed estimation of the main-effect preference weights of interest using a random-parameters logit (RPL) model [14]. The experimental design was generated in SAS 9.4 software (SAS Institute Inc.; Cary, North Carolina) using a D-optimal algorithm to construct a fractional factorial experimental design [24, 25]. The experimental design included 72 DCE questions, which were used to create eight blocks of nine DCE questions each. Respondents were randomly assigned to one block of nine questions.

A survey was considered complete and was included in the final sample for analysis if the respondent answered at least one DCE question. Respondents were excluded if they did not show variability in their answers to the DCE choice questions or if they completed the survey too quickly (in < 6 min). Missing data on DCE questions and respondents’ demographics were not included in the DCE or subgroup models.

Study population

Global Perspectives, an international market research firm with a speciality in healthcare, invited potential respondents in the US, the UK, and Japan via email to participate in the online survey instruments through Global Perspectives’ web-based partner panels. A quota sample of 200 respondents in the US, 50 in the UK, and 50 in Japan was targeted. Eligible individuals had a self-reported physician diagnosis of advanced breast cancer (stage III or IV), were aged ≥ 18 years, and were able to read and understand the language of the study country. All respondents provided electronic informed consent.

Statistical analyses

The DCE data for the US, the UK, and Japan were pooled and analyzed using an error component (EC) RPL model following good research practices [14]. Before pooling the data, RPL models were estimated for each country separately, but the small sample size for the UK prevented the model from converging and that for Japan precluded drawing conclusions because of wide confidence intervals. Preliminary models suggested that the US and UK respondents had similar preferences, but that Japanese respondents might place less relative importance on an increase in months of PFS. Using the approach of Hensher et al. [26], the final EC RPL model included a term equal to 1 if the data were from Japan and equal to 0 if the data were from the US and UK that was interacted with an effects-coded alternative dummy variable that was equal to 1 if the observation was associated with the first medicine alternative presented in the DCE question. The EC RPL model helps control for potential differences in preferences between Japan and the US and UK. The EC RPL model related the choices respondents made in the DCE survey to the differences in the attribute levels across the alternatives in each choice question [14]. All attribute levels were effects coded [27].

The model yields a set of relative preference weights for the attribute levels. Relative preference weights are a measure of the relative effect of an attribute level on the utility or preference for a hypothetical treatment. A Wald \({\chi }^{2}\) test was used to determine the statistical significance of differences between adjacent attribute levels. The conditional relative importance of each attribute, or the maximum change in utility achievable with that attribute relative to all other included attributes, was calculated as the difference between the EC RPL preference weights for the most- and least-preferred levels for that attribute. In other words, conditional relative importance indicates the overall relative importance of the attribute over the range of attribute levels included in the survey. Estimates from the EC RPL model were also used to calculate the MAB as additional months of PFS required for respondents to accept worsening in the levels of each of the AEs included in the survey.

To test for observable characteristics that may be systematically associated with differences in preferences, we explored preferences among subgroups defined by breast cancer stage, prior breast cancer treatment experience, HER2 status, whether respondents had children, employment in the medical field, and the respondent’s country of residence. For each mutually exclusive set of subgroups, we created a dummy variable that was equal to 1 if the respondent belonged to the subgroup and interacted the dummy variable with each of the variables for the attribute levels in the model. The parameter on each of these interaction terms can be interpreted as the difference between the two mutually exclusive subgroups. Separate subgroup models were estimated using the EC RPL model, and differences in preferences between subgroups were tested through a log-likelihood \({\chi }^{2}\) test of joint statistical significance of all the interaction terms (P < 0.05).

Results

Respondent characteristics

The final sample (Table 2) included 302 respondents (US, 200; UK, 52; Japan, 50) who met the inclusion criteria, provided informed consent, and whose surveys were considered complete. The mean patient age was 48 years; approximately 60% had stage IV breast cancer, 40% were HER2 positive, and 76% had children. Approximately 87% of the respondents were currently receiving treatment for their breast cancer; approximately 78% had received treatment for breast cancer in the past. The most common treatments differed by country.

Table 2 Demographic and disease characteristics of respondents (N = 302)

Preference weights and conditional relative importance

Preferences for attribute levels were ordered as expected, with better levels being preferred to worse levels. Figure 2 shows the normalized mean preference weight estimates for each attribute level. The change in utility associated with a change in the level of each attribute is represented by the vertical difference between the preference weights for those levels; larger differences between preference weights indicate that respondents viewed the change as having a relatively greater effect on overall utility. Most of the levels within each attribute were statistically different from one another (P < 0.05), except for the difference between a 0% and 1% risk of heart failure.

Fig. 2
figure 2

Attribute preference weights for respondents (N = 302). Note: The vertical bars surrounding each mean preference weight denote the 95% confidence interval of the point estimate (preference weights computed by the delta method for the level omitted in estimation for each attribute)

Figure 3 shows the conditional relative importance of changing each attribute from the least-preferred to the most-preferred level, scaled such that the change in PFS is set to 10. Over the ranges presented in the survey, patients placed the most importance on a change in PFS from 5 to 26 months, and the utility of the 21-month change in PFS was estimated to be statistically significantly higher than the changes in other attributes presented in the survey. This was followed by, in order, a 15% reduction in risk of heart failure, a 15% reduction in the risk of serious lung damage and infections, avoiding the possibility of severe liver function problems, avoiding severe nausea and vomiting, and avoiding severe diarrhea. The reduction in the risk of heart failure, risk of serious lung damage and infections, possibility of liver function problems, and avoiding severe nausea and vomiting were of approximately similar relative importance while avoiding severe diarrhea was statistically significantly less important than the reduction in the risk of all side effects.

Fig. 3
figure 3

Conditional relative attribute importance for respondents (N = 302). PFS progression-free survival. Note: The vertical bars surrounding each bar denote the 95% confidence interval (computed by the delta method). The bars are scaled such that PFS is set to 10, and the conditional importance of each of the other attributes is scaled relative to the conditional importance of PFS

Minimum acceptable benefit

MABs, shown in Fig. 4, provide another way to quantify the relative importance of changes from one level of an attribute to another. Specifically, the MAB is the number of months of PFS that would offset a change in another attribute. Accepting a 15% risk of heart failure required the largest MAB in additional months of PFS. Specifically, to accept a 15% risk of heart failure compared with 0% or 1%, respondents would require approximately a 5.8- or 5.6-month increase in PFS, respectively. The next largest MAB was an increase of approximately 4.6 months of PFS for moving from no risk of serious lung damage and infections to 15% risk, followed by an increase of approximately 4.2 months of PFS for moving from no liver function problems to possible severe liver function problems. The MAB to move from no nausea and vomiting to severe nausea was an increase of approximately 3.7 months of PFS, and the MAB to move from no diarrhea to severe diarrhea 4 or 5 days of the month was an increase of about 2.3 months of PFS.

Fig. 4
figure 4

Minimum acceptable benefit in months of progression-free survival for a change in adverse event severity or risk (N = 302). PFS progression-free survival. Note: The horizontal bars surrounding each bar denote the 95% confidence interval (computed by the delta method)

Subgroup analyses

Among the six subgroups evaluated, only the differences between women based on HER2 status, whether the respondent had children, and the respondent’s country of residence were statistically significantly different using a test for the joint significance of all the interaction terms (P = 0.044, P = 0.001, and P = 0.001, respectively).

In the subgroup analysis by HER2 status, both HER2-positive respondents (n = 122) and HER2-negative respondents (n = 152) placed the most relative importance on the change in PFS from 5 to 26 months. However, additional months of PFS were more important to respondents with HER2-negative disease relative to the other attributes than for respondents with HER2-positive disease (Fig. 5a).

Fig. 5
figure 5

Conditional relative attribute importance: Subgroup analyses. a Respondents with HER2-positive disease (n = 122) and respondents with HER2-negative disease (n = 152). b Respondents who do not have children (n = 72) and respondents who have children (n = 230). c US or UK respondents (n = 252) and Japan respondents (n = 50). HER2 human epidermal growth factor receptor 2, UK United Kingdom, US United States. Note: The vertical bars surrounding each relative importance weight estimate denote the 95% confidence interval (computed by the delta method). The bars for each subgroup are scaled so that the bar for each attribute is the percentage of the total importance of that attribute for the subgroup (scaled such that each attribute is divided by the total sum of relative attribute importance coefficients multiplied by 100 within each subgroup)

In the subgroup analysis by whether respondents have children, both respondents who did not have children (n = 72) and respondents who had children (n = 230) placed the most relative importance on the change in PFS from 5 to 26 months (Fig. 5b). However, relative to the other attributes, respondents with children placed more importance on increasing PFS than did respondents without children. Compared with respondents who had children, respondents without children had a stronger preference for avoiding liver function problems, diarrhea, and nausea and vomiting relative to the other attributes presented in the study.

In the subgroup analysis by country of residence, respondents in the US and UK (n = 252) and in Japan (n = 50) placed the most relative importance on the change in PFS from 5 to 26 months (Fig. 5c). However, relative to the other attributes, respondents residing in the US and UK placed more importance on increasing PFS than did respondents residing in Japan. Compared with respondents residing in the US and UK, respondents residing in Japan had a stronger preference for avoiding the risk of heart failure, liver function problems, and the risk of serious lung damage and infections relative to the other attributes presented in the study.

Discussion

New treatments for HER2-positive metastatic breast cancer offer the possibility of better efficacy but also carry risks of different side effects than existing treatments have. This DCE study explored the importance of changes in treatment benefit (PFS) relative to treatment-related AEs (nausea and vomiting, diarrhea, the possibility of liver function problems, the risk of heart failure, and the risk of serious lung damage and infections) among women with advanced breast cancer in the US, UK, and Japan. The treatment attributes and levels were selected based on clinical outcomes and data for new and existing treatments available for HER2-positive metastatic breast cancer. The study provides insights into the willingness, on average, of women from three countries to accept AE risks in exchange for additional months of PFS and the characteristics that affected preferences in our sample.

Given the set of attributes and the range of levels for these attributes included in the survey, patients generally placed more weight on PFS gain than on the potential risk of identified AEs when deciding between hypothetical breast cancer treatments. Specifically, the change in PFS from 5 to 26 months was the most important change to respondents relative to the improvements in treatment-related AEs presented in the survey for the sample as a whole and for each of the subgroups tested. As a measure of the importance of the treatment-related AEs, we calculated the additional months of PFS required to offset a change from the best to the worst level of each of the treatment-related AEs. In general, patients required an increase of approximately 2–6 months of PFS to accept the highest level of AE risk compared with no risk. Compared with no risk, to accept a 15% risk of heart failure required the largest amount of additional months of PFS (5.8 months), followed by a 15% risk of serious lung damage and infections (4.6 months), possible severe liver function problems (4.2 months), and severe nausea and vomiting (3.7 months), although the differences were not statistically significant. Compared with no diarrhea, accepting severe diarrhea required statistically significantly fewer additional months of PFS (2.3 months) compared with accepting a 15% risk of heart failure.

In subgroup analyses, we observed that, compared with women with HER2-positive breast cancer, women with HER2-negative breast cancer placed greater weight on PFS gains relative to the other attributes, although both groups placed the greatest weight on gains in PFS relative to other attributes. There are few targeted treatments for an HER2-negative population in comparison with HER2-positive disease, where the overall prognosis is much improved in comparison with patients with HER2-negative disease. Similarly, while both groups placed the most importance on PFS gains, compared with women who do not have children, women with children placed greater priority on PFS gains relative to the other attributes. It is challenging to speculate about the drivers for these differences, but these findings suggest that HER2 status and having children may play a role in how respondents weighed additional months of PFS relative to AEs. Anecdotal evidence from the pretest interviews suggested that women with young children prioritized increased PFS. In addition, respondents in the US and UK placed greater importance on increasing PFS than did respondents residing in Japan, who had stronger preferences for avoiding AEs and AE risks. A possible explanation for this finding is that respondents in the US and UK completed the survey before the coronavirus disease 2019 (COVID-19) outbreak, while respondents in Japan completed the survey during the outbreak, potentially enhancing their concerns about AE risks. Specifically, during pretest interviews for Japan, several participants mentioned the COVID-19 virus when discussing the attribute describing the risk of serious lung infections.

According to a review of DCE studies for cancer treatment, approximately 30% of health preference studies on cancer treatment related to breast cancer [16]. Multiple DCEs have explored patients’ preferences for breast cancer treatment and their willingness to trade off side effects against other treatment outcomes and burdens in both early stage and metastatic breast cancer [28,29,30,31,32,33]. To our knowledge, this study is the first to have explored respondents’ tolerance for certain safety risks associated with HER2-targeted treatment in exchange for slowing disease progression, as well as the first study to evaluate preferences for breast cancer treatments across the US, UK, and Japan [16].

The results of this study should be interpreted in the context of several limitations. All data were self-reported. Treatment features were based on HER2-targeted treatments for metastatic breast cancer, but the sample was composed of women with stage III and IV breast cancer with any HER2 status. While we did not observe any statistically significant differences between women with stage III and women with stage IV breast cancer in their preferences for the attributes evaluated in this study, these groups may have different treatment priorities and different preferences for treatment features not evaluated. Moreover, while OS remains the gold standard for the demonstration of clinical benefit to patients, PFS was selected as the benefit attribute because the PFS data were available for the treatments of interest when the study was designed. Evaluating the importance of OS in patient preferences for HER2-targeted treatments is a direction for future research, although there is generally a lack of consistent and comparable OS data, making it a more difficult attribute to analyze. In addition, DCE surveys can include only a limited number of attributes, and factors not evaluated in the survey may further influence preferences. Although there were no missing DCE data, some individuals did not respond to all demographic questions, hence variables used to identify subgroups may be incomplete. The treatment profiles were hypothetical and may not replicate the experience of patients talking to their doctor about treatments where other considerations may come into play, such as cost or access to care. The preferences of respondents residing in the US, UK, and Japan cannot necessarily be generalized to patients in other countries, nor can the preferences of respondents with stage III or IV disease be generalized to individuals with early-stage breast cancer.

The study used a quota sample recruited from online panels and may not be representative of all women with advanced breast cancer. The final survey was administered online. Research has shown that results from online stated preference surveys are, in general, not statistically significantly different from those elicited through face-to-face interviews or mail-based surveys but yield superior response rates [34,35,36,37]. However, the online setting of the survey also may have influenced respondents’ choices. Finally, respondents in Japan completed the survey during the COVID-19 outbreak. The survey did not contain questions about COVID-19, so we could not explore the extent to which the results for the sample in Japan were affected by the COVID-19 pandemic.

Conclusions

Preferences for HER2-targeted treatments for advanced or metastatic breast cancer among women in the US, the UK, and Japan vary by individual and disease-related characteristics. We found that patients generally valued PFS gain higher than the potential risk of identified AEs when deciding between hypothetical breast cancer treatments. The importance of increased PFS was greater for women who were HER2 negative, for those who had children, and for residents of the US and UK relative to residents of Japan. These findings provide insights into the trade-offs between slowing disease progression and treatment-related AEs that patients with breast cancer consider important for advanced or metastatic breast cancer treatments.