A call to action to harmonize patient-reported outcomes evidence requirements across key European HTA bodies in oncology
Abstract
Patient-reported outcome (PRO) data are increasingly being included in Health Technology Assessment (HTA) submissions for oncology drugs. This study aims to provide differences in PRO evidence requirements in oncology across key HTA bodies and calls for its harmonization. Method guidance provided by HTA bodies in Germany, France and the UK, and analysis of HTA reports of 20 oncology case studies were evaluated in this review. Differences exist between HTA bodies regarding guidance on how PRO data should be collected, reported and analyzed as well as how the data are reviewed and considered in oncology HTAs. HTA bodies can play a key role to harmonize PRO method guidance in collaboration with regulators and sponsors.
Plain language summary
Patient-reported outcomes (PRO) are information provided directly by the person who is experiencing a disease or undergoing a treatment, without additional interpretation by a clinician or caregiver. Along with other outcome measures, PROs may be included in the body of evidence used by health technology assessment bodies in their review. In this article, the authors summarize the guidance documents published by key health technology assessment agencies and reviewed 20 past cancer drug case studies to understand how different agencies use PROs when deciding on recommendations for new cancer treatments.
Historically, key end points in oncology trials were restricted to survival and tumor reduction [1]. However, in the era of patient-focused drug development, developers of new drugs and biologics for the treatment of cancer are increasingly collecting information directly from patients about outcomes that are important to them in defining treatment success using patient-reported outcomes (PROs) [1]. PROs are reports about the status of a patient's health condition or treatment that come directly from the patient [2]. PROs can measure a variety of concepts, including health-related quality of life (HRQoL), functioning (e.g., physical, psychological and social), symptoms (including disease and treatment related), patient satisfaction, treatment adherence and utility [2,3]. It is a unique way of capturing how the patient feels or functions, which may not be fully captured through traditional end points such as overall survival (OS), progression-free survival (PFS), biomarker measures or adverse events (AEs) [2]. The European Medicines Agency (EMA) [2], US FDA [4], International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use (ICH) [5] and the UK's Medicines and Healthcare Products Regulatory Agency (MHRA) [6] all consider PRO data derived robustly using fit-for-purpose PRO instruments as key components of decision-making during the benefit–risk appraisal of new drugs and biologic products in oncology.
PROs provide key perspectives from patients to inform the benefit–risk profile of new products, especially in oncology trials in which benefits are often based on surrogate end points when OS data are immature or in which treatment benefit is associated with additional safety risks [7–9]. Here, PROs are used as secondary or exploratory end points and, in some cases, in the statistical testing hierarchy to measure the impact of drug from the patient's perspective [10].
PROs have been reported not to play such an important role in health technology assessments (HTAs), however [11]. Most HTA bodies assess new drugs using comparative clinical benefit assessment and health economic assessment [12–14], but the opportunities for PRO data within these assessments is not entirely clear or consistent across HTA bodies. Given the increase in PRO data generation by developers of new drugs and biologics for the treatment of cancer, it is important that these developers have clear and consistent guidance as well as precedence from HTA bodies to inform their PRO strategy, implementation, analysis and submissions. The authors evaluated method guidance published by the HTA bodies in the three largest European oncology markets with well-established HTA processes – Germany's Gemeinsamer Bundesausschuss (G-BA) and Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen (IQWiG), the UK's National Institute for Health and Care Excellence (NICE) and France's Haute Autorité de Santé (HAS) – and critically appraised HTA reports of selected case studies in both solid and hematological tumors to identify the similarities and differences in how PROs were assessed by the HTA bodies.
Methods
Method guidance documents and HTA oncology case studies from G-BA/IQWiG, NICE and HAS were analyzed to evaluate published information relating to several areas of interest, including trial design considerations such as blinding and multiplicity adjustment requirements for PROs, PRO data collection and reporting requirements such as preferred instruments, missing data acceptance threshold, post-progression data collection and PRO data analysis such as clinically meaningful change threshold, preference for time-to-event end point definitions and missing data sensitivity analysis.
As a first step, HTA method guidance, position papers and technical support documents were identified from the HTA bodies' websites and searched for any information relating to PROs, using keywords including ‘patient-reported-outcomes,’ ‘PRO,’ ‘PROs,’ ‘quality of life,’ ‘HRQoL,’ ‘EQ-5D,’ and ‘patient experience.’ The HTA method guidance, position papers and technical support documents with details on PROs were then reviewed to evaluate areas of interest described above. Table 1 shows a list of method guidance that contain information related to PROs.
Country | HTA body | Method guidance | Ref. |
---|---|---|---|
Germany | IQWiG | • General methods version 6.0, Nov 2020 • General methods version 6.1, Jan 2022 | [15,16] |
G-BA | • G-BA rules of procedure, chapter 5, Jan 2011 • G-BA submission dossier module 4, Feb 2019 | [17,18] | |
France | HAS | • Transparency Committee Doctrine, Principles of medicinal products assessment and appraisal for reimbursement purposes, Dec 2020 • Authorization for early access to medicinal products: HAS assessment doctrine, Jun 2021 • Choices in Methods for Economic Evaluation | [19–21] |
England | NICE | • Guide to the methods of technology appraisal 2013 • Position statement on the use of EQ-5D-5L value set for England, Oct 2019 • NICE DSU Technical Support document: The use of health state utility values in decision models • NICE health technology evaluations: the manual | [22–25] |
The second step involved analysis of published HTA reports of selected case studies by the four HTA bodies between October 2015 and November 2020. This analysis was conducted between December 2020 and February 2021 using IQVIA's proprietary database HTA Accelerator [26], which was used in multiple peer-reviewed publications [27–29]. HTA Accelerator contains more than 36,000 HTA report publications covering 40 countries and 100 HTA bodies published since 2011. Data in HTA Accelerator are collated from publicly available HTA reports and structured into a framework of 250 available data elements that also includes information related to PROs. Details related to evaluation of the PRO evidence in these HTA reports on the topics of the role of PRO data in HTA as well as trial design considerations and PRO data collection, reporting and data analysis recommendations as noted earlier were captured. On the basis of these available details, 20 case studies covering more common indications of interest to the study sponsor in both solid and hematological cancer were selected, including lung cancer, ovarian cancer, cervical cancer, breast cancer, prostate cancer, melanoma, multiple myeloma and non-Hodgkin lymphoma. After excluding non-submissions by companies and ongoing assessments by HTA bodies, a total of 70 published assessments across the four HTA bodies for the 20 case studies were retrieved (Appendix Table 1 & 2).
Results
HTA method guidance
HTA bodies in Europe follow different assessment methodologies, mostly based on clinical comparative effectiveness and cost–effectiveness analysis [12–14]. The details on trial design, PRO data collection, reporting and analysis in the method guidance for the 4 HTA bodies in scope are summarized in Table 2.
Particulars | G-BA/IQWiG [15,16,30] | HAS [19,21] | NICE [22,23,25] | |
---|---|---|---|---|
Trial design | Open-label study | Open-label studies are of limited validity but acceptable in situations when blinding of physicians and patients is not possible | PRO data from open-label studies are not acceptable. Double-blinded conditions are required for demonstrating improvement in HRQoL that leads to clinical added value | No guidance is provided on acceptance of PRO data from open-label studies |
Multiplicity adjustments | No direct commentary is provided regarding multiplicity adjustment requirements in the method guidance | Management of multiplicity analyses is required in submission | No direct commentary is provided regarding multiplicity adjustment requirements in the method guidance | |
PRO data collection and reporting | Preferred instruments | Use of validated or established instruments is a prerequisite. In addition to the use of a generic instrument, disease-specific instruments to determine HRQoL in clinical studies should be applied | Use of validated instruments appropriate to the objective (preferentially specific) | There is a preference for EQ-5D instrument for deriving utility values; however, other methods are acceptable if data collection using EQ-5D is unsuitable as documented in the updated NICE manual |
Missing data threshold | Results are not acceptable if proportion of study participants not included in the analysis is >30% | Limited missing data is required to demonstrate improvement in HRQoL; however, no specific missing data threshold is mentioned in method guidance | No clear guidance is provided on acceptable threshold for missing data | |
Post-progression data collection | Data collection is preferred until dropout or death | There is no mention of post-progression PRO data collection in the method guidance | ||
PRO data analysis | Clinically meaningful change threshold | Outcome considered clinically meaningful if responder definition is ≥15% of the scale range | No specific threshold in method guidance is specified | |
Preference for time-to-event end point definitions | Preference is given for time-to-event analysis with use of time to deterioration or time to improvement for HTA benefit assessment | No preference in method guidance is specified | ||
Missing data sensitivity analysis | Sensitivity analysis is used to explore robustness of missing-at-random assumption | No analysis method in method guidance is specified | Sensitivity analysis needs to be used to demonstrate the use of alternative utility values if EQ-5D data are available from both trial and literature |
Role of PRO data in HTA
The impact of PROs on HTA decision-making is clearest in Germany. In assessments by the IQWiG and G-BA, PRO data alone can result in added benefit based on patient-relevant end points such as morbidity and HRQoL [30,31]. This was observed in seven of the 20 case studies reviewed in this study. For example, in the assessment of crizotinib for advanced anaplastic lymphoma kinase (ALK)-positive non-small-cell lung cancer (NSCLC), a hint of minor added benefit in the final G-BA assessment was supported by significant improvement in symptom scales of European Organization for Research and Treatment of Cancer 30-item QoL Questionnaire – Core 30 (EORTC QLQ-C30) and European Organization for Research and Treatment of Cancer 13-item module, the QoL Questionnaire – Lung Cancer 13 (EORTC QLQ-LC13) as well as HRQoL scales of EORTC QLQ-C30. This is despite being an open-label study and OS data not being statistically significant versus cisplatin in combination with pemetrexed or carboplatin in combination with pemetrexed [32]. Details of other case studies are included in Supplementary Appendix Table 3.
In the UK, NICE's decision-making is driven by cost–effectiveness analysis (CEA) in which the EQ-5D: EuroQol – 5 Dimension (EQ-5D) is a key input to determine health state utilities [22]. For example, in the assessment of pembrolizumab for advanced/unresectable metastatic melanoma, although both EORTC QLQ-C30 and EQ-5D data were collected by the manufacturer in the KEYNOTE-006 trial, only EQ-5D data were used by NICE in its appraisal to derive utility values. In comparison, in the G-BA assessment, minor improvements in fatigue, nausea and vomiting scales and the social functioning scale as measured by the EORTC QLQ-C30 supported a considerable added benefit rating in some subgroups. In the NICE report, details on the impact of PRO data other than the EQ-5D to inform the CEA are limited [33,34].
HAS Transparency Committee guidance states that besides efficacy and safety data, evidence demonstrating an improvement in HRQoL can lead to a higher clinical added value rating [19]. Its importance is also stated in the early access HAS doctrine, in which HRQoL is included as a part of the efficacy demonstration [20]. However, in oncology HTAs, PRO data are often not considered by HAS due to several methodological issues [11]. In the case studies analyzed, PRO data were considered only in two cases: abiraterone in metastatic castration-sensitive prostate cancer (mCSPC) and castration-resistant prostate cancer (mCRPC) [35,36]. In the mCSPC assessment, median time until deterioration of quality-of-life measured using FACT-P was longer in the abiraterone group (12.9 months) versus placebo (8.3 months). However, HAS commented that the quality-of-life data measured via FACT-P and other instruments such as BPI-SF and BFI should be interpreted with caution due to the exploratory nature of the analysis and the large number of missing data, for which approximately 50–70% of the questionnaires were completed during treatment, depending on the group from cycle 17 onward [35]. In the mCRPC assessment, the median time until deterioration of quality of life measured using FACT-P was longer in the abiraterone group (12.7 months) versus placebo (8.3 months) [36]. Of the other 17 case studies analyzed, PRO data were rejected in 13 cases due to open-label design, missing data or exploratory nature of HRQoL analysis with no method of controlling alpha risk inflation related to multiplicity. PRO data were not mentioned in four cases [26].
Specific requirements for HTA body to accept PRO data
Trial design
Open-label study
Although IQWiG guidance mentions that PRO data from open-label studies are of limited validity, in practice both G-BA and IQWiG accept PRO data from open-label oncology studies to inform benefit ratings. This was observed in all 12 case studies analyzed in this study in which PRO data submitted were from open-label studies. However, the extent of benefit is downgraded to a hint only [15,26]. Similarly, NICE accepts PRO data from open-label studies as observed in all 10 case studies analyzed [26,37]. On the contrary, HAS does not accept PRO data from open-label studies, as double-blinded condition is one of the criteria for quality-of-life data to be considered in clinical added benefit assessment [19]. This is further confirmed by case studies assessed in which PRO data of all 11 case studies from open-label trials were rejected by HAS [26].
Multiplicity adjustment
The need to prospectively plan for multiplicity adjustment for PRO data was not directly mentioned in guidance for G-BA, IQWIG and NICE [15,17,18,22,25]. However, the HAS Transparency Committee guidance mentioned management of multiplicity of analyses as a key factor for acceptance of HRQoL data [19]. In seven case studies researched, PROs were rejected due to a lack of method to control for type I error due to multiplicity. For example, this was one of the key reasons for rejection of PRO data in the assessment of pomalidomide plus bortezomib plus dexamethasone for adult patients with multiple myeloma [38].
PRO data collection & reporting
Preferred instrument
G-BA and IQWiG accept both generic and disease-specific PRO instruments for benefit assessment if they are validated or established. Cost–effectiveness analysis is not a part of G-BA and IQWiG assessments and hence only the visual analog scales of the EQ-5D instrument are considered as a measure of patient-reported morbidity [15,30]. HAS requires the HRQoL benefit to be demonstrated with the use of validated scales appropriate or specific to the objective and prefers EQ-5D or Health Utility Index for cost–utility analysis [19,21,39].
NICE assesses PRO data mostly as part of its economic evaluation to derive utilities and EQ-5D is the preferred instrument. The EQ-5D-3L value set is considered the reference case and all EQ-5D-5L data should be mapped onto the EQ-5D-3L value set using mapping function developed by Van Hout et al. [23,40]. In the absence of EQ-5D data collection in trials, EQ-5D data can be sourced from the literature, if justification for choosing a particular data set is clearly explained. Alternatively, EQ-5D data can be estimated by mapping other HRQoL measures or health-related benefits observed in the relevant clinical trials to EQ-5D [22,23]. For example, in the assessment of carfilzomib plus lenalidomide plus dexamethasone for previously treated patients with multiple myeloma, NICE preferred to use utility values mapped from EORTC QLQ-C30 trial data using a mapping algorithm from Proskorovsky et al. [41,42]. NICE has published a new manual providing a hierarchy of preferred HRQoL methods when EQ-5D is not available or not considered appropriate [24,25].
Missing data threshold
G-BA and IQWiG have specified a threshold for missing data for the PRO data to be accepted [30]. IQWiG stated in the 2020 method guidance that results will generally not be considered in the benefit assessment if the proportion of study participants (intention to treat) not included in the analysis is >30%. For example, in the assessment of nivolumab for unresectable or metastatic melanoma, PRO data collected using EORTC QLQ-C30 and EQ-5D were not considered by G-BA and IQWiG, as the proportion of evaluable patients in the study arms were too low and causes of missing values or sensitivity analysis were not provided [15,43,44]. In addition, results will not be considered in the benefit assessment if the difference in the proportion of study participants missing is >15% between the study arms. These thresholds are considered as guidance, and exceptions are made if data are demonstrated to be missing at random. G-BA and IQWiG also evaluate missing data due to loss in follow-up: the number of trial participants, time point of losses and reasons for the loss should be evaluated and adequate replacement strategy or statistical analysis methods need to be implemented [15]. In practice, mixed-effects models with repeated measures (MMRM) are considered to be the favorable method compared with last observation carried forward (LOCF) to address uncertainties resulting from missing data [30].
For NICE, there is no missing data threshold provided in its guidance, and the completion rate is generally not critiqued in its assessments [22,26]. Similarly, there is no missing data threshold provided in the HAS guidance; nonetheless, it is an important attribute for PRO data acceptance for HAS [19]. In the assessment of abiraterone for mCSPC, HAS critiqued that PRO data results should be interpreted with caution when missing data are >30%, but this critique did not lead to the data being rejected [35].
Post-progression data collection
IQWiG suggests post-progression PRO data be collected until dropout or death in oncology clinical trials [30]. Although post-progression PRO data were collected in 11 analyzed case studies, collection until death or final survival analysis was conducted only in three cases: in nivolumab in metastatic melanoma [43,44], palbociclib plus aromatase inhibitor or plus fulvestrant trial in metastatic breast cancer [45,46] and ixazomib plus lenalidomide plus dexamethasone trial in relapsed or refractory multiple myeloma [47,48]. Nonetheless, the data were deemed unusable in these cases due to high proportion of missing data, over the 30% threshold.
NICE considers post-progression EQ-5D data as input to utility values in economic modeling. Although post-progression data were collected in eight analyzed case studies, EQ-5D data from clinical trials were only used in three cases to derive utility values. These include nivolumab in metastatic melanoma [49], olaparib in ovarian cancer [50] and ixazomib plus lenalidomide plus dexamethasone in relapsed or refractory multiple myeloma [51]. Nonetheless, NICE prefers to use post-progression data from trials instead of literature as commented in the obinutuzumab plus bendamustine assessment in follicular lymphoma [37]. For HAS, no guidance regarding post-progression data collection is available. This was not mentioned in any of the case studies either because PRO data from most case studies were rejected [19,26].
PRO data analysis
Clinically meaningful change threshold
IQWiG introduced new guidance in 2020 as well as an update in 2022 with regard to within-patient clinically meaningful change in PROs [15,16]. Outcomes are considered clinically meaningful if the responder definition is ≥15% of the scale range (e.g., 15 points on a 0- to 100-point visual analog scale). When response criteria are <15% of the scale range or no response criteria are prespecified, standardized mean difference (Hedges' g) is used. Here, an irrelevance threshold of 0.2 is used for the results to be considered clinically meaningful.
Because case studies analyzed for this study were pre-November 2020, published literature was the common evidence source provided for clinically meaningful change thresholds for G-BA/IQWiG assessments [52,53]. G-BA tends to accept Pickard et al. as a source for minimally important difference (MID) for EQ-5D, whereas IQWiG typically rejects it [53]. For example, in the assessment of olaparib for breast cancer gene 1/2 (BRCA1/2)+ ovarian, fallopian tube and primary peritoneal cancer, IQWiG rejected the time to deterioration of ≥7–10 points threshold for EQ-5D because the Pickard study was not considered suitable to prove the validity of the MID. Hence, IQWiG assessed a mean difference end point in this case. However, G-BA accepted the responder analyses, citing that responder analyses based on MID have general advantages for a clinical evaluation of effects compared with an analysis of mean differences and the validation study in question has already been used in previous assessments [52–55].
For NICE, predefined clinically meaningful change thresholds for EQ-5D are irrelevant for health economic modeling. However, in case studies in which time to deterioration is used to demonstrate a drug's clinical benefit, thresholds from published literature (e.g., Osoba et al.) were accepted by NICE [52]. For example, in the assessment of pembrolizumab for metastatic NSCLC, the sponsor submitted EORTC QLQ-C30 data with a mean difference of ≥10 points being considered clinically meaningful quoting the Osoba et al. and King et al. studies [52,56]. It was also mentioned that clinically meaningful mean difference as low as 4 points have been reported for EORTC QLQ-C30 in NSCLC trials as quoted in the Maringwa et al. study [57]. However, the evidence review group critiqued that it is unclear whether the result of Maringwa analysis is clinically important [58].
For HAS, no predefined clinically meaningful change threshold is provided in its guidance or case studies reviewed in this study [19,26].
Preference for time-to-event (TTE) endpoint definitions
Oncology studies generally have different follow-up times between different arms. Hence, G-BA and IQWiG prefer time-to-event analyses such as time to deterioration or time to improvement for evaluation of PRO data [30].
Some of the accepted definitions are demonstrated in the case study of daratumumab plus bortezomib plus thalidomide plus dexamethasone for multiple myeloma. Here, time to deterioration measured with the EORTC QLQ-C30 and EQ-5D visual analog scale with data collected until day 100 after autologous stem cell therapy was accepted by G-BA. End-point definitions used were ‘Time to deterioration by ≥10 points’ for EORTC QLQ-C30 and ‘Time to deterioration by ≥7 and ≥10 points’ for EQ-5D visual analog scale [59,60].
For NICE appraisals, changes in baseline scores are most common, but time to deterioration is used in some assessments. For example, for the assessment of crizotinib in ALK+ advanced NSCLC, time to deterioration was evaluated as a composite end point for the symptoms of cough, dyspnea and pain in chest using the EORTC QLQ-LC13 module. It was defined as the time from randomization to the earliest date that the patient's scale scores showed a ≥10-point increase after baseline (indicating a worsening of symptoms) in any of the three symptoms. Although the evidence review group mentioned that statistically significant benefits were observed with EORTC QLQ-LC13, only EQ-5D data were used, given it is a preferred instrument to derive utility values [61].
There is no explicit preference by HAS for specific PRO end-point definition, with no comments on end-point definitions assessed in the abiraterone assessments [26,35,36].
Missing data sensitivity analysis
In the G-BA/IQWiG assessments, sensitivity analysis is used to explore robustness of missing-at-random (MAR) assumption [30]. For example, in case of carfilzomib plus lenalidomide plus dexamethasone in multiple myeloma, the scale ‘overall health/quality of life’ was compared between the treatment groups using a likelihood-based MMRM under the MAR assumption during primary analysis. A sensitivity analysis, including a pattern mixture model (missing not at random - MNAR) with an auxiliary variable to consider missing data and a pattern mixture model that considered the time of the last available evaluation, was used to check the robustness [62,63].
NICE specifies use of sensitivity analysis to demonstrate the use of alternative utility values if EQ-5D data are available from both trials and literature. For example, in the original assessment of obinutuzumab plus bendamustine in follicular lymphoma, the sponsor provided sensitivity analysis in which EQ-5D utility scores from the GADOLIN trial as well as literature from Wild et al. were used [26,64]. Sensitivity analysis was conducted to demonstrate the use of literature over trial data as the GADOLIN trial did not capture EQ-5D data from trial participants with advance stages of disease progression [37]. For HAS, there is no mention of missing data sensitivity analysis either in guidance or analyzed case studies [26].
The summary of the PRO evidence requirements for HTA observed from the 20 case studies is shown in Table 3.
Particulars | G-BA/IQWiG | HAS | NICE | |
---|---|---|---|---|
Role of PRO data in HTA assessment | Key role in driving benefit assessment based on morbidity and HRQoL patient-relevant end points | Limited role due to frequent rejection of PRO data owing to stringent evaluation methodology | Key role in deriving utility values for cost–effectiveness analysis using EQ-5D-3L value set | |
Trial design | Open-label study | PRO data from open-label studies accepted to inform HTA decisions in practice; however, extent of effect downgraded to hint only | PRO data from open-label studies not accepted by HAS in practice | PRO data from open-label studies accepted to inform HTA decisions in practice |
Multiplicity adjustments | No direct commentary regarding multiplicity adjustment requirements in the case studies analyzed | PRO data rejected when no method for controlling type I error linked to multiplicity was specified | No direct commentary regarding multiplicity adjustment requirements in the case studies analyzed | |
PRO data collection and reporting | Preferred instruments | No preference specified among generic or disease-specific scales in the observed case studies | No preference specified in the observed case studies | Preference for EQ-5D-3L instrument for deriving utility values |
Missing data threshold | Up to 30% missing data acceptable as observed in the case studies | Missing data is one of the key critiques for rejection of PRO data; 30% threshold cited in one selected case study | No clear comments on acceptable threshold for missing data observed in case studies | |
Post-progression data collection | Data collection is preferred until death, but missing data become more problematic | No preference observed in selected case studies | Trial-derived post-progression data preferred but literature use also accepted | |
PRO data analysis | Clinically meaningful change threshold | Responder analysis with MID sourced from literature accepted; going forward a unique responder definition of ≥15% of the scale range will be preferred for all PRO data | No specific threshold specified in selected case studies | |
Preference for time-to-event end-point definitions | Preference for time-to-event analysis with use of time to deterioration or time to improvement for HTA benefit assessment observed in selected case studies | No preference observed in selected case studies | Change-from-baseline analysis is more commonly observed than time-to-event end point | |
Missing data sensitivity analysis | Sensitivity analysis used to explore robustness of missing-at-random assumption in the selected case studies | No analysis method specified in selected case studies | Sensitivity analysis utilized to demonstrate the use of alternative utility values if EQ-5D data are available from both trial and literature |
Conclusions
A major reason for increased use of PROs in oncology clinical trials can be attributed to the recognition of their importance by regulators including the FDA and EMA and the vision of incorporating the patient voice in reimbursement decisions by HTA bodies, especially IQWiG/G-BA and NICE [1,7]. However, as highlighted by this study, HTA bodies have provided varying levels of detail regarding PRO evidence requirements in their current method guidance. In particular, there is a lack of detail from NICE and HAS on how the HTA bodies would prefer PRO evidence to be analyzed and reported for HTA. Furthermore, there is a lack of consistency across HTA bodies on how PRO evidence is reviewed and considered in oncology HTAs. For instance, PRO evidence alone can lead to added benefit ratings in G-BA and IQWiG assessments, but the same data would be rejected by HAS due to methodological issues. For NICE, the most common impact for PRO data is the use of EQ-5D data in cost–effectiveness analysis.
Discrepancies highlighted in the guidance were also evident in the oncology case studies analyzed. For example, HAS rejected the PRO data in 17 of 19 selected oncology case studies citing methodological considerations such as rejection of data from open-label studies and lack of method for controlling type I error linked to multiplicity. In contrast, PRO data led to added benefit ratings in 7 of 20 selected oncology case studies assessed by G-BA and IQWiG. Oncology case studies assessed by NICE reflected what was provided in the guidance with utilities derived primarily from EQ-5D or literature values. Requirements and preference for missing data thresholds, post-progression data collection and time-to-event end points were clearly highlighted in the G-BA/IQWiG guidance and the same was observed in the case studies.
Some of the key discrepancies across HTA bodies require further exploration. For instance, HAS does not accept HRQoL data from open-label studies. However, studies have found no evidence of significant bias for PROs due the absence of blinding in randomized-controlled trials [65]. The EMA recognizes that blinded studies are not always possible, and PRO data from unblinded trials can be considered for benefit–risk assessment [2]. In another example, IQWiG's requirement of a responder definition of 15% of the range of the respective scale is not aligned with both EMA and FDA guidance [2,4]. Although IQWiG has defined this threshold based on systematic reviews of MIDs, this universal responder definition applied to all PROs has raised concerns among some questionnaire developers and sponsors, as conceptually, the responder definition cannot be the same for symptoms and for more indirect concepts such as quality of life.
Such discrepancies across European HTA bodies make it challenging for sponsors to meet the varying requirements of all HTA bodies and may reduce incentives for the sponsors to generate and provide comprehensive PRO evidence for oncology HTA submissions, despite regulator's proposed actions for patient-focused drug development. Sponsors may also have to include multiple PRO instruments in the same trial to satisfy the requirements by different regulatory and HTA bodies, which may lead to an unnecessary increase in patient burden during the clinical trial. HTA bodies can play a key role in harmonizing PRO evidentiary requirements in collaboration with regulators and sponsors. Further research is required to highlight whether similar discrepancies exist in other therapeutic areas.
Future perspective
Currently, there are therapeutic area-agnostic collaborative efforts between the EMA and FDA, which issued a joint reflection paper on Patient-Focused Drug Development that sets out a list of proposed actions of new ICH guideline for the industry [66]. To encourage the collection of PROs in clinical trials, European HTA bodies have an opportunity to issue uniformed PRO evidence requirements at the European level given the recent acceptance by the EU council for joint HTA clinical assessments (JCA) and the ongoing Setting International Standards of Patient-Reported Outcomes and Quality of Life Endpoints in Cancer Clinical Trials – Innovative Medicines Initiative (SISAQOL-IMI) project, which aims to develop standardized methods to analyze and report findings of PRO evidence from oncology trials [67–71].
Limitations of our study
The study was focused on only four key HTA bodies with three key European HTA processes. To manage the scope of the study, the results were based on the 20 solid tumor or hematological oncology PRO case studies of HTA reports published between October 2015 and November 2020; these case studies may not be representative of all solid tumor and hematological oncology HTA evaluations of PROs. The analysis in the current study was also limited to publicly available information in the method guidance published by the HTA bodies and commentary by the agencies in the HTA reports of the case studies. A more comprehensive review of differences in guidance and PRO data use among regulators, HTA bodies and payers across the US and Europe needs to be undertaken to streamline the PRO evidence requirements.
Aims
Patient-reported outcomes (PROs) are increasingly collected in clinical trials and used in health technology assessment (HTA) submissions to measure how new treatments impact patient-relevant outcomes.
However, the opportunities for PRO data within HTAs is not entirely clear or consistent across HTA bodies.
This study aims to highlight differences in PRO evidence requirements across HTA bodies in Germany, France and the UK as well as calls for its harmonization.
Methods
Method guidance documents published by Germany's G-BA and IQWiG, France's HAS and the UK's NICE were analyzed to evaluate PRO-related commentary.
This was supplemented by analysis of 20 oncology cases studies pertaining reports published by the four HTA bodies between October 2015 and November 2020.
Results
PRO evidence assessment and its role in driving HTA assessment decisions vary across HTA agencies.
Impact of PRO data collection is most clear in Germany because it is an integral part of G-BA and IQWiG's outcome analysis and can alone drive added benefit.
Although French HAS guidance emphasizes importance of Health-related quality of life in driving clinical value added rating, methodological hurdles often lead to non-assessment of such data.
The UK, driven by cost–effectiveness, only relies on EuroQol – 5 Dimension data to determine health state utilities.
Conclusions
Discrepancies across European HTA bodies increases challenges for the sponsors to meet their requirements and may reduce incentives to generate PRO data for oncology HTA submissions.
To mitigate these challenges, HTA bodies could play a key role in harmonizing PRO evidentiary requirements in collaboration with regulators and sponsors.
Supplementary data
To view the supplementary data that accompany this paper please visit the journal website at: www.futuremedicine.com/doi/suppl/10.2217/fon-2022-0374
Author contributions
All authors contributed to the entire content of the manuscript.
Acknowledgments
The authors thank Dr J Sloan and Dr Y-B Böhler for their input in the concept of the manuscript.
Financial & competing interests disclosure
The research was funded by Regeneron Pharmaceuticals, Inc. Authors who are employed by Regeneron Pharmaceuticals, Inc., also participated in the study design and the analysis and interpretation of findings. IQVIA was funded by Regeneron Pharmaceuticals, Inc., to conduct the research, perform the analysis and write this manuscript. All authors were involved at all stages of the manuscript in collaboration with the sponsor company. O Chassany has received fees from Novartis, Bristol-Myers Squibb, Takeda and IQVIA. A van Engen, L Lai, K Borhade and M Ravi are employed by IQVIA. J Harnett, C-I Chen and RGW Quek are employees and shareholders of Regeneron Pharmaceuticals, Inc. J Harnett is also a shareholder of Pfizer Inc. RGW Quek is also a shareholder of Amgen Inc. and Pfizer Inc. The authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.
Medical writing support was provided by IQVIA and was funded by was funded by Regeneron Pharmaceuticals, Inc.
Open access
This work is licensed under the Attribution-NonCommercial-NoDerivatives 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/
Papers of special note have been highlighted as: • of interest; •• of considerable interest
References
- 1. Clinical trial patient-reported outcomes data: going beyond the label in oncology. Clin. Ther. 38(4), 811–820 (2016).
- 2. European Medicines Agency. Appendix 2 to the guideline on the evaluation of anticancer medicinal products in man. The use of patient-reported outcome (PRO) measures in oncology studies (2016). www.ema.europa.eu/en/documents/other/appendix-2-guideline-evaluation-anticancer-medicinal-products-man_en.pdf
- 3. . Patient-reported outcomes (PROs) and patient-reported outcome measures (PROMs). Health Serv. Insights. 6, 61–68 (2013).
- 4. US FDA. Guidance for industry. Patient-reported outcome measures: use in medical product development to support labeling claims (2009). www.fda.gov/media/77832/download
- 5. International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use. ICH Reflection Paper on Patient Focused Drug Development (PFDD) (2020). www.ich.org/page/reflection-papers#1-1
- 6. Medicines & Healthcare Products Regulatory Agency. Patient involvement strategy 2021–25 (2021). https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/1022370/Patient_involvement_strategy.pdf
- 7. . Patient-reported outcome measures (PROMs) as proof of treatment efficacy. BMJ Evid.-Based Med.
10.1136/bmjebm-2020–111573 (2021). - 8. . Surrogate endpoints need complementary patient reported outcomes. BMJ Opin. (2021). https://blogs.bmj.com/bmj/2021/09/15/surrogate-endpoints-need-complementary-patient-reported-outcomes/
- 9. . Surrogate end points and patient-reported outcomes for novel oncology drugs approved between 2011 and 2017. JAMA Oncol. 5(9), 1358–1359 (2019).
- 10. . Patient-reported outcomes (PRO) in cancer trials submitted to the FDA from 2012–2015. J. Clin. Oncol. 35(Suppl. 15), e14024–e14024 (2017).
- 11. Place of patient-reported outcomes & experiences measurements (PROMS/PREMS) in the assessment and pricing of health technologies in France. Therapie. 77(1), 103–115 (2022).
- 12. . Using health technology assessment to assess the value of new medicines: results of a systematic review and expert consultation across eight European countries. Eur. J. Health Econ. 19(1), 123–152 (2018).
- 13. . Variation in health technology assessment and reimbursement processes in Europe. Value Health. 20(1), 67–76 (2017).
- 14. . A comparison of reimbursement recommendations by European HTA agencies: Is there opportunity for further alignment? Front. Pharmacol. 8 (2017). www.frontiersin.org/articles/10.3389/fphar.2017.00384
- 15. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. IQWiG general methods version 6 (2020). www.iqwig.de/methoden/general-methods_version-6-0.pdf •• Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen (IQWiG) version 6 method guidance document with substantial details on PROs,
- 16. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. IQWiG general methods version 6.1 (2022). www.iqwig.de/methoden/allgemeine-methoden-v6-1.pdf •• Latest IQWiG method guidance document with few updates on version 6,
- 17. Gemeinsamer Bundesausschuss. G-BA rules of procedure (2021). www.g-ba.de/downloads/62-492-2597/VerfO_2021-07-15_iK-2021-08-28.pdf
- 18. Gemeinsamer Bundesausschuss. G-BA submission dossier module (2021). www.g-ba.de/downloads/17-98-4827/2021-08-03_Anl2_1_Erstellung-Einreichung-Dossier.pdf
- 19. Transparency Committee, Haute Autorité de Santé. Transparency Committee Doctrine, Principles of medicinal products assessment and appraisal for reimbursement purposes (2020). www.has-sante.fr/upload/docs/application/pdf/2019-07/doctrine_de_la_commission_de_la_transparence_-_version_anglaise.pdf •• HAS method guidance document with methodological considerations for PROs.
- 20. . Authorisation for early access to medicinal products: HAS assessment doctrine (2021). www.has-sante.fr/upload/docs/application/pdf/2021-09/doctrine_de_la_ceesp_version_anglaise_2021-09-29_11-14-2_803.pdf
- 21. Haute Autorité de Santé. Choices in methods for economic evaluation (2012). www.has-sante.fr/upload/docs/application/pdf/2012-10/choices_in_methods_for_economic_evaluation.pdf
- 22. National Institute for Health and Care Excellence. Guide to the methods of technology appraisal (2013). www.nice.org.uk/process/pmg9/resources/guide-to-the-methods-of-technology-appraisal-2013-pdf-2007975843781 •• NICE method guidance document with methodological considerations for PROs.
- 23. National Institute for Health and Care Excellence. Position statement on use of the EQ-5D-5L value set for England | NICE Technology appraisal guidance (2019). www.nice.org.uk/about/what-we-do/our-programmes/nice-guidance/technology-appraisal-guidance/eq-5d-5l •• NICE position statement on utilization of EQ-5D-5L set for the sponsors.
- 24. NICE Decision Support Unit. Utilities TSD series – NICE Decision Support Unit (2011). http://nicedsu.org.uk/technical-support-documents/utilities-tsd-series/
- 25. National Institute for Health and Care Excellence. NICE health technology evaluations: the manual (2022). www.nice.org.uk/process/pmg36/resources/nice-health-technology-evaluations-the-manual-pdf-72286779244741
- 26. IQVIA HTA Accelerator. https://hta.quintiles.com/ •• Key source used for prioritizing and analyzing the 20 oncology case studies along with respective HTA country websites.
- 27. Use of external comparators for health technology assessment submissions based on single-arm trials. Value Health. 24(8), 1118–1125 (2021).
- 28. PCV24 Health technology assessment submissions for cardiovascular interventions in the ASIA-Pacific region. Value Health Reg. Issues. 22, S30 (2020).
- 29. . OP30 Impact of patient reported outcomes data on health technology assessments of acute myeloid leukemia. Int. J. Technol. Assess. Health Care. 35(S1), 7–7 (2019).
- 30. Patient-reported outcomes in the context of the benefit assessment in Germany (2021). https://arxiv.org/ftp/arxiv/papers/2107/2107.03249.pdf • Key source providing details on the PRO data assessments by G-BA and IQWiG.
- 31. . PCN315 – Impact of PRO data on IQWiG benefit ratings in oncology. Value Health. 21, S67–S68 (2018).
- 32. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient crizotinib (new area of application: non-small cell lung cancer, ALK +, first line) - Federal Joint Committee. https://www.g-ba.de/bewertungsverfahren/nutzenbewertung/209/
- 33. National Institute for Health and Care Excellence. Pembrolizumab for advanced melanoma not previously treated with ipilimumab. Technology appraisal guidance [TA366]. (2017). https://www.nice.org.uk/guidance/ta366
- 34. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient pembrolizumab (melanoma) – Federal Joint Committee (2016). www.g-ba.de/bewertungsverfahren/nutzenbewertung/184/
- 35. Haute Autorité de Santé. ZYTIGA (acétate d'abiratérone), inhibiteur de la synthèse des androgènes (2018). www.has-sante.fr/jcms/c_2857377/fr/zytiga-acetate-d-abiraterone-inhibiteur-de-la-synthese-des-androgenes
- 36. Haute Autorité de Santé. ZYTIGA (abiratérone), inhibiteur de la synthèse des androgènes (2013). www.has-sante.fr/jcms/c_1622341/fr/zytiga-abiraterone-inhibiteur-de-la-synthese-des-androgenes
- 37. National Institute for Health and Care Excellence. Obinutuzumab with bendamustine for treating follicular lymphoma refractory to rituximab. Technology appraisal guidance [TA472] (2017). https://www.nice.org.uk/guidance/ta472
- 38. Haute Autorité de Santé. IMNOVID (pomalidomide) (2020). www.has-sante.fr/jcms/p_3161436/fr/imnovid-pomalidomide
- 39. A French value set for the EQ-5D-5L. PharmacoEconomics. 38(4), 413–425 (2020).
- 40. Interim scoring for the EQ-5D-5L: mapping the EQ-5D-5L to EQ-5D-3L value sets. Value Health J. Int. Soc. Pharmacoeconomics Outcomes Res. 15(5), 708–715 (2012).
- 41. National Institute for Health and Care Excellence. Carfilzomib for previously treated multiple myeloma Technology appraisal guidance [TA657]. (2020). www.nice.org.uk/guidance/ta657
- 42. Mapping EORTC QLQ-C30 and QLQ-MY20 to EQ-5D in patients with multiple myeloma. Health Qual. Life Outcomes. 12(1), 35 (2014).
- 43. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. [A15-27] Nivolumab – Nutzenbewertung gemäß § 35a SGB V (Dossierbewertung) (2015). www.iqwig.de/projekte/a15-27.html
- 44. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient nivolumab (melanoma) – Federal Joint Committee (2016). www.g-ba.de/bewertungsverfahren/nutzenbewertung/179/
- 45. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient palbociclib (breast cancer) - Federal Joint Committee (2018). www.g-ba.de/bewertungsverfahren/nutzenbewertung/269/
- 46. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. [A16-74] Palbociclib (Mammakarzinom) - Nutzenbewertung gemäß § 35a SGB V (2017). www.iqwig.de/projekte/a16-74.html
- 47. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient ixazomib (multiple myeloma, at least 1 previous therapy, combination with lenalidomide and dexamethasone) - Federal Joint Committee. (2017). www.g-ba.de/bewertungsverfahren/nutzenbewertung/275/
- 48. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. [G17-02] Ixazomib (multiples Myelom) – Bewertung gemäß § 35a Abs. 1 Satz 10 SGB V (2017). www.iqwig.de/projekte/g17-02.html
- 49. National Institute for Health and Care Excellence. Nivolumab for treating advanced (unresectable or metastatic) melanoma technology appraisal guidance [TA384] (2016). https://www.nice.org.uk/guidance/ta384
- 50. National Institute for Health and Care Excellence. Olaparib for maintenance treatment of BRCA mutation-positive advanced ovarian, fallopian tube or peritoneal cancer after response to first-line platinum-based chemotherapy technology appraisal guidance [TA598] (2019). www.nice.org.uk/guidance/ta598
- 51. National Institute for Health and Care Excellence. Ixazomib with lenalidomide and dexamethasone for treating relapsed or refractory multiple myeloma technology appraisal guidance [TA505] (2018). www.nice.org.uk/guidance/ta505
- 52. Interpreting the significance of changes in health-related quality-of-life scores. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 16(1), 139–144 (1998).
- 53. . Estimation of minimally important differences in EQ-5D utility and VAS scores in cancer. Health Qual. Life Outcomes. 5, 70 (2007).
- 54. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient olaparib (new area of application: ovarian carcinoma, fallopian tube carcinoma or primary peritoneal carcinoma, BRCA-mutated, FIGO stages III and IV, maintenance therapy) – Federal Joint Committee (2020). www.g-ba.de/bewertungsverfahren/nutzenbewertung/474/
- 55. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. [A19-56] Olaparib (Ovarialkarzinom) – Nutzenbewertung gemäß § 35a SGB V (2019). www.iqwig.de/projekte/a19-56.html
- 56. . The interpretation of scores from the EORTC quality of life questionnaire QLQ-C30. Qual. Life Res. Int. J. Qual. Life Asp. Treat. Care Rehabil. 5(6), 555–567 (1996).
- 57. Minimal important differences for interpreting health-related quality of life scores from the EORTC QLQ-C30 in lung cancer patients participating in randomized controlled trials. Support. Care Cancer. 19(11), 1753–1760 (2011).
- 58. National Institute for Health and Care Excellence. Pembrolizumab for untreated PD-L1-positive metastatic non-small-cell lung cancer technology appraisal guidance [TA531] (2018). https://www.nice.org.uk/guidance/ta531
- 59. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient daratumumab (new area of application: multiple myeloma, newly diagnosed, patients suitable for autologous stem cell transplantation, combination with bortezomib, thalidomide and dexamethasone) – Federal Joint Committee (2020). www.g-ba.de/bewertungsverfahren/nutzenbewertung/522/
- 60. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. [A20-15] Daratumumab (multiples Myelom, Stammzelltransplantation) – Nutzenbewertung gemäß § 35a SGB V (2020). www.iqwig.de/projekte/a20-15.html
- 61. National Institute for Health and Care Excellence. Crizotinib for untreated anaplastic lymphoma kinase-positive advanced non-small-cell lung cancer technology appraisal guidance [TA406] (2016). www.nice.org.uk/guidance/TA406
- 62. Gemeinsamer Bundesausschuss. Benefit assessment procedure for the active ingredient carfilzomib – Federal Joint Committee (2016). www.g-ba.de/bewertungsverfahren/nutzenbewertung/204/
- 63. Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen. [G15-16] Carfilzomib – Bewertung gemäß § 35a Abs. 1 Satz 10 SGB V (2016). www.iqwig.de/projekte/g15-16.html
- 64. . PCN62 Utility elicitation in patients with follicular lymphoma. Value Health. 9(6), A294.
- 65. Investigating the impact of open label design on patient-reported outcome results in prostate cancer randomized controlled trials. Cancer Med. 9(20), 7363–7374 (2020).
- 66. European Medicines Agency. EMA/FDA – patient-focused drug development ICH reflection paper (2021). www.ema.europa.eu/en/documents/presentation/presentation-ema/fda-patient-focused-drug-development-ich-reflection-paper-mbonelli-ema_en.pdf
- 67. Council of the European Union. Council green lights new rules on health technology assessment, improving access to medicines and simplifying procedures (2021). www.consilium.europa.eu/en/press/press-releases/2021/11/09/council-green-lights-new-rules-on-health-technology-assessment-improving-access-to-medicines-and-simplifying-procedures/
- 68. Regulation on Health Technology Assessment. (2021). https://health.ec.europa.eu/health-technology-assessment/regulation-health-technology-assessment_en
- 69. BEUC, The European Consumer Organisation. Good for consumers: deal on EU-wide coordination for assessing new medicines and medical devices (2021). www.beuc.eu/publications/good-consumers-deal-eu-wide-coordination-assessing-new-medicines-and-medical-devices/html
- 70. European Organisation for Research and Treatment of Cancer. Establishing international standards in the analysis of patient reported outcomes and health-related quality of life data in cancer clinical trials (2021). https://cordis.europa.eu/project/id/945052
- 71. Innovative Medicines Initiative. Topic: establishing international standards in the analysis of patient reported outcomes and health-related quality of life data in cancer clinical trials (2020). www.imi.europa.eu/sites/default/files/uploads/documents/apply-for-funding/future-topics/Topic_PROcancer.pdf