Long-awaited results from the the Southwest Oncology Group (SWOG) trial comparing bortezomib–lenalidomide-dexamethasone (VRd) versus lenalidomide-dexamethasone (Rd) as induction treatment for previously untreated multiple myeloma (MM) patients were recently presented (S0777).1 Results justified the early adoption of VRd in the frontline setting over 5 years ago by the NCCN based on Level 2A evidence according to their guidelines.2 The randomized study E1A05 also sought to evaluate VRd superiority over a doublet (Vd), but in the consolidation setting. The trial was closed to enrollment prematurely due to slow accrual. Results reported here include 48 enrolled patients.

Patients with symptomatic MM who had completed a minimum of one cycle and maximum of six cycles of dexamethasone-based induction therapy were eligible provided no more than 8 weeks had elapsed since induction end. Patients were stratified by prior lenalidomide-dexamethasone use during induction and by complete response (CR) status at registration. Patients in the VRd arm received bortezomib (V) 1.3 mg/m2 IV d1, 4, 8 and 11, lenalidomide (R) 15 mg/day PO days 1–14 plus dexamethasone (d) 40 mg/day PO d1, 8 and 15 every 21 days for 8 cycles; patients in the Vd arm received bortezomib 1.3 mg/m2 IV d1, 4, 8 and 11 plus dexamethasone 40 mg/day PO d1, 8, 15 every 21 days for 8 cycles. The study was designed to detect a 50% improvement in median progression-free survival (PFS) from 21 months (m) to 31.5 m using a stratified log-rank test given 392 patients, 212 failures, 82% power and 2.5% one-sided Type I error.

Descriptive statistics were used to characterize patients at study entry. Response evaluation was based on the International Myeloma Working Group (IMWG) response criteria.3 Toxicity was evaluated according to CTCAEv3, including only events with at least possible treatment attribution. Response and toxicity rates were estimated with 95% exact binomial confidence intervals. The odds ratio for objective response (OR), defined as partial response (PR) or better, was estimated with 95% CIs. PFS was defined as the time from randomization to the earliest documentation of disease progression (PD) or death from any cause. Patients who were alive without evidence of PD were censored at the date of last disease assessment. Time-to-event distributions were estimated using the Kaplan-Meier (KM) method and compared using the log-rank test in all randomized patients according to assigned treatment (intention to treat, ITT).4 Cox proportional hazards regression was used to assess treatment hazard ratio (HR, VRd/Vd).5 Restricted mean survival time, which represents the area under the survival curves when cutting follow-up at a certain time, was also calculated.6 Quality of life (QoL) was evaluated using the FACT-Neurotoxicity Trial Outcome Index, which is the sum of the FACT physical, functional and neurotoxicity instruments (25 questions, score 0–100). A minimal absolute difference in mean change score between arms of 6-8 or half observed standard deviation (s.d.) was considered clinically significant.7 QoL was measured at baseline, during treatment (cycle 5), end of treatment (cycle 8 or earlier) and in long-term follow-up (months 9, 12, 15 and 18 post registration).

Between January 2008 and May 2010, 48 patients from 15 institutions (85% ECOG) were randomized, with equal allocation to VRd (n=23) and Vd (n=25). Of the study cohort, 56% were 65 years, 44% female, 85% white, 58% PS>0 and 19% ISS Stage III. Median prior dose of dexamethasone was 640 mg on both arms, with 22 patients receiving Rd. On study, mean duration of treatment was 6.5 cycles. 56% patients completed eight cycles (VRd 65%, Vd 48%). The main reason for early discontinuation was due to adverse event/complications (66%). Three patients underwent transplant within 6 m of ending treatment, but no further data on subsequent therapy were collected. The primary response analysis data set comprised 32 patients (16/arm) with confirmed measurable disease at baseline. Response rates were higher on the VRd arm, including OR rates of 81% vs 63% (odds ratio 2.6 (95% CI (confidence interval): 0.52, 13.0)) and very good partial response rates of 63% vs 19%. Sensitivity analysis excluding only patients not in CR at registration (n=3) added 14 more unevaluable patients to the existing 3. In this subset, OR and very good partial response rates on VRd and Vd, respectively, were 59% vs 43% and 45% vs 13%. Grade 3 or higher treatment-related, non-hematological toxicity rates between arms were similar (VRd 65 vs Vd 64%) (Table 1). Grade 3-4 neurologic/pain/gastrointestinal toxicity rates were 35%/13%/9% on VRd vs 28%/16%/16% on Vd. Five second primary malignancies were reported (3 Vd, 2 VRd). Median PFS was similar: VRd 20 m (95% CI: 11–43 m) vs Vd 18 m (95% CI: 9–37 m) (Figure 1). The PFS HR was 0.96 (95% CI: 0.53–1.75), with 44 failures (92%) overall. Median survival follow-up was 72 months (52% of patients have died). Median OS was 69 m on VRd and 60 m on Vd. The OS HR was 0.77 (95% CI: 0.35–1.70). Five-year OS probability was 65% (95% CI: 48–88%) on VRd and 52% (95% CI: 36–76%) on Vd. Setting follow-up at 60 m, patients on VRd survived for 50.1 m on average, a survival savings of 4.8 m (95% CI: −4.4 to −14 m). The FACT Ntx TOI mean change from baseline to 6 m post treatment end (primary) was −7.9 VRd (n=12) vs −2.6 Vd (n=8) with a s.d. of change of 15 in both arms. The mean change to treatment end was −14.9 VRd (n=19) vs −8.0 Vd (n=20). Patients on average experienced −3 to −5 and −1 to −2 point decreases over treatment in FACT physical and functional score, respectively, on the VRd arm compared with little change on the Vd arm. The two arms had parallel adverse trends over treatment in the FACT Ntx domain: −4 to −8 VRd vs −3 to −6 Vd.

Table 1 Incidence of Treatment-Related Toxicity by Treatment Arm
Figure 1
figure 1

Overall Survival by Treatment Arm.

The trial results in terms of comparing treatments are limited because of the small sample size. Our study demonstrated that VRd was not substantially less tolerated than Vd, but appeared inferior in terms of QoL driven by differences in FACT physical and less so functional domains comprising the FACT Ntx TOI. There was a trend towards improved long-term survival on the VRd arm with 5-year OS probability 65% (95% CI: 48–88%) vs Vd 52% (95% CI: 36–76%), echoing S0777 results. PFS was overlapping between arms in our study and short relative to S0777 due to the consolidation setting and lack of maintenance treatment on protocol. VRd neurologic/pain toxicity as well as OR rates paralleled S0777 results despite lower lenalidomide dosing. Response analyses show that accurate representation of efficacy can be compromised due to the stringent nature of coding disease at baseline and in follow-up, wherein a missing test can lead to ineligible/unevaluable status. Not only are response rates often underestimated but also randomization is not upheld.

The premature closure of E1A05 due to weak accrual was not uncharacteristic of clinical trial research in the cooperative group setting. Research has shown that a phase III therapeutic trial takes on average 2.5 years from formal concept review to submission and that length of development time is adversely associated with accrual performance.8, 9 Activated in September 2007, E1A05 was on par, with about a 2-year development time. The final E1A05 study design centered on comparing a two- vs three-drug regimen using the most promising novel agents of the time.10 At concept initiation, however, the goal was to compare VRd against a regimen that included VRd and high-dose therapy with autologous stem cell transplant (HDT), but a design with transplant randomization was not palatable for reviewers at that time. The value of chemotherapy-based consolidation for myeloma compared with HDT in terms of OS is still unclear, but is being tested in an international trial. The E1A05 experiment became obsolete fairly quickly, with VRd superiority a foregone conclusion based initially on reports of a Phase I/II study and confirmed later in other small studies showing unprecedented results for VRd.11, 12, 13

At ASCO 2013 the NCI MM Accrual Working Group published 10 Barriers to Accrual (BtA) and presented strategies to overcome these.14 E1A05 represents an example of BtA 2 (spectrum of available treatment options), 6 (competing trials including S0777 among others) and 7 (lack of interest in a trial evaluating commercially available agents only). Adding in the larger context of weak patient accrual metrics in the US, the problem was intensified.15 If E1A05 had met its accrual rate and event targets, definitive information on VRd would have been available as early as March 2011. The NCI has made serious efforts to improve the process of clinical trial activation with the establishment of more aggressive timelines to be met (OEWG) as well as the creation of a central IRB (CIRB). Furthermore, the development and deployment of disease-specific steering committees within the NCI was pursued with the intent to coordinate clinical trial development and, specifically, to prevent competing clinical trials within the NCI Clinical Trial Network. The cooperative group oncology clinical research enterprise provides the platform for independent, practice-changing research in MM, but the operational environment is very challenging and initiatives to enhance accrual need to continue.