Main

Tumour recurrence occurs in almost all patients with epithelial ovarian cancer (EOC) at a median of 15–18 months from diagnosis (du Bois et al, 2009; Hennessy et al, 2009). In the past two decades, great efforts have been taken to improve the survival time for platinum-sensitive recurrent (PSR) disease. The most active chemotherapy regimens in the treatment of PSR EOC were reported with a median survival of 29 months (Parmar et al, 2003). Although a randomised study, MRC OV05/EORTC 55955 trial, in ovarian cancer of early treatment of relapse based on CA125 level alone vs delayed treatment based on conventional clinical indicators showed no survival benefit from early treatment, it was argued that this trial did not appropriately consider the role of SCR since patients with PSR EOC were just assigned to receive chemotherapy (Rustin et al, 2010).

There was a reported series of surgery for PSR disease with survival of 20–63.2 months in optimal cytoreductive groups (Berek et al, 1983; Morris et al, 1989; Jänicke et al, 1992; Segna et al, 1993; Vaccarello et al, 1995; Eisenkop et al, 2000; Zang et al, 2004; Chi et al, 2006; Harter et al, 2006; Salani et al, 2007; Oksefjell et al, 2009; Tian et al, 2010), which showed a prolonged survival when parallelly compared with the patients received chemotherapy alone after recurrence (Parmar et al, 2003; Pfisterer et al, 2006; Alberts et al, 2008; Power et al, 2009). These data supported the inclusion of secondary cytoreductive surgery (SCR) as a considerable and acceptable therapeutic approach for PSR EOC.

However, the role of SCR remains in controversial because selection bias remains in the surgical cohort and there is no level I/II evidence for SCR in the management of PSR EOC. There also existed some limitations in the publications about SCR because the majority of the literature was retrospective; all of the publication were from single institute; SCR was carried out less frequently in patients with recurrent disease; and most of the published series had fewer than 100 patients. The study with the largest sample was reported by the Arbeitsgemeinschaft Gynaekologische Onkologie (AGO) (Harter et al, 2006). About 2500 patients with regard to SCR for PSR EOC were reported in the past three decades.

The aims of this study were to redefine the role of SCR and to establish a predictive model for survival based on the pooled data from an international collaborative cohort. We tried to collect as many individual data as possible, and the final data set used for the pooled analysis included nine studies with a total of 1252 cases from seven centres or groups around the world.

Materials and methods

Study design and data collection

Study protocol was developed before data collection. Platinum-sensitive recurrent is defined as the first recurrence after completion of primary treatment of at least 6 months without any evidence of disease. Secondary cytoreductive surgery is performed to remove as much of the tumour as possible in order to prolong survival in patients with PSR EOC.

A systematic search of literature was performed to identify all published studies with no restriction on language. This study protocol was started in December 2008, followed by literature searched in PubMed/Medline, Ovid, Web of knowledge and Embase, with additional MESH and free text terms for ‘secondary cytoreductive surgery and ovarian cancer or secondary cytoreductive surgery and ovarian carcinoma’, were supplemented by hand searches of conference proceedings, reference lists in the publications and review articles. We sent the invitation letters to all available investigators or groups of studies we identified, who had reported articles with regard to SCR. An international collaborative study group was then set up. Our collaborators were closely involved with the main international gynaecological cancer researchers through the Gynecologic Cancer Intergroup and were asked whether they knew of additional published or unpublished studies.

Individual patient data were collected from all participating groups in which these involved already complete data sets. The data collected were all factors according to the review of relative literatures, their clinical relevance, as well as the pilot results obtained based on the studies from the lead group. Data provided by each centre should include at least the following variables: age at recurrence, histology type at primary diagnosis, grade at primary diagnosis, FIGO (the International Federation of Gynecology and Obstetrics) stage at primary diagnosis, residual disease at primary cytoreduction (0 vs >0 cm), progression-free interval (PFI), ECOG (the Eastern Cooperative Oncology Group) performance before SCR, ascites at recurrence, CA125 at recurrence, extent status of recurrent tumour (categorised as localised which defined as 3 lesions at recurrence vs multiple defined as >3 lesions), residual disease after SCR, the largest diameter of the maximal recurrent tumour, bowel resection at SCR, blood loss at SCR, length time of SCR operation from ‘knife to skin’, complications of SCR, follow-up status, survival after SCR and survival after primary therapy. The year of the data was defined as the median year of the data in each study. The updated survival data from original reports were collected if they were available. The residual disease after SCR was defined as: R0, complete resection of all visible disease; R1, remaining small volume disease of 0.1–1 cm; R2, remaining disease >1 cm or residual disease could not be evaluated as used in our previous study (Tian et al, 2010).

Data that had been reported previously and some additional updates from each centre were collected and merged after the approval by institutional review board at each participating centre. Data of 2002–2006 from Fudan University Cancer Hospital and Nord-Ostdeutsche Gesellschaft für Gynäkologische Onkologie (NOGGO) were recently published and updated (Sehouli et al, 2008; Supplementary Table S1). Eligibility was reviewed for all patients collected. The extensive data validity and consistency were checked for every study before combining for the final analysis. Patients were excluded if the histology of the tumour was not epithelial type, PFI <6 months or the survival data were not available.

Statistical analysis, derivation and validation of the risk model

The primary outcome measure was survival time from the date of SCR to the date of death. Patients alive were censored on the date of the last follow-up. The univariate analysis for all SAS software (version 9.2, SAS Institute Inc., Cary, NC, USA) was used to develop the risk model for survival. The median survival was computed using the Kaplan–Meier method. The differences in survival of 14 variables were assessed by a log-rank test. Estimated survival rates were calculated by Life Table. We also evaluated hazard ratio (HR) using both log-rank method and the Life Table. The Cox regression was conducted to establish the risk model of survival and estimate the HR. CA125 and PFI were treated as continuous variables, and the cut-off points were evaluated by the receiver-operating characteristics (ROC) curve area and the Youden index (Chmura Kraemer, 1992; Schisterman et al, 2005).

Multiple imputation has emerged as an appropriate and flexible way of handling missing data. Some researchers avoid imputation approaches because of fears of ‘making up data’. In fact, complete-case analyses require stronger assumptions than does imputation. Multiple imputation is one technique becoming increasingly advocated to deal with missing data because of its improved performance over alternative approaches (Rubin, 1987; Klebanoff and Cole, 2008; Graham, 2009). The sequential regression multiple imputation (SRMI) method was used to estimate missing values, and the analyses were performed using the multiple-imputed data sets (He et al, 2010).

In the final multivariate model, potential predictive variables of a P-value <0.05 were considered as risk factors, and a risk score was calculated based on its beta-coefficients from the multivariate analysis. The risk model combining all scores was then developed, and a score for every patient represented the sum of scores for each risk factor.

We adopted an internal validation to assess the performance of the risk model for survival. Youden index is a global measure of diagnostic effectiveness, which calculated by sensitivity+specificity−1. This index represents the maximum vertical distance between the ROC curve and the diagonal line. As it occurs at the cut-point that optimises the variable's differentiating ability when equal weight is given to sensitivity and specificity, we used both the Youden index and ROC curves area to determine the cut-off of the continuous variables included in the risk model.

Results

Description of study cohort

By April 2009, seven centres from China (the lead group), Germany, the United States, Norway, Italy and Turkey had joined the international collaborative study group. There were 1252 patients initially collected from nine studies. However, 152 patients were excluded, of which 5 patients with were excluded because of the histology of the tumour being malignant mixed mullerian type, 114 patients for PFI <6 months and the other 33 patients for unavailable survival data. As a result, 1100 patients were finally entered into the study cohort, in which 60 (1986–1997), 87 (1998–2001) and 140 (2002–2006) were from Fudan University Cancer Hospital (FDUCH), respectively; 235 (2000–2003) from AGO; 204 (1999–2004) from NOGGO; 148 (1987–2001) from Memorial Sloan-Kettering Cancer Center; 128 (1985–2000) from Norwegian Radium Hospital; 54 (1990–2001) from Hacettepe University; and 44 (1982–1994) from University of Bari (Supplementary Table S1).

The clinicopathologic characteristics of these patients are presented in Table 1. There were no missing data of PFI, but for other variables the percentage of missing data ranged from 0.2% to 37.9%. The median age at disease recurrence was 56 years, ranging from 16 to 84 years. A median PFI was 19 months (range, 6–326 months). Complete SCR was achieved in 433 (39.4%) patients.

Table 1 Patient characteristics

Predictors of survival

At the time of data analysis, 685 (62.3%) patients had died. The median follow-up time was 20.5 months (range, 0.1–278.8 months) (Supplementary Table S1). The median survival after SCR was 28 months.

On the multivariate analysis, four variables PFI, ascites at recurrence, extent of recurrent tumours and residual disease after SCR were finally identified as the predictors for survival (Supplementary Tables S2, S3). The cut-off of PFI determined by ROC was 23.1 months. The median survival after SCR for patients who had PFI of >23.1 months and 23.1 months was 45.0 months and 21.0 months, respectively (HR: 1.72, 95% CI: 1.45–2.03; P<0.0001; Supplementary Figure S1). The cut-off of CA 125 level at recurrence was 251.0 U ml–1, but it was not a survival determinant by the Cox regression analysis. Patients with localised lesions had a median survival of 43.9 months and the survival in those with multiple lesions found at SCR decreased to 20.0 months (HR: 1.38, 95% CI: 1.16–1.64; P<0.0001; Supplementary Figure S2).

Residual disease after SCR was the strongest survival determinant among almost all those variables, with a median survival of 57.7 months for those without gross residual disease, compared with 27.0 months in R1, and 15.6 months in R2 (R1 vs R0: HR: 1.90, 95% CI: 1.54–2.34; χ2=70.32, P<0.0001; R2 vs R0: HR: 3.00, 95% CI: 2.43–3.70; χ2=237.26, P<0.0001; R1 vs R2: χ2=36.79, P<0.0001; Figure 1).

Figure 1
figure 1

Survival by residual disease after SCR in patients with recurrent ovarian cancer with HR estimated by univariate analysis.

The model for survival

Assignment of risk score to each predictor was dividing the coefficient from the Cox regression model by 0.3. Patients with R2 after SCR were given the highest score of 4. Consequently, the total scores for patients with PSR EOC ranged from 0 to 8 points. (Supplementary Table S3)

The model performance was internally validated for discrimination. In the risk model, the ROC area was 0.729 (95% CI: 0.698–0.759) with a cut-off point at 2.5 (Table 2, Figure 2). According to the cut-point and the distribution of cases of each score, we divided patients into two classes of low risk (0–2 points) and high risk (3–8 points). In the cohort, there were 418 (38.0%) patients at low risk and 682 patients at high risk. The median survival after SCR for these two classes was 63.0 and 19.1 months, respectively (HR=3.65, P<0.0001) (Table 3). Additional analyses the hazard functions of the risk model are further described in Figure 3.

Table 2 Scoring system for survival in patients with recurrent ovarian cancer undergoing secondary cytoreductive surgery
Figure 2
figure 2

Derivation and validation of the risk model for survival in patients with recurrent ovarian cancer who underwent secondary surgical cytoreduction. (A) Risk ratios of all variables. (B) ROC curve of validation. The area under the curve is 0.729. The cut-off point of the risk model is 2.5 with the sensitivity, 1-specificity and Youden index (Sen.+Spe.−1) of 76.3%, 38.3% and 0.38, respectively. RD1=residual disease after first surgery; RD2=residual disease after secondary surgery.

Table 3 Performance of survival risk model in the validation cohorts
Figure 3
figure 3

A comparison of survival hazard between low-risk and high-risk groups. (A) By log-rank test. (B) By the Life Table. Orange line or marker=the curve of patients with scores 0–2 (low risk). Blue line or marker=the curve of patients with scores 3–8 (high risk).

Discussion

The risk model and predictors of survival in patients with recurrent ovarian cancer

This risk model suggested residual disease after SCR was the strongest survival determinant when compared with other three factors. It is an evidence to demonstrate to role of secondary cytoreduction. In our pooled analysis, the value of area under the ROC curve suggested that this risk model had a reasonable discriminatory ability and may be used to stratify patients into risk classes for surgical outcomes. Through internal validation, the model showed the feasibility (the ROC curve area was 0.729) in estimating survival determinants for an individual patient with PSR EOC. Using the cut-point scoring system, one patient at low risk with a cut-point value of 0–2 will have a prolonged medial survival of 63 months. To our knowledge, it is the first model of survival for patients with PSR EOC. In this model, PFI, ascites at recurrence, extent of recurrent disease were three intrinsic variables that were well balanced by ethnicities. DESKTOP I found three variables impact on survival after surgery for recurrence: complete resection (residual tumour 0 vs >0 mm: HR: 2.94; P<0.001), ascites (<500 vs >500 ml: HR: 2.30; P=0.004) and postoperative chemotherapy (platinum-containing chemotherapy yes vs non: HR: 1.84; P=0.015). It was different from the results of DESKTOP I that we found extent of recurrent disease was a survival determinant by multivariate analysis (localised lesions vs multiple lesions: HR: 1.38, P<0.0001). According to the literature, the cut-off value of PFI was generally accepted as 12 months, herein, we demonstrated the cut-off point of PFI being 23.1 months determined by the ROC curve. Of 418 women at low risk, 379 (90.7%) were with R0 after SCR and 39 (9.3%) with R1. That is, no gross residual disease after SCR contributed most to the risk model. Therefore, we believe that SCR should be offered for patients with an estimated complete SCR.

In order to be used easily by clinicians, we tried to establish the model using factors before SCR, but the performance of the model was not satisfactory because of a lower sensitivity and a higher false-positive rate (data not shown). That is, we cannot ignore the variables of the intervention in the derivation of the model. On the multivariate analysis, Figure 2A shows patients with R2 experienced the highest HR of 3.0 (95% CI: 2.43–3.70).

In this cohort, we compared the difference in survival among seven centres, and the P-value was <0.0001, but the HR was 0.986 (95% CI: 0.982–0.990) (data was not shown). In our analysis, the year of the data was not a survival determinant, and it was different from that in the meta-analysis by Bristow et al (2009). Surgical experience is very important, but this factor has been balanced by patient selection criteria.

There existed some limitations in this study, and we tried to settle those problems before our data analysis. First was about the missing data. It had been acknowledged there would be missing data as not all variables defined in the protocol were collected in all original published studies. It was reasonable to assume the missing data were random and distributed evenly among categories. Compared with simpler approaches such as complete-case analysis, multiple imputation techniques will generally provide more accurate estimates of associations in the data (Rubin, 1987). When we performed the statistical analysis, we used the SRMI method to reduce the impact of missing data.

Second, we did not include salvage chemotherapy before or after SCR as a prognostic factor when gathered the SCR data from each centre. Only a few series reported the systemic postoperative chemotherapy (Jänicke et al, 1992; Segna et al, 1993; Zang et al, 2000; Zang et al, 2004; Harter et al, 2006; Oksefjell et al, 2009). And in most of the publications involved salvage chemotherapy after surgery, there was significant difference in survival when compared with the cycles of salvage chemotherapy, but we considered that those published studies were heterogeneous and the study duration was different. As there are varied regimens for recurrent ovarian cancer, even for one patient, the regimens would change based on the response to chemotherapy. Thus, it is hard to well evaluate the role of salvage chemotherapy in this study cohort. Therefore, we did not consider salvage chemotherapy as a prognostic factor but rather a confounder to have been controlled in the model.

The role of complete secondary cytoreduction (R0 vs R1 and R2)

In this collaborative cohort, survival difference was observed in each stratum among R0, R1 and R2. Surely, a R0 patient most likely is going to live longer after secondary cytoreduction than a R1 or R2 patient. Patients with R1 could also get a significant survival advantage from SCR when compared with those with R2. In 2004, we had reported a prospective study of 117 patients with the same results between R1 and R2, but there was no significant statistical difference in survival between patients with R0 and R1 (Zang et al, 2004). So SCR with residual disease 1 cm was defined as optimal cytoreduction in that study. Chi et al (2006) set 0.5 cm as a cut-off point and defined residual disease 0.5 cm as optimal result. Data from their centre showed no statistically significant differences in survival between patients with R0 and patients who had residual disease of 0.1–0.5 cm, as well as among patients who had residual disease of 0.6–1.0 cm, 1.1–2.0 cm and >2 cm. So those results suggested 0.5 cm should be the cut-off of optimal SCR. However, in the report of 267 patients by AGO, Harter et al (2006) found that just women with R0 after SCR could get a survival benefit. For patients with R0, the median survival was 45.2 months compared with 19.6 months for patients with R1 and 19.7 months for R2. Nevertheless, in a recent report, we found that when we did the paired comparison of survival for patients with R0, R1 and R2, they all showed significant differences, which was coincident with the results of the present pooled analysis (Tian et al, 2010).

In the present pooled-analysis, we demonstrated that R0 patients had a contribution of >90% to the group of patients at low risk for survival. So it is important how to find out those will have complete SCR. Our recent publication has presented another model to predict which patients will undergo a successful SCR (Tian et al, 2011).

Conclusions

The role of SCR is first demonstrated by a risk model for survival for patients with recurrent ovarian cancer. Based on the international multi-centre database, this risk model for survival can well predict that which patients would benefit most. R0 should be goal of SCR. However, external validation and randomised trials are needed to test the prognostic model and the role of SCR for patients with PSR ovarian cancer.