Introduction

As an increasing number of women live with a previous diagnosis of cancer, the importance of supporting the ongoing physical and psychosocial health of this population is an important area of clinical care and research. Women who have completed treatment for cancer experience physical and psychological sequelae that can negatively affect their quality of life (QoL) and health as they recover from treatment [1]. While research in this area is well-established, “recovery” care to help women manage these symptoms is not usual care in practice. As such, there are calls for interventions to improve these symptoms so that women can recover well from their treatment experience [1]. One of the most common persistent health complaints reported by women after cancer treatment is sleep disturbance [1,2,3]. Poor sleep in women previously treated for cancer has been associated with a range of symptoms and other outcomes including fatigue [4], poorer daytime functioning, mood disturbances, poorer health and diminished health-related QoL (HRQoL) [1, 4].

Sleep is a plausible candidate to explain variability in women’s outcomes, including QoL, after receiving treatment for cancer. Sleep is hypothesised to serve a restorative and recuperative function [5]. Insufficient and disturbed sleep post-cancer treatment could impede recovery, disrupt rehabilitation efforts and compound other psychological, cognitive and health problems, ultimately affecting QoL. Although the association between sleep and HRQoL has been explored in women’s cancer contexts before, much of this research has been conducted in diagnosed women before or during their cancer treatment [6, 7]. Given how common sleep disturbance is after treatment completion [1, 3], more comprehensive research is needed to understand its impact on HRQoL during this stage of the cancer care trajectory. This understanding could inform effective interventions to improve sleep and QoL after cancer treatment.

The years following treatment for cancer are likely a critical period for the development and maintenance of health-improving behaviours that can influence proximal and long-term physical and psychological health outcomes in this population. The Women’s Wellness after Cancer Program (WWACP) was developed to enhance women’s HRQoL after the completion of their breast, blood or gynaecological cancer treatment [8]. The program targets a suite of behaviours, including body composition, physical activity, diet and alcohol consumption via evidence-based health education and health promotion strategies. Among these, strategies for sleep management are also provided. The effectiveness of the program was examined via a multi-centre randomised control trial. Analysis of other trial outcomes of the WWACP intervention (reported elsewhere) found a significant effect of the intervention on HRQoL [9], a significant effect of the intervention for less alcohol intake at 12 weeks, but not at the 24-week follow-up [10], and no significant effect of the intervention for exercise recommendations [11]. The impact of this intervention on sleep has not yet been reported.

The purpose of this paper was to examine the association between sleep problems and HRQoL. We also sought to examine the effectiveness of the WWACP on improving participants’ sleep duration and other sleep problems. It was hypothesised that sleep problems would significantly predict self-reported physical and mental HRQoL, and compared to usual care, women undertaking the WWACP intervention would show greater long-term adherence to sleep recommendations (i.e. sleeping > 7 h per night).

Methods

Study design, participants and procedure

The WWACP was a single-blinded, multi-centre randomised controlled trial. Participant recruitment, trial design, measures and procedure have been described in full previously [8]. The protocol was registered with the Australian and New Zealand Clinical Trials Registry: Trial ID: ACTRN12614000800628. In summary, 351 women who had completed chemotherapy (primary or adjuvant) and/or radiotherapy for breast, gynaecological or blood cancers within the previous 24 months were enrolled in this study. Data were collected at baseline (prior to randomisation to the intervention or usual care), 12 weeks (at completion of the intervention) and 24 weeks. In total, 175 (49.9%) participants were randomised to the intervention group and 176 (50.1%) were randomised to the usual care (control) group. Recruitment occurred via five major hospital sites in Queensland, New South Wales, Victoria and Western Australia, and through promotion through partner organisations (e.g. Wesley Hospital Choices Cancer Support Center) and consumer groups (e.g. the National Breast Cancer Foundation’s Register, the Breast Cancer Network of Australia).

Inclusion criteria were age ≥ 18 years, English language proficiency, completion of surgery and primary or adjuvant chemotherapy and/or radiotherapy for breast, blood or gynaecological cancer within the previous 24 months. Access to the internet via a device (mobile phone, laptop or desktop computer) was also required. Participants with metastatic or advanced cancer, inoperable or active locoregional disease or undertaking maintenance chemotherapy for blood cancers were excluded from participation.

Data were collected via online surveys and face to face, depending on the information type. Participants were randomly assigned to either usual care or the WWACP using permuted-block randomisation after completing the baseline data collection assessment. Follow-up data collection occurred at 12 weeks (at completion of the intervention) and 24 weeks (to assess sustained behaviour change) after the baseline assessment.

The WWACP was a 12-week multimodal health education and promotion intervention for women previously treated for cancer that aims to enhance HRQoL. The program was delivered via an e-health-enabled platform, with individual virtual consultations (~ 1 h at weeks 0, 6 and 12) with a cancer nurse occurring for 12 weeks. Participants all received a hard copy journal and or iBook divided into weeks and steps to assist with developing positive lifestyle habits. Topics included managing exercise, nutrition, sleep, stress, menopause and sexuality concerns. There was access throughout the intervention to a website (including podcasts), discussion groups on forums and interactive WWACP iBook. A range of delivery strategies were used to help participants attend to the intervention information, including consultations, motivational coaching and mobile phone text messaging. The nurses worked with participants flexibly to set realistic goals within the participants’ individual physical and psychological capabilities. Fidelity was enhanced by the expert cancer nurses receiving intervention training, including receiving a self-directed protocol manual, and participation in two full days of skills development sessions. Further information about the WWACP intervention content and delivery strategies have been reported previously [8]. The sleep-related content in the program included a section in the book that discussed sleep management strategies and encouraged the use of the six rules for ‘BETTER’ sleep [12]. A podcast discussing sleep approaches was also available for participants. Where participants specifically indicated that improving sleep was a key goal for them, the cancer care nurse would offer support and discussion in this specific area. This might include (but was not limited to) reasons for sleep issues, maintaining a sleep diary, referral back to the general practitioner for ongoing referral to sleep specialists, counselling support and recommendation of other written sleep resources.

This study was approved by Queensland University of Technology Human Research Ethics Committee (Approval No: 1300000335). All participants provided written informed consent.

Measures

Sleep problems

Sleep quality was measured using the Pittsburgh Sleep Quality Index (PSQI) [13]. The PSQI has 19 self-rated items that measure sleep quality and disturbances. For each item, participants were asked to reflect on their usual sleep habits for the previous month. The response formats for each of the items vary, but scoring instructions are used to assign scores from 0 to 3 for each of the 7 subscales or “components”. The 7 component scores represent the following constructs: (1) subjective sleep quality, (2) sleep latency, (3) sleep duration, (4) habitual sleep efficiency, (5) sleep disturbances, (6) use of sleep medication and (7) daytime dysfunction. Scores on these component scores range from 0 to 3, with higher scores indicating more severe difficulty (0 = no difficulty; 3 = severe difficulty). The seven component scores were added together to create one Global Sleep Quality score ranging from 0 to 21, with a higher score indicating more severe difficulty across all areas. The additional 5 roommate/bed partner-rated items were not scored in this study. This scale has demonstrated adequate psychometric properties [13], including in cancer patients [4, 14]. The scale is the most commonly used in cancer studies [15]. A cut score of > 5 was used for the Global Sleep Quality score to indicate clinically significant sleep problems, consistent with previous recommendations and research [16, 17]. Other sleep outcomes derived from this measure included insufficient sleep duration (< 7 h; i.e. component score > 0); poor sleep quality (“fairly bad” and “very bad” sleep quality; i.e. component score of ≥ 2); poor sleep efficiency (habitual sleep efficiency of “65–74%” and “ < 65%; i.e. component score ≥ 2); frequent sleep disturbance (i.e. “once or twice a week” and “three or more times a week”; i.e. component score ≥ 2).

Health-related quality of life

The 36-Item Short Form Health Survey (SF-36) [18, 19] is a widely used non-specific measure of HRQoL. The measure meets high psychometric standards [20, 21]. The 36 items represent eight scales: Physical functioning, role-physical, bodily pain, general health, vitality, social functioning, role-emotional and mental health. These eight scales form two higher order constructs: physical health (the first four scales) and mental health (the latter four scales). These constructs are represented by two summary measures: Physical Component Summary (PCS) measure and Mental Component Summary (MCS) measure. Scoring was completed as per standard instructions, using norm-based scoring algorithms. Lower scores on the MCS reflect “frequent psychological distress, substantial social and role disability due to emotional problems; health in general rated ‘poor’”. Lower scores on the PCS reflect “Substantial limitations in self-care, physical, social, and role activities; severe bodily pain; frequent tiredness; health rated ‘poor’” (p.72) [22].

Covariates

Sociodemographic and clinical data (e.g. age, education, employment status, marital status, self-reported body mass index (BMI)) were collected via online surveys and face to face, depending on information type. Menopausal symptoms were assessed with the 21-item self-rated Greene Climacteric Scale (GCS) [23], which includes three main scales: psychological (11 items), physical (7 items) and vasomotor (2 items: hot flushes, sweating at night), and single-item “sexual dysfunction” measure that elicits information about “loss of interest in sex”. Pain was measured using the bodily pain subscale of the SF-36. Other data collected included the International Physical Activity Questionnaire Short Form (IPAQ-SF), the Center for Epidemiologic Studies Depression Scale and the Zung Self-rating Anxiety Scale.

Data analysis

To account for the baseline difference in the sleep outcome prevalence, we used the difference-in-difference regression model approach to examine the intervention effect on the five sleep outcomes. The difference-in-differences estimate is a widely used statistical technique for pre-post intervention control design for efficacy analysis [24, 25]. In this approach, the intervention effect was measured as the difference-in-differences in the proportion of sleep outcomes. The estimate is zero if there is no intervention effect and negative if there is a reduction in the proportion for participants in the intervention group compared to the control participants. The regression model was structured as follows to estimate the intervention effect having adjusted by the difference in the proportion at the baseline:

$$\mathrm{Sleep}\;\mathrm{outcomes}=\mathrm{Intercept}+\mathrm a^\ast\mathrm{Group}+\mathrm b^\ast\mathrm{Time}+\mathrm c^\ast\mathrm{Interaction}+\mathrm{error}$$

The variables were coded as follows: for group, intervention = 1 and control = 0; for time, follow-up (12 or 24 weeks) = 1 and baseline = 0; and interaction = 1 for the measurements of intervention group at follow-up time points, and all others = 0. The c-coefficient is considered as the estimate of the intervention effect. As the sleep outcomes were binary variables, difference-in-differences logistic regression models were used to compare the proportion of sleep outcomes between the intervention and control groups at week 12 and week 24 with reference to their baseline estimates. These models were further extended to the adjusted models by including the predictors of these sleep outcomes, which were identified in another study by Edmed and colleagues’ (submitted). Specifically, the models adjusted for age, education, employment status, marital status, GCS psychological symptoms, GCS vasomotor symptoms, GCS sexual dysfunction and pain.

To examine the association between sleep outcomes and HRQoL (both physical (PCS) and mental (MCS)), a single-exposure-outcome model was used. Due to multicollinearity issues, each of the sleep variables was assessed separately with HRQoL. The outcome variables of PCS and MCS were continuous; with data collected at three time points, we used linear panel regression (xtreg command in STATA, panel at participants’ ID level) to examine the association between sleep outcomes and HRQoL after accounting for the dependence due to each participant having repeated observations at three different time points. For adjusted analyses, results were adjusted for covariates identified as predictors of sleep as well as the SF36 PCS and MCS (results not reported), which included age, marital status, education, self-reported body mass index (BMI) and the GCS psychological, vasomotor and sexual dysfunction scales, as identified in another study by Edmed and colleagues (submitted).

Missing data

There were substantial missing values on the sleep disturbance component score (and consequently the Global Sleep Quality score) of the PSQI and for some covariates. As such, we conducted analyses in three different samples: (1) available sample (unadjusted model); (2) complete sample (adjusted model on complete sample after excluding missing values at covariates); (3) imputed full sample (complete sample after imputing missing values). Missing values were imputed using multiple imputations. We used chained equations, a sequence of univariate imputation methods with fully conditional specification of prediction equations using STATA command mi impute chained. The imputation routine consisted of 1000 iterations to create 30 imputed data sets. These imputations were then validated. To assess the accuracy of the imputation, several parameters were also examined, such as relative increase in variance, fraction of missing information, degrees of freedom, relative efficiency and the between-imputation and the within-imputation variance estimates. Given that we did not observe any substantial differences between the imputed sample and the complete case sample, we report results from the imputed sample in the main text. The results that report the analyses using complete cases are presented in supplementary Tables 1 and 2.

Results

Table 1 presents the characteristics of the full sample at the baseline measurement. This table shows that nearly all participants had a previous diagnosis of breast cancer, and most participants were born in Australia. Table 2 shows the distribution of participants’ characteristics across the intervention (n = 175) and the treatment as usual (i.e. control; n = 176) groups. The between group comparisons show that there were no significant differences for the sample characteristics between the intervention and control groups (Table 2).

Table 1 Characteristics of participants at baseline
Table 2 Distribution of participant characteristics between intervention and control group at baseline

Association between sleep and HRQoL

Table 3 shows the association between the sleep outcomes and HRQoL as measured by the SF-36 MCS and PCS. These results show that all the sleep indicators statistically significantly predict physical HRQoL, except for insufficient sleep duration, in both the unadjusted and adjusted analyses. Although the sleep outcomes predict mental HRQoL in the unadjusted analyses, the results were not statistically significant when the results were adjusted for age, marital status, education, self-reported BMI and the GCS psychological, vasomotor and sexual dysfunction scales (Table 3). The statistically significant results of the adjusted analyses show that for women who had sleep problems, the predicted PCS score would be 2.1–3.0 points lower than women who did not have sleep problems, holding all other covariates constant (Table 3).

Table 3 Association between sleep and health-related quality of life (HRQoL) of cancer-treated women: regression analysis

Effectiveness of the intervention on indicators of sleep disturbance

Figure 1 depicts the prevalence of each sleep outcome at baseline, 12 weeks and 24 weeks. The proportion of participants meeting criteria for insufficient sleep duration at baseline is 38% for the intervention group and 40% for the control group. By week 24, 39% of the intervention group and control group meet criteria for insufficient sleep. A similar pattern of results can be observed for other outcomes (Fig. 1 and Table 3). Table 4 reports the unadjusted and adjusted intervention effects at different times. Non-significant trends from the difference-in-difference analysis show that the proportion of sleep problems reduced at 12 weeks, but not at the 24-week follow-up. Overall, there was no statistically significant effect of the intervention on any of the sleep outcomes at any of the follow-up time points.

Fig. 1
figure 1

Prevalence of each sleep outcome at baseline, 12 weeks and 24 weeks for Intervention (left) and Control (right) groups

Table 4 Efficacy of the intervention on sleep outcomes among cancer-treated women: difference-in-differences regression analysis

Discussion

This paper examined the relationship between sleep problems and HRQoL, and the potential effectiveness of the WWACP intervention. As hypothesised, sleep efficiency, sleep quality, sleep disturbance and clinically significant sleep disturbance significantly predicted physical HRQoL. However, contrary to our hypothesis, these sleep outcomes did not predict mental HRQoL after adjustment for covariates. Also, sleep duration did not predict physical or mental HRQoL after adjustment. Contrary to our hypothesis, participants in the intervention group did not show greater long-term adherence to sleep recommendations (i.e. sleeping > 7 h). Overall, there was no significant effect of the intervention on the sleep duration outcome or any of the other sleep outcomes measured in our study.

We found that all the sleep outcomes except insufficient sleep duration were associated with physical HRQoL, but not mental HRQoL in analyses adjusted for covariates. This may indicate that other factors such as the climacteric-related psychological or vasomotor symptoms contribute more to poorer HRQoL in the mental health domain. Although not explored in this study, there are likely complex pathways between sleep, climacteric symptoms and mental health-related QoL. Further research is needed to understand these complex pathways. Previous studies have also observed a relationship between sleep disturbance and general HRQoL in breast cancer patients at varying stages of the clinical care trajectory [1, 4, 6, 7], including with objectively measured sleep [7]. Our findings contribute to this research by demonstrating the independent association between self-reported sleep problems and physical HRQoL in women treated for cancer using standardised measures of these constructs. The magnitude of this effect was that in women experiencing poor sleep after completing cancer treatment, SF-36 PCS scores were 2.1–3.0 points lower than in women who did not have sleep problems, holding all other covariates constant. As such, unassessed and untreated, sleep problems may undermine physical domains of HRQoL across the cancer care trajectory.

Overall, we found that there was no statistically significant effect of the intervention on any of the sleep outcome measures. There was no support for the a priori hypothesis originally described in the study protocol, which was that compared to usual care, women undertaking the WWACP intervention will show greater long-term adherence to sleep recommendations. The sleep duration recommendations provided by the National Sleep Foundation [26] advise that adults aged between 26 and 65 years should obtain between 7 and 9 h of nightly sleep. In this study, at least 40% of the sample did not meet these recommendations at baseline (38% intervention, 40% control) and there was no statistically significant reduction in the proportion of participants who were exposed to the intervention not meeting these recommendations at 12 weeks (at the end of the intervention) or at 24 weeks (to assess sustained behaviour changed). When interpreting the trends in the effectiveness analysis, the proportion of participants in the intervention group meeting criteria for all poor sleep outcomes reduced at week 12 for many of the outcomes (although not statistically significantly), but this direction of the effect was not sustained at 24 weeks. These trends suggest that any behaviour change or impact of the intervention was not sustained over the longer term.

Although the WWACP included strategies for sleep management, it is likely that the intervention was not effective for this outcome because these strategies were a small component of a larger multimodal lifestyle intervention targeting a broad range of behaviours. In this program, certain behaviours were targeted and worked through intensively when identified as a participant goal. Despite the high prevalence of sleep problems in this sample, women might have prioritised other problems for more intensive focus, which is likely given the array of other symptom concerns in this population (e.g. vasomotor symptoms). Analysis of other trial outcomes of the WWACP intervention (reported elsewhere) did find a significant effect of the intervention on HRQoL at week 12 and sustained at week 24 [9], despite non-sustained improvement of other study outcomes (i.e. alcohol use [10] and exercise recommendations [11]). Overall, although we found no significant effect of the intervention on any of our sleep outcomes, the results from this study can contribute to future meta-analyses. More targeted interventions to improve sleep are needed, where sleep problems are indicated. Further research exploring such interventions, or adaptions to the WWACP are needed.

The results from this study should be interpreted with respect to its limitations. Sleep is a complex construct. Sleep–wake disturbance can be measured subjectively (e.g. self-report) or objectively (e.g. actigraphy, polysomnography). Both ways of measuring sleep provide different, and often complementary information about the nature and impact of the sleep disturbance on the patient. For instance, subjective report provides information about how problematic sleep disturbance is for the patient. However, objective measurement of sleep can overcome limitations associated with self-report, such as recall bias, and reveal potentially important physiological features of sleep. The PSQI includes a range of subscales examining different sleep constructs (e.g. sleep timing, sleep efficiency), but there is lack of specificity by clinically diagnosable disorders. One implication of this lack of diagnostic information is that the experience of “poor sleep” could reflect different sleep disorders that require different treatments. Greater clarity about the nature of sleep disturbance after cancer treatment would be gained if future studies undertook a comprehensive approach to assessing sleep, including the inclusion of objective sleep measurement. We also encountered a significant amount of missing data with this measure on the sleep disturbance component subscale, and as such reported analyses using imputed missing data. The phenomena of obtaining substantial missing data on the PSQI when self-administered has been recognised previously (e.g. 21% of cases lost due to cumulative impact of missing items when calculating the Global Sleep Quality score) [14]. Future uses of the PSQI in this population should seek to verify missing participant responses where possible. More broadly, this study utilised a short follow-up period. Longer term monitoring of outcomes might have resulted in different findings. Finally, these results likely do not generalise to women with types of cancers other than breast. Despite these limitations, the study included strong design features of a randomised control trial with repeated follow-up assessments. The primary analyses controlled for the impact of potential confounders. Finally, sleep was assessed using a validated scale. Although previous studies have examined sleep–wake disturbances during recovery from cancer, they have mostly examined sleep–wake disturbance as one of a broader range of symptoms, or commonly as part of QoL measures [27], rather than as a primary outcome with validated scales.

Disturbed sleep after cancer treatment can impede recovery, disrupt rehabilitation efforts and compound other psychological, cognitive and health problems. As a modifiable factor, the treatment of sleep–wake disturbance across the cancer care trajectory can have numerous downstream effects, including improving HRQoL, as suggested by the results of our study. Opportunities to optimise and better target sleep interventions should continue to be explored in this population.