Scolaris Content Display Scolaris Content Display

Pharmacological treatment for psychotic depression

This is not the most recent version

Collapse all Expand all

Abstract

available in

Background

Evidence is limited regarding the most effective pharmacological treatment for psychotic depression: combination of an antidepressant plus an antipsychotic, monotherapy with an antidepressant or monotherapy with an antipsychotic. This is an update of a review first published in 2005 and last updated in 2009.

Objectives

1. To compare the clinical efficacy of pharmacological treatments for patients with an acute psychotic depression: antidepressant monotherapy, antipsychotic monotherapy and the combination of an antidepressant plus an antipsychotic, compared with each other and/or with placebo.

2. To assess whether differences in response to treatment in the current episode are related to non‐response to prior treatment.

Search methods

A search of the Cochrane Central Register of Controlled Trials and the Cochrane Depression, Anxiety and Neurosis Group Register (CCDANCTR) was carried out (to 12 April 2013). These registers include reports of randomised controlled trials from the following bibliographic databases: EMBASE (1970‐), MEDLINE (1950‐) and PsycINFO (1960‐). Reference lists of all studies and related reviews were screened and key authors contacted.

Selection criteria

All randomised controlled trials (RCTs) that included participants with acute major depression with psychotic features, as well as RCTs consisting of participants with acute major depression with or without psychotic features, that reported separately on the subgroup of participants with psychotic features.

Data collection and analysis

Two review authors independently extracted data and assessed risk of bias in the included studies, according to the criteria of the Cochrane Handbook for Systematic Reviews of Interventions. Data were entered into RevMan 5.1. We used intention‐to‐treat data. For dichotomous efficacy outcomes, the risk ratio (RR) with 95% confidence intervals (CIs) was calculated. For continuously distributed outcomes, it was not possible to extract data from the RCTs. Regarding the primary outcome of harm, only overall dropout rates were available for all studies.

Main results

The search identified 3659 abstracts, but only 12 RCTs with a total of 929 participants could be included in the review. Because of clinical heterogeneity, few meta‐analyses were possible. The main outcome was reduction of severity (response) of depression, not of psychosis.

We found no evidence for the efficacy of monotherapy with an antidepressant or an antipsychotic.

However, evidence suggests that the combination of an antidepressant plus an antipsychotic is more effective than antidepressant monotherapy (three RCTs; RR 1.49, 95% CI 1.12 to 1.98, P = 0.006), more effective than antipsychotic monotherapy (four RCTs; RR 1.83, 95% CI 1.40 to 2.38, P = 0.00001) and more effective than placebo (two identical RCTs; RR 1.86, 95% CI 1.23 to 2.82, P = 0.003).

Risk of bias is considerable: there were differences between studies with regard to diagnosis, uncertainties around randomisation and allocation concealment, differences in treatment interventions (pharmacological differences between the various antidepressants and antipsychotics) and different outcome criteria.

Authors' conclusions

Psychotic depression is heavily understudied, limiting confidence in the conclusions drawn. Some evidence indicates that combination therapy with an antidepressant plus an antipsychotic is more effective than either treatment alone or placebo. Evidence is limited for treatment with an antidepressant alone or with an antipsychotic alone.

PICOs

Population
Intervention
Comparison
Outcome

The PICO model is widely used and taught in evidence-based health care as a strategy for formulating questions and search strategies and for characterizing clinical studies or meta-analyses. PICO stands for four different potential components of a clinical question: Patient, Population or Problem; Intervention; Comparison; Outcome.

See more on using PICO in the Cochrane Handbook.

Plain language summary

available in

Pharmacological treatment for psychotic depression

Psychotic depression is a severe depression with psychotic features (i.e. delusions and/or hallucinations). Uncertainty surrounds the most effective drug treatment for psychotic depression: with an antidepressant alone, with an antipsychotic alone or with the combination of an antidepressant plus an antipsychotic.

The aim of this review is to compare the efficacy of the various forms of drug treatment that have been used to treat psychotic depression. We did this by analysing all randomised controlled trials (RCTs) that investigated drug treatments for psychotic depression. We searched for these trials in a wide‐ranging way. The search identified 3659 studies, but in the end, we found only 12 RCTs that met our inclusion criteria. These trials involved a total of 929 people.

From these trials, we found evidence that the combination of an antidepressant plus an antipsychotic provides more effective treatment for psychotic depression than either treatment alone. However, our confidence in this conclusion is limited because the information came from only a small number of RCTs, which included small numbers of people. In addition, the types of people involved varied between RCTs, and the RCTs differed in design, which means that we cannot confidently generalise their findings.

Authors' conclusions

Implications for practice

Psychotic depression is heavily under‐studied, limiting confidence in the conclusions drawn. Evidence suggests that combination therapy with an antidepressant plus an antipsychotic is more effective than either treatment alone or placebo. Evidence for treatment with an antidepressant alone or for an antipsychotic alone is lacking.

Implications for research

Further studies are needed:

  1. to study the efficacy of other combinations of an antidepressant plus an antipsychotic. Regarding the antidepressants: combinations with a TCA, with an SSRI or with other newer antidepressants such as mirtazapine; regarding the antipsychotics: combinations with other so called atypical antipsychotics (aripiprazole, risperidone, olanzapine, etc.);

  2. to compare the effect of the combination of an antidepressant with antipsychotics with other pharmacological options, such as the augmentation of an antidepressant with lithium or more experimental treatments such as ketamine;

  3. to compare the effect of the combination of an antidepressant plus an antipsychotic with ECT; and

  4. to evaluate the efficacy of stepwise approaches or algorithms encompassing the previous steps after each other.

Background

Description of the condition

Psychotic depression is a severe condition that is defined as a depressive episode with psychotic features (i.e. delusions and/or hallucinations) in the context of a (unipolar) major depressive disorder. Psychotic depression is not uncommon. In the US Epidemiologic Catchment Area Study (Johnson 1991), 14% of participants who met the criteria for major depression had a history of episodes with psychotic features. In a European general population study, 18.5% of respondents with a major depressive episode had psychotic features; the prevalence of psychotic depression was 0.4% and of non‐psychotic depression 2.0% (Ohayon 2002). In a US study of hospitalised participants with major depression, 25% met the criteria for psychotic depression (Coryell 1984). Compared with non‐psychotic depression, psychotic depression is marked by greater severity, increased incapacity, a lower likelihood of placebo response, longer duration of episodes and recurrence of psychotic features in subsequent episodes (Coryell 1998).

Description of the intervention

Guidelines (APA 2010; NICE 2009) recommend electroconvulsive therapy (ECT) or pharmacotherapy as treatment for psychotic depression. Pharmacotherapy for psychotic depression could consist of an antipsychotic, an antidepressant or a combination of both. Most guidelines recommend treatment that combines an antidepressant with an antipsychotic (APA 2010; NICE 2009. However, discussion continues regarding whether the combination of an antipsychotic plus an antidepressant is more effective than monotherapy with an antidepressant or an antipsychotic ( Dutch Guideline 2009; Mahli 2009; Parker 1992; Wijkstra 2005; Wijkstra 2007). The intervention studied in this review is pharmacological treatment for psychotic depression, especially the question of whether the combination of an antipsychotic plus an antidepressant is more effective than either treatment given as monotherapy.

How the intervention might work

All antidepressants enhance the activity of serotonin and/or noradrenaline, and some of them (also) dopamine (Sadock 2009). Most antidepressants achieve this via inhibition of reuptake of these neurotransmitters in the presynaptic neuron (tricyclic antidepressants (TCAs), selective serotonin reuptake inhibitors (SSRIs) and serotonin‐noradrenaline reuptake inhibitors (SNRIs)), although some antidepressants have other working mechanisms (e.g. blockade of postsynaptic serotonin‐2 receptors such as mirtazapine or inhibition of their breakdown via inhibition of the enzyme monoamine oxidase (MAOIs)). Nevertheless, their noradrenergic and serotonergic effects do not completely explain their efficacy, as these effects occur already within hours after first intake, but it takes days to weeks before antidepressants begin to exert their effects in participants with depression or anxiety (Sadock 2009).

Almost all antipsychotics (classical as well as atypical antipsychotics, with the exception of clozapine) are blockers of the postsynaptic dopamine‐2 receptor, and their therapeutic efficacy is correlated with their affinity for dopamine‐2 receptors in vivo. However, other effects may contribute to their efficacy, such as their affinity for presynaptic serotonin‐1 receptors, postsynaptic serotonin‐2 receptors and histamine receptors, as can be seen with some atypical antipsychotics (e.g. olanzapine and quetiapine). Similar to the antidepressants, these effects do not completely explain their efficacy because they also occur already within hours after first intake, but it takes days to weeks for antipsychotics to begin to work (Sadock 2009).

The traditional view is that antidepressants work against depression and antipsychotics work against psychosis. Therefore, it seems appropriate in psychotic depression to treat the psychotic symptoms with an antipsychotic and the depressive symptoms with an antidepressant. However, when psychotic depression is considered as the most severe form of depression, and when psychosis is viewed as the distal consequence of that severity (as is the case in the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM‐IV‐TR);APA 2000), treatment with an antidepressant alone seems logical. On the other hand, treatment with an antipsychotic alone, especially one of the newer atypical antipsychotics with possibly antidepressant effects, cannot be ruled out.

Why it is important to do this review

Clinical practise is characterised by uncertainty as to whether it is most appropriate to start treatment in this patient group with antidepressant monotherapy or with the combination of an antidepressant and an antipsychotic because of the potential adverse effects of antipsychotics (especially extrapyramidal side effects, hyperprolactinaemia and the risk of metabolic syndrome, including weight gain). A previous meta‐analysis did not find a statistically significant difference between TCA monotherapy and combination therapy (Parker 1992). However, the findings and conclusions of that meta‐analysis were limited by the inadequate methodology of many of the included studies, which were often retrospective, uncontrolled and/or not randomised. Some international guidelines on the pharmacological treatment of psychotic depression (in the United States: Nelson 1997; in the Netherlands: Dutch Guideline 2009) and those presented in reviews (Wheeler 2000) suggest that one may consider TCA monotherapy before adding an antipsychotic. However, in contrast, the American Psychiatric Association (APA) Practice Guideline for the Treatment of Patients with Major Depressive Disorder (APA 2010) and the National Institute for Health and Clinical Excellence (NICE 2009) recommend initial combination therapy. The same recommendation is made in a review by Coryell (Coryell 1998). This variation between guidelines reflects the limited evidence on which these guidelines are based. In a review about evidence used in practice guidelines (Wijkstra 2007), we concluded that physicians (and patients) should be aware that guidelines for treatment recommendations may be less evidence‐based than asserted, even when it is stated that treatment recommendations are based on the highest level of evidence.

Treatment with a classical antipsychotic alone is not recommended, primarily because of the findings of a study by Spiker (Spiker 1985), in which treatment with perphenazine alone was less effective than treatment with perphenazine plus amitriptyline. However, the atypical antipsychotics may be worth reconsidering now because of the reduced risk of extrapyramidal side effects and potential antidepressant properties of some of the atypical antipsychotics (Rothschild 2004a).

This review is an update of our Cochrane review first published in 2005 (Wijkstra 2005). Our conclusion in 2005 was as follows: ''Treatment with an antipsychotic alone is not a good option. Starting with the combination of an antidepressant and an antipsychotic, as well as starting with an antidepressant alone and adding an antipsychotic if the patient does not respond, both appear to be appropriate options for patients with psychotic depression.''

Since 2005, a few more studies have been conducted, leading to a different conclusion regarding treatment with an antidepressant alone or with the combination of an antidepressant and an antipsychotic.

Another important clinical issue is that differences in response to specific treatments may be explained in relation to non‐response to prior treatment(s). Generalising from observations across medical disciplines, it would be expected that patients who did not respond to an adequate treatment will respond less to subsequent treatment. Some data are available on this topic with regard to pharmacological treatment of major depressive disorder (Sackeim 2001). Two studies (Prudic 1990; Prudic 1996) showed that a greater degree of treatment resistance predicts an inferior response to ECT.

Objectives

  1. To compare the clinical efficacy of pharmacological treatments for patients with an acute psychotic depression: antidepressant monotherapy, antipsychotic monotherapy and the combination of an antidepressant plus an antipsychotic, compared with each other and/or with placebo.

  2. To assess whether differences in response to treatment in the current episode are related to non‐response to prior treatment.

Methods

Criteria for considering studies for this review

Types of studies

We included randomised controlled trials (RCTs) of the pharmacological treatment of participants with acute psychotic depression.

As we expected to identify very few RCTs assessing the treatment of psychotic depression as the primary focus, we decided a priori to also include RCTs assessing the treatment of major depression with or without psychotic features. Effects in the subgroup of participants with psychotic features should then be reported separately, irrespective of whether the subgroup with psychotic features was stratified before randomisation.

No language restrictions were applied for included studies.

Types of participants

Participants were of any age in any setting (both inpatients and outpatients) who had a major depressive disorder and a current episode with psychotic features (delusions and/or hallucinations appearing in the context of a full major depressive episode) according to the Diagnostic and Statistical Manual of Mental Disorders, Third Edition (DSM‐III)/DSM, Third Edition, Revised (DSM‐III‐R)/DSM, Fourth Edition (DSM‐IV)/DSM, Fourth Edition, Text Revision (DSM‐IV‐TR) (APA 1980; APA 1987; APA 1994; APA 2000) or International Classification of Diseases (ICD) codes for the same.

We also included studies in which participants had comorbidities, as comorbidity was not a reason for exclusion.

As patients with a major depressive episode in the context of a bipolar disorder (bipolar depression) are at increased risk of switching to mania (Licht 2008), we intended to include only trials that assessed participants with unipolar depression. If trials had included participants with both unipolar and bipolar depression, we decided a priori to include the trial only when results in the unipolar group were reported separately, or when the percentage of participants with bipolar depression did not exceed 20% of the total study population.

Types of interventions

We included any pharmacological treatment of a current (i.e. acute) episode. Treatment had to be given for at least four weeks with the intention of treating the current episode.

Where possible, the following pairwise comparisons were considered.

  1. Antidepressant versus placebo.

  2. Antipsychotic versus placebo.

  3. Antidepressant versus antidepressant.

  4. Antipsychotic versus antipsychotic.

  5. Antidepressant versus antipsychotic.

  6. Antidepressant plus antipsychotic versus placebo.

  7. Antidepressant plus antipsychotic versus placebo plus antidepressant.

  8. Antidepressant plus antipsychotic versus placebo plus antipsychotic.

Types of outcome measures

Primary outcomes

1. Efficacy: clinical response of depression based on observer‐rated symptom reduction: reduction of at least 50% on the Hamilton Rating Scale for Depression (HRSD, Hamilton 1960) or the Montgomery Åsberg Depression Rating Scale (MADRS, Montgomery 1979) or any other observer depression severity rating scale, or a change score on the Clinical Global Impression‐Change (CGI‐C, Guy 1976) of 'much improved' or 'very much improved'.

2. Harm: overall dropout rate during acute treatment as a proxy measure of overall acceptability of treatment.

Secondary outcomes

3. Remission of depression as defined in the reports and based on HRSD or MADRS or other observer depression severity rating scale.

4. Change from baseline in score on the HRSD, MADRS or any other observer depression severity rating scale or change in severity on Clinical Global Impression‐Severity (CGI‐S).

5. Quality of life, as defined in the reports.

6. Dropout rate due to adverse effects.

Search methods for identification of studies

CCDAN's Specialised Register (CCDANCTR)

The Cochrane Depression, Anxiety and Neurosis Group (CCDAN) maintains two clinical trials registers at its editorial base in Bristol, UK; a references register and a studies‐based register. The CCDANCTR‐References Register contains over 31,500 reports of trials in depression, anxiety and neurosis. Approximately 65% of these references have been tagged to individual, coded trials. The coded trials are held in the CCDANCTR‐Studies Register and records are linked between the two registers through the use of unique Study ID tags. Coding of trials is based on the EU‐Psi coding manual. Reports of trials for inclusion in the Group's registers are collated from routine (weekly), generic searches of MEDLINE, EMBASE and PsycINFO; quarterly searches of the Cochrane Central Register of Controlled Trials (CENTRAL) and review specific searches of additional databases. Reports of trials are also sourced from international trials registers c/o the World Health Organization’s trials portal (ICTRP), drug companies, and the handsearching of key journals, conference proceedings and other (non‐Cochrane) systematic reviews and meta‐analyses. Details of CCDAN's generic search strategies can be found on the Group's website.

Electronic searches

The CCDANCTR‐Studies Register was searched (all years to 12 April 2013) using the following terms:
Condition = (depressi* or “affective disorder*” or “affective symptoms”)
AND
Condition or Comorbidity = (psychosis or psychoses or psychotic* or delusion*)

The CCDANCTR‐Studies Register was searched (all years to 12 April 2013) using the following terms to identify additional untagged references:
Title/Abstract/Keywords = ((depressi* or “affective disorder*” or “affective symptoms”)
AND
Free‐Text=(psychosis or psychoses or psychotic* or delusion* or hallucin* or antipsychotic* or psychotropic*))

An additional search of the Cochrane Central Register of Controlled Trials (CENTRAL) was carried out (Issue 4, 2010) (Appendix 1).

A summary of searches carried out for the earlier version of this review can be found in Appendix 2.

Searching other resources

The reference lists of all studies, related reviews and relevant conference proceedings were screened and key authors contacted.

Data collection and analysis

Selection of studies

In step 1, all abstracts of identified publications were screened independently by two review authors (JL and JW), and studies were selected if they met the following criteria.

  1. RCT.

  2. Included participants with a major depressive disorder.

  3. Investigated the efficacy of pharmacological treatment.

  4. Concerned acute phase treatment (minimum of four weeks treatment), not continuation or maintenance treatment.

If any doubt or disagreement arose between the review authors, the publication was included in step 2. Full articles were obtained for the selected abstracts.

In step 2, selected full articles were screened (JL and JW) according to the following criteria.

  1. Participants with psychotic depression were not excluded.

  2. Results in the subgroup of psychotic depressed participants were reported separately.

If any doubt arose about the article, it was included in step 4.

In step 3, the reference lists of related reviews and of included publications, conference abstracts and personal communications were searched.

Finally, in step 4, two review authors (JL and JW) independently reviewed all identified publications according to the full inclusion criteria of the review. Any disagreement was resolved by consensus discussion with another review author (WN).

Data extraction and management

Extracted data included the following: participant characteristics (age, gender, setting: inpatients/outpatients); diagnosis (which diagnostic instrument, system of classification, number of bipolar participants); prior treatment of the current episode; intervention; length of illness; suicide attempts; treatment details (treatment period, washout period, additional medication, blood levels, doses) and outcome measures. Data were extracted independently by two review authors (JW and JL).

All data (from 2004 and recent data) were entered into RevMan 5.1 (RevMan 2011).

Assessment of risk of bias in included studies

In the original version of this review, we assessed methodological quality of included studies by criteria set out in the Cochrane Handbook for Systematic Reviews of Interventions (Alderson 2004); however, after publication of the revised and expanded Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011), we updated our methods accordingly. Working independently, JW and JL assessed risk of bias of included studies using the tool described in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011). The following items were assessed.

  1. Sequence generation: Was the allocation sequence adequately generated?

  2. Allocation concealment: Was allocation adequately concealed?

  3. Blinding of participants, personnel and outcome assessors for each main outcome or class of outcomes: Was knowledge of the allocated treatment adequately prevented during the study?

  4. Incomplete outcome data for each main outcome or class of outcomes: Were incomplete outcome data adequately addressed?

  5. Selective outcome reporting: Were reports of the study free of suggestion of selective outcome reporting?

  6. Other sources of bias: Was the study apparently free of other problems that could put it at high risk of bias?

We included quotations from the text of included studies and comments on how we assessed the risk of bias; we judged each study to be of low, unclear or high risk of bias.

See risk of bias figures (Figure 1; Figure 2).


Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.

Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.


Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

If disputes arose as to which judgement should be given, resolution was achieved after consultation with the third review author (WN).

Measures of treatment effect

Binary outcomes

For binary efficacy outcomes, such as response, remission and dropouts, the risk ratio (RR) (with 95% confidence intervals (CIs)) was calculated for each comparison using the numbers randomly assigned and the numbers of events.

Continuous outcomes

For continuously distributed outcomes, we calculated the standardised mean difference (SMD).

Skewed and non‐quantitative outcome data were presented descriptively.

Unit of analysis issues

We identified neither cluster‐randomised nor cross‐over trials. However, if found, we would not have included them in our review, as we were interested in differences not between clusters (e.g. clinics) but between drugs or classes of drugs; nor were we interested in the results of a cross‐over phase, as the outcome of the first phase might have had an impact on the outcome of the second (i.e. cross‐over) phase.

In case a study had multiple intervention groups, only the data for the pairwise comparison in question were included. Further, if a study compared two or more medications of the same type (e.g. venlafaxine and imipramine), data were combined into a single category, for example, the category 'antidepressant'.

Dealing with missing data

We used intention‐to‐treat (ITT) response data in the analyses, as ITT data are less biased and because they address a more pragmatic and clinically relevant question (Higgins 2011). When necessary, we converted response data from the trials into ITT response data by using the total number of randomly assigned participants per group that had started with treatment as the denominator. So participants who had started with medication but were withdrawn before endpoint were assumed not to have experienced response.

When data were missing, we contacted the study authors to request the required data.

No other imputation techniques were used to deal with missing data.

Assessment of heterogeneity

First, we evaluated whether clinical homogeneity could be assumed by evaluating any between‐study dissimilarities regarding participants, interventions and outcome measures. Studies that were considered to threaten the clinical homogeneity assumption were excluded from the meta‐analysis.

The I2 statistic supplied with a 95% CI was used to assess the magnitude of statistical heterogeneity when values exceeding 0.40 were considered possibly relevant. We did not perform the Q test to determine heterogeneity because of its low power in our meta‐analysis resulting from the low numbers of studies in all of our comparisons.

Pre‐stated subgroup analyses were planned to explore sources of any heterogeneity. No meta‐regression was performed.

Assessment of reporting biases

When data from at least 10 studies became available, the presence of publication bias was assessed using contour‐enhanced funnel plots in which treatment effects expressed as RR (Relative Risk) from individual studies were plotted against each study's sample size. We did not perform Egger's regression test in view of low statistical power in our meta‐analyses, again resulting from the low number of studies included.

It must be noted that asymmetry of funnel plots does not necessarily indicate publication bias but may result from other biases such as inflated results in smaller studies due to poorer methodological quality, or true heterogeneity.

Data synthesis

We primarily used the Mantel‐Haenszel fixed‐effect method to calculate pooled risk ratios with 95% confidence intervals. Risk ratios are preferred over odds ratios because of their more straight‐forward interpretation (i.e. the number of times the outcome is more likely to occur given one treatment over another).

In cases in which evidence of statistical heterogeneity was found, we used DerSimonian and Laird's random‐effects models (DerSimonian 1986).

Subgroup analysis and investigation of heterogeneity

If sufficient data were available, subgroup analyses were planned as follows.

  1. Participants who were non‐refractory to prior treatment(s) during the current episode.

  2. Participants with mood congruent psychotic features only.

Because all psychotically depressed patients are considered to be severely depressed, it was not considered appropriate to evaluate baseline severity in a subgroup analysis.

Sensitivity analysis

If sufficient data were available, sensitivity analyses were planned as follows.

  1. Studies focusing on psychotic depression only.

  2. Studies in which participants with psychotic depression were separately randomised.

  3. Studies of lower methodological quality to assess robustness of results.

  4. Smaller versus larger studies.

Results

Description of studies

Results of the search

In 2004 from the Cochrane Central Register of Controlled Trials (CENTRAL) search, we identified 1782 publications: in the searches in MEDLINE 720 and in EMBASE 831; in total, 3333. With some overlap from 2004, we have since found an additional 326 publications. The first step of screening the abstracts of these publications in 2004 resulted in 798 relevant publications (749 from CENTRAL, 38 from MEDLINE and 11 from EMBASE); in 2011, we found 40 additional studies (three from CENTRAL, 32 from CCDANCTR‐References Register and 5 from CCDANCTR‐Studies Register). The second step of screening the full articles of these publications in 2004 resulted in the identification of 52 publications (47, three and two respectively) and now 17 (one from CENTRAL, 13 from CCDANCTR‐References Register and three from CCDANCTR‐Studies Register). In 2004, handsearching of reference lists from relevant reviews resulted in the identification of one other publication (Bellini 1994), and handsearching of the included publications did not lead to identification of any further relevant publications. Now, this resulted in no publications. In 2004, the final fourth step of reviewing these 53 publications resulted in seven included studies, and in 2004 we added two other publications, which we knew were in press: one by van den Broek (van den Broek 2004a) and one reporting two similar trials (Rothschild 2004a; Rothschild 2004b). Thus, in 2004 in total, nine publications with 10 RCTs were included. Now, in the fourth step, by reviewing 17 publications, we found two additional studies to be included.

In additional searches on the CCDANCTR and CENTRAL (December 2011 to April 2013), we found another 288 references. Of these 288 references, we did not find any other publication to include.

So, taking together both searches, we found 3974 publications through the electronic search strategies, from which we finally included 11 publications with 12 RCTs (see Figure 3).


Study flow diagram.

Study flow diagram.

In 2004, 13 studies needed a consensus discussion in the fourth step of the search strategy before they were excluded, and now five additional studies needed such a discussion before they were excluded, resulting in a total of 18 excluded studies (see Excluded studies).

Included studies

See the table Characteristics of included studies for a description of the 12 included RCTs (Anton 1990; Bruijn 1996; Meyers 2009; Mulsant 2001; Rothschild 2004a; Rothschild 2004b; Spiker 1985; Spiker 1988; Zanardi 1996; Zanardi 2000; van den Broek 2004a; Wijkstra 2010).

Design

All studies were RCTs comparing the effects of pharmacological interventions for the treatment of psychotic depression or for the treatment of depression with and without psychotic features but with data about the participants with psychotic features published separately.

Sample sizes

Sample sizes were as follows: Anton 1990: 46; van den Broek 2004a: 48; Bruijn 1996: 30; Mulsant 2001: 54; Rothschild 2004a: 124; Rothschild 2004b: 125; Spiker 1985: 58; Spiker 1988: 27; Zanardi 1996: 32; Zanardi 2000: 22; Meyers 2009: 259; and Wijkstra 2010: 122, for a total of 947 participants,

Setting

In most studies, inpatients were included, except Meyers 2009, Rothschild 2004a and Rothschild 2004b. In Meyers 2009, 69.1% of participants entered as inpatients. In the studies of Rothschild, participants were treated for at least one week as inpatients.

Seven of the studies originate from the United States, three from the Netherlands (Bruijn 1996; van den Broek 2004; Wijkstra 2010) and two from Italy (Zanardi 1996; Zanardi 2000).

Participants

All participants fulfilled criteria for major depressive disorder with psychotic features, classified according to a formal classification system (RDC: Research Diagnostic Criteria, DSM‐III, DSM‐IV, DSM‐IV‐TR). Six studies explicitly used a semi‐structured diagnostic interview (van den Broek 2004a: SADS: Schedule for Affective Disorders and Schizophrenia; Bruijn 1996: SADS; Mulsant 2001: BPRS: Brief Psychiatric Rating Scale; Spiker 1985: SADS; Meyers 2009: SCID IV: Structured Clinical Interview for DSM‐IV; Wijkstra 2010: SCID IV).

Diagnostic procedures were different between the studies. In all studies, participants were diagnosed according to a formal classification system (RDC, DSM‐III, DSM‐IV, DSM‐IV‐TR). Six studies explicitly used a semi‐structured diagnostic interview (van den Broek 2004a: SADS; Bruijn 1996: SADS; Mulsant 2001: BPRS; Spiker 1985: SADS; Meyers 2009: SCID IV; Wijkstra 2010: SCID IV). These different procedures could have led to possible differences in participant categories.

Seven studies included only participants with unipolar psychotic depression. In the study of Zanardi 1996, it was possible to exclude bipolar participants from the data. In the study of Anton 1990, 15.8% (six out of thirty eight) bipolar participants were included in the data that the author used in his analysis. However, it was unclear how many of the eight dropouts, who were excluded before analysis, were bipolar participants. The author was not able to give additional information. Therefore, we decided to assume a random dropout rate. In the study of Spiker 1985, 15.5% of participants were bipolar (Anton 1990; Spiker 1985). In the studies of Bruijn 1996 and Zanardi 2000, we were able to exclude the bipolar participants with additional information provided by the authors. In the studies of Bruijn 1996, Meyers 2009 and Wijkstra 2010, the types of psychotic symptoms were described in greater detail. In Spiker 1985, only participants with mood congruent delusions were included. This indicates some heterogeneity in diagnosis with regard to bipolarity.

Interventions

1. Antidepressant versus placebo was studied in one study (Spiker 1988): amitriptyline (three weeks at least 150 mg) versus placebo; treatment period was four weeks.

2. Antipsychotic versus placebo was the subject in one arm of the two identical studies of Rothschild (Rothschild 2004a; Rothschild 2004b): olanzapine (mean 11.9 mg; respectively, 14.0 mg) versus placebo; treatment period was eight weeks.

3. Antidepressant versus antidepressant was examined in five studies. In van den Broek 2004: imipramine (plasma levels imipramine plus its metabolite desmethylimipramine 192 to 521 ng/mL) versus fluvoxamine (plasma level 109 to 325 ng/mL); treatment period was four weeks after predefined blood levels were reached; in Bruijn 1996: imipramine (plasma levels imipramine plus its metabolite desmethylimipramine 199 to 400 ng/mL) versus mirtazapine (plasma level 49 to 93 ng/mL); treatment period was four weeks after predefined blood levels were reached; in Zanardi 1996: sertraline (150 mg from day eight) versus paroxetine (50 mg from day eight); treatment period was five weeks; in Zanardi 2000: venlafaxine (300 mg from day eight) versus fluvoxamine (300 mg from day eight); treatment period was five weeks; and in one arm of the study of Wijkstra 2010: imipramine (plasma levels imipramine plus its metabolite desmethylimipramine 200 to 300 ng/mL) versus venlafaxine (375 mg); treatment period was seven weeks.

4. Antipsychotic versus antipsychotic was not available in any study.

5. Antidepressant versus antipsychotic was the subject in one arm of the study by Spiker 1985: amitriptyline (mean dose 218 mg; 130 to 500 ng/mL) versus perphenazine (mean dose 50 mg; blood level 19 to 113 ng/mL); treatment period was four weeks.

6. Antidepressant plus antipsychotic versus placebo was studied in one arm of both identical studies of Rothschild (Rothschild 2004a; Rothschild 2004b): olanzapine (12.4 mg; respectively, 13.9 mg) plus fluoxetine (23.5 mg; respectively, 22.6 mg) versus placebo; treatment period was eight weeks.

7. Antidepressant plus antipsychotic versus placebo plus antidepressant was studied in four studies. In Anton 1990: amitriptyline (150 to 250 mg) plus perphenazine (24 to 40 mg) versus amoxapine (300 to 500 mg); treatment period was four weeks; in Mulsant 2001: nortriptyline (mean 63 mg) plus perphenazine (mean 19 mg) versus nortriptyline (mean dose 76 mg); treatment started with nortriptyline, and once nortriptyline blood level was between 50 and 50 ng/mL, random assignment followed; treatment period with nortriptyline plus perphenazine (or placebo) after random assignment was 2 to 16 weeks (total treatment at least four weeks); in one arm of the study by Spiker 1985: amitriptyline (mean 170 mg; 18 to 128 ng/mL) plus perphenazine (mean 54 mg; 157 to 690 ng/mL) versus amitriptyline (mean 218 mg; 130 to 500 ng/mL); treatment period was four weeks; and in two arms of Wijkstra 2010: venlafaxine (375 mg) plus quetiapine (600 mg) versus imipramine (plasma levels imipramine plus its metabolite desmethylimipramine 200 to 300 ng/mL) and versus venlafaxine (375 mg); treatment period was 7 weeks.

8. Antidepressant plus antipsychotic versus placebo plus antipsychotic was the subject in three studies. In one arm of both identical studies of Rothschild (Rothschild 2004a; Rothschild 2004b): olanzapine (12.4 mg; respectively, 13.9 mg) plus fluoxetine (23.5 mg; respectively, 22.6 mg) versus olanzapine (mean 11.9 mg; respectively, 14.0 mg); treatment period was eight weeks; in one arm of Spiker 1985: amitriptyline (mean 170 mg; 18 to 128 ng/mL) plus perphenazine (mean 54 mg; 157 to 690 ng/mL) versus perphenazine (mean dose 50 mg; blood level 19 to 113 ng/mL); treatment period was four weeks; and in Meyers 2009, olanzapine (15 to 20 mg) plus sertraline (150 to 200 mg) versus olanzapine (15 to 20 mg); treatment period was 12 weeks.

Most of these studies had a washout period before the start of treatment or random assignment, varying from four to seven days. One study had a washout period of two weeks (Spiker 1988). Because of the design of the study of Mulsant 2001 (all participants used nortriptyline before random assignment to additional perphenazine or placebo), this study was considered a trial without a washout period. The two trials Rothschild 2004a and Rothschild 2004b had a 'screening period' of three to nine days, leaving unclear whether this was a period in which medication was not allowed. In Meyers 2009, psychotropics were tapered before random assignment without a washout period. So, heterogeneity is seen in the medication‐free period before treatment.

Dosage of psychotropics used in the different trials was considered reasonably adequate. However, differences in dosing strategies led to possible bias. In four studies re TCAs, doses were adjusted according to predefined therapeutic plasma levels (Bruijn 1996; Mulsant 2001; van den Broek 2004a; Wijkstra 2010). In the study of Spiker 1985, dose adjustment was not based on plasma levels, but afterwards it was found that the plasma levels were therapeutic in most participants, although plasma levels of participants receiving TCAs alone were lower compared with those of participants receiving TCA plus antipsychotic. In the two other trials with TCAs, no plasma levels were determined. Dosages in the Spiker 1988 study were at least 150 mg/d, but only during three of the four study weeks. In the study of Anton 1990, participants received at least 150 mg/d from the third day of the four‐week study period. Amitriptyline 150 mg/d is in the low range of an adequate dosage. In the studies of van den Broek 2004 and Bruijn 1996, SSRIs (fluvoxamine and mirtazapine) were dosed according to predefined plasma levels. Fixed doses were used in the studies of Zanardi (Zanardi 1996; Zanardi 2000): sertraline 150 mg, paroxetine 50 mg, venlafaxine 300 mg and fluvoxamine 300 mg. In the studies of Rothschild (Rothschild 2004a; Rothschild 2004b), doses were clinically adjusted: olanzapine 5 to 20 mg and fluoxetine 20 to 80 mg.

Differences in additional medication strategies were also noted. In most studies, additional medication was used, such as benzodiazepines (flurazepam up to 30 mg or lorazepam as needed) and anticholinergics. In the studies of van den Broek 2004a and Bruijn 1996, some participants were treated with additional haloperidol, if clinically needed. With information provided by the authors, we were able to identify these participants in the results, and we counted them as dropouts, as in these studies the focus was the comparison of two antidepressants.

Outcomes

The primary efficacy outcome was the response rate in each study. It was not possible to transfer the authors' defined response data into rates based on one definition (i.e. at least 50% reduction of the HRSD score). Some authors used response definitions based on what is often considered remission. In addition, some authors included psychotic symptoms in their response definition. In the absence of a better option, we decided to use the response data as reported by the authors.

Differences in outcome measures were noted. In most trials, the HRSD was used as an outcome measure. However, different versions of the HRSD were used, and the authors used different definitions of response. In six trials, the response definition was a reduction of at least 50% of the HRSD score compared with baseline (Anton 1990; Bruijn 1996; Rothschild 2004a; Rothschild 2004b; van den Broek 2004; Wijkstra 2010). In four studies, the authors' definition of response was actually comparable with a frequently used definition of remission (Spiker 1985; Spiker 1988; Zanardi 1996; Zanardi 2000).

In five studies, the response definition also includes psychotic symptoms (Meyers 2009; Mulsant 2001; Spiker 1985; Zanardi 1996; Zanardi 2000). Bruijn 1996 and van den Broek 2004a used a response criterion of HRSD‐17 < 50%; Anton 1990 ≤ 50%; Rothschild 2004a and Rothschild 2004b HRSD‐24 < 50%; Wijkstra 2010 < 50% plus < 15; Spiker 1985 HRSD‐17 < 7 and no delusions; Spiker 1988 HRSD‐17 < 7 and not psychotic, or HRSD‐17 6.5 to 9.5 and not psychotic and a third less of entering score; and Mulsant 2001 HRSD‐17 < 11 and BPRS score of the items 11, 12 and 15 of 1 or 2 (i.e. not psychotic). In Meyers 2009, remission was defined as HRSD‐17 score < 11 and no psychosis; Zanardi 1996 HRSD‐21 < 8 and DDERS (Dimensions of Delusional Experience Rating Scale) = 0; and Zanardi 2000 HRSD‐21 < 9 and DDERS = 0. In the studies of Zanardi 1996 and Zanardi 2000, no minimum HRSD score was applied as an inclusion criterion.

In addition to response rates based on the above criteria used by study authors, several studies reported remission rates separately (van den Broek 2004a: HRSD‐17 < 8, Rothschild 2004a and Rothschild 2004b: HAMD‐24 < 9 for two consecutive visits, Wijkstra 2010: HRSD‐17 < 8). In two studies, the authors' definition of response is the same as what is nowadays considered the definition of remission (Spiker 1985 and Spiker 1988: HRSD‐17 < 7).

Dropout rates

Overall dropout rates for the primary outcome were available for all studies. Dropout rates due to adverse effects were available for six studies (Anton 1990; Meyers 2009; Mulsant 2001; Spiker 1985; van den Broek 2004a; Wijkstra 2010). Dropout rates due to adverse effects were not based on ITT analysis for three studies (Bruijn 1996; Rothschild 2004a; Rothschild 2004b); were unavailable for one study (Spiker 1988); and were the same as the overall dropout rates for two studies (Zanardi 1996; Zanardi 2000). Dropouts specifically due to mortality or suicide were not reported, so we decided to extract only overall dropout rates.

The overall dropout rates for the studies varied from 9% to 45%. In the Bruijn 1996 study, the reported dropout rate was 10%. However, when haloperidol‐treated participants were included as dropouts, as was our approach, the dropout rate was 40%. In the two multicenter trials with olanzapine/fluoxetine (Rothschild 2004a; Rothschild 2004b), dropout rates were 41%, and even higher non‐completion rates (56%) were reported. Mostly, no statistically significant differences in overall dropout rates were noted between any of the treatments, neither in individual studies nor after pooling of studies.

Prior treatment

The Bruijn 1996 study provided information on prior treatment of the current episode. However, these data were not available for the subgroup with psychotic depression. In Wijkstra 2010, some data about prior treatment are available. The other studies did not provide information on prior treatment of the current episode. Therefore, it was not possible to examine differences in treatment response in relation to non‐response to prior treatment(s).

Excluded studies

See the Characteristics of excluded studies table.

Reasons for exclusion involved not fulfilling inclusion criteria: open study design, problems with diagnosis (unclear diagnostic procedure, more then 20% bipolar participants included and no additional data to exclude bipolar participants), low numbers of included participants (3 to 15 per group), problems with treatments (continuation of mood stabilisers, treatment with diverse psychotropics), no possibility to extract ITT data and no adequate data on the MDD subgroup.

Risk of bias in included studies

See Figure 1 and Figure 2 and the table Characteristics of included studies.

Allocation

All included studies were randomised studies. Randomisation was fully described in van den Broek 2004a and Wijkstra 2010. Randomisation was described in part in Meyers 2009, Spiker 1985 and Spiker 1988. In the other eight studies, randomisation was mentioned as such, but the methods of randomisation were not delineated.

(Random sequence generation: seven studies unclear risk and five low risk of bias; allocation concealment: 10 unclear risk and two low risk.)

Blinding

All studies were double‐blind studies, but blinding itself was not always adequately described in the methods section of the study. Blinding was adequately described in Anton 1990, Bruijn 1996, Meyers 2009,Mulsant 2001, Spiker 1985, Spiker 1988, van den Broek 2004a and Wijkstra 2010. The method of blinding was not explicitly described in Rothschild 2004a, Rothschild 2004b, Zanardi 1996 and Zanardi 2000. However, the authors explicitly state the double‐blind condition of their studies; we have no reason to doubt that these double‐blind conditions pertained to both investigators/assessors and participants.

(Blinding of participants: two studies unclear risk and 10 studies low risk; blinding of personnel: two studies unclear risk and 10 studies low risk; blinding of outcome assessors: five studies unclear risk and seven studies low risk.)

Incomplete outcome data

The primary efficacy outcome was the response rate (of depression). It was not possible to transfer the authors' defined response data into rates based on a single definition (i.e. 50% reduction of the HRSD score). Some authors used response definitions based on what is often considered remission (e.g. HAMD < 8 or 10), and some authors included psychotic symptoms in their response definition. In the absence of a better option, we decided to use response data as reported by the authors, with the preference to use response of depression rather than response of psychosis.

In eight of the 12 studies, we recalculated intention‐to‐treat response rates using all randomly assigned participants as the denominator. Of 46 participants in the study of Anton 1990, eight were dropped from the study before receiving two full weeks of active medication. These participants were excluded from the analysis by the author, but we included them in our ITT analysis. In the studies of van den Broek 2004a and Bruijn 1996, we counted participants treated with haloperidol as dropouts because haloperidol treatment in these participants was started after random assignment, in part to keep them in the study. Thus, treatment with haloperidol is considered a potential bias with regard to the effect of study medication as well as dropout. Mulsant 2001 excluded six dropouts after random assignment to perphenazine or placebo. We included them in our ITT analysis. In the studies of Rothschild 2004a and Rothschild 2004b, seven per cent and nine per cent, respectively, of randomly assigned participants were lost before baseline plus one visit. These participants were excluded from the study's analysis but were included in our ITT analysis. In both studies of Spiker (Spiker 1985; Spiker 1988), seven dropouts were excluded from their analysis, but we included them in our ITT analysis.

For the secondary outcome of change in symptom severity, we decided to refrain from using continuous data from observer depression severity scales. In two studies (Bruijn 1996; van den Broek 2004), continuous data were available for the total group (psychotic and non‐psychotic) but not from the psychotic subgroup. In four studies, continuous data were not available (Anton 1990; Mulsant 2001; Spiker 1988; Zanardi 1996). In one study (Spiker 1985), baseline and final mean HRSD data are given, but it was impossible to exclude bipolar participants from these data and convert the data to ITT data. In the studies of Rothschild (Rothschild 2004a; Rothschild 2004b), only LOCF continuous data are available, and in this study, dropout is very high. Pooling of data from the three remaining studies is useless because three different comparisons are studied: AD versus AD (Zanardi 2000) with no ITT data; AD + AP versus AP (Meyers 2009); and AD + AP versus AD (Wijkstra 2010).

(Incomplete data: one study high risk, three studies unclear risk and eight studies low risk.)

Selective reporting

All studies used generally accepted outcomes. From two recent studies (Meyers 2009 and Wijkstra 2010), the study protocol is available; in these two studies, no post‐protocol changes in outcome measures had been made (except in Wijkstra 2010, which used remission as an outcome not stated in the protocol but with reasonable argument that remission has become a generally accepted outcome measure).

(Selective reporting: 10 studies unclear risk and two studies low risk.)

Other potential sources of bias

The subgroup of psychotic depressed participants (Bruijn 1996; Spiker 1988; van den Broek 2004a) was not stratified before random assignment in any of these studies. Subgroup analysis are more likely to be carried out if the results for primary outcomes are not significant or are more likely to be reported for groups for whom a significant result was found. However, in the case of this review, we ourselves analysed subgroup data in studies that primarily reported on participants with depression with and without psychotic features.

As described under 'Description of included studies', clinical heterogeneity is seen within the results.

In five studies, response definition included response with regard to psychotic symptoms (Meyers 2009; Mulsant 2001; Spiker 1985; Zanardi 1996; Zanardi 2000), and in the other studies (Anton 1990; Bruijn 1996; Rothschild 2004a; Rothschild 2004b; Spiker 1988; van den Broek 2004; Wijkstra 2010; Zanardi 1996), response rates concerned only change in severity of depression. In our analysis, we looked only for response of depression leading to a possible bias favouring antidepressants over antipsychotics.

(Other biases together: seven studies high risk and five studies unclear risk.)

Effects of interventions

For the secondary outcome of change in symptom severity, extracting continuous data from observer depression severity scales was not possible because in seven studies, we were not able to convert these data according to intention‐to‐treat analysis (Anton 1990; Bruijn 1996 ; Rothschild 2004a; Rothschild 2004b; Spiker 1985; Spiker 1988; van den Broek 2004a), and in two other studies, no continuous data were reported (Zanardi 1996; Zanardi 2000).

Comparison 1. Antidepressant versus placebo

Primary outcomes
1.1 Efficacy response rates

An antidepressant was compared with placebo in only one study.

In this study (Spiker 1988; Analysis 1.1), no evidence suggested that amitriptyline was superior to placebo (RR 8.40, 95% CI 0.50 to 142.27, P = 0.14).

1.2 Harm: overall dropout rate during acute treatment

No difference (Spiker 1988; Analysis 1.2) (RR 1.24, 95% CI 0.34 to 4.51).

Secondary outcomes
1.3 Remission

No data.

1.4 Change from baseline

No ITT data.

1.5 Quality of life

No data,

1.6 Dropout rate due to adverse effects

No data.

Comparison 2. Antipsychotic versus placebo

Primary outcomes
2.1 Efficacy response rates

Two studies compared an antipsychotic with placebo (Rothschild 2004a; Rothschild 2004b). Pooling of these studies (Analysis 2.1) does not reveal a difference between olanzapine and placebo (RR 1.13, 95% CI 0.74 to 1.73, P = 0.57).

2.2 Harm: overall dropout rate during acute treatment

Pooling of these two studies (Rothschild 2004a; Rothschild 2004b; Analysis 2.2) reveals no difference (RR 0.79, 95% CI 0.57 to 1.08, P = 0.14).

Secondary outcomes
2.3 Remission

No data.

2.4 Change from baseline

For the secondary outcome of change in symptom severity, extracting continuous data from observer depression severity scales was not possible; we were not able to convert these data according to ITT analysis (Rothschild 2004a; Rothschild 2004b).

2.5 Quality of life

No data.

2.6 Dropout rate due to adverse effects

No data.

Comparison 3. Antidepressant versus antidepressant

Primary outcomes
3.1 Efficacy response rates

In five studies, two different antidepressants were compared directly with each other (Bruijn 1996; van den Broek 2004a; Wijkstra 2010; Zanardi 1996; Zanardi 2000).

Differences in efficacy were found in three of these studies. In the study by van den Broek 2004a (Analysis 3.1), imipramine was more effective than fluvoxamine (RR 2.10, 95% CI 1.06 to 4.17); in the study by Bruijn 1996 (Analysis 3.1), imipramine was near more effective than mirtazapine (RR 3.00, 95% CI 1.01 to 8.95); and in the study by Zanardi 1996 (Analysis 3.1), sertraline was better than paroxetine (RR 3.37, 95% CI 1.19 to 9.57, P = 0.02). The study by Zanardi 2000 (Analysis 3.1) did not reveal a difference between fluvoxamine and venlafaxine (RR 1.50, 95% CI 0.82 to 2.75). In Wijkstra 2010 (Analysis 3.1), no difference was noted between imipramine and venlafaxine (RR 1.57, 95% CI 0.93 to 2.67).

Pooling of studies was not reasonable because different antidepressants were used.

3.2 Harm: overall dropout rate during acute treatment

In these five studies, no differences in overall dropout rate were reported (Analysis 3.1).

Secondary outcomes
3.3 Remission

No data.

3.4 Change from baseline

Pooling not possible because different antidepressants were used.

3.5 Quality of life

No data.

3.6 Dropout rate due to adverse effects

No data.

Comparison 4. Antipsychotic versus antipsychotic

No data were available for any outcome for this comparison.

Comparison 5. Antidepressant versus antipsychotic

Primary outcomes
5.1 Efficacy response rates

We found one RCT that compared an antidepressant with an antipsychotic. In this trial (Spiker 1985; Analysis 4.1), no difference between perphenazine and amitriptyline was reported (RR 2.09, 95% CI 0.64 to 6.82).

5.2 Harm: overall dropout rate during acute treatment

One study (Spiker 1985; Analysis 4.2) reported no difference between perphenazine and amitriptyline (RR 1.79, 95% CI 0.18 to 18.02).

Secondary outcomes
5.3 Remission

No data.

5.4 Change from baseline

For the secondary outcome of change in symptom severity, extracting continuous data from observer depression severity scales was not possible because in the only study, we were not able to convert these data according to intention‐to ‐treat analysis (Spiker 1985).

5.5 Quality of life

No data.

5.6 Dropout rate due to adverse effects

No data.

Comparison 6. Antidepressant plus antipsychotic versus placebo

Primary outcomes
6.1 Efficacy response rates

Two identical studies (Rothschild 2004a; Rothschild 2004b) compared the combination of fluoxetine and olanzapine with placebo. Pooling of the studies (Analysis 5.1) reveals the combination to more efficacious (RR 1.86, 95% CI 1.23 to 2.82, P = 0.003).

6.2 Harm: overall dropout rate during acute treatment

Pooling of both studies (Rothschild 2004a; Rothschild 2004b; Analysis 5.2) reveals no difference (RR 0.75, 95% CI 0.48 to 1.18, P = 0.21).

In this pooling, we find heterogeneity (I2 = 76%). In these two studies, dropout rates are high—in one study higher for placebo, and in the other study higher for olanzapine.

Secondary outcomes
6.3 Remission

No data.

6.4 Change from baseline

For the secondary outcome of change in symptom severity, extracting continuous data from observer depression severity scales was not possible because in these two studies, we were not able to convert these data according to intention‐to ‐treat analysis (Rothschild 2004a; Rothschild 2004b).

6.5 Quality of life

No data.

6.6 Dropout rate due to adverse effects

No data.

Comparison 7. Antidepressant plus antipsychotic versus placebo plus antipsychotic

Primary outcomes
7.1 Efficacy response rates

Four studies compared the combination of an antidepressant plus an antipsychotic with antipsychotic monotherapy (Meyers 2009; Spiker 1985; Rothschild 2004a; Rothschild 2004b).

In the ITT analysis of Spiker et al. (Spiker 1985; Analysis 6.1), the combination of amitriptyline plus perphenazine was superior to perphenazine (RR 3.61, 95% CI 1.23 to 10.56, P = 0.02).

Pooling of both identical Rothschild studies (Analysis 6.1; Rothschild 2004a; Rothschild 2004b) gives an advantage to the combination of olanzapine plus fluoxetine over olanzapine alone (RR 1.64, 95% CI 1.10 to 2.44, P = 0.01).

In Meyers at al (Meyers 2009; Analysis 6.1), olanzapine plus sertraline was more effective than olanzapine alone (RR 0.57, 95% CI 0.39 to 0.82, P = 0.003).

Pooling the data from all four studies (Meyers 2009; Rothschild 2004a; Rothschild 2004b; Spiker 1985; Analysis 6.1) shows a difference favouring the combination (RR 1.83, 95% CI 1.40 to 2.38, P = 0.00001).

7.2 Harm: overall dropout rate during acute treatment

Pooling all four studies (Meyers 2009; Rothschild 2004a; Rothschild 2004b; Spiker 1985; Analysis 6.2) reveals no difference (RR 0.79, 95% CI 0.63 to 1.01, P = 0.06, I2 = 63%).

Heterogeneity of these pooled results is possibly a result of the Rothschild 2004a and Rothschild 2004b studies because in these two studies, dropout rates are very high, and these rates are different between the two studies.

Secondary outcomes
7.3 Remission

No data.

7.4 Change from baseline

For the secondary outcome of change in symptom severity, extracting continuous data from observer depression severity scales was not possible because in three studies, we were not able to convert these data according to ITT analysis (Rothschild 2004a; Rothschild 2004b; Spiker 1985).

7.5 Quality of Life

No data.

7.6 Dropout rate due to adverse effects

No data.

Comparison 8. Antidepressant plus antipsychotic versus placebo plus antidepressant

Primary outcomes
8.1 Efficacy response rates

Four studies (five comparisons) compared the combination of an antidepressant plus an antipsychotic with antidepressant monotherapy (Anton 1990; Mulsant 2001; Spiker 1985; Wijkstra 2010). Two of these studies compared a TCA plus an antipsychotic with TCA monotherapy (Mulsant 2001; Spiker 1985). Pooling of the ITT data of these two studies (as in our first review) did not reveal a difference (RR 1.44, 95% CI 0.86 to 2.41, P = 0.16: analysis in first review). Another study that compared the combination of venlafaxine plus an antipsychotic (quetiapine) with venlafaxine monotherapy and with imipramine monotherapy (Wijkstra 2010; Analysis 7.1) found a difference with venlafaxine (RR 0.51, 95% CI 0.31 to 0.83) but not with imipramine (RR 0.80, 95% CI 0.55 to 1.14). In the study of Anton 1990, amoxapine was used as an antidepressant with antipsychotic effects. Pooling of these five studies results in a difference favouring the combination (RR 1.44; 95% CI 1.15 to 1.80, P = 0.002; Analysis 7.1).

Three studies in this comparison used the same antidepressant in both arms, whereas two studies used a different antidepressant in each arm. Analysing just the studies where the same antidepressant was used in both arms (i.e. leaving out Anton 1990 and Wijkstra 2010) (Analysis 8.1) still reveals a difference in favour of the combination (RR 1.70, 95% CI 1.19 to 12.43, P = 0.003).

8.2 Harm: overall dropout rate during acute treatment

No differences were noted after pooling of these three studies (Analysis 7.2): RR 0.91, 95% CI 0.58 to 1.44, P = 0.69. In the subgroup analysis, no difference is noted (Analysis 8.2; RR 1.04, 95% CI 0.52 to 0.91, P = 0.91).

Secondary outcomes
8.3 Remission

No data.

8.4 Change from baseline

For the secondary outcome of change in symptom severity, extracting continuous data from observer depression severity scales was not possible because we were not able to convert these data according to intention‐to‐treat analysis.

8.5 Quality of life

No data.

8.6 Dropout rate due to adverse effects

No data.

Subgroup and sensitivity analysis

Because of lack of data, subgroup and sensitivity analyses were not possible.

Discussion

Despite our extensive search of the literature (screening more than 3600 abstracts and 829 full articles, as well as carefully reviewing 69 publications), we identified very few RCTs on pharmacological treatment with an antidepressant, an antipsychotic or the combination of an antidepressant with an AP of participants with a major depressive disorder with a current episode with psychotic features (unipolar psychotic depression).

In addition to nine studies, whose main focus was the treatment of participants with psychotic depression, we were able to find three studies that reported separately on the effects of the subgroups of participants with psychotic depression. The authors of two further studies provided us with additional information on the results for the subgroup of psychotically depressed participants in their studies on both psychotic and non‐psychotic depressed patients. In our previous review, we invited authors of several studies to provide us with subgroup data if available, so we could use these data in our review, but we did not receive any data.

In this update, we found two new studies to be added to the ten studies found in our previous review, which (again) illustrates that this most severe form of depression is highly under‐investigated. One probable reason for this is that it is very difficult to conduct RCTs in patients with psychotic depression. These patients often are not only psychotic but are very anxious or physically ill. Moreover, they are often offered ECT directly without a trial of pharmacological treatment. Finally, many patients with psychotic depression are not able or are reluctant to give informed consent, or they tend to withdraw their consent.

As a result of the paucity of RCTs, few clinically relevant conclusions can be drawn.

Summary of main results

1. Evidence was derived from two identical placebo‐controlled studies (Rothschild 2004a; Rothschild 2004b; Analysis 5.1) for the efficacy of the combination of an antidepressant plus an antipsychotic (fluoxetine plus olanzapine vs placebo; RR 1.86, 95% CI 1.23 to 2.82, P = 0.003)

2. We found evidence that the combination of an antidepressant plus an antipsychotic is more effective than antidepressant monotherapy. Pooling of four studies (five comparisons) that compared the combination of an antidepressant plus an antipsychotic with antidepressant monotherapy (Anton 1990; Mulsant 2001; Spiker 1988; Wijkstra 2010; Analysis 7.1) results in a significant difference favouring the combination (RR 1.44, 95% CI 1.15 to 1.80, P = 0.002). When the two comparisons with a different antidepressant are left out (Anton 1990; Wijkstra 2010), this difference still is statistically significant (Analysis 8.1; RR 1.70, 95% CI 1.19 to 2.43, P = 0.003). In our previous review, based on two studies (Mulsant 2001; Spiker 1988; RR 1.44, 95% CI 0.86 to 2.41), no such proof was found. Therefore, it can be concluded that the recommendation in the APA guidelines (APA 2010) that in psychotic depression the combination should be preferred over an AD alone can be considered evidence based.

3. The combination of an antidepressant plus an antipsychotic is more effective than antipsychotic monotherapy. Pooling of four studies that compared the combination of an antidepressant plus an antipsychotic with antipsychotic monotherapy (Meyers 2009; Rothschild 2004a; Rothschild 2004b; Spiker 1985; Analysis 6.1) shows a difference favouring the combination (RR 1.83, 95% CI 1.40 to 2.38, P = 0.00001). This confirms the conclusion stated in our previous review based on three of these four studies (Rothschild 2004a; Rothschild 2004b; Spiker 1985).

4. No randomised controlled data are available to lead to the conclusion that an antidepressant alone is efficacious in the treatment of psychotic depression. Only one small study compared monotherapy with an antidepressant (amitriptyline) with placebo (Spiker 1988; Analysis 1.1) and reported no difference (RR 8.40, 95% CI 0.50 to 142.27).

5. No randomised controlled data are available to lead to the conclusion that an antipsychotic alone is efficacious in the treatment of psychotic depression. Two studies compared monotherapy with an antipsychotic (olanzapine) with placebo (Rothschild 2004a; Rothschild 2004b). Pooling of these two studies does not reveal a difference (Analysis 2.1; RR 1.13, 95% CI 0.74 to 1.73).

6. We were not able to collect data on prior treatments. So we could not address the second objective of our review: to assess whether differences in response to treatment in the current episode would be related to non‐response to prior treatment(s).

7. Regarding acceptability of treatment (dropout, adverse effects) and quality of life, we were able to collect data only on overall dropout. In all studies except the study of Meyers 2009, no differences in overall dropout rates were reported between any of the treatment groups, neither in individual studies nor after pooling of studies. With this rather rough measure, we did not find overall differences in overall acceptability of treatments. In the study of Meyers 2009, fewer dropouts were reported in the group treated with the combination of olanzapine plus sertraline than in the group treated with olanzapine alone (RR 1.43, 95% CI 1.08 to 1.88). The authors suggest that higher attrition rates among participants treated with olanzapine plus sertraline may be attributable to insufficient response in the olanzapine‐treated group.

8. We only found indication of heterogeneity drop‐out data in instances where the two identical studies of Rothschild (Rothschild 2004a; Rothschild 2004b)  were included (Analysis 2.2; Analysis 5.2; Analysis 6.2). This heterogeneity probably is due to high dropout rates, together with differences in dropout rates between the two studies.

Overall completeness and applicability of evidence

Our conclusions are based on only 12 studies that fulfilled our inclusion criteria. Moreover, in the included studies, only a few different antidepressants and antipsychotics were used. Therefore, it remains unclear whether the above conclusions can be extrapolated to other antidepressants and antipsychotics. Strictly spoken, evidence that the combination of an antidepressant plus an antipsychotic is more effective than an antidepressant alone in an RCT has been obtained only for the combination of venlafaxine and quetiapine compared with venlafaxine (Wijkstra 2010), although evidence that the combination of an antidepressant plus an antipsychotic is more effective than an antipsychotic alone has been obtained in RCTs for only two antipsychotics: the combination of amitriptyline and perphenazine reported as more effective than perphenazine (Spiker 1985); all three other studies involved combinations with olanzapine: in two studies with fluoxetine (Rothschild 2004a; Rothschild 2004b) and in one study with sertraline (Meyers 2009).

Nearly all participants in these studies were inpatients. This of course is a consequence of the severity of the illness and the fact that in clinical practice, most patients with psychotic depression are treated as inpatients. There is also the problem of patients who were not included in the studies: the most severely ill patients who were not able to give informed consent or who were immediately given ECT. All these points limit the generalisability of the results to all patients with psychotic depression.

These problems with the quality of diagnostic assessment can restrict generalisation of the findings to all patients with psychotic depression, leaving aside the problem of establishing the diagnosis of psychotic depression in clinical reality.

Quality of the evidence

The strength of this review and its conclusion is that only randomised controlled studies have been included, and only ITT data have been used in the analyses.

Several factors limit our confidence in the findings of this review.

Most studies were relatively small. Only four RCTs had a more or less adequate sample size: Rothschild 2004a; Rothschild 2004b: with olanzapine 48 and 53 participants, with placebo 51 and 49 participants, but only 25 and 23 olanzapine plus fluoxetine, respectively; Meyers 2009: with olanzapine + sertraline 129 and sertraline 130 participants; and Wijkstra 2010: imipramine 42, venlafaxine 39 and venlafaxine + quetiapine 41 participants. As with all systematic reviews, publication bias is a potentially serious source of bias. Too few studies were identified to allow further investigation into the possibility of publication bias (e.g. by making funnel plots). However, the fact that in these mostly small trials, five (50%) found a significant difference between two active treatments, is suggestive of publication bias.

Allocation concealment, especially in the older studies, was not explicitly described. Although we do not assume allocation concealment to be a real bias, this is of course unsure.

Dosages of antidepressants and antipsychotics used in the different trials were considered reasonably adequate. However, differences in dosing strategies were noted, leading to possible bias. Differences in additional medication strategies and differences in treatment periods were also reported (see paragraph 'Other potential sources of bias').

We could use only one outcome measure regarding efficacy: the response rates as defined by the authors. It was impossible to recalculate these response rates into a standard response rate based on a single definition (e.g. reduction on the HRSD‐17 of at least 50% compared with baseline), as many studies used other versions of the HRSD. Moreover, some studies used response definitions that are commonly used for the definition of remission, and we could not recalculate the results to the more commonly used definition of response (i.e. reduction of 50% compared with baseline).

We cannot rule out the possibility of differences in the diagnostic assessment of participants and thus in the quality of the diagnosis across the studies. Although it was reported in all publications that participants included in the trials fulfilled criteria for a major depressive episode with psychotic features, according to a specified diagnostic classification system (RDC, DSM‐III or DSM‐IV), one could doubt the reliability of the diagnoses made in some trials. Six studies (Bruijn 1996; Meyers 2009; Mulsant 2001; Spiker 1985; van den Broek 2004; Wijkstra 2010) used a semi‐structured interview, and only three studies (Bruijn 1996; Meyers 2009; Wijkstra 2010) reported the types of psychotic features. This leaves open the possibility that for instance the judgement that a feeling or idea of guilt was a guilt delusion was drawn differently across the studies in this review. A similar diagnostic problem may have played a role in the judgement of whether a participant had a psychotic depression in the course of unipolar disorder or bipolar disorder. Finally, the study of Mulsant 2001 included a geriatric sample with a mean age of 72 years, leading to the possibility that dementia or another neurological disorder was part of the diagnosis in some participants.

Potential biases in the review process

One problem is that there does not exist a key word (Mesh Term) for psychotic depression. Therefore, we had to search all RCTs involving depression on whether included participants had depression with psychotic features, or whether such participants had been part of the group of included participants and were reported as a separate subgroup. We anticipated in the first version of this review that we might have missed one or more studies. However, we did not receive any information that we had missed any study. Therefore, in this update, we are now rather sure that we indeed have included all existing studies.

In three studies (Bruijn 1996Spiker 1988; van den Broek 2004a), the subgroup of psychotic depressed participants was part of a greater group of participants with psychotic and non‐psychotic depression, although the subgroups were not stratified before random assignment.

Another potential problem, which was not taken into account in our a priori protocol before this systematic review was performed, is that in five studies, the response definition included response with regard to psychotic symptoms (Meyers 2009Mulsant 2001Spiker 1985Zanardi 1996Zanardi 2000), although in the other studies (Anton 1990Bruijn 1996Rothschild 2004aRothschild 2004bSpiker 1988van den Broek 2004; Wijkstra 2010Zanardi 1996), response rates concerned only change in severity of depression. In our analysis, we looked only for the response of depression, leading to a possible bias favouring antidepressants over antipsychotics.

Agreements and disagreements with other studies or reviews

In a review of practice guidelines regarding the treatment of psychotic depression (Wijkstra 2007), we found different recommendations based on slightly different studies, mostly not reanalyzed. Two guidelines—Nice 2004 and the Dutch guideline 2005 (now updated with no changes regarding treatment of psychotic depression: Dutch Guideline 2009; NICE 2009)—were cautious in their recommendations. Nice 2004: Augmenting an AD with an AP should be considered; and Dutch 2005: Starting treatment with a TCA and if after 4 weeks there is still no response, adding an AP is a reasonable option; starting with the combination of a TCA and an AP is also a reasonable option. The other reviewed guidelines, including the APA guideline 2000 (now updated with no differences regarding the treatment of psychotic depression; APA 2010), recommend using the combination of an AD and an AP. None of these guidelines recommended monotherapy with an antipsychotic. These recommendations were not based on a systematic review of data from all available RCTs; they were based on a few studies—some randomised and some open non‐randomised. Another meta‐analysis on the treatment of psychotic depression has been published (Farahani 2012). This review and meta‐analysis is focused on the comparison of antidepressant or antipsychotic monotherapy with combination treatment. Five studies were included (Anton 1990; Künzel 2008; Mulsant 2001; Spiker 1985; Wijkstra 2010). In our review, for this particular comparison, we excluded Künzel 2008 because it is unclear whether less or more than 20% of included participants in the ITT group had a bipolar disorder (see table Excluded studies), and we did not use the study of Anton 1990 because we did not regard amoxapine as an antidepressant (Analysis 8.1). We included in our review the same three other studies (Mulsant 2001; Spiker 1985; Wijkstra 2010), using exactly the same extracted ITT data. The conclusion of this review is consistent with ours: Combination treatment is more effective than antidepressant monotherapy. For the comparison of an antipsychotic plus an antidepressant versus an antipsychotic, the same four studies with again exactly the same extracted ITT data were included (Meyers 2009; Rothschild 2004a; Rothschild 2004b; Spiker 1985; Analysis 5.1), leading in both reviews to the same conclusion: Combination treatment is more effective than antipsychotic monotherapy. As in our review, no differences were reported in overall dropout rates across all studies for both comparisons.

Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.
Figures and Tables -
Figure 1

Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.
Figures and Tables -
Figure 2

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Study flow diagram.
Figures and Tables -
Figure 3

Study flow diagram.

Comparison 1 Antidepressant versus placebo, Outcome 1 Clinical response.
Figures and Tables -
Analysis 1.1

Comparison 1 Antidepressant versus placebo, Outcome 1 Clinical response.

Comparison 1 Antidepressant versus placebo, Outcome 2 Dropouts.
Figures and Tables -
Analysis 1.2

Comparison 1 Antidepressant versus placebo, Outcome 2 Dropouts.

Comparison 2 Antipsychotic versus placebo, Outcome 1 Clinical response.
Figures and Tables -
Analysis 2.1

Comparison 2 Antipsychotic versus placebo, Outcome 1 Clinical response.

Comparison 2 Antipsychotic versus placebo, Outcome 2 Dropouts.
Figures and Tables -
Analysis 2.2

Comparison 2 Antipsychotic versus placebo, Outcome 2 Dropouts.

Comparison 3 Antidepressant versus antidepressant, Outcome 1 Clinical response.
Figures and Tables -
Analysis 3.1

Comparison 3 Antidepressant versus antidepressant, Outcome 1 Clinical response.

Comparison 3 Antidepressant versus antidepressant, Outcome 2 Dropouts.
Figures and Tables -
Analysis 3.2

Comparison 3 Antidepressant versus antidepressant, Outcome 2 Dropouts.

Comparison 4 Antidepressant versus antipsychotic, Outcome 1 Clinical response.
Figures and Tables -
Analysis 4.1

Comparison 4 Antidepressant versus antipsychotic, Outcome 1 Clinical response.

Comparison 4 Antidepressant versus antipsychotic, Outcome 2 Dropouts.
Figures and Tables -
Analysis 4.2

Comparison 4 Antidepressant versus antipsychotic, Outcome 2 Dropouts.

Comparison 5 Antidepressant plus antipsychotic versus placebo, Outcome 1 Clinical response.
Figures and Tables -
Analysis 5.1

Comparison 5 Antidepressant plus antipsychotic versus placebo, Outcome 1 Clinical response.

Comparison 5 Antidepressant plus antipsychotic versus placebo, Outcome 2 Dropouts.
Figures and Tables -
Analysis 5.2

Comparison 5 Antidepressant plus antipsychotic versus placebo, Outcome 2 Dropouts.

Comparison 6 Antidepressant plus antipsychotic versus placebo plus antipsychotic, Outcome 1 Clinical response.
Figures and Tables -
Analysis 6.1

Comparison 6 Antidepressant plus antipsychotic versus placebo plus antipsychotic, Outcome 1 Clinical response.

Comparison 6 Antidepressant plus antipsychotic versus placebo plus antipsychotic, Outcome 2 Dropouts.
Figures and Tables -
Analysis 6.2

Comparison 6 Antidepressant plus antipsychotic versus placebo plus antipsychotic, Outcome 2 Dropouts.

Comparison 7 Antidepressant plus antipsychotic versus placebo plus antidepressant, Outcome 1 Clinical response.
Figures and Tables -
Analysis 7.1

Comparison 7 Antidepressant plus antipsychotic versus placebo plus antidepressant, Outcome 1 Clinical response.

Comparison 7 Antidepressant plus antipsychotic versus placebo plus antidepressant, Outcome 2 Dropouts.
Figures and Tables -
Analysis 7.2

Comparison 7 Antidepressant plus antipsychotic versus placebo plus antidepressant, Outcome 2 Dropouts.

Comparison 8 Antidepressant plus antipsychotic versus placebo plus the same antidepressant, Outcome 1 Clinical response.
Figures and Tables -
Analysis 8.1

Comparison 8 Antidepressant plus antipsychotic versus placebo plus the same antidepressant, Outcome 1 Clinical response.

Comparison 8 Antidepressant plus antipsychotic versus placebo plus the same antidepressant, Outcome 2 Dropouts.
Figures and Tables -
Analysis 8.2

Comparison 8 Antidepressant plus antipsychotic versus placebo plus the same antidepressant, Outcome 2 Dropouts.

Comparison 1. Antidepressant versus placebo

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

1.1 Amitriptyline versus placebo

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 Dropouts Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

2.1 Amitriptyline versus placebo

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 1. Antidepressant versus placebo
Comparison 2. Antipsychotic versus placebo

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

2

Risk Ratio (M‐H, Fixed, 95% CI)

Subtotals only

1.1 Olanzapine versus placebo

2

201

Risk Ratio (M‐H, Fixed, 95% CI)

1.13 [0.74, 1.73]

2 Dropouts Show forest plot

2

Risk Ratio (M‐H, Fixed, 95% CI)

Subtotals only

2.1 Olanzapine versus placebo

2

201

Risk Ratio (M‐H, Fixed, 95% CI)

0.79 [0.57, 1.08]

Figures and Tables -
Comparison 2. Antipsychotic versus placebo
Comparison 3. Antidepressant versus antidepressant

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

5

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

1.1 Imipramine versus venlafaxine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.2 Imipramine versus mirtazapine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.3 Imipramine versus fluvoxamine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.4 Fluvoxamine versus venlafaxine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.5 Sertraline versus paroxetine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 Dropouts Show forest plot

5

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

2.1 Imipramine versus venlafaxine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.2 Imipramine versus mirtazapine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.3 Imipramine versus fluvoxamine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.4 Fluvoxamine versus venlafaxine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.5 Sertraline versus paroxetine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 3. Antidepressant versus antidepressant
Comparison 4. Antidepressant versus antipsychotic

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

1.1 Amitriptyline versus perphenazine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 Dropouts Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

2.1 Amitriptyline versus perphenazine

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 4. Antidepressant versus antipsychotic
Comparison 5. Antidepressant plus antipsychotic versus placebo

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

2

Risk Ratio (M‐H, Fixed, 95% CI)

Subtotals only

1.1 Fluoxetine + olanzapine versus placebo

2

148

Risk Ratio (M‐H, Fixed, 95% CI)

1.86 [1.23, 2.82]

2 Dropouts Show forest plot

2

Risk Ratio (M‐H, Fixed, 95% CI)

Subtotals only

2.1 Fluoxetine + olanzapine versus placebo

2

148

Risk Ratio (M‐H, Fixed, 95% CI)

0.75 [0.48, 1.18]

Figures and Tables -
Comparison 5. Antidepressant plus antipsychotic versus placebo
Comparison 6. Antidepressant plus antipsychotic versus placebo plus antipsychotic

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

4

447

Risk Ratio (M‐H, Fixed, 95% CI)

1.83 [1.40, 2.38]

1.1 amitriptyline + perphenazine versus perphenazine

1

39

Risk Ratio (M‐H, Fixed, 95% CI)

3.61 [1.23, 10.56]

1.2 Fluoxetine + olanzapine versus olanzapine

2

149

Risk Ratio (M‐H, Fixed, 95% CI)

1.64 [1.10, 2.44]

1.3 Olanzapine + sertraline versus olanzapine

1

259

Risk Ratio (M‐H, Fixed, 95% CI)

1.76 [1.21, 2.54]

2 Dropouts Show forest plot

4

447

Risk Ratio (M‐H, Fixed, 95% CI)

0.79 [0.63, 1.01]

2.1 Amitriptyline + perphenazine versus perphenazine

1

39

Risk Ratio (M‐H, Fixed, 95% CI)

3.09 [0.38, 25.19]

2.2 Fluoxetine + olanzapine versus olanzapine

2

149

Risk Ratio (M‐H, Fixed, 95% CI)

0.95 [0.59, 1.53]

2.3 Olanzapine + sertraline versus olanzapine

1

259

Risk Ratio (M‐H, Fixed, 95% CI)

0.70 [0.53, 0.92]

Figures and Tables -
Comparison 6. Antidepressant plus antipsychotic versus placebo plus antipsychotic
Comparison 7. Antidepressant plus antipsychotic versus placebo plus antidepressant

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

4

286

Risk Ratio (M‐H, Fixed, 95% CI)

1.44 [1.15, 1.80]

1.1 Nortriptyline + perphenazine versus nortriptyline

1

36

Risk Ratio (M‐H, Fixed, 95% CI)

1.12 [0.49, 2.53]

1.2 Venlafaxine + quetiapine versus venlafaxine

1

80

Risk Ratio (M‐H, Fixed, 95% CI)

1.98 [1.20, 3.24]

1.3 Amitriptyline + perphenazine versus amitriptyline

1

41

Risk Ratio (M‐H, Fixed, 95% CI)

1.73 [0.89, 3.37]

1.4 Amitriptyline + perphenazine versus amoxapine

1

46

Risk Ratio (M‐H, Fixed, 95% CI)

1.19 [0.75, 1.88]

1.5 Venlafaxine + quetiapine versus imipramine

1

83

Risk Ratio (M‐H, Fixed, 95% CI)

1.26 [0.87, 1.81]

2 Dropouts Show forest plot

4

286

Risk Ratio (M‐H, Fixed, 95% CI)

0.91 [0.58, 1.44]

2.1 Nortriptyline + perphenazine versus nortriptyline

1

36

Risk Ratio (M‐H, Fixed, 95% CI)

1.12 [0.26, 4.81]

2.2 Venlafaxine + quetiapine versus venlafaxine

1

80

Risk Ratio (M‐H, Fixed, 95% CI)

0.83 [0.33, 2.08]

2.3 Amitriptyline + perphenazine versus amitriptyline

1

41

Risk Ratio (M‐H, Fixed, 95% CI)

1.73 [0.35, 8.41]

2.4 Amitriptyline + perphenazine versus amoxapine

1

46

Risk Ratio (M‐H, Fixed, 95% CI)

0.65 [0.29, 1.45]

2.5 Venlafaxine + quetiapine versus imipramine

1

83

Risk Ratio (M‐H, Fixed, 95% CI)

1.02 [0.39, 2.66]

Figures and Tables -
Comparison 7. Antidepressant plus antipsychotic versus placebo plus antidepressant
Comparison 8. Antidepressant plus antipsychotic versus placebo plus the same antidepressant

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Clinical response Show forest plot

3

157

Risk Ratio (M‐H, Fixed, 95% CI)

1.70 [1.19, 2.43]

1.1 Nortriptyline + perphenazine versus nortriptyline

1

36

Risk Ratio (M‐H, Fixed, 95% CI)

1.12 [0.49, 2.53]

1.2 Venlafaxine + quetiapine versus venlafaxine

1

80

Risk Ratio (M‐H, Fixed, 95% CI)

1.98 [1.20, 3.24]

1.3 Amitriptyline + perphenazine versus amitriptyline

1

41

Risk Ratio (M‐H, Fixed, 95% CI)

1.73 [0.89, 3.37]

2 Dropouts Show forest plot

3

157

Risk Ratio (M‐H, Fixed, 95% CI)

1.04 [0.52, 2.07]

2.1 Nortriptyline + perphenazine versus nortriptyline

1

36

Risk Ratio (M‐H, Fixed, 95% CI)

1.12 [0.26, 4.81]

2.2 Venlafaxine + quetiapine versus venlafaxine

1

80

Risk Ratio (M‐H, Fixed, 95% CI)

0.83 [0.33, 2.08]

2.3 Amitriptyline + perphenazine versus amitriptyline

1

41

Risk Ratio (M‐H, Fixed, 95% CI)

1.73 [0.35, 8.41]

Figures and Tables -
Comparison 8. Antidepressant plus antipsychotic versus placebo plus the same antidepressant