Recent developments in improving signal detection and reducing placebo response in psychiatric clinical trials

doi:10.1016/j.jpsychires.2011.03.001

Journal of Psychiatric Research

Volume 45, Issue 9, September 2011, Pages 1202-1207

https://doi.org/10.1016/j.jpsychires.2011.03.001 Get rights and content

Abstract

Recent (2007–2010) empirical and theoretical literature on associations of trial design features with signal detection and placebo response were investigated, along with data and analytic considerations. Trials with greater percentages of patients randomized to placebo had larger average drug-placebo differences in two comprehensive meta-analyses (MDD and Schizophrenia). Excluding patients with large responses during double-blind placebo lead-ins resulted in small increases in drug-placebo differences. Core factor subscales of the HAMD yielded larger drug-placebo differences than the HAMD total score. Direct likelihood-based (MMRM) and similar analyses provided better control of false positive and false negative results than LOCF and BOCF. Theoretical considerations suggested that the number of sites and number of countries can influence power, depending on the correlation structure in the data and on how sites and countries are chosen. Use of centralized ratings reduced placebo response and improved drug-placebo differences. However, the number of comparisons was too small to draw conclusions. Use of patient ratings and reducing the number of study visits reduced placebo response, but their effects on signal detection were unclear. Practical experience with novel designs such as the sequential parallel approach hold promise for improvements in signal detection. Given the complexities of signal detection and placebo response, no single strategy is likely to fully solve the problem and combinations of approaches may be most useful. Utilizing appropriate analytic techniques and randomizing an adequate fraction of patients to placebo are perhaps the most broadly applicable approaches.

Introduction

Signal detection is the ability to differentiate between an effective drug and placebo; that is, to find a treatment effect when one exists (Mallinckrodt et al., 2007). Not surprisingly, poor signal detection has been linked with difficulties in discovering new therapies (Gelenberg et al., 2008). Khan et al. (2003a) reported that in studies of known effective antidepressants 21.1% of the drug-placebo contrasts were statistically significant in trials with high placebo response, compared with 74.2% significant contrasts in trials with low placebo response. Similarly, in recent schizophrenia trials placebo response has increased and signal detection has become more difficult (Kemp et al., 2010).

Placebo response is the change that occurs after administration of placebo and is caused by a study effect plus a placebo effect (Yang et al., 2005). The study effect is the tendency for a patient’s state to be modified solely from participation in a clinical trial and not to a treatment administered therein. The study effect includes such factors as spontaneous improvement, regression to the mean, superior care provided in the trial as compared to that received prior to trial participation, etc. The study effect influences all patients and could be assessed as the change in patients participating in the trial who were not administered a medication.

The placebo effect is the nonspecific, psychological, or psychophysiologic therapeutic effect often attributed, at least in part, to the expectation that improvement will follow the administration of treatment. Therefore the placebo effect is linked to the awareness and acceptance that a potentially effective treatment is received. The placebo effect influences all patients that receive study medication in a trial and could be assessed as the difference in response between placebo-treated patients and patients not administered drug in the same protocol. Placebo response is not limited to efficacy outcomes, although that is the focus here.

Placebo response and signal detection have been active areas of research. Given placebo response is the result of a study effect plus a placebo effect, trial features might be modified to reduce study and placebo effects, thereby reducing placebo response and improving signal detection. Therefore, we summarized recent empirical and theoretical literature on associations of trial design features with placebo response and trial outcome, along with data and analytic considerations.

Section snippets

Percent randomized to placebo

Earlier research noted that studies with fewer treatment arms and studies with flexible vs. fixed dosing more frequently yielded statistically significant differences from placebo (Khan et al., 2003a, Khan et al., 2003b). However, flexible dose studies typically had fewer treatment arms. Hence, the two effects were confounded and the independent contribution of each was unclear.

Recent literature has focused on the role of patient and rater expectation in placebo response (Rutherford et al., 2009

Outcome measures

The psychometric properties of the scales used to assess severity of symptoms have been debated, especially in MDD. For example, Gibbons et al. (1993) noted that the HAMD loses accuracy in assessing changes in depression severity because it does not define an uni-dimensional depressive state. As a result, uni-dimensional factors of the HAMD that focus on the core symptoms of depression have been defined (Bech et al., 2006, Maier and Philipp, 1985, Gibbons et al., 1993, McIntyre et al., 2005;

Discussion

Improving signal detection and reducing placebo response have remained active areas of investigation. This review summarized recent (2007–2010) research on these topics. Focus was on design considerations plausibly linked with patient and rater expectation, or thought to increase placebo response through an increased study effect. Data and analysis considerations potentially influencing precision or bias in estimates of treatment effects were also examined. Many of the studies in this review

Funding

Funding for this study was provided by Eli Lilly and Co. Lilly had no further role in study design; in the collection, analysis and interpretation of data; in the writing of the report; and in the decision to submit the paper for publication.

Contribution

Mallinckrodt was the primary author of the paper. However, Tamura and Tanaka each authored specific sections of the paper. All authors contributed to the review of the literature and interpretation of it, with the magnitude of that work corresponding to the authorship position. All authors have approved the final manuscript.

Conflict of interest

All authors are employees of and share holders in Eli Lilly and Co.

Acknowledgments

No acknowledgments to declare.

References (44)

R. Entsuah et al.
A critical examination of the sensitivity of unidimensional subscales derived from the Hamilton depression rating scale to antidepressant drug effects
Journal of Psychiatric Research
(2002)
D. Faries et al.
The responsiveness of the Hamilton depression rating scale
J Psychiatr Res.
(2000)
R.D. Gibbons et al.
Exactly what does the Hamilton depression rating scale measure?
Journal of Psychiatric Research
(1993)
G.I. Papakostas et al.
Does the probability of receiving placebo influence clinical trial outcome? A meta-regression of double-blind, randomized clinical trials in MDD
European Neuropsychopharmacology
(2009)
W. Rief et al.
Meta-analysis of the placebo response in antidepressant trials
Journal of Affective Disorders
(2009)
P. Bech et al.
Dose-response relationship of duloxetine in placebo-controlled clinical trials in patients with major depressive disorder
Psychopharmacology (Berl)
(2006)
Y.F. Chen
“Placebo response in major depression and schizophrenia trials: statistical consideration and design strategies”
(2010)
V. Coric et al.
A randomized, double-blind, placebo-controlled and active comparator trial of pexacerfont, a corticotropin releasing factor receptor-1 antagonist, in the treatment of generalized anxiety disorder
(December 2008)
C. Faes et al.
The e_ective sample size and a novel small sample degrees of freedom method
The American Statistician
(2009)
D.E. Faries et al.
The double-blind variable placebo lead-in period: results from two antidepressant clinical trials
J Clin Psychopharmacol
(2001)

M. Fava et al.

The problem of the placebo response in clinical trials for psychiatric disorders: culprits, possible remedies, and a novel study design approach

Psychother Psychosom

(2003)

M. Fava et al.

System and method for reducing the placebo effect in controlled clinical trials

(2010)

V.V. Federov et al.

“Enrichment design”

A.J. Gelenberg et al.

The history and current state of antidepressant clinical trial design: a call to action for proof–of–concept studies

Journal of Clinical Psychiatry

(2008)

M. Hamilton

A rating scale for depression

Journal of Neurology, Neurosurgery, and Psychiatry

(1960)

X. Huang et al.

“Comparison of test statistics for the sequential parallel design”

Statistics in Biopharmaceutical Research

(2010)

A.S. Kemp et al.

What is causing the reduced drug-placebo difference in recent schizophrenia clinical trials and what can be done about it?

Schizophrenia Bulletin

(2010)

M. Kenward et al.

Missing data in clinical studies

(2007)

A. Khan et al.

Placebo response and antidepressant clinical trial outcome

Journal of Nervous and Mental Disease

(2003)

A. Khan et al.

Frequency of positive studies among fixed and flexible dose antidepressant clinical trials: an analysis of the food and drug administration summary basis of approval reports

Neuropsychopharmacology

(2003)

K.A. Kobak et al.

Sources of unreliability in depression ratings

Journal of Clinical Psychopharmacology

(2009)

K.A. Kobak et al.

Site versus centralized raters in a clinical depression trial: impact on patient selection and placebo response

Journal of Clinical Psychopharmacology

(2010)

Cited by (32)

Statistical methods in handling placebo effect
2020, International Review of Neurobiology
Citation Excerpt :
However, any deviation from balanced allocation ratios has been shown to result in an increase in placebo response rate for a trial and introduce allocation bias (Enck et al., 2011; Enck, Junne, Klosterhalfen, Zipfel, & Martens, 2010). A balanced randomization allocation ratio permits the most optimal assessment of treatment effects and ensures maximum power for the study (Mallinckrodt, Tamura, & Tanaka, 2011). Maintenance of blind: Effective maintenance of the study blind at the participant and the site level is another factor that substantially improves the ability to detect the efficacy signal in a trial.
A critical issue facing the therapeutic area of neurological diseases is the large number of failed randomized clinical trials, especially when moving from promising Phase 2 trials to failed Phase 3 trials. A common cited reason for these failures is a high placebo response rate that thereby reduces the observed treatment effect. Explanations for this higher than anticipated placebo response include small sample sizes, inadequate study designs and/or analytic methods, baseline characteristics of the trial sample, possible investigator bias and a participant's own expectations and conditional learning. Several innovative study designs and new methodological approaches to statistical analyses have been proposed to handle placebo effects anticipated or observed in double blind, randomized clinical trials (RCT's). This chapter examines current study designs being used to reduce the observed placebo response and statistical analysis methods being employed for addressing this problem in neuroscience clinical trials.
Determinants of antidepressant response: Implications for practice and future clinical trials
2018, Journal of Affective Disorders
Citation Excerpt :
This is in contrast to the findings of Khin et al. (2011) who examined 81 studies conducted in a similar time period and found that the placebo response showed a modest increase over the observation period but the treatment effect clearly diminished, resulting in decreasing drug-placebo separation over time. Our finding that treatment response increased with the proportion of subjects on placebo is consistent with a meta-analysis of depression studies (Papakostas and Fava, 2009), an analysis of a patient registry of antipsychotic trials (Mallinckrodt et al., 2011), a review of trials across psychiatry (Weimer et al., 2015) and has been reported in other areas of medicine as well (Enck et al., 2011). This finding has been attributed to expectancy; if the proportion of subjects on placebo is low then the expectation of both subject and investigator is that a given subject is on active treatment (Enck et al., 2011).
Response to antidepressants in major depressive disorder is variable and determinants are not well understood or used to design clinical trials. We aimed to understand these determinants.
Supported by Innovative Medicines Initiative, as part of a large public-private collaboration (NEWMEDS), we assembled the largest dataset of individual patient level information from industry sponsored randomized placebo-controlled trials of antidepressant drugs in adults with MDD. We examined patient and trial-design-related determinants of outcome as measured by change on Hamilton Depression Scale or Montgomery–Asberg Depression Rating Scale in 34 placebo-controlled trials (drug, n = 8260; placebo, n = 3957).
While it is conventional for trials to be 6–8 weeks long, drug-placebo differences were nearly the same at week 4 as at week 6 and with lower dropout rates. At the multivariate level, having any of these attributes was significantly associated with greater drug vs. placebo differences on symptom improvement: female, increasing proportion of patients on placebo, centers located outside of North America, centers with low placebo response (regardless of active treatment response) and using randomized withdrawal designs.
Data on compounds that failed were not available to us. Findings may not be relevant for new mechanisms of action.
Proof of concept trials can be shorter and efficiency improved by selecting enriched populations based on clinical and demographic variables, ensuring adequate balance of placebo patients, and carefully selecting and monitoring centers. In addition to improving drug discovery, patient exposure to placebo and experimental treatments can be reduced.
A meta-analysis of randomized, placebo-controlled trials of vortioxetine for the treatment of major depressive disorder in adults
2016, European Neuropsychopharmacology
The efficacy and safety of vortioxetine, an antidepressant approved for the treatment of adults with major depressive disorder (MDD), was studied in 11 randomized, double-blind, placebo-controlled trials of 6/8 weeks׳ treatment duration. An aggregated study-level meta-analysis was conducted to estimate the magnitude and dose-relationship of the clinical effect of approved doses of vortioxetine (5–20 mg/day). The primary outcome measure was change from baseline to endpoint in Montgomery–Åsberg Depression Rating Scale (MADRS) total score. Differences from placebo were analyzed using mixed model for repeated measurements (MMRM) analysis, with a sensitivity analysis also conducted using last observation carried forward. Secondary outcomes included MADRS single-item scores, response rate (≥50% reduction in baseline MADRS), remission rate (MADRS ≤10), and Clinical Global Impressions scores. Across the 11 studies, 1824 patients were treated with placebo and 3304 with vortioxetine (5 mg/day: n=1001; 10 mg/day: n=1042; 15 mg/day: n=449; 20 mg/day: n=812). The MMRM meta-analysis demonstrated that vortioxetine 5, 10, and 20 mg/day were associated with significant reductions in MADRS total score (Δ-2.27, Δ-3.57, and Δ-4.57, respectively; p<0.01) versus placebo. The effects of 15 mg/day (Δ-2.60; p=0.105) were not significantly different from placebo. Vortioxetine 10 and 20 mg/day were associated with significant reductions in 10 of 10 MADRS single-item scores. Vortioxetine treatment was also associated with significantly higher rates of response and remission and with significant improvements in other depression-related scores versus placebo. This meta-analysis of vortioxetine (5–20 mg/day) in adults with MDD supports the efficacy demonstrated in the individual studies, with treatment effect increasing with dose.
Placebo response in antipsychotic trials of patients with acute mania. Results of an individual patient data meta-analysis
2015, European Neuropsychopharmacology
Citation Excerpt :
The presence of psychotic features at baseline was defined as a score of 3 (‘flight of ideas; tangentially; difficult to follow; rhyming; echolalia’) or 4 (incoherent; communication impossible) on question 7 of the YMRS or a score of 6 (‘Grandiose or paranoid ideas; Ideas of reference’) or 8 (‘Delusions; Hallucinations’) on question 8 (‘Content’) of the YMRS questionnaire (21). Study characteristics included study year (Agid et al., 2013; Cohen et al., 2010; Gispen-de Wied et al., 2012; Sysko and Walsh, 2007; Yildiz et al., 2011), number of visits per protocol (Cohen et al., 2010; Montgomery, 1999), number of study arms (Agid et al., 2013), number of countries (Agid et al., 2013; Keck et al., 2000; Mallinckrodt et al., 2011; Yildiz et al., 2011), region (Mallinckrodt et al., 2010), number of regions, mean change score on the YMRS in the treatment arm (Agid et al., 2013; Cohen et al., 2010; Gispen-de Wied et al., 2012; Yildiz et al., 2011), and proportion of patients assigned to receive placebo (Kemp et al., 2010; Mallinckrodt et al., 2011; Mallinckrodt et al., 2010; Sysko and Walsh, 2007; Yildiz et al., 2011). Region was classified into three areas: Europe, USA, and Other.
We examined the role of placebo response in acute mania trials. Specifically, whether placebo response: (1) predicts treatment effect, (2) can be predicted by patient and study characteristics, and (3) can be predicted by a parsimonious model. We performed a meta-analysis of individual patient data from 10 registration studies (n=1019) for the indication acute manic episode of bipolar disorder. We assessed the effect of 14 determinants on placebo response. Primary outcome measures were mean symptom change score (MCS) on the Young Mania Rating Scale (YMRS) and response rate (RR), defined as ≥50% YMRS symptom improvement from baseline to endpoint. The overall placebo response was 8.5 points improvement on the YMRS (=27.9%) with a RR of 32.8%. Placebo response was significantly associated with the overall treatment response. Five determinants significantly (p<0.05) predicted the placebo response. The multivariate prediction model, which consisted of baseline severity, psychotic features at baseline, number of geographic regions, and region, explained 10.4% and 5.5% of the variance in MSC and RR, respectively. Our findings showed that the placebo response in efficacy trials of antipsychotics for acute mania is substantial and an important determinant of treatment effect. Placebo response is influenced by patient characteristics (illness severity and presence of psychotic features) and by study characteristics (study year, number of geographic regions and region). However, the prediction model could only explain the placebo response to a limited extent. Therefore, limiting trials to certain patients in certain geographic regions seems not a viable strategy to improve assay sensitivity.
Rating depression over brief time intervals with the Hamilton Depression Rating Scale: Standard vs. abbreviated scales
2015, Journal of Psychiatric Research
Citation Excerpt :
Because these approaches led to different subscales of the HDRS, additional studies examined how well various subscales performed compared to total HDRS score. While some studies showed that shorter subscales improved the rate of response to the outcome measure (e.g. Bech et al., 2010; Faries et al., 2000; Mallinckrodt et al., 2011; Revicki et al., 2010; Santen et al., 2009; Silverstone et al., 2002), others found no noticeable difference (e.g. Ballesteros et al., 2007; McIntyre et al., 2005; Revicki et al., 2010; Ruhe et al., 2005). Boessen et al. (2013) pointed out that some of the differences across studies were likely due to the type of studies used to evaluate the scales.
Although antidepressant trials typically use weekly ratings to examine changes in symptoms over six to 12 weeks, antidepressant treatments may improve symptoms more quickly. Thus, rating scales must be adapted to capture changes over shorter intervals. We examined the use of the 17-item Hamilton Depression Rating Scale (HDRS) to evaluate more rapid changes. Data were examined from 58 patients with major depressive disorder or bipolar disorder enrolled in double-blind, placebo-controlled, crossover studies who received a single infusion of ketamine (0.5 mg/kg) or placebo over 40 min then crossed over to the other condition. HDRS subscales, a single HDRS Depressed mood item, and a visual analogue scale were used at baseline, after a brief interval (230 min), and one week post-infusion. Effect sizes for the ketamine-placebo difference were moderate (d > 0.50), but one and two-item HDRS subscales had the smallest effects. Response rates on active drug were lowest for the complete HDRS (43%); the remaining scales had higher response rates to active drug, but the shortest subscales had higher response rates to placebo. Correlations between the changes from baseline to 230 min post-ketamine across scores were similar for most subscales (r = 0.82–0.97), but correlations using the single items were lower (r < 0.74). Overall, effect sizes for drug-placebo differences and correlations between changes were lower for one- and two-item measures. Response rates were lower with the full HDRS scale. The data suggest that, to best identify rapid antidepressant effects, a scale should have more than two items, but fewer items than a full scale.
Effectiveness and acceptability of deep brain stimulation (DBS) of the subgenual cingulate cortex for treatment-resistant depression: A systematic review and exploratory meta-analysis
2014, Journal of Affective Disorders
Citation Excerpt :
First, the included studies enrolled relatively small numbers of depressed subjects. Second, because pre–post designs are limited with regard to their ability to show causality, we cannot rule out that the clinical improvement observed with DBS was related, for example, to systematic differences in individual patient care, rater bias, placebo effect and/or natural disease course (Mallinckrodt et al., 2011). There is, however, strong indirect evidence suggesting that placebo response rates and spontaneous remission are significantly lower in subjects with TRD as compared to those with uncomplicated MD (Dunner et al., 2006; Fekadu et al., 2009; Fournier et al., 2010).
Deep brain stimulation (DBS) applied to the subgenual cingulate cortex (SCC) has been recently investigated as a potential treatment for severe and chronic treatment-resistant depression (TRD). Given its invasive and experimental nature, a comprehensive evaluation of its effectiveness and acceptability is of paramount importance. Therefore, we conducted the present systematic review and exploratory meta-analysis.
We searched the literature for English language prospective clinical trials on DBS of the SCC for TRD from 1999 through December 2012 using MEDLINE, EMBASE, PsycINFO, CENTRAL and SCOPUS, and performed a random effects exploratory meta-analysis using Event Rates and Hedges׳ g effect sizes.
Data from 4 observational studies were included, totaling 66 subjects with severe and chronic TRD. Twelve-month response and remission rates following DBS treatment were 39.9% (95% CI=28.4% to 52.8%) and 26.3% (95% CI=13% to 45.9%), respectively. Also, depression scores at 12 months post-DBS were significantly reduced (i.e., pooled Hedges׳ g effect size=−1.89 [95% CI=−2.64 to −1.15, p<0.0001]). Also, there was a significant decrease in depression scores between 3 and 6 months (Hedges׳ g=−0.27, p=0.003), but no significant changes from months 6 to 12. Finally, dropout rates at 12 months were 10.8% (95% CI=4.3% to 24.4%).
Small number of included studies (most of which were open label), and limited long-term effectiveness data.
DBS applied to the SCC seems to be associated with relatively large response and remission rates in the short- and medium- to long-term in patients with severe TRD. Also, its maximal antidepressant effects are mostly observed within the first 6 months after device implantation. Nevertheless, these findings are clearly preliminary and future controlled trials should include larger and more representative samples, and focus on the identification of optimal neuroanatomical sites and stimulation parameters.

View all citing articles on Scopus

View full text

ReviewRecent developments in improving signal detection and reducing placebo response in psychiatric clinical trials

Abstract

Introduction

Section snippets

Percent randomized to placebo

Outcome measures

Discussion

Funding

Contribution

Conflict of interest

Acknowledgments

Journal of Psychiatric Research

J Psychiatr Res.

Journal of Psychiatric Research

European Neuropsychopharmacology

Journal of Affective Disorders

Dose-response relationship of duloxetine in placebo-controlled clinical trials in patients with major depressive disorder

Psychopharmacology (Berl)

“Placebo response in major depression and schizophrenia trials: statistical consideration and design strategies”

A randomized, double-blind, placebo-controlled and active comparator trial of pexacerfont, a corticotropin releasing factor receptor-1 antagonist, in the treatment of generalized anxiety disorder

The e_ective sample size and a novel small sample degrees of freedom method

The American Statistician

The double-blind variable placebo lead-in period: results from two antidepressant clinical trials

J Clin Psychopharmacol

The problem of the placebo response in clinical trials for psychiatric disorders: culprits, possible remedies, and a novel study design approach

Psychother Psychosom

System and method for reducing the placebo effect in controlled clinical trials

“Enrichment design”

The history and current state of antidepressant clinical trial design: a call to action for proof–of–concept studies

Journal of Clinical Psychiatry

A rating scale for depression

Journal of Neurology, Neurosurgery, and Psychiatry

“Comparison of test statistics for the sequential parallel design”

Statistics in Biopharmaceutical Research

What is causing the reduced drug-placebo difference in recent schizophrenia clinical trials and what can be done about it?

Schizophrenia Bulletin

Missing data in clinical studies

Placebo response and antidepressant clinical trial outcome

Journal of Nervous and Mental Disease

Frequency of positive studies among fixed and flexible dose antidepressant clinical trials: an analysis of the food and drug administration summary basis of approval reports

Neuropsychopharmacology

Sources of unreliability in depression ratings

Journal of Clinical Psychopharmacology

Site versus centralized raters in a clinical depression trial: impact on patient selection and placebo response

Journal of Clinical Psychopharmacology

Review
Recent developments in improving signal detection and reducing placebo response in psychiatric clinical trials