Scolaris Content Display Scolaris Content Display

Botulinum toxin type A therapy for blepharospasm

Collapse all Expand all

Background

This is an update of a Cochrane Review first published in 2005. Blepharospasm is the second most common form of focal dystonia. It is a disabling disorder, characterised by chronic, intermittent or persistent, involuntary eyelid closure, due to spasmodic contractions of the orbicularis oculi muscles. Currently, botulinum toxin type A (BtA) is considered the first line of therapy for this condition.

Objectives

To compare the efficacy, safety, and tolerability of BtA versus placebo in people with blepharospasm.

Search methods

We searched Cochrane Movement Disorders' Trials Register, CENTRAL, MEDLINE, Embase, reference lists of included articles, and conference proceedings. We ran all elements of the search, with no language restrictions, in July 2020.

Selection criteria

Double‐blind, parallel, randomised, placebo‐controlled trials (RCTs) of BtA versus placebo in adults with blepharospasm.

Data collection and analysis

Two review authors independently assessed records, selected included studies, extracted data using a paper pro forma, and evaluated the risk of bias. We resolved disagreements by consensus, or by consulting a third review author. We performed meta‐analyses using a random‐effects model, for the comparison of BtA versus placebo, to estimate pooled effects and corresponding 95% confidence intervals (95% CI). We did not carry out any prespecified subgroup analyses. The primary efficacy outcome was improvement on any validated symptomatic rating scale. The primary safety outcome was the proportion of participants with any adverse event.

Main results

We included three RCTs, assessed at low to moderate overall risk of bias, which randomised 313 participants with blepharospasm. Two studies excluded participants with poorer prior responses to BtA treatment, therefore, they included an enriched population with a higher probability of benefiting from this therapy. All trials were industry‐funded. All RCTs evaluated the effect of a single BtA treatment session.

BtA resulted in a moderate to large improvement in blepharospasm‐specific severity, with a reduction of 0.93 points on the Jankovic Rating Scale (JRS) severity subscale at four to six weeks after injection (95% confidence interval (CI) 0.61 to 1.25; I² = 9%) compared to placebo. BtA was also resulted in a moderate to large improvement in blepharospasm‐specific disability and blepharospasm‐specific involuntary movements at four to six weeks after injection (disability: 0.69 JRS disability subscale points, 95% CI 0.18 to 1.19; I² = 74%; blepharospasm‐specific involuntary movements: standardised mean difference (SMD) 0.79, 0.31 to 1.27; I² = 58%) compared to placebo. BtA did not show a risk of adverse events (risk ratio (RR) 1.18, 95% CI 0.87 to 1.60; I² = 0%). However, BtA increased the risk of vision complaints and eyelid ptosis (vision complaints: RR 5.73, 95% CI 1.79 to 18.36; I² = 51%; eyelid ptosis: RR 4.02, 95% CI 1.61 to 10.00; I² = 39%). There was no distinction between BtA and placebo in the number of participants who dropped out of the trial.

A single trial estimated the duration of effects to be 10.6 weeks (range 6.1 to 19.1).

We found no evidence supporting the existence of a clear dose‐response relationship with BtA. We found no data reporting the impact of BtA on health‐related quality of life, or the development of secondary non‐responsiveness.

Authors' conclusions

We are moderately certain that a single BtA treatment resulted in a clinically relevant reduction of blepharospasm‐specific severity and disability, and have low certainty that it is well tolerated, when compared with placebo. There is low‐certainty evidence that people treated with BtA are not at an increased risk of developing adverse events, though BtA treatment likely increases the risk of visual complaints and eyelid ptosis. There are no data from RCTs evaluating the effectiveness and safety of repeated BtA injection cycles.

There is no evidence from RCTs to allow us to draw definitive conclusions on the optimal treatment intervals and doses, or the impact on quality of life.

PICOs

Population
Intervention
Comparison
Outcome

The PICO model is widely used and taught in evidence-based health care as a strategy for formulating questions and search strategies and for characterizing clinical studies or meta-analyses. PICO stands for four different potential components of a clinical question: Patient, Population or Problem; Intervention; Comparison; Outcome.

See more on using PICO in the Cochrane Handbook.

Plain language summary

Botulinum toxin type A for people with involuntary eyelid closure, or blepharospasm

The review question

We reviewed the evidence about the effect of botulinum toxin type A (BtA) in people with involuntary eyelid closure, or blepharospasm. This is an update of a previous Cochrane Review and we assessed the effectiveness and safety of BtA versus placebo (a pretend medicine) in blepharospasm.

Background

Blepharospasm is a dysfunction of the eyelids that presents as involuntary eyelid closure, due to contractions of the eye muscles. Botulinum toxin type A (BtA) is a powerful, natural chemical that can cause severe paralysis (an inability to move the part of the body in which it is injected) in animals and humans. It can also be used to treat many conditions, in particular, those with involuntary muscle contractions, such as blepharospasm. Botulinum toxin is delivered by injections into the muscles that contract to produce most of the disorder‐related symptoms. There are different types of botulinum toxin, not all are available for treating health conditions. BtA is typically considered the first main treatment option for people with blepharospasm.

Study characteristics

We searched the medical literature in July 2020 and found three studies that compared treatment with BtA with placebo (injection with a liquid that will not treat the problem). These studies included a total of 313 participants, who had, on average, a moderate impairment. Most (66%) of the people in the studies were women. All trials were funded by drug manufacturers with possible interests in the results of the studies.

Key results

The results show that a single treatment session (where both eyelids were injected with BtA multiple times) improved the severity of blepharospasm symptoms, disability, and number of involuntary movements. We did not find an increased risk of any unpleasant or undesirable event, though we did find a larger risk of vision complaints and eyelid drooping in people who took BtA. Participants felt that BtA was better than placebo. The BtA effect lasted for around 10 weeks. No study examined the effect of BtA on quality of life.

Certainty in the evidence

The certainty in the evidence varies from low to high. We can draw no conclusions regarding long‐term effects of BtA for this condition.

Authors' conclusions

Implications for practice

In this updated Cochrane Review, we found that a single treatment session of botulinum toxin type A (BtA) is effective and well‐tolerated in the treatment of moderately impaired adults with blepharospasm. The clinical benefit includes moderate to large improvements across objective disorder‐related domains, such as severity and disability. The benefit is also meaningful when subjectively assessed by the participants.

There is no evidence regarding health‐related quality of life. Adverse events are frequent in both BtA (66 of 115 participants) and placebo groups (26 of 54 participants), but are not commonly associated with discontinuing treatment. Vision complaints (49 of 166 BtA‐treated participants) and eyelid ptosis (53 of 207 BtA‐treated participants) are the most frequent treatment‐related adverse events of special interest. We are moderately certain about the conclusions based on the evidence.

Implications for research

The net benefit of a single BtA injection in the treatment of blepharospasm is likely very positive overall.

Nonetheless, further studies are needed to establish the relative effectiveness of different doses of BtA, assessing efficacy, safety, duration of effect, and quality of life across regimens, with repeated BtA treatment sessions, and assessed under conditions more closely resembling clinical practice (pragmatic clinical trials). Because therapy typically requires optimising a dose for each person, rather than administering a fixed dose of botulinum toxin, such a line of research would be important to support physicians' management of doses, and allow for a more solid and safe individualisation of treatment.

Future research concerning all formulations of botulinum neurotoxin should endeavour to establish clinical effectiveness, not only based on changes from baseline, but also, preferably, based on validated measures of minimal clinically important difference or change (Brożek 2006). Research is required in order to establish such a parameter for the Jankovic Rating Scale (JRS), currently the most widely used and disseminated clinical scale in the field.

It is currently uncertain whether or not the clinical effectiveness of botulinum toxin decays over time, with repeated treatment sessions, and whether a possible loss of effectiveness occurs in all clinical domains. Future studies comparing any form of BtA should address the comparative proportion of participants who develop secondary non‐responsiveness to treatment.

Finally, in conducting this systematic review, we were faced with the fact that there is no defined core outcome set in blepharospasm research, as there is for other areas (Tugwell 2007). To promote research in this field, and to support the clinical effectiveness of botulinum toxin, it would be relevant to define a set of core outcome measures, and include it in future research, via well‐established methodology, to determine the inclusion of participant‐reported outcomes (Macefield 2014).

Summary of findings

Open in table viewer
Summary of findings 1. Botulinum toxin type A compared to placebo for blepharospasm

Botulinum toxin type A compared to placebo for blepharospasm

Patient or population: adults with blepharospasm
Setting: hospital‐based, movement disorders clinics
Intervention: botulinum toxin type A (BtA)
Comparison: placebo

Outcomes

Relative effect
(95% CI)

Anticipated absolute effects* (95% CI)

Certainty of the evidence
(GRADE)

What happens

Without BtA

With BtA

Difference (95% CI)

Blepharospasm‐specific severity

(assessed with JRS severity subscore (0 to 4; lower = better), measured between 4 to 6 weeks)

(3 RCT, 280 participants)

MD 0.93 higher
(0.61 higher to 1.25 higher)

⊕⊕⊕⊝
Moderatea

BtA likely reduces blepharospasm‐specific severity

Adverse events

(reported at any time)

(2 RCT, 169 participants)

RR 1.18
(0.87 to 1.60)

44.3%

53%
(42 to 77)

8.7% more
(6.3% fewer to 28.9% more)

⊕⊕⊝⊝
Lowa,b

BtA may result in little to no difference in adverse events

Subjective participant evaluation

(assessed with Patient Evaluation of Global Response (PEGR) (‐4 to +4; higher = better); measured between 4 to 6 weeks)

(2 RCT, 170 participants)

SMD 0.86 higher
(0.53 higher to 1.2 higher)

⊕⊕⊕⊕
Highb,c

BtA results in improved subjective patient evaluation

Frequency of blepharospasm‐specific involuntary movements

(assessed with Frequency of Involuntary Movement (FIM) scale (1 to 5; lower = better) or JRS frequency subscore (0 to 4; lower = better),; measured between 4 to 6 weeks)

(2 RCT, 229 participants)

SMD 0.79 higher
(0.31 higher to 1.27 higher)

⊕⊕⊝⊝
Lowa,b

BtA may reduce blepharospasm‐specific involuntary movements

Health‐related quality of life

None of the included trials reported this outcome

Tolerability

(number of dropouts; reported at any time)

(2 RCT, 169 participants)

RR 1.44
(0.31 to 6.79)

3.7%

5.3%
(1.1% to 25.1%)

1.6% more
(2.6% fewer to 21.4% more)

⊕⊕⊕⊝
Moderateb

BtA likely does not increase dropouts

Duration of effect

(when effects started to wane)

1 RCT (129 participants) estimated an effect duration of 10.6 weeks (range 6.1 to 19.1)

⊕⊕⊝⊝
Lowa,b

Results from BtA may last for 10.6 weeks

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; RR: Risk ratio; SMD: standardised mean difference; JRS: Jankovic Rating Scale

GRADE Working Group grades of evidence
High certainty. We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty. We are moderately confident in the effect estimate; the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty. Our confidence in the effect estimate is limited; the true effect may be substantially different from the estimate of the effect
Very low certainty. We have very little confidence in the effect estimate; the true effect is likely to be substantially different from the estimate of effect

aDowngraded one level due to serious study limitations; namely, concerns with blinding, attrition bias, and other biases, such as for‐profit bias
bDowngraded one level due to serious imprecision, due to low sample size

cUpgraded one level due to large effect size.

Background

This is an update of a Cochrane Review first published in 2005 (Costa 2005).

Description of the condition

Dystonia is the third most common movement disorder, after Parkinson’s disease and essential tremor, with an overall prevalence of 164 per million (Steeves 2012). Dystonia syndromes are a group of disabling, painful disorders characterised by involuntary sustained or intermittent muscle contractions causing abnormal, often repetitive, movements or postures of the face, neck, trunk, or limbs (Albanese 2013). Dystonic movements are typically patterned or twisting, and are often initiated or worsened by voluntary action (Albanese 2013). These neurological disorders can be classified, based on topographic distribution, including focal dystonia (one body region, e.g. cervical dystonia and blepharospasm), segmental dystonia (two or more adjacent regions), multifocal dystonia (two or more nonadjacent regions), hemidystonia (ipsilateral regions), and generalised dystonia (trunk and two or more other regions (Albanese 2013; Tarsy 2006)).

Focal dystonia is a highly disabling movement disorder, with serious functional and social impairment. Close to half of the people with it quit work by the age of forty, or retire early, and 10 years later, only 25% of people are working compared to 62% of the general population (Zoons 2012). Moreover, health‐related quality of life is significantly diminished, mainly attributable to depression and anxiety, with scores comparable to people with multiple sclerosis, Parkinson’s disease, or stroke (Zoons 2012).

Blepharospasm is a focal dystonia characterised by chronic intermittent or persistent involuntary eyelid closure due to spasmodic contractions of the orbicularis oculi muscles (Berardelli 1985; Elston 1988; Grandas 1988; Jankovic 1982a; Marsden 1976; Tolosa 1988). Sometimes, there is additional involuntary inhibition of the levator palpebrae superioris muscle (Aramideh 1994). The term essential blepharospasm is used to describe involuntary contractions involving only the orbital and periorbital muscles. However, many people also have spasms of other facial, oromandibular, pharyngeal, laryngeal, or cervical muscles (Tolosa 1979). When adjacent body regions are involved, this form of segmental dystonia is referred to as cranial cervical dystonia.

Neurophysiological studies support the hypothesis that blepharospasm is due to hyperexcitability of brainstem interneurons, as a result of organic dysfunction of the basal ganglia (Grandas 1988; Grandas 1998). The vast majority of cases are idiopathic. Exposure to neuroleptics is a known risk factor for dystonia and blepharospasm. Rarely, lesions in the basal ganglia and upper midbrain (e.g. with stroke, multiple sclerosis, hydrocephalus) have been associated with blepharospasm. Cranial cervical dystonia can also occur in association with other diseases and disorders of the central nervous system, such as Wilson's disease, Parkinson's disease, and progressive supranuclear palsy (Cardoso 1995).

The prevalence of blepharospasm is estimated to be 5 per 100,000 (Grandas 1988; Nutt 1988). It usually begins late in life, during the fifth or sixth decade, and affects women more often than men (Frueh 1976; Grandas 1988; Henderson 1956; Jankovic 1983; Marsden 1976; Tolosa 1981). Typically, it starts with an increased frequency of blinking to a variety of stimuli, such as air pollution, bright light, and stress. It progresses to chronic, involuntary spasm involving both eyes synchronously (Jankovic 1982b). Its severity can range from repeated frequent blinking to persistent spasmodic closure of the eyelids, leading to functional blindness, with severe private and professional disability (Jankovic 1982a; Tucha 2001).

To date, no curative or disorder‐modifying treatments are available for blepharospasm.

Description of the intervention

Botulinum toxin is a powerful biological toxin produced by Clostridium botulinum. The active form of botulinum toxin is a di‐chain polypeptide composed of two chains: a heavy chain (100 kDa) and a light chain (50 kDa), and by associating with certain auxiliary proteins (haemagglutinins and non‐haemagglutinins), the toxin forms a non‐covalent multimeric complex of variable size (Simpson 2004). The nontoxic proteins aid the formation of neutralising antibodies, though beyond this, their role is unclear (Frevert 2010). Botulinim toxin binds to peripheral cholinergic nerve terminals of the neuromuscular junction, as well as sympathetic ganglionic, parasympathetic ganglionic, and postganglionic terminals (Simpson 2004). After binding to an acceptor protein, botulinum toxin is endocytosed at the presynaptic membrane of acetylcholine nerve terminals (Pellizzari 1999). By action of the N‐terminal on the heavy chain, a pore is formed on the endocytic membrane, which permits the release of the light chain into the cytosol. This light chain, which is a zinc protease, performs the key action of the botulinum toxin, by cleaving soluble N‐ethylmaleimide‐sensitive factor attachment receptor proteins (SNARE proteins; (Pellizzari 1999)).

SNAREs are docking proteins for acetylcholine vesicles that allow for the release of acetylcholine into the synaptic cleft (Pellizzari 1999). The overall effect of botulinum toxin is a local chemodenervation by the temporary blockade of acetylcholine release at cholinergic synapses. Temporary synapses are consequently formed via the process of axonal sprouting (Duchen 1971; Holland 1981; Juzans 1996).

There are seven immunologically distinct botulinum toxin serotypes (labelled A to G). These different botulinum toxin serotypes cleave specific SNARE proteins. Serotype A cleaves SNARE protein SNAP 25, located on the inner membrane of nerve cells (Pellizzari 1999).

Botulinum toxin is injected into the muscles thought to be involved in dystonia. As a general rule, the number of muscles injected are tailored to the severity of the case in question, and the number of injection sites per muscle are determined by the mass of the muscle. Within roughly three months after injection of botulinum toxin into skeletal muscle, the nerve terminal resumes exocytosis, and the muscle returns to its baseline clinical function, showing a wearing‐off response from the botulinum toxin injection (Jankovic 2004). Eventually, the muscle paralysis subsides; this is associated with the formation of new sprouts that are capable of neurotransmission. Over time, synaptic activity resumes in the original nerve terminals, leading to sprout regression (de Paiva 1999).

Currently, there are two commercially available botulinum toxin serotypes – botulinum toxin type A (BtA) and botulinum toxin type B (BtB). The following products are commonly available (three BtA and one BtB): onabotulinumtoxinA (Botox, Allergan Inc., Irvine, CA, USA), abobotulinumtoxinA (Dysport or Reloxin or Azzalure, Ipsen Pharma, Boulogne Billancourt, France), incobotulinumtoxinA (Xeomin or Bocoture Merz GmbH, Frankfurt, Germany), and rimabotulinumtoxinB (Myobloc or Neurobloc, Solstice Neurosciences Inc., Louisville, KY, USA). Other BtA formulations are available in more restricted markets, and are yet to receive a generic name: Prosigne or Lantox (Lanzhou Institute of Biological Products, China), PurTox (Mentor Worldwide LLC, Santa Barbara, CA, USA), and Neuronox (Medy‐Tox Inc, South Korea; (Walker 2014)).

How the intervention might work

The therapeutic potential of all botulinum toxin serotypes derives from their ability to inhibit the release of acetylcholine from the presynaptic nerve terminal into the synaptic cleft, causing local chemodenervation (Jankovic 2004). In addition to this, recent research has also suggested that botulinum toxin is active at multiple levels, namely sensory nerve terminals, and muscle spindles, which leads to a reduction in sensory input and fewer muscle contractions (Filippi 1993; Matak 2014; Rosales 1996; Rosales 2010).

It has been further suggested that cortical reorganisation may result from changes in the spinal cord, brainstem, and central nervous pathways (Palomar 2012). Animal research has shown the presence of supra‐therapeutic levels of botulinum toxin by way of retrograde axonal transport and penetration of the CNS (Antonucci 2008; Boroff 1975). However, botulinum toxin has not been shown to penetrate the blood‐brain barrier in humans.

Until recently, SNARE proteins were considered the only target molecules of botulinum toxin. Thus, it was widely accepted that the therapeutic and toxic actions of botulinum toxin were exclusively mediated by SNARE cleavage preventing the release of synaptic neurotransmitters. However, recent studies have suggested that a number of botulinum toxin actions might not be mediated by SNARE cleavage, specifically regarding neuroexocytosis, cell cycle and apoptosis, neuritogenesis, and gene expression (Matak 2015). The existence of unknown botulinum toxin molecular targets and modulation of unknown signalling pathways is a possibility that may prove to be pharmacologically relevant.

Why it is important to do this review

BtA is the toxin serotype that has been most intensively studied and approved for the treatment of a large number of focal dystonias. Both BtA and BtB have been shown to be efficacious in cervical dystonia (Castelão 2017; Marques 2016; Duarte 2016), and for hemifacial spasm (Costa 2005a). However, even in moderate‐severity dystonia, there is evidence that people attach a considerable expectation of harm due to botulinum toxin, the so called nocebo effect (Duarte 2018).

This is an update of a Cochrane Review that aimed to assess the efficacy and safety of BtA in comparison to placebo in people with blepharospasm. The original review failed to identify evidence from randomised controlled trials to support the use of BtA for blepharospasm. Since its publication, three new trials have been published (Jankovic 2011; Mitsikostas 2018; Truong 2008). Cochrane’s criteria for evaluating studies' risk of bias and the certainty in evidence have also evolved and been updated. Therefore, the authors considered it important to update this review.

Objectives

To compare the clinical efficacy, safety, and tolerability of botulinum toxin type A (BtA) versus placebo in the treatment of adults with blepharospasm.

Methods

Criteria for considering studies for this review

Types of studies

Randomised controlled trials (RCTs), blinded, single, or multiple dose, parallel‐designed, of any duration, assessing the efficacy or safety, or both, of botulinum toxin type A (BtA) treatment versus placebo, in people with blepharospasm, were eligible for inclusion in this review. We excluded non‐parallel study designs, namely cross‐over trials, due to uncertainty about whether this type of study design was appropriate to study people with blepharospasm, as well as methodological concerns with regards to detection and performance bias.

There were no restrictions regarding the number of participants recruited to trials, or the number of recruitment centres.

Types of participants

Adults (i.e. 18 years of age or older), in any setting, with a clinical diagnosis of blepharospasm, made by a physician, specialist, or other healthcare provider. We included trials enrolling participants with any form of blepharospasm, with or without widespread dystonias. We included participants with prior exposure to botulinum toxin, or those taking concomitant medications, if they were on stable regimens.

Types of interventions

Intramuscular injections of botulinum toxin type A (BtA) compared to placebo. We allowed all administration schedules and injection techniques.

Types of outcome measures

Primary outcomes
Blapherospasm‐specific improvement

Overall improvement, measured on any validated symptomatic rating scale, such as Jankovic Rating Scale, measured between weeks three and six.

Adverse events

The proportion of participants with any adverse event, measured at any point during study follow‐up. For this outcome, we also evaluated adverse events of special interest, such as sore throat or dry mouth, neck weakness, dysphagia, injection site pain, voice change, and systemic complaints (e.g. diffuse muscle weakness, malaise, dizziness, and headache), measured at any point during study follow‐up.

Secondary outcomes
Subjective evaluation of clinical status

Evaluated by either participants, or clinicians, or both, and assessed with validated assessment tools, such as Patient Subjective Assessment of Change, Patient Global Assessment of Improvement, Patient Evaluation of Global Response (PEGR), Patient and Physician Global Assessment of Change, Investigator Global Assessment of Efficacy (IGAE), Physician Global Assessment of Change (PGAC), and visual analogue scale (VAS) for symptom severity, measured between weeks three and six.

Frequency of blepharospasm‐specific involuntary movements

Measured between weeks three and six.

Health‐related quality of life

Assessed with validated assessment tools, such as Short Form 36 (SF‐36) Quality of Life questionnaire, measured at any point during study follow‐up.

Tolerability

We defined tolerability as the number of participants who discontinued treatment (dropouts) due to adverse events, measured at any point during study follow‐up.

Duration of effect

Assessed by the number of days until the need for reinjection, or waning of effect.

Search methods for identification of studies

For this update, we expanded the search strategy to capture all the search terms for BtA formulations that were currently available. We designed the search strategy to include other botulinum toxin formulations and other dystonic disorders that are also under current revision the Movement Disorders Cochrane Review Group.

Electronic searches

We ran the final search for the original version of this review in June 2003, based on the search strategy developed for Cochrane Movement Disorders to identify all papers since 1977, the first year that botulinum toxin was used therapeutically in any condition. We ran the search for the current update for the last time in July 2020.

For the identification of studies considered for inclusion in this review, we developed detailed search strategies for each database searched. Please see Appendix 1 for the Cochrane Central Register of Controlled Trials (CENTRAL) strategy, Appendix 2 for the MEDLINE search strategy, and Appendix 3 for the Embase strategy.

We assessed non‐English language papers, translated them as necessary, and evaluated them for inclusion.

We did not search trials registries.

Databases searched

  • Cochrane Movement Disorders' Trials Register (July 2020);

  • CENTRAL (2020, Issue 6) in the Cochrane Library, (searched July 2020);

  • MEDLINE (1977 to July 2020);

  • Embase (1977 to July 2020).

Searching other resources

The search strategy also included:

  • searches of reference lists of located trials and review articles concerning botulinum toxin;

  • handsearch of abstracts of international congresses relevant to the fields of movement disorders and botulinum toxins (American Academy of Neurology, Movement Disorders Society, International Association of Parkinsonism and Related Disorders, and International Neurotoxin Association (1985 to July 2020));

  • personal communication with other researchers in the field;

  • contact with drug manufacturers;

  • whenever necessary, we contacted authors of published trials for further information and unpublished data.

Data collection and analysis

Selection of studies

Two review authors independently screened all titles and abstracts identified from searches to determine which ones met the inclusion criteria. We retrieved in full text any papers identified as potentially relevant by at least one review author, or those without an available abstract. Two review authors independently screened full‐text articles, with discrepancies resolved by discussion, and by consulting a third review author, where necessary, to reach consensus. We collated duplicate publications and presented our references by individual study. We outlined the screening and selection process in a PRISMA flow chart (Liberati 2009); see Figure 1.


Flow diagram for study selection

Flow diagram for study selection

Data extraction and management

Two review authors independently extracted data from included studies, using a piloted data extraction form. We resolved any discrepancies by discussion, until consensus was reached, or through consultation with a third review author, where necessary. Data extracted included the following items from each study.

  • Participants: inclusion and exclusion criteria, demographics and clinical baseline characteristics, number and reasons for dropping out, exclusions, and loss to follow‐up, if any

  • Interventions: full description of intervention, duration of treatment period and follow‐up, providers, and co‐interventions, if any

  • Comparisons: number of randomised participants to each arm, compliance and number of dropouts, reasons for dropping out, and ability to perform an intention‐to‐treat analysis

  • Outcomes: definition of outcomes, use of validated measurement tools, time point measurements, change from baseline or post‐interventional measures, and missing outcomes, if any

  • Study design: interventional, randomised, controlled, double‐blind.

Assessment of risk of bias in included studies

We assessed the risk of bias of included studies according to the domains described in the Cochrane tool for assessing risk of bias, and classified the risk of bias for each domain as high, unclear, or low, and the overall assessment as high or low (Higgins 2011a). We assessed two further domains, which are described below: enriched population and independent funding. We used the following definitions for each domain in the 'Risk of bias' assessment.

  • Random sequence generation (checking for possible selection bias). We assessed the method used to generate the allocation sequence as: low risk of bias (any truly random process, e.g. random number table; computer random number generator); unclear risk of bias (method used to generate sequence not clearly stated); high risk of bias (non‐random process used, e.g. allocation by birth year or by judgement).

  • Allocation concealment (checking for possible selection bias). We assessed the method used to conceal allocation to interventions prior to assignment, to determine whether intervention allocation could have been foreseen in advance of, or during recruitment, or changed after assignment. We assessed the methods as: low risk of bias (e.g. telephone or central randomisation; consecutively numbered, sealed, opaque envelopes); unclear risk of bias (method not clearly stated); high risk of bias (e.g. open list).

  • Blinding of participants and personnel (checking for possible performance bias). We assessed the methods used to blind study participants and personnel from knowledge of which intervention a participant received. We assessed methods as: low risk of bias (study stated that it was blinded and described the method used to achieve blinding, such as identical tablets matched in appearance or smell, or a double‐dummy technique); unclear risk of bias (study stated that it was blinded but did not provide an adequate description of how it was achieved). Studies that were not double‐blind were considered to have high risk of bias.

  • Blinding of outcome assessment (checking for possible detection bias). We assessed the methods used to blind outcome assessors from knowledge of which intervention a participant received. We assessed the methods as: low risk of bias (study had a clear statement that outcome assessors were unaware of treatment allocation, and ideally described how this was achieved); unclear risk of bias (study stated that outcome assessors were blind to treatment allocation but lacked a clear statement on how it was achieved). We considered studies where outcome assessment was not blinded as having a high risk of bias.

  • Selective reporting (checking for reporting bias). We assessed whether primary and secondary outcome measures were pre‐specified, and whether these were consistent with those reported. We assessed selective reporting as: low risk of bias (studies reporting results for primary and secondary outcomes); unclear risk of bias (study reporting insufficient information to permit judgement); high risk of bias (not all pre‐specified outcomes reported, or only reported for certain data collection time points).

  • Incomplete outcome data (checking for possible attrition bias due to the amount, nature, and handling of incomplete outcome data). We assessed the methods used to deal with incomplete data as: low risk (< 10% of participants did not complete the study, trial authors used ‘baseline observation carried forward’ analysis, or both); unclear risk of bias (used 'last observation carried forward' analysis); high risk of bias (used 'completer' analysis).

In addition to these criteria, we also added two more items for consideration.

  • Enriched population. Because the clinical effect of botulinum toxin treatment is easily perceived, participants not naive to botulinum toxin are likely to recognise the presence or absence of beneficial clinical effects, or frequent adverse events, or both, effectively revealing the respective allocation arm. It is also relevant that by preferentially including responders to botulinum toxin or excluding non‐responders to botulinum toxin, there is an increased likelihood that these participants would respond more favourably to botulinum toxin than a naive population would. We opted to subdivide this domain in two: preferential enrolment of known positive responders to botulinum toxin; and exclusion of known poor responders to botulinum toxin.

    • Low risk of bias: at least 70% of trial participants were naive to treatment with botulinum toxin; the trial did not exclude any particular form of blepharospasm.

    • Unclear risk of bias: the trial did not make explicit the percentage of participants who were known to be naive to botulinum toxin.

    • High risk of bias: arbitrarily defined as more than 30% of participants who were not naive to botulinum toxin; explicit exclusion of people with forms of blepharospasm associated with a poorer response to botulinum toxin.

  • For‐profit bias. In order to assess the study source of funding, we added this domain

    • Low risk of bias: the trial appeared to be free of industry sponsorship or other types of for‐profit support that may introduce bias into trial design, conduct, or trial results.

    • Unclear risk of bias: the trial may or may not be free of for‐profit bias, as the trial did not provide any information on clinical trial support or sponsorship.

    • High risk of bias: the trial was sponsored by industry or received other type of for‐profit support.

Measures of treatment effect

We compared disorder‐related symptoms at baseline to symptoms in weeks four to six post‐injection in the BtA and placebo arms. We extracted continuous outcomes whenever possible, pooled the data from the studies, where adequate, and used them for comparison.

Dichotomous data

We based analysis of these data on the number of events and the number of people assessed in the intervention and comparison groups. We used these to calculate the risk ratio (RR) and 95% confidence interval (CI).

Continuous data

We based analysis of these data on the mean, standard deviation (SD), and number of people assessed for both the intervention and comparison groups to calculate mean difference (MD) and 95% CI. Where the MD was reported without individual group data, we used this to report the study results. If more than one study measured the same outcome using different validated tools, we calculated the standardised mean difference (SMD), namely Hedges’ (adjusted) g, and 95% CI (Hedges 1985). For interpretation of effect sizes with SMDs, we used a rule of thumb to define a small effect (SMD = 0.2), a moderate effect (SMD = 0.5), or a large effect (SMD = 0.8; (Cohen 1988)). If necessary for comparison, we dichotomised rating scales using each study author's own criteria for improvement or no improvement.

Time‐to‐event data

We planned to analyse these data based on log hazard ratios (HR) and standard errors (SE) obtained from results of Cox proportional hazards regression models. We had planned to use these in order to calculate a HR and 95% CI.

Unit of analysis issues

Whenever the included studies had multiple arms with different doses of botulinum toxin, we combined all groups to create a single pair‐wise comparison, using the Review Manager 5 calculator, according to the methods suggested by Cochrane (Higgins 2011b; Review Manager 2014). We also would have opted to create a single, pair‐wise comparison in cases when multiple treatment groups, using different interventions (e.g. onabotulinumtoxinA and abobotulinumtoxinA), were compared to the same comparator.

This method combines all relevant experimental intervention groups of the study into a single group, and all relevant control intervention groups into a single control group. This approach avoids the duplication of the control group, which would happen if multiple comparisons (e.g. BtA dose 1 versus placebo; BtA dose 2 versus placebo) were included in the meta‐analysis, as well as the loss of information if one dose group is chosen over the others. If applicable, we plan to explore the effect of dose in subgroup analysis.

For dichotomous outcomes, we planned to sum both the sample sizes and the numbers of people with events across groups. For continuous outcomes, we planned to pool means and standard deviations in a meta‐analysis (Higgins 2011b; Higgins 2011c).

Dealing with missing data

For missing outcome or summary data, we used imputation methods to derive the missing data (where possible), and reported any assumptions in the review. In these cases, we carried out sensitivity analyses to investigate the effects of any imputed data on pooled effect estimates.

As the first option, we used the available information (e.g. standard error (SE), 95% CI, or exact P value) to recover the missing data algebraically (Higgins 2011b; Higgins 2011c; Wiebe 2006). When change from baseline SD data were not reported, or we were unable to extract them, we attempted to create a correlation coefficient, based on another study in the review, and then used this correlation coefficient to impute a change from baseline SD (Abrams 2005; Follmann 1992; Higgins 2011b).

If this failed, and there was at least one sufficiently large and similar study, we planned to use a method of single imputation (Furukawa 2006; Higgins 2011b).

Lastly, if there were a sufficient number of included studies with complete information, we planed to use multiple imputation methods to derive missing data (Carpenter 2013; Rubin 1991).

If none of these methods proved successful, we planned to conduct a narrative synthesis for the data in question.

Assessment of heterogeneity

We assessed whether studies were similar enough to allow pooling of data using meta‐analysis. Where data were pooled using meta‐analysis, we assessed the degree of heterogeneity by visual inspection of forest plots and by examining the Chi² test for heterogeneity (Deeks 2011). We quantified heterogeneity using I² (Higgins 2003). We considered an I² value of 50% or more to represent substantial levels of heterogeneity, but interpreted this value in light of the size and direction of effects, and the strength of the evidence for heterogeneity, based on the P value from the Chi² test.

Assessment of reporting biases

We included too few studies in this review, i.e. fewer than 10, to allow construction of a funnel plot (Sterne 2001), and formal testing of asymmetry, which may indicate publication bias (Peters 2006). Should enough studies be included in future updates of this review, we plan to undertake these analyses.

Data synthesis

We performed the analyses with Review Manager 5 (Review Manager 2014), Stata version 15 (Stata), and Trial Sequential Analysis (TSA; (Thorlund 2011; TSA 2011)).

Meta‐analysis

We based the decision of whether or not to meta‐analyse data on an assessment of whether the interventions in the included trials were similar enough in terms of participants, settings, intervention, comparison, and outcome measures to ensure meaningful conclusions from a statistically pooled result. We conducted data synthesis using a random‐effects model.

We pooled effect measures by applying the Mantel‐Haenszel method for dichotomous outcomes, and applied the inverse‐variance or generic inverse‐variance method for continuous outcomes. We had planned to pool time‐to‐event data using the generic inverse‐variance method. We presented all results with 95% CI.

We calculated the number of participants needed to treat for an additional beneficial outcome (NNTB) and for an additional harmful outcome (NNTH) from meta‐analysis estimates, rather than treating data as if they came from a single trial, as the latter approach is more prone to bias, especially when there are significant imbalances between groups within one or more trials in the meta‐analysis (Altman 2002). However, caution is needed in the interpretation of these findings, since they may be misleading because of variation in the event rates in each trial, differences in the outcomes considered, and differences in clinical settings (Smeeth 1999).

Where there were no data that could be combined in a meta‐analysis, we undertook a narrative approach to result synthesis.

Assessing the certainty in the evidence

As recommended by the GRADE Working Group methodology, two review authors independently assessed all of the outcomes in the following domains: study limitations, inconsistency, indirectness, imprecision, and publication bias (Schünemann 2011). In case of disagreement, the review authors attempted to reach consensus, consulting an independent third review author if necessary. For this purpose, we used the GRADEpro GDT software tool, which we then used to export a 'Summary of findings' table into the review text (GRADEpro GDT).

To ensure the consistency and reproducibility of GRADE judgements, we applied the following criteria to each outcome.

  • Study limitations: we downgraded once if more than 30% of participants were from studies classified as being at a high risk of bias across any domain, with the exception of for‐profit bias.

  • Inconsistency: we downgraded once if heterogeneity was statistically significant, or if the I² value was more than 40%. When we did not perform a meta‐analysis, we downgraded once if trials did not show effects in the same direction.

  • Indirectness: we downgraded once if more than 50% of the participants were outside the target group.

  • Imprecision: we downgraded once if the optimal information size was not met, or alternatively, if it was met but the 95% CI failed to exclude important benefit or important harm (Guyatt 2011).

  • Publication bias: we downgraded once if there was direct evidence of publication bias, or if estimates of effect were based on small scale, industry‐sponsored studies, which raised a high index of suspicion of publication bias.

We applied the following definitions to the certainty in the evidence (Balshem 2011):

  • high certainty: we are very confident that the true effect lies close to that of the estimate of the effect;

  • moderate certainty: we are moderately confident in the effect estimate; the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different;

  • low certainty: our confidence in the effect estimate is limited; the true effect may be substantially different from the estimate of the effect;

  • very low certainty: we have very little confidence in the effect estimate; the true effect is likely to be substantially different from the estimate of effect.

'Summary of findings' table

We included a 'Summary of findings' table to present the main findings of this review in a simple tabular format, based on the results of the GRADE analysis. Version 3 was used for ease of interpretation (Carrasco‐Labra 2016).

Subgroup analysis and investigation of heterogeneity

We planned subgroup analyses for the following areas, independently of the presence of significant heterogeneity.

  • Different BtA formulations

  • Different BtA doses, all defined arbitrarily: high (Botox or Xeomin > 200 U; Dysport = 1000 U), medium (Botox or Xeomin 100 U to 200 U; Dysport = 500 U), and low (Botox or Xeomin < 100 U; Dysport = 250 U

Sensitivity analysis

We conducted sensitivity analyses for every study for which we applied imputation methods.

Results

Description of studies

We identified three new studies for inclusion in this update (Jankovic 2011; Mitsikostas 2018; Truong 2008).

These were parallel‐designed studies comparing botulinum toxin type A (BtA; different total treatment doses) with placebo, with a total of 313 participants with blepharospasm.

See also Characteristics of included studies.

Results of the search

See: Figure 1, flow diagram of study selection.

We last ran the electronic search in July 2020. The search returned 1126 records (390 through CENTRAL; 286 though MEDLINE; 450 through Embase), resulting in 784 records after removing all duplicates. After title and abstract screening, we assessed 18 articles for full‐text screening. We included three for both the qualitative and quantitative syntheses.

We excluded one trial for including participants in which the blepharospasm was part of a more complex syndrome (Jankovic 1987),; eight for not including a placebo group; two for possibly not being randomised (Fahn 1985; Girlanda 1996); two for not being randomised (Aramideh 1995; Jankovic 1995); another two without usable study data (Frueh 1988; Park 1993); and one for being a duplicate of an included study (Pagan 2018).

Included studies

See Characteristics of included studies.

The three trials enrolled a total of 313 adult participants, with a mean age of 61.2 years (range 18 years to 82 years), 208 of whom were female (66%). Sample size varied from 61 to 129 participants. All were multicentre trials.

Participants' baseline characteristics differed between trials. The mean duration of blepharospasm was only reported in one trial; 233 months in the BtA arm and 278 months in the placebo arm (Jankovic 2011).

The overall disorder‐related impairment at baseline was moderate to severe in Jankovic 2011, with an average total Jankovic Rating Scale (JRS) score of 5.87 in the BtA arm and 5.76 in the placebo arm. The other two trials did not report any impairment information.

Only Mitsikostas 2018 exclusively enrolled participants who had never been exposed to botulinum toxin, and did not enrich the trial population by excluding participants with clinical forms of blepharospasm associated with a poorer response to botulinum toxin. We deemed both other studies to be at high risk of bias for this domain. As a result, the population characteristics across studies did not allow us to conduct a subgroup analysis for people naive and non‐naive to botulinum toxin. Overall, the number of dropouts was low in both trials that reported this outcome, with 6 dropouts from the BtA (5%) groups and two dropouts from the placebo (4%) groups (Jankovic 2011; Mitsikostas 2018). The reasons for dropping out were not adequately reported in any of the included trials.

The duration of trials ranged from 6 weeks to 20 weeks post‐injection. All trials assessed efficacy and other primary outcomes using an intent‐to‐treat (ITT) analysis, which included all participants randomised to treatment.

Excluded studies

We listed all the excluded studies in this review, together with reasons for their exclusion, in the Characteristics of excluded studies table.

Risk of bias in included studies

See Characteristics of included studies: 'Risk of bias' table.

We evaluated the studies using a modified version of the Cochrane 'Risk of bias' tool. See Figure 2 and Figure 3 for the 'Risk of bias' summary graphs. These assessments were based on the information available in the primary report data.


'Risk of bias' graph: review authors' judgements about each 'Risk of bias' item presented as percentages across all included studies

'Risk of bias' graph: review authors' judgements about each 'Risk of bias' item presented as percentages across all included studies


'Risk of bias' summary: review authors' judgements about each 'Risk of bias' item for each included study

'Risk of bias' summary: review authors' judgements about each 'Risk of bias' item for each included study

Overall, we considered none of the studies to be at low risk of bias across all domains. All trials were industry‐funded, and therefore, we judged them at a high risk of for‐profit bias. We assessed Mitsikostas 2018 largely based on the information available in the clinicaltrials.gov web page.

Allocation

We judged Jankovic 2011 and Truong 2008 to have a low risk of selection bias, as both provided an adequate description of both the randomisation and allocation concealment methods used. Mitsikostas 2018 did not describe the method of randomisation, though we judged allocation concealment to be adequate.

Blinding

Overall, we considered there was an unclear risk of performance and detection bias in Jankovic 2011 and Truong 2008, as there was insufficient information to make adequate judgements across multiple domains. We judged Mitsikostas 2018 to be at low risk of bias for both domains.

Incomplete outcome data

The number of dropouts was low in all groups in Mitsikostas 2018, and unlikely to introduce bias into the study results. The risk of attrition bias was low in Jankovic 2011. Truong 2008 reported a very high proportion of dropouts in the placebo arm (64%), which may have introduced a relevant bias in the interpretation of the trial results.

Selective reporting

All trials had a low risk of selective reporting bias.

Other potential sources of bias

For‐profit bias

All trials were fully funded by pharmaceutical companies, and therefore we judged them at a high risk of for‐profit bias.

Enriched population

Both Jankovic 2011 and Truong 2008 included an enriched population, which may limit the internal reliability of the data, the former predominantly enrolling positive responders to BtA, and the latter excluding poor responders to BtA. Mitsikostas 2018 included a treatment‐naive population, and we judged it at a low risk of bias in this domain.

Publication bias

We intended to use funnel plots to explore publication bias. However, due to the small number of included studies, the power of this analysis was considered to be inadequate (Sterne 2011).

Effects of interventions

See: Summary of findings 1 Botulinum toxin type A compared to placebo for blepharospasm

The key results of this review can be found in 'summary of findings Table 1'.

Primary outcomes

Blepharospasm‐specific improvement
1.1 Overall blepharospasm‐specific improvement

Only Jankovic 2011 reported overall blepharospasm‐specific improvement, using the total Jankovic Rating Scale (JRS) score.

BtA resulted in a 1.5 point improvement over placebo (95% confidence interval (CI) 0.8 to 2.3; 1 trial, 109 participants). We have moderate confidence that BtA may improve overall blepharospasm‐specific status.

1.2 Overall blepharospasm‐specific severity improvement

Treatment with BtA improved blepharospasm‐specific severity, as measured with the JRS severity subscore (mean difference (MD) 0.93, 95% CI 0.61 to 1.25; I² = 9%; 3 trials, 313 participants; moderate‐certainty evidence; Analysis 1.1).

1.3 Overall blepharospasm‐specific disability improvement

Treatment with BtA improved blepharospasm‐specific disability, measured with the JRS disability subscore (MD 0.69, 95% CI 0.18 to 1.19; I² = 74%; 3 trials, 313 participants; low‐certainty evidence; Analysis 1.2).

Adverse events

Adverse events related to study treatment were reported in 53% of BtA‐treated participants, compared to 45% of placebo‐treated. Treatment with BtA did not increase the risk of adverse events, when compared with placebo (risk ratio (RR) 1.18, 95% CI 0.87 to 1.60; I² = 0%; 2 trials, 169 participants; low‐certainty evidence; Analysis 1.3).

Treatment with BtA increased the risk of visual complaints (diplopia, blurred vision, and visual disturbance; RR 5.73, 95% CI 1.79 to 18.36; I² = 52%; 2 trials, 228 participants; moderate‐certainty evidence; Analysis 1.4), and eyelid ptosis (RR 4.02, 95% CI 1.61 to 10.00; I² = 39%; 3 trials, 289 participants; high‐certainty evidence; Analysis 1.5).

BtA did not increase the risk of increased lacrimation (RR 2.04, 95% CI 0.46 to 9.13; I² = 0%; 2 trials, 228 participants; moderate‐certainty evidence; Analysis 1.6) or xerophthalmia (RR 1.83, 95% CI 0.69 to 4.86; I² = 0%; 2 trials, 228 participants; moderate‐certainty evidence; Analysis 1.7).

Secondary outcomes

Subjective evaluation of clinical status

Two trials contributed data for this outcome (Jankovic 2011; Mitsikostas 2018).

We found a large improvement with BtA compared to placebo (SMD 0.86, 95% CI 0.53 to 1.20; I² = 0%; 2 trials, 170 participants; moderate‐certainty evidence; Analysis 1.9).

Frequency of blepharospasm‐specific involuntary movements

Two trials contributed data for this outcome (Jankovic 2011; Truong 2008).

We found a large improvement with BtA compared to placebo (standardised mean difference (SMD) 0.79, 95% CI 0.31 to 1.27; I² = 58%; 2 trials, 252 participants; low‐certainty evidence; Analysis 1.8).

Health‐related quality of life

None of the included trials reported data for this outcome.

Tolerability ‐ dropouts

Two trials contributed data for this outcome (Jankovic 2011; Mitsikostas 2018.

BtA did not increase the risk of dropouts (RR 1.44, 95% CI 0.31 to 6.79; I² = 0%; 2 trials, 169 participants; moderate‐certainty evidence; Analysis 1.10).

Duration of effect

Only Jankovic 2011 reported this outcome.

This trial estimated a effect duration of 10.6 weeks (range 6.1 weeks to 19.1 weeks; 1 trial, 109 participants; low‐certainty evidence).

Discussion

Summary of main results

This updated review included three randomised, parallel‐designed trials, that enrolled 313 people with blepharospasm, 61% of whom had been previously treated with botulinum toxin for their condition.

As can be seen in the summary of findings Table 1, in comparison to placebo, botulinum toxin type A (BtA) improved blepharospasm‐specific symptoms. Treatment with BtA also increased the likelihood that participants themselves would detect an improvement. Uncertanity remains over the effect of BtA on people's quality of life, as this outcome was not reported in the included trials.

Treatement with BtA did not increase the risk of experiencing an adverse event, though it did increase the risk of two specific adverse events of special interest – vision complaints (diplopia, blurred vision, and visual disturbance) and eyelid ptosis. No fatalities or serious adverse events were considered to be related to BtA treatment in any trial. Treatment with BtA did not increase the risk of dropouts from the included clinical trials. Data for special subpopulations, such as children and pregnant women, were not available. The duration of response is in accordance with previous observational data.

Overall completeness and applicability of evidence

All included trials addressed the primary research question directly, using similar and validated assessment tools. However, they did not report data for all outcomes of clinical interest. This limited the amount of data available, and consequently, the confidence in overall conclusions.

The participants included in the trials were not fully representative of the overall population of people with blepharospasm. The effects of population enrichment and the moderate overall impairment (as assessed by the baseline Jankovic Rating Scale (JRS) severity subscores) preclude definite conclusions concerning all people with this condition. The proportion of participants with any adverse event was high in both the BtA and placebo arms. A large nocebo effect, as is common in movement disorders research, may mask safety conclusions (Duarte 2018; Rato 2018; Rato 2019; Silva 2017).

Three noteworthy factors challenge the implementation of the evidence in this review. First, sample size across included trials was relatively small, and subgroup analyses addressing clinically relevant questions for the main outcomes would have been underpowered if conducted. More studies are needed to provide robust evidence for these questions. Second, the use of enriched populations in clinical trials limits applicability of results into clinical practice, as complex and potentially poorer responders are usually excluded from these trials. The fact that such individuals are common in clinical practice further complicates issues of generalisation. Third, it is common for people with blepharospasm to be taking concomitant medications for their condition, such as muscle relaxants and benzodiazepines. Reasonably, participants in trials are required to be on a stable dose of these medications for many weeks to avoid confounding factors. As a result, little is known at present about the impact of these drug regimens with regard to implementation of the evidence in this review.

Quality of the evidence

See Characteristics of included studies, 'Risk of bias' tables, 'Risk of bias' summary tables (Figure 2; Figure 3), and summary of findings Table 1.

We considered all included trials at high risk for for‐profit bias, and all but one had an enriched population. We considered all studies to be appropriately blinded in general. However, we considered all but one possibly biased regarding subjective outcome assessment, as all but one predominantly enrolled participants with previous exposure to botulinum toxin. This represents a major methodological limitation that may have resulted in a biased assessment of the intervention effect, particularly with regards to subjective outcomes, which are highly susceptible to biased estimations.

The included trials each enrolled between 61 and 129 participants, and although individually, some of these trials were underpowered, the pooling of the trials permitted an adequate sample size for the efficacy outcomes.

Taken together, as can be seen in summary of findings Table 1, we consider that there is moderate certainty in the evidence that a single treatment session of BtA improves overall blepharospasm‐specific severity. There is low certainty in the evidence that the likelihood risk of any adverse event, is low. The certainty in the evidence assessing the change in subjective evaluation of clinical status evaluated by participants is high. We have low certainty that BtA reduced the frequency of blepharospasm‐specific involuntary movements. Finally, we have moderate certainty in the evidence that treatment with BtA does not increase the likelihood of participants dropping out of clinical trials.

Potential biases in the review process

Although we followed the methods recommended by Cochrane in order to minimise bias in the review process, certain areas do deserve attention. In particular, we did not search clinical trials registries. Although this opens the current review to the potential bias of having missed trials, we consider this possibility highly unlikely because we extensively contacted other experts in this field, and USA and European trials in this area are well‐known.

Agreements and disagreements with other studies or reviews

As the previous version of this review was unable to include any trials, the results of this update are novel (Costa 2005). The current clinical practice guidelines of the American Academy of Neurology and the European Academy of Neurology are in agreement that BtA is likely safe and effective for the treatment of blepharospasm (Albanese 2011; Simpson 2016).

Flow diagram for study selection

Figures and Tables -
Figure 1

Flow diagram for study selection

'Risk of bias' graph: review authors' judgements about each 'Risk of bias' item presented as percentages across all included studies

Figures and Tables -
Figure 2

'Risk of bias' graph: review authors' judgements about each 'Risk of bias' item presented as percentages across all included studies

'Risk of bias' summary: review authors' judgements about each 'Risk of bias' item for each included study

Figures and Tables -
Figure 3

'Risk of bias' summary: review authors' judgements about each 'Risk of bias' item for each included study

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 1: Blepharospasm‐specific severity

Figures and Tables -
Analysis 1.1

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 1: Blepharospasm‐specific severity

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 2: Blepharospasm‐specific disability

Figures and Tables -
Analysis 1.2

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 2: Blepharospasm‐specific disability

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 3: Adverse events

Figures and Tables -
Analysis 1.3

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 3: Adverse events

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 4: Vision complaints (diplopia, blurred vision, visual disturbance)

Figures and Tables -
Analysis 1.4

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 4: Vision complaints (diplopia, blurred vision, visual disturbance)

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 5: Eyelid ptosis

Figures and Tables -
Analysis 1.5

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 5: Eyelid ptosis

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 6: Increased lacrimation

Figures and Tables -
Analysis 1.6

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 6: Increased lacrimation

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 7: Xerophthalmia

Figures and Tables -
Analysis 1.7

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 7: Xerophthalmia

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 8: Frequency of blepharospasm‐specific involuntary movements

Figures and Tables -
Analysis 1.8

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 8: Frequency of blepharospasm‐specific involuntary movements

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 9: Subjective participant evaluation

Figures and Tables -
Analysis 1.9

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 9: Subjective participant evaluation

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 10: Tolerability – dropouts

Figures and Tables -
Analysis 1.10

Comparison 1: Botulinum toxin type A (BtA) versus placebo, Outcome 10: Tolerability – dropouts

Summary of findings 1. Botulinum toxin type A compared to placebo for blepharospasm

Botulinum toxin type A compared to placebo for blepharospasm

Patient or population: adults with blepharospasm
Setting: hospital‐based, movement disorders clinics
Intervention: botulinum toxin type A (BtA)
Comparison: placebo

Outcomes

Relative effect
(95% CI)

Anticipated absolute effects* (95% CI)

Certainty of the evidence
(GRADE)

What happens

Without BtA

With BtA

Difference (95% CI)

Blepharospasm‐specific severity

(assessed with JRS severity subscore (0 to 4; lower = better), measured between 4 to 6 weeks)

(3 RCT, 280 participants)

MD 0.93 higher
(0.61 higher to 1.25 higher)

⊕⊕⊕⊝
Moderatea

BtA likely reduces blepharospasm‐specific severity

Adverse events

(reported at any time)

(2 RCT, 169 participants)

RR 1.18
(0.87 to 1.60)

44.3%

53%
(42 to 77)

8.7% more
(6.3% fewer to 28.9% more)

⊕⊕⊝⊝
Lowa,b

BtA may result in little to no difference in adverse events

Subjective participant evaluation

(assessed with Patient Evaluation of Global Response (PEGR) (‐4 to +4; higher = better); measured between 4 to 6 weeks)

(2 RCT, 170 participants)

SMD 0.86 higher
(0.53 higher to 1.2 higher)

⊕⊕⊕⊕
Highb,c

BtA results in improved subjective patient evaluation

Frequency of blepharospasm‐specific involuntary movements

(assessed with Frequency of Involuntary Movement (FIM) scale (1 to 5; lower = better) or JRS frequency subscore (0 to 4; lower = better),; measured between 4 to 6 weeks)

(2 RCT, 229 participants)

SMD 0.79 higher
(0.31 higher to 1.27 higher)

⊕⊕⊝⊝
Lowa,b

BtA may reduce blepharospasm‐specific involuntary movements

Health‐related quality of life

None of the included trials reported this outcome

Tolerability

(number of dropouts; reported at any time)

(2 RCT, 169 participants)

RR 1.44
(0.31 to 6.79)

3.7%

5.3%
(1.1% to 25.1%)

1.6% more
(2.6% fewer to 21.4% more)

⊕⊕⊕⊝
Moderateb

BtA likely does not increase dropouts

Duration of effect

(when effects started to wane)

1 RCT (129 participants) estimated an effect duration of 10.6 weeks (range 6.1 to 19.1)

⊕⊕⊝⊝
Lowa,b

Results from BtA may last for 10.6 weeks

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; RR: Risk ratio; SMD: standardised mean difference; JRS: Jankovic Rating Scale

GRADE Working Group grades of evidence
High certainty. We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty. We are moderately confident in the effect estimate; the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty. Our confidence in the effect estimate is limited; the true effect may be substantially different from the estimate of the effect
Very low certainty. We have very little confidence in the effect estimate; the true effect is likely to be substantially different from the estimate of effect

aDowngraded one level due to serious study limitations; namely, concerns with blinding, attrition bias, and other biases, such as for‐profit bias
bDowngraded one level due to serious imprecision, due to low sample size

cUpgraded one level due to large effect size.

Figures and Tables -
Summary of findings 1. Botulinum toxin type A compared to placebo for blepharospasm
Comparison 1. Botulinum toxin type A (BtA) versus placebo

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1.1 Blepharospasm‐specific severity Show forest plot

3

Mean Difference (IV, Random, 95% CI)

0.93 [0.61, 1.25]

1.2 Blepharospasm‐specific disability Show forest plot

3

290

Mean Difference (IV, Random, 95% CI)

0.69 [0.18, 1.19]

1.3 Adverse events Show forest plot

2

169

Risk Ratio (M‐H, Fixed, 95% CI)

1.18 [0.87, 1.60]

1.4 Vision complaints (diplopia, blurred vision, visual disturbance) Show forest plot

2

228

Risk Ratio (M‐H, Fixed, 95% CI)

5.73 [1.79, 18.36]

1.5 Eyelid ptosis Show forest plot

3

289

Risk Ratio (M‐H, Fixed, 95% CI)

4.02 [1.61, 10.00]

1.6 Increased lacrimation Show forest plot

2

228

Risk Ratio (M‐H, Fixed, 95% CI)

2.04 [0.46, 9.13]

1.7 Xerophthalmia Show forest plot

2

228

Risk Ratio (M‐H, Fixed, 95% CI)

1.83 [0.69, 4.86]

1.8 Frequency of blepharospasm‐specific involuntary movements Show forest plot

2

Std. Mean Difference (IV, Random, 95% CI)

0.79 [0.31, 1.27]

1.9 Subjective participant evaluation Show forest plot

2

170

Std. Mean Difference (IV, Fixed, 95% CI)

0.86 [0.53, 1.20]

1.10 Tolerability – dropouts Show forest plot

2

169

Risk Ratio (M‐H, Fixed, 95% CI)

1.44 [0.31, 6.79]

Figures and Tables -
Comparison 1. Botulinum toxin type A (BtA) versus placebo