FormalPara Key Points

PML risk minimisation is a consideration in many drug development programmes involving investigational or approved immunomodulatory agents.

This programme for Risk Assessment and Minimisation of PML (RAMP) demonstrated that practical and effective screening and surveillance measures can be applied successfully in clinical trials to help minimise and assess the potential risk of PML associated with new drugs.

No cases of PML were identified during the development of vedolizumab following rigorous risk minimisation and assessment via the RAMP.

The overall number of new unexplained neurological events detected, and the number of costly or invasive procedures (i.e. magnetic resonance imaging, lumbar puncture) required by the stepwise evaluation were low in an inflammatory bowel disease population.

1 Introduction

Progressive multifocal leukoencephalopathy (PML) is a rare but serious demyelinating opportunistic infection of the central nervous system caused by the JC polyomavirus. This virus is near ubiquitous, with approximately 55–60% of healthy adults harbouring a clinically latent infection. However, PML rarely develops in immunocompetent hosts [1,2,3]. The infection was originally recognised for its association with severe immunocompromise, such as occurs with haematologic malignancies, human immunodeficiency virus infection/acquired immune deficiency syndrome and solid organ transplantation [4]. More recently, PML has also been reported in association with a number of immunosuppressive agents used to treat immune-mediated diseases and cancer, [2, 5,6,7,8,9,10], particularly monoclonal antibodies directed against cell surface integrins [11,12,13]. In 2005, three cases of PML were reported in clinical trials of natalizumab, among approximately 3000 patients, with a mean latency of 18 months [14,15,16]. The findings were surprising in light of the safety profile of natalizumab up to that point, and the absence of a prior association between its mechanism of action (i.e. antibody against α4 integrin) and PML. Efalizumab, a monoclonal antibody against αLβ2 integrin, was also shown to have an association with PML, which ultimately led to its withdrawal from the market [17]. Risk estimates of integrin therapeutics appeared to be elevated, even when compared with other classes of drugs associated with PML. These facts raised the possibility that other integrin antagonists that target the immune system might also be associated with an increased risk, necessitating appropriate PML risk minimisation and assessment measures in clinical trials [18]. However, the rarity of PML, along with its varied clinical presentation, render early diagnosis challenging. Currently, no validated tools exist to systematically screen for PML; therefore, systematic approaches toward PML risk minimisation with early detection of possible cases, both in clinical trials and other clinical settings, are needed. In this study, we describe the Risk Assessment and Minimisation of PML (RAMP) programme, designed to address this need. The RAMP was created for, and implemented during, the clinical development of vedolizumab, a monoclonal antibody to α4β7 integrin for the treatment of inflammatory bowel disease (IBD), ulcerative colitis (UC) and Crohn’s disease (CD).

In creating the RAMP to support vedolizumab development, it was unclear if an approach toward proactive risk minimisation could be validated in the absence of actual PML events. The exclusive binding of vedolizumab to integrin α4β7 inhibits the adhesion of certain types of B and T lymphocytes to mucosal addressin cell adhesion molecule (MAdCAM-1), which is primarily expressed on gut vascular endothelial cells. This fact made it unlikely that vedolizumab would result in general immunosuppression or an increased PML risk [19, 20].

Although cases of PML have been initially detected radiographically, diagnosis is most often based on clinical suspicion in at-risk patients who exhibit symptoms (typically a change in neurological symptoms or function). Thereafter, a confirmatory evaluation typically includes a combination of the following: brain magnetic resonance imaging (MRI) revealing characteristic white matter lesions, and either lumbar puncture with detection of JC viral DNA in cerebrospinal fluid (CSF) by polymerase chain reaction (PCR) or biopsy evidence of JC virus in brain tissue [21]. For these reasons, the RAMP focused on patient and study site staff education about the presenting signs and symptoms of PML through standardised teaching materials. The programme also utilised screening questionnaires to monitor PML risk throughout a patient’s time on-study, and a stepwise management algorithm for potential cases. When cases of persistent, new, unexplained neurological symptoms arose, they were adjudicated by a panel of experts with extensive expertise in diagnosing PML (all co-authors of this report).

In this study, we present the cumulative experience involving approximately 3000 patients with UC or CD over the course of 7 years of vedolizumab clinical development. From these data, we propose that the RAMP constitutes a staged, pragmatic risk management tool for use in clinical trials of investigational agents for which the possibility of drug-associated PML represents an important consideration.

2 Methods

2.1 Studies and Patients

The RAMP was implemented in six vedolizumab phase II and III clinical trials in patients with UC or CD (Table 1). These studies were conducted at approximately 300 study sites in nearly 40 countries in North America, Europe, Asia, Africa, and Australia.

Table 1 RAMP programme implementation: multiple-dose studies in patients with IBDa

Prior to study enrolment, informed consent regarding a potential risk of PML associated with administration of the study drug was obtained. The text of the informed consent form can be found in Electronic Supplementary Material 2.

Protocol eligibility criteria, including PML checklists (described below), were utilised to screen for patients who were either deemed to be at increased risk of PML based on a set of predefined criteria, or had other neurological diagnoses that would render an evaluation of PML more difficult, e.g. multiple sclerosis.

2.2 Subject and Staff Education

Patients were provided with a brochure written in lay language describing the risks of PML. as well as the presenting neurological symptoms for which to remain vigilant (Electronic Supplementary Material 3). Patients were also given a wallet card, which concisely summarised the information in the brochure, for portability and easy reference (Electronic Supplementary Material 4).

Study staff (including investigators, nurses, and clinical study coordinators) were trained on the pathophysiology and natural history of PML, including signs and symptoms of disease. They were also educated on the importance of the cessation of administration of vedolizumab and the expedited evaluation of suspected PML cases when suspicion of PML was high. A brochure suitable for medical personnel containing detailed technical language was provided to each member of the study staff (Electronic Supplementary Material 5). The principal investigators were generally gastroenterologists, rather than neurologists. For that reason, investigator training included a video demonstration of how to perform a targeted neurological examination to evaluate common clinical presentations of PML (e.g. aphasia, limb weakness, etc.). The video was provided to all investigators in DVD format for quick reference.

We required a local neurologist to be included as a sub-investigator at each clinical study site; however, PML is an uncommon disease and even experienced neurologists lack first-hand familiarity with the acute clinical presentation. For this reason, we trained the local neurologist on the important clinical features that might indicate a PML diagnosis, and provided the study protocol in advance of initiating the trial.

At investigator meetings and mid-study refresher web-based conferences, study staff were also trained on the PML checklists (described below) and the PML case evaluation algorithm, which were to be used at each patient visit to systematically evaluate and triage new, unexplained neurological findings.

2.3 Progressive Multifocal Leukoencephalopathy (PML) Checklists, PML Case Evaluation Algorithm, and Serum JC Virus Assay

An Independent Adjudication Committee (IAC) was formed with the purpose of providing an independent, unbiased opinion regarding the likelihood of PML in a subject who developed new neurological symptoms or signs, or who otherwise met criteria that indicated a clinical suspicion of PML. The subjective PML checklist and objective PML checklist can be found in Electronic Supplementary Material 6, and a description of how IAC experts were identified and the charter describing roles and responsibilities, process flow, and scoring systems for case evaluation can be found in Electronic Supplementary Material 2.

The IAC, together with the sponsor, created a PML case evaluation algorithm for sequential screening and diagnostic evaluation (Fig. 1). This algorithm follows a stepwise series of measures to diagnose PML, including brain MRI and lumbar puncture. A detailed description of the algorithm and its individual steps can be found in Electronic Supplementary Material 2.

Fig. 1
figure 1

RAMP programme case evaluation algorithm. CSF cerebrospinal fluid, IAC Independent Adjudication Committee, JCV JC virus, MRI magnetic resonance imaging, PCR polymerase chain reaction

Following publication of an American Association of Neurology (AAN) consensus position on diagnostic criteria for PML in 2013, these criteria were adopted into the RAMP to define PML cases [21].

Blood-based assessment of JC viral DNA by PCR was initially performed as an exploratory biomarker for PML risk based on the most current state of understanding in 2007. Sera were collected longitudinally and stored, then assessed retrospectively in batches. However, over the course of vedolizumab clinical development, this potential biomarker was found to lack predictive value [22]; hence, following discussion with key regulatory authorities, that assessment was discontinued. The JCV serologic assay, developed subsequently and ultimately proven to have predictive value, was not available during our investigation [23].

2.4 Adjudication of Potential Cases

IAC neuroradiologists were asked to utilise a 0–5 Likert scale (5 being highest) to assign the probability of PML based on the imaging findings (see Electronic Supplementary Material 1: Table S1). In the event of a discrepancy, the neuroradiologists were to confer and assign a single score.

Within 48 h of receiving the data, IAC members were expected to record their reviews on evaluation worksheets that they forwarded to the IAC chairperson, who then collated the judgements in these reviews and provided a final assessment of the case to the sponsor, documenting key findings and assessing the likelihood of PML based on the qualitative scale below early in the programme, and the AAN consensus criteria later in the programme.

  • Not PML.

  • Possible PML. More information is needed to exclude or confirm diagnosis.

  • Probable PML. Very strong clinical suspicion in the absence of CSF PCR results and brain histopathology results.

  • Definite PML. Positive CSF PCR analysis for JC virus or confirmed brain histopathology.

If the assessment could not be completed based on incomplete data, additional data were requested from the study site. Once all IAC members had performed their analysis, the IAC Chairperson provided a signed memorandum to the sponsor with a brief synopsis of key findings and the consensus assessment, either making a determination of PML or not PML, or requesting additional information. The IAC recommendations were not considered binding given the fact that clinical care was intended to occur according to local standards.

2.5 Post-Study Telephone Follow-Up

Once patients concluded study participation, they were contacted at 6-month intervals, for a period of 2 years, by a telephone call centre, a Clinical Research Organisation (CRO) representative or the enrolling site, to be asked focused questions about their health, including occurrence of surgery, hospitalisation, serious infection, cancer and clinical features suggestive of PML (see Electronic Supplementary Material 7). A minimum of four attempts at telephone calls were made to reach each patient at each time point, even if attempts at prior time points had been unsuccessful. The sponsor received email alerts for any positive responses.

3 Results

3.1 Checklist Findings

The vedolizumab phase III clinical development programme spanned 7 years and included 2884 patients (1114 with UC; 1770 with CD). The mean duration of exposure was approximately 450 days per patient (range 0–1865). Exposure duration of > 2 years and prior and current immunosuppressive use are well-recognised risk factors for PML with natalizumab. A total of 835 vedolizumab patients had over 2 years of exposure, approximately 80% had prior immunosuppressive use (thiopurines, methotrexate), and 30% had current use.

As of March 2013, 2913 patients (UC: 1142; CD: 1771) (Table 2) completed 57,986 subjective checklists (UC: 24,919; CD: 33,067). Positive subjective findings led to completion of 167 and 342 objective checklists in 95 UC patients and 189 CD patients, respectively. Very few patients (< 1.0%) indicated a positive response at baseline for the subjective checklist questions; those who did were excluded from enrolling. All neurologic signs and symptoms that led to positive subjective findings or abnormal objective findings are shown in Fig. 2. Positive responses were most commonly sensory in origin (e.g. paraesthesias, numbness, etc.), followed by visual findings and limb weakness less commonly reported.

Table 2 Summary of PML checklist results
Fig. 2
figure 2

Positive subjective checklist findings: signs and symptoms1. 1Subjective checklist symptoms: gait/coordination (e.g. bumping into objects, difficulty writing); confusion (e.g. problems with memory/thinking); vision/ocular motility (e.g. difficulty reading); sensation (e.g. loss, numbness); speaking (e.g. dysarthria, aphasia); comprehension (e.g. inability to follow serial commands); weakness/spasticity (e.g. pronator drift, lack of muscle strength)

3.2 Algorithm Utilisation

The IAC evaluated suspected cases involving 83 patients (UC: 24; CD: 59) (Table 3). Adjudication was achieved within 7 days for approximately two-thirds of these cases, which included 62 patients with abnormal objective checklist results (UC: 17; CD: 45) (Table 2) and 21 additional patients (UC: 7; CD: 14) who were referred with complaints as a precautionary measure, but without objective findings. Although further diagnostic evaluation in the latter group was not mandated by the RAMP, these patients were included here for the sake of comprehensive patient monitoring in light of the serious nature of PML.

Table 3 Summary of RAMP algorithm results

From the 83 patients, the IAC recommended brain MRIs in 56 patients (Table 3). The MRI results were inconsistent with PML, with a limited number of patients scoring 1–2 on the radiographic scale in the charter, and no individual patient having a score of 3 or higher. Diagnostic evaluation in 5 patients included lumbar puncture with PCR analysis of CSF for JC virus (UC: 2; CD: 3), all of which were negative. Given the fact that lumbar puncture is an invasive procedure and a common penultimate step in confirming PML, case history narratives for those five patients are provided in Electronic Supplementary Material 1: Table S2. No PML cases were identified by the IAC through 14 March 2013. Vedolizumab was approved by the US FDA and the European Medicines Agency in May 2014, and received over 50 approvals in countries around the world thereafter.

3.3 Post-Study Follow-Up (Up to 2 Years Per Patient at 6-Month Intervals)

As of March 2013, 1382 of 1794 (77%) patients provided responses to at least one post-study telephone follow-up survey. For the 24-months post-study time point, data from 562 patients (67% of eligible patients) were available. There were no reports of symptoms or signs referable to a PML diagnosis during this period.

4 Discussion

Because no validated method exists to screen for PML, the RAMP was devised as a clinical trial tool to address this potential risk. It consisted of a comprehensive evaluation, which included (1) exclusion of patients at higher risk of PML; (2) exclusion of patients in whom PML diagnosis might be more difficult to ascertain; (3) monitoring all patients for their risk of PML throughout their duration of the study using screening questionnaires; (4) objective testing; (5) a sequential case evaluation algorithm; and (6) post-study telephone follow-up.

Although sequential MRI scans are helpful in the diagnosis of PML when patients are still asymptomatic, early symptoms are the most practical means for early detection of the onset of PML. Because early diagnosis has been demonstrated to improve the likelihood of survival with less permanent neurological injury, we considered the use of early and frequent screening via questionnaires an important objective [24].

Neurological symptoms are common and occur more frequently in the IBD population than in the general population [25], as do white matter lesions on MR imaging [26]. This fact could theoretically confound the diagnosis of PML in this population. Moreover, IBD patients are usually cared for by gastroenterologists rather than neurologists, a fact that could make timely diagnosis of drug-associated PML more challenging. As a result, it was not evident from the outset how to create a system that is both sensitive enough to detect PML cases early in their course, while maintaining specificity in the face of many transient neurological symptoms that are inconsistent with PML. It was also not known if non-neurologists would respond well to the trainings and gain comfort with the subjective and objective facets of testing for a neurological disease. The types of symptoms and signs indicative of PML that were most likely to be detected by the questionnaires are not routinely evaluated by the cohort of investigators involved in the vedolizumab trials. Finally, the RAMP as a previously unvalidated algorithm, necessitated a high degree of compliance with risk minimisation procedures and associated timelines to assess its potential usefulness. It was not known at the outset what level of compliance might be achievable.

The overall experience with implementation of the RAMP across the clinical trial programme for vedolizumab revealed a low percentage of positive subjective checklists. Because PML is typified by new progressive neurological dysfunction, it seemed unlikely for symptoms consistent with PML to be missed based on the questions posed. Furthermore, gastroenterologists were able to successfully administer objective testing as qualitatively assessed by trainings, queries, source data verification, interaction with investigators, and compliance metrics, resulting in minimal demand for consultation with study neurologists. MRIs were performed in few patients overall, and even fewer outside of algorithm requirements.

A very small number of patients (5, approximately 0.2% of the study population) underwent lumbar puncture, the most invasive procedure necessitated by the RAMP for establishing PML diagnosis. As for MRIs, a lumbar puncture for CSF JCV analysis was not indicated by algorithm in all instances; three of these were performed electively by the local caring physician, and, in two of those instances, CSF had been collected for other standard-of-care assessments. JC viral DNA was not detected in any CSF sample, and these negative findings (and the IAC’s judgements) were confirmed by the fact that PML did not develop in any patients during the follow-up period.

One could speculate that baseline or longitudinal MRIs might have avoided the need for lumbar puncture in at least a few subjects; however, experience from the RAMP shows that the on-study MRIs were generally interpretable, even in the absence of a baseline examination. Moreover, performing thousands of potentially unhelpful MRIs to possibly prevent a very limited number of lumbar punctures argues against their clinical utility. For vedolizumab, with the evidence for gut selective biology and the absence of any proven PML events, MRIs would have represented an extremely low yield approach toward risk minimisation. For a drug with a very different pharmacologic or benefit-to-risk profile, the relative merits of MRI monitoring could be considered as part of a tailored approach.

The data derived from this risk minimisation plan allow us to make useful inferences about the risk of PML in the vedolizumab-exposed population (the actual PML risk estimates from this dataset have been published elsewhere) [27]. The tools created and utilised are not vedolizumab or integrin antagonist specific. This fact lends strength and generalisability to other clinical development programmes where PML represents an actual or theoretical consideration. The checklists are simple and intuitive and do not require PML experts or neurologists to administer. The stepwise sequential evaluation mirrors clinical care and reserves more invasive testing for instances of higher clinical suspicion. Multiple checks and balances help ensure that new neurological symptoms are carefully followed and their evolution assessed in a timely manner. Given the rapidly progressive nature of clinical PML, it was important that the RAMP worked efficiently because delays in PML diagnosis could result in serious disability or death. In this global clinical trial programme with patients on five continents, adjudication was achieved within 7 days for approximately two-thirds of the cases; others required more time, often due to operational constraints (e.g. translation of medical documents, technical challenges with obtaining MRI scans, etc.). Although this was simply an assumption when the RAMP was devised, recent data suggest that early diagnosis of PML results in better outcomes [28]. Thus, the potential for investigators to receive expert feedback on PML likelihood and appropriate intervention measures within days is highly desirable. Patient and study staff education, training, and real-time support played an important role in clarifying the likelihood of development of this potentially devastating yet difficult-to-diagnose disease process, allowing the clinical development programme for vedolizumab to progress.

A limitation of the RAMP is the difficulty in ascertaining the predictive value of measures put in place in the absence of PML events. One option would be to utilise the RAMP or appropriate adaptation in a context with a better understood background rate of PML. Another limitation is the fact that the RAMP was performed uniquely in the clinical trial setting. The overwhelming amount of immunosuppressive use occurs in a real-world setting without case report forms, checklists or other systematic data collection methods. Some elements of the RAMP, e.g. elements of the informed consent language or educational materials, could potentially be adapted to serve real-world needs; however, pragmatic considerations could limit the utility of certain other RAMP components. While it is possible that PML cases could have gone undetected in spite of methods described herein, the authors have concluded that the extent of awareness and data collection via the RAMP, coupled with the progressive nature of PML, makes this possibility unlikely. Of note, since the approval of vedolizumab in many parts of the world in 2014, no cases of PML have been spontaneously reported, or otherwise identified, as of the submission of this manuscript [29].

5 Conclusions

The RAMP programme reported here demonstrates practical and effective screening and surveillance measures that can be used in clinical trials to manage potential PML risks of investigational or approved treatments. Implementation of the RAMP in a large global clinical trial programme was comprehensive and achievable, with high levels of compliance. No cases of PML were detected with vedolizumab in a patient population with at least some risk factors, suggesting a low likelihood of any relationship between vedolizumab mechanism of action and PML risk. Many elements of this plan could be utilised in other clinical trial programmes. Certain RAMP elements could be adapted for use in a real-world setting, however other measures are unlikely to be feasible or warranted. The RAMP cannot be validated in the absence of PML events; thus, future work should include assessing this programme in conjunction with drugs with proven PML risk. Additional work might include assessment of the RAMP by patients and healthcare professionals.