Scolaris Content Display Scolaris Content Display

Point‐of‐care viral load tests to detect high HIV viral load levels in HIV‐positive people on antiretroviral therapy

This is not the most recent version

Collapse all Expand all

Abstract

This is a protocol for a Cochrane Review (Diagnostic test accuracy). The objectives are as follows:

To estimate the diagnostic accuracy of point‐of‐care tests to detect high viral load levels in HIV‐positive people on ART.

Background

Access to antiretroviral therapy (ART) for HIV‐positive people has increased significantly over the years. It is estimated that 53% of HIV‐positive people globally were on ART in 2016 (UNAIDS 2017), up from 23% in 2010 (UNAIDS 2015). In sub‐Saharan Africa, about 12 million HIV‐positive people (47%) were on ART in 2015 (UNAIDS 2016), up from less than 100,000 in 2002 (UNAIDS 2015). In order to effectively sustain treatment for people on ART, it is essential to know the HIV viral load (VL) levels in those who are on treatment. VL (the number of HIV viral ribonucleic acid (RNA) particles per millilitre of blood) is the recommended monitoring approach to diagnose and confirm ART treatment failure (WHO 2016). VL is usually measured in plasma but some technologies use whole blood (UNITAID 2015). In Africa, it is estimated that less than 20% of people on ART received routine VL testing in Africa in 2013 (ASLM 2013). This could be partly be explained by poor access to VL testing services. Currently, VL testing is largely done on laboratory‐based platforms that involve sophisticated equipment requiring dedicated laboratory space, substantial financial resources, and trained laboratory technicians. These laboratory tests require venous blood collection, cold chain storage of collected samples, and instrument‐based sample processing techniques. With transport shortcomings being a common challenge in resource‐limited settings, delays in transporting samples to the laboratory and relaying test results back to the health centre lead to delays in changing therapy in cases of treatment failure. To overcome this challenge, point‐of‐care tests are increasingly being developed because they are potentially easy to use, cost‐effective, require minimal laboratory infrastructure, and may minimize the need for transporting samples to the laboratory. They could also potentially reduce patient waiting time and therefore reduce loss to follow‐up cases (UNITAID 2014; UNITAID 2015; WHO 2014).

Target condition being diagnosed

The target condition of this review is high HIV VL levels in blood or plasma of people infected with either HIV‐1 or HIV‐2 on ART. The main objective of ART is to reduce HIV VL to undetectable levels meaning that the concentration HIV RNA cannot be detectable by a test. In HIV‐infected people, it is therefore essential to know the VL levels after ART initiation. The higher the VL, the higher the increased risk of transmission when VL is detectable and the faster the CD4 cells and body's immune system are destroyed. Detectable VL can be a reflection of poor adherence to treatment or treatment failure once poor adherence is ruled out. Intermittent low level viraemia (50 copies/mL to 1000 copies/mL) not associated with treatment failure may also occur during effective treatment (Havlir 2001). Current World Health Organization (WHO) guidelines on ART define a high or detectable VL level to be 1000 copies/mL or greater and treatment failure as a persistently high VL concentration (1000 copies/mL or greater) in two consecutive measurements (with adherence support between measurements) (WHO 2016). Treatment failure should trigger evaluation or changing the antiretroviral drugs included in ART. Delayed detection of treatment failure may therefore lead to progression of HIV infection to AIDS or the resistance of the infection to ART or increase the risk of HIV transmission (UNITAID 2015; WHO 2013).

Index test(s)

In this Cochrane Review, we will estimate the accuracy of molecular point‐of‐care (POC) tests in detecting high VL levels (POC VL) after ART initiation. Molecular POC VL include semi‐quantitative and quantitative tests that quantify the copies of HIV virus in plasma or whole blood (UNITAID 2014; UNITAID 2015). Results are reported as HIV copies in a millilitre (copies/mL). There is no established optimal threshold for detecting VL concentration or defining virological failure (Fox 2012; Ritchie 2014; WHO 2013; WHO 2016). In 2013, the WHO lowered the threshold for detecting high VL levels from 5000 copies/mL to 1000 copies/mL based on evidence that below 1000 copies/mL intermittent low‐level viraemia (50 copies/mL to 1000 copies/mL) not associated with treatment failure can occur during effective treatment (Ritchie 2014; WHO 2013). Also, the risk of HIV transmission and progression of disease is minimal when VL concentration is less than 1000 copies/mL. Nonetheless, the lower limit of VL detection depends on the test and sample used. For example, a capillary sample from a finger prick may not accurately detect a VL level below 5000 copies/mL (ASLM 2013; UNITAID 2015).

Various definitions of POC testing have been proposed with no universally accepted definition (Drain 2014; UNITAID 2015). Some definitions consider technical characteristics of the test (rapid test with minimal infrastructure requirements) (Wu 2012), or its effect on management (linking to decision making at the same patient visit) (Pai 2012), or its location (at the patient site or near the treatment facility) (Drain 2014). A general definition of a POC test would be a diagnostic test that is administered near the patient or at a health facility, with a fast turnaround time, leading to a change in patient management (Schito 2012). In this review, we will include studies with VL tests that meet the ASSURED (Affordable, Sensitive, Specific, User‐friendly, Robust & Rapid, Equipment free, and Deliverable to end‐users) criteria developed by the WHO for the ideal rapid test for resource limited settings (Wu 2012). In resource‐limited settings, testing locations are often blurred as POC tests have been evaluated and implemented across a variety of healthcare and laboratory settings ranging from primary level next to patients (Level 1 facilities) to district (Level 2) and provincial levels (Level 3) (UNITAID 2015). Ideally, true POC tests are conducted on patient samples next to the patient or at the bedside in settings with minimaI laboratory and training requirements (Level 1 facilities). Tests otherwise referred to as near‐POC tests are conducted on patient samples away from the patient by technicians in slightly more developed laboratories (Level 2 and 3 facilities). For example, some POC tests detect VL in plasma (Ritchie 2014), and some settings may not have the infrastructure required for plasma separation from whole blood at the point of care (UNITAID 2015). To maximize the utility of our review, we will evaluate all forms of POC tests for VL regardless of the health facility setting in which the test was conducted.

Clinical pathway

The role of POC VL for monitoring response to ART will be to act as a replacement for laboratory‐based VL testing platforms in the current testing algorithms outlined in Figure 1.


Routine viral load testing algorithm. Abbreviations: ART: antiretroviral therapy.

Routine viral load testing algorithm. Abbreviations: ART: antiretroviral therapy.

In routine care, current WHO guidelines recommend that VL testing be done at six and 12 months after initiation of ART and repeated every 12 months thereafter. If the VL is detectable at any time (1000 copies/mL or greater), it is recommended that a patient undergoes intensive adherence support and repeat VL testing three to six months later. If the VL is still detectable and non‐adherence can be ruled out, a clinician may then decide to change to second‐line therapy (WHO 2013; WHO 2016).

In this review, we will focus on the accuracy of a single POC VL test done at one time point (for example, at six or 12 months after ART initiation). We believe that this evidence may be extrapolated to provide evidence that may guide the use of the test at other time points.

Alternative test(s)

Alternative HIV VL tests include non‐nucleic acid tests (non‐molecular tests) that detect HIV viral enzymes (reverse transcriptase) and HIV viral proteins (p24 antigen); markers that can be correlated to HIV RNA. These tests indirectly reflect VL concentration and are currently not commonly used (UNITAID 2015).

Alternative methods for monitoring response to ART include immunological monitoring through CD4 testing and clinical monitoring through WHO clinical staging. For example, in adults, a persistent CD4 count less than 100 cells/mm3 or a new or recurrent clinical condition indicative of WHO clinical stage 4 after six months of treatment is regarded as treatment failure. However, these methods are less sensitive and specific than VL testing and are not recommended as the first‐line approach for monitoring response to ART (Rutherford 2014). This may lead to delayed detection of treatment failure or lead to unnecessary therapy switches. In addition, the WHO revised its guidelines in 2013 to recommend that all HIV‐positive people be started on ART regardless of CD4 count and clinical status (WHO 2013). In this regard, using these criteria to monitor response to therapy will not be an accurate measure of treatment failure. Nonetheless, these alternative tests may still be used in areas that do not have access to VL testing (WHO 2013).

Rationale

In 2014, the Joint United Nations Programme on HIV/AIDS (UNAIDS) declared the 90‐90‐90 target; it aimed to have at least 90% of HIV‐positive people diagnosed, at least 90% of those diagnosed receiving ART, and at least 90% of those receiving ART having suppressed viral replication by 2020 (WHO 2016). POC VL tests being developed to detect HIV RNA and treatment failure in HIV‐positive people on ART in resource‐limited settings will be instrumental in checking if the third target will be met effectively. If these POC VL tests have a high level of accuracy, they can replace or complement laboratory‐based testing platforms because they are quicker to use and may minimize delays in initiating therapy or changing therapy in cases of treatment failure (UNITAID 2014; UNITAID 2015). A high sensitivity is required because false‐negative results will lead to a delay in detecting treatment failure or adherence concerns related to treatment. This will ultimately lead to progression to AIDS and mortality. A high specificity is also required because false‐positive results will lead to unnecessary switching to costly second‐line therapy. A test with an optimal combination of sensitivity and specificity is thus needed.

Objectives

To estimate the diagnostic accuracy of point‐of‐care tests to detect high viral load levels in HIV‐positive people on ART.

Secondary objectives

To investigate sources of heterogeneity in test accuracy estimates including age (children versus adults), test type (commercially available versus in‐house assays), sample type (whole blood versus plasma), test threshold (1000 copies/mL or greater versus other thresholds), location of testing (true‐POCs versus near‐POCs), geographical location (sub‐Saharan Africa versus other regions), and methodological quality (high versus low risk of bias).

Methods

Criteria for considering studies for this review

Types of studies

We will include any primary study that compares the results of the index test to that of a reference standard (cross‐sectional, prospective, and retrospective study designs or diagnostic accuracy studies performed within randomized trials), and those that provide sufficient data to create the 2 × 2 table to calculate sensitivity, specificity, and negative and positive predictive values.

We will exclude ecological studies and diagnostic case‐control studies in which the test performance was compared in participants with the target condition versus healthy controls, as specificity will be overestimated (Macaskill 2010).

We will exclude studies without a reference standard, case reports and case‐series studies, animal or laboratory studies, reviews, discussion papers, non‐research letters, commentaries, or editorials.

Participants

People infected with either HIV‐1 or HIV‐2 on ART irrespective of age and gender, from any healthcare or geographical setting. Though POC VL tests are mainly applicable to resource‐limited settings where the burden of HIV is high, we will not exclude evaluations conducted in resource‐rich settings in order to maximize utility of our review.

Index tests

We will include studies evaluating the accuracy of molecular POC VL. We will consider the current WHO recommended threshold (1000 copies/mL or greater) as the main threshold to define test positivity (WHO 2013; WHO 2016). We will also consider the previous WHO recommended threshold (5000 copies/mL or greater) (WHO 2010), and other thresholds that may have been used for test evaluations in the subgroup analyses.

Examples of POC VL include (but not limited to):

  • SAMBA I HIV‐1 Semi‐Quantitative Test;

  • SAMBA II HIV‐1 Semi‐Quantitative Test;

  • Alere q Analyser and Alere q HIV‐1/2 assay (quantitative whole blood assay);

  • Savanna RealTime HIV‐1 Viral Load assay (Quidel);

  • Cobas Liat analyser (Roche);

  • Xpert HIV‐1 Viral Load (Cepheid);

  • ZIVA (Cavidi);

  • Liat Analyser (IQuum Inc);

  • EOSCAPE HIV Rapid RNA Assay System;

  • True lab Real Time Micro PCR system (Molbio);

  • RT CPA HIV‐1 viral load.

Most of these tests are still in the pipeline except the SAMBA VL assay, which is available. The SAMBA‐Semiquantitative HIV tests use 200 µL of plasma or 120 µL of whole blood and have a total assay time of 90 minutes. They have been designed to detect VL concentrations of 1000 copies/mL or greater. SAMBA I is a semi‐automated test with a daily throughput of 16 to 48 samples whereas SAMBA II is fully automated with a daily throughput of four to 32 samples. SAMBA II is better suited for facilities with technicians and electricity (UNITAID 2014; UNITAID 2015).

Target conditions

A high HIV VL level in people infected with HIV‐1‐ or HIV‐2 on ART.

Reference standards

Laboratory‐based testing platforms to detect high VL levels taken at the same time (within 24 hours) as the sample for POC VL tests. Most laboratory‐based VL platforms are designed to detect the HIV virus in plasma that is extracted from a venous blood sample though centrifugation. Typical laboratories for VL technologies involve sophisticated equipment and have three rooms for sample extraction, reagent preparation, and amplification (and detection) of the HIV virus (UNITAID 2015). Examples of laboratory‐based platforms for VL include: nucleic acid‐based tests (NAT) including five commercially available reverse transcriptase polymerase chain reaction (RT‐PCR)‐based VL assays:

  • COBAS AmpliPrep/ COBAS TaqMan v2.0 (Roche);

  • RealTime HIV‐1 (Abbott);

  • VERSANT HIV RNA 1.0 (kPCR) (Siemens);

  • Artus HIV‐1 QS‐RGQ (QIAGEN);

  • RT‐TMA technology for Panther system (Hologic).

Current and previous WHO recommended thresholds to detect high HIV VL levels in plasma and classify a patient as having treatment failure include 1000 copies/mL or greater (WHO 2013; WHO 2016), and 5000 copies/mL or greater (WHO 2010). We will include data where the threshold of 1000 copies/mL were presented but also collect data of the 5000 copies/mL threshold.

Where studies have used a tie‐breaker approach (where a second test/polymerase chain reaction (PCR) is used for discordant results, we will include results for the first test/PCR only in the 2 × 2 tables to avoid inflation of sensitivity and specificity (Ritchie 2014).

Search methods for identification of studies

Electronic searches

We will conduct the search in the electronic databases; Cochrane Central Register of Controlled Trials (CENTRAL), MEDLINE, Embase, LILACS, the WHO International Clinical Trials Registry Platform (WHO ICTRP), Science Citation Index Expanded (SCI‐EXPANDED)/Conference Proceedings Citation Index‐ Science (CPCI‐S), and WHO Global Index Medicus from March 2015 without language, document type, or publication status limitations (Appendix 1).

Searching other resources

We will track reference lists of included studies, relevant systematic reviews, and conference proceedings (Conference on Retroviruses and Opportunistic Infections, International AIDS Society Conference and African Society for Laboratory Medicine). We will consult experts in the field such as the WHO HIV Department for potentially relevant studies.

Data collection and analysis

Selection of studies

We will deduplicate search results in EndNote X7. Two review authors (EO and AK) will independently screen the titles and abstracts of the search results to identify eligible articles. They will initially remove reports that are obviously not relevant based on title and abstract and remove duplicates as well. The two review authors (EO and AK) will then independently assess full texts of journal articles or conference proceedings for eligibility based on our a priori inclusion criteria. We will resolve any disagreements by consensus or by consulting a third review author (SM or JD). Justifications for excluding articles from the review will be documented in the 'Characteristics of excluded studies' table. We will present details of included studies in the 'Characteristics of included studies' table. We will present the study selection process in a PRISMA flow diagram.

Data extraction and management

We will extract information on study characteristics including: study design; demographic and participant characteristics; methods of collecting and preparing blood specimen; time point at which VL testing is done after ART initiation; index test and reference standard characteristics; test cut‐off and performance; main outcome data or results; number of true‐positive, false‐positive, false‐negative, and true‐negative results (Appendix 2).

Two review authors (EO and AK) will independently extract data. We will resolve any disagreements by discussion and will document all decisions. If we cannot reach a consensus, a third review author (SM or JD) will make the final decision on inclusion.

If we identify more than one publication for the same included study, we will consider the main publication as the one with more information; all others will be considered companion publications and we will only collect data if they had not been provided in the main publication to avoid double‐counting. We will collate the companion publications of the same study, so that each study rather than each report, is the unit of interest in the review.

Assessment of methodological quality

We will use the QUADAS‐2 (Quality Assessment of Diagnostic Accuracy Studies) tool to assess the risk of bias and applicability concerns of the included studies (Whiting 2011). We will tailor the tool in line with the context of our review question (Appendix 3). Two review authors (EO and AK) will independently assess included studies using the outlined tool in Appendix 3. We will resolve any disagreements by consensus or by consulting a third review author (SM or JD).

Statistical analysis and data synthesis

Our unit of analysis will be individual participants. For each study, we will identify the threshold(s) used to define test positivity and construct 2 × 2 tables (true positive, false positive, false negative, true negative) at the presented thresholds. We will perform the main analysis with study data using the current WHO recommended threshold (1000 copies/mL or greater) definition of test positivity (WHO 2016). We will undertake subgroup analyses separately at other commonly presented thresholds. Preliminary exploratory analyses on diagnostic accuracy will be conducted by plotting estimates of sensitivity and specificity from each study on Forest plots and in receiver operating characteristic (ROC) space. These analyses will enable visual assessment of the variation between studies, and will also facilitate investigations of heterogeneity for exploring the effect of certain characteristics on test performance.

If there are sufficient data (for example, four or more studies), we will use the bivariate model to estimate the summary sensitivity and specificity at the current WHO threshold (1000 copies/mL) for the main meta‐analysis and compare the accuracy of two or more tests. The bivariate model with random effects accounts for within‐study variability and correlation of sensitivity and specificity. This method models sensitivity and specificity directly at a common threshold (Macaskill 2010; Reitsma 2005). If bivariate models do not converge and so cannot give a model estimate, we will fit simplified univariable models for sensitivity and specificity separately, using a random‐effects model (Takwoingi 2017). If there are no false positives across all studies in a meta‐analysis (i.e. specificity is estimated at 100% in all studies), we will undertake a univariate random‐effects meta‐analysis of sensitivity, and compute the exact 95% confidence interval for the 100% specificity estimate using the total number of participants without disease across all studies as the denominator.

For comparisons between tests (if there is sufficient data), we will initially include all studies in the analysis (indirect comparison). Subsequently, if data are available, we will restrict the analyses to only studies that have compared tests in the same population, either within participants or between randomized groups (direct comparison). Such analyses are likely to produce results not confounded by differences in study or participant characteristics.

We will perform analyses using Review Manager 5 (RevMan 5) (RevMan 2014), and the meta‐analysis using STATA (STATA 2017), or SAS (SAS 2017).

Investigations of heterogeneity

If there are sufficient data, we will investigate sources of heterogeneity in estimates of test accuracy. We will add the following covariates to the bivariate model to assess its influence on test performance: age (children versus adults), test type (commercially available versus in‐house assays), sample (whole blood versus plasma), and location of testing (true‐POC versus near‐POC). We will flag studies that report mixed‐age populations (unclear data on data per age group) as age 'not reported' in the statistical models. If there are sufficient data we will estimate accuracy for individual tests as per manufacturer type. We will estimate sensitivity and specificity at other commonly used thresholds separately in the subgroup analyses as well.

Sensitivity analyses

If there are sufficient data, we will use sensitivity analyses to explore the effect of geographical setting and study quality. We will restrict analysis to studies conducted in sub‐Saharan Africa and to studies at low risk of bias for participant selection and high applicability for index test conduct.

Assessment of reporting bias

We will not assess reporting bias.

Assessment of strength of the evidence

We will summarize the main findings from the review, reporting the numbers of true positives, true negatives, false positive and false negatives per 1000 people tested in the 'Summary of findings' table. GRADE for diagnostic test accuracy reviews is still under development (Gopalakrishna 2014; Gopalakrishna 2016), and rather than following any formal process for downgrading evidence, we will fully describe the following concepts which constitute an assessment of strength of evidence.

  • Precision of the study estimates.

  • Heterogeneity in study findings.

  • Risk of bias.

  • Concerns about applicability.

  • Indirect comparisons between tests.

These issues cover the key domains of GRADE (GRADE 2013), except publication bias which cannot be assessed, and would allow the evidence to be included in a GRADE assessment should a guideline developer wish to do so.

Routine viral load testing algorithm. Abbreviations: ART: antiretroviral therapy.
Figures and Tables -
Figure 1

Routine viral load testing algorithm. Abbreviations: ART: antiretroviral therapy.