Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter November 12, 2020

Comparison of machine learning methods for predicting viral failure: a case study using electronic health record data

  • Allan Kimaina , Jonathan Dick , Allison DeLong , Stavroula A. Chrysanthopoulou , Rami Kantor and Joseph W. Hogan ORCID logo EMAIL logo

Abstract

Background

Human immunodeficiency virus (HIV) viral failure occurs when antiretroviral therapy fails to suppress and sustain a person’s viral load count below 1,000 copies of viral ribonucleic acid per milliliter. For those newly diagnosed with HIV and living in a setting where healthcare resources are limited, such as a low- and middle-income country, the World Health Organization recommends viral load monitoring six months after initiation of antiretroviral treatment and yearly thereafter. Deviations from this schedule are made in cases where viral failure occurs or at the discretion of the clinician. Failure to detect viral failure in a timely fashion can lead to delayed administration of essential interventions. Clinical prediction models based on information available in the patient medical record are increasingly being developed and deployed for decision support in clinical medicine and public health. This raises the possibility that prediction models can be used to detect potential for viral failure in advance of viral measurements, particularly when those measurements occur infrequently.

Objective

Our goal is to use electronic health record data from a large HIV care program in Kenya to characterize and compare the predictive accuracy of several statistical machine learning methods for predicting viral failure at the first and second measurements following initiation of antiretroviral therapy. Predictive accuracy is measured in terms of sensitivity, specificity and area under the receiver-operator characteristic curve.

Methods

We trained and cross-validated 10 statistical machine learning models and algorithms on data from over 10,000 patients in the Academic Model Providing Access to Healthcare care program in western Kenya. These included parametric, non-parametric, ensemble, and Bayesian methods. The input variables included 50 items from the clinical record, hand picked in consultation with clinician experts. Predictive accuracy measures were calculated using 10-fold cross validation.

Results

Viral load failure rate is about 20% in this patient cohort at both the first and second measurements. Ensemble techniques generally outperformed other methods. For predicting viral failure at the first follow up measure, specificity was over 90% for these methods, but sensitivity was typically in the 50–60% range. Predictive accuracy was greater for the second follow up measure, with sensitivities over 80%. Super Learner, gradient boosting and Bayesian additive regression trees consistently outperformed other methods. For a viral failure rate of 20%, the positive predictive value for the top-performing methods is between 75 and 85%, while the negative predictive value is over 95%.

Conclusion

Evidence from this study suggests that machine learning techniques have potential to identify patients at risk for viral failure prior to their scheduled measurements. Ultimately, prognostic virologic assessment can help guide the administration of earlier targeted intervention such as enhanced drug resistance monitoring, rigorous adherence counseling, or appropriate next-line therapy switching. External validation studies should be used to confirm the results found here.


Corresponding author: Joseph W. Hogan, Brown University, Providence, Rhode Island, USA; and Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya, E-mail: .

Funding source: National Institute of Allergy and Infectious Diseases

Award Identifier / Grant number: R01 AI 108441

Award Identifier / Grant number: P30 AI 42853

Award Identifier / Grant number: K24 AI 134359

Funding source: Fogarty International Center

Award Identifier / Grant number: D43 TW010050

  1. Research funding: This work was funded by National Institute of Allergy and Infectious Diseases (R01 AI 108441, P30 AI 42853, K24 AI 134359) and Fogarty International Center (D43 TW010050).

  2. Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.

  3. Competing interests: Authors state no conflict of interest.

Appendix

A. Additional tables

Table 2:

Detailed summary statistics.

First virologic failureSecond virologic failure
Total (N=10,180)No (N=8,002)Yes (N=2,178)No (N=4,900)Yes (N=1,173)
Baseline age
 Median (Q1, Q3)39 (32, 47)39 (32, 47)39 (31, 46)40 (33, 49)40 (33, 48)
BMI
 Median (Q1, Q3)23 (21, 25)23 (21, 25)23 (21, 25)23 (21, 25)23 (21, 25)
 Missing4,413 (43.3%)3,278 (41.0%)1,135 (52.1%)1,949 (39.8%)523 (44.6%)
Days since baseline
 Median (Q1, Q3)240 (160, 370)200 (27, 340)
Days since first VL
 Median (Q1, Q3)230 (140, 370)220 (70, 350)
Prop days on ARVs
 Median (Q1, Q3)0 (0, 0.84)0 (0, 0.85)0 (0, 0.76)0.37 (0, 0.86)0 (0, 0.84)
Prop defaulted visits
 Median (Q1, Q3)0 (0, 0.10)0 (0, 0.10)0 (0, 0.11)0 (0, 0.10)0 (0, 0.091)
Num encounters
 Median (Q1, Q3)6.0 (2.0, 9.0)6.0 (3.0, 9.0)5.0 (1.0, 8.0)7.0 (3.0, 9.0)6.0 (2.0, 9.0)
Baseline WHO stage
 Median (Q1, Q3)1.0 (1.0, 3.0)1.0 (1.0, 2.0)2.0 (1.0, 3.0)1.0 (1.0, 3.0)2.0 (1.0, 3.0)
 Missing6,330 (62.2%)4,926 (61.6%)1,404 (64.5%)3,027 (61.8%)746 (63.6%)
Baseline ARV line
 First5,282 (52%)4,202 (53%)1,080 (50%)2,092 (43%)553 (47%)
 Second125 (1%)88 (1%)37 (2%)50 (1%)18 (2%)
 Third13 (0%)12 (0%)1 (0%)1 (0%)0 (0%)
 Missing4,760 (46.8%)3,700 (46.2%)1,060 (48.7%)2,757 (56.3%)602 (51.3%)
Gender
 Female6,810 (67%)5,405 (68%)1,405 (65%)3,237 (66%)741 (63%)
 Male3,370 (33%)2,597 (32%)773 (35%)1,663 (34%)432 (37%)
Baseline weight
 Median (Q1, Q3)62 (56,70)62 (56,70)61 (56,68)62 (57,70)63 (56,69)
 Missing4,414 (43.4%)3,279 (41.0%)1,135 (52.1%)1,950 (39.8%)523 (44.6%)
Height
 Median (Q1, Q3)170 (160, 170)170 (160, 170)170 (160, 170)170 (160, 170)170 (160, 170)
 Missing4,604 (45.2%)3,431 (42.9%)1,173 (53.9%)2,062 (42.1%)557 (47.5%)
HIV status disclosed774 (8%)618 (8%)156 (7%)167 (3%)57 (5%)
On contraceptive7,479 (73%)5,996 (75%)1,483 (68%)4,151 (85%)956 (82%)
On health cover3,067 (30%)2,428 (30%)639 (29%)1,757 (36%)455 (39%)
On TB IPT regimen4,123 (41%)3,387 (42%)736 (34%)2,503 (51%)526 (45%)
STI symptoms255 (3%)196 (2%)59 (3%)125 (3%)42 (4%)
TB symptoms973 (10%)681 (9%)292 (13%)452 (9%)169 (14%)
Referral ordered6,055 (59%)4,754 (59%)1,301 (60%)3,399 (69%)819 (70%)
Referred to PHDP7,777 (76%)6,300 (79%)1,477 (68%)4,395 (90%)982 (84%)
Needs family TX support1,603 (16%)1,277 (16%)326 (15%)909 (19%)179 (15%)
Been hospitalized220 (2%)159 (2%)61 (3%)119 (2%)33 (3%)
Alcohol consumer1,367 (13%)1,074 (13%)293 (13%)726 (15%)163 (14%)
Cigarette smoker463 (5%)360 (4%)103 (5%)232 (5%)57 (5%)
Chest X-ray357 (4%)261 (3%)96 (4%)220 (4%)60 (5%)
General exam: abnormal1,127 (11%)855 (11%)272 (12%)636 (13%)178 (15%)
Skin exam: abnormal598 (6%)420 (5%)178 (8%)312 (6%)96 (8%)
Lymph nodes exam: abnormal122 (1%)83 (1%)39 (2%)54 (1%)13 (1%)
Respiratory exam: abnormal266 (3%)203 (3%)63 (3%)139 (3%)29 (2%)
Abdominal exam: abnormal73 (1%)62 (1%)11 (1%)38 (1%)6 (1%)
Urogenital exam: abnormal1,938 (19%)1,504 (19%)434 (20%)1,071 (22%)244 (21%)
Underweight1,655 (16%)1,153 (14%)502 (23%)756 (15%)225 (19%)
High BP4,301 (42%)3,585 (45%)716 (33%)2,187 (45%)416 (35%)
Low BP130 (1%)88 (1%)42 (2%)41 (1%)22 (2%)
Oxygen saturation: abnormal208 (2%)153 (2%)55 (3%)94 (2%)24 (2%)
Fever156 (2%)107 (1%)49 (2%)59 (1%)35 (3%)
MTRH clinic989 (10%)742 (9%)247 (11%)521 (11%)104 (9%)
Second VL count
 Median (Q1, Q3)0 (0.460)0 (0.180)730 (0, 21,000)0 (0, 83)9,900 (2,600, 61,000)
 Missing4,107 (40.3%)3,262 (40.8%)845 (38.8%)0 (0%)0 (0%)
First VL count
 Median (Q1, Q3)0 (0.580)0 (0,60)16,000 (3,100, 95,000)0 (0.170)1,900 (0, 39,000)
 Missing0 (0%)0 (0%)0 (0%)0 (0%)0 (0%)
  1. TB, tuberculosis; ARV, antiretroviral; IPT, isoniazid preventive therapy; BP, blood pressure; PHDP, positive health, dignity, and prevention; TX, treatment; MTRH, Moi Teaching and Referral Hospital. The definition of each variable can be found in Appendix Table 3.

Table 3:

Variable description.

VariableTypeDescription
Baseline ageIntegerAge at enrollment
Baseline arv lineCategoricalAntiretroviral (ARV) therapy line at enrollment
Baseline WHO stageOrdinalWorld Health Organization (WHO) staging at enrollment into care
Abnormal oxy satBinaryPresence of abnormal oxygen saturation: <90
Been hospitalizedBinaryIndicator for patient hospitalized recently: <1 month
FeverBinaryBaseline indicator for fever: temperature > 38 °C
High BPBinarySystolic > 130 , or diastolic > 80, or isolated systolic hypertension (age > 55)
Low BPBinaryLow blood pressure (BP): >90 mmHg for systolic or >60 mmHg for diastolic
Referred to PHDPBinaryReferral to Positive health, dignity, and prevention (PHDP)
Referral orderedBinaryIndicator for any clinical referral e.g., nutrition clinic
STI symptoms statusBinaryPresence of sexually transmitted infection (STI) symptoms
TB symptomsBinaryPresence of tuberculosis (TB) symptoms
Abdominal clinical examBinaryAbnormal abdominal physical clinical examination
Chest X-rayBinaryPresence of abnormal chest X-ray radiograph
General clinical examBinaryBaseline indicator for abnormal general physical examination
Lymph nodes clinical examBinaryPhysical assessment for positive swollen lymph nodes
MaleBinaryBinary indicator for male gender: 1 is male else 0
vl2 failureBinaryBinary indicator for second virologic failure
On contraceptiveBinaryPatient is on contraceptive (condom, pills, etc.)
On health coverBinaryPatient has health cover/insurance
On TB IPT regimenBinaryPatient is on/started TB isoniazid preventive therapy (IPT)
Respiratory clinical examBinaryBaseline physical examination/reported abnormal respiratory organs
Skin clinical examBinaryBaseline physical examination/reported abnormal skin lesion
HIV status disclosedBinaryHIV disclosure status at baseline
UnderweightBinaryBody mass index (BMI) < 18.5
Urogenital clinical examBinaryBaseline physical examination for abnormal urinary and genital organs
Needs family TX supportBinaryPatients needs family treatment (TX) support
Num encountersNumericNumber of HIV clinical encounters before first virologic measurements
BMINumericBody mass index – measure of body fat using height and weight
Prop poor adherenceNumericProportion of clinical encounters with reported poor adherence
Prop days on arvsNumericProportion of days on ARVs medication
Days since first VLNumericNumber of days since first VL
VL count 2 logNumericLogarithmic transformation of second VL count: log(1 + vl count)
Prop defaulted visitsNumericProportion of missed appointments
VL count 1 logNumericLogarithmic transformation of first VL count: log(1 + vl count)
Cig smokerBinaryPatient reported cigarette smoking
Alcohol consumerBinaryPatient reported alcohol consumption
MTRH clinicBinaryBinary indicator whether patient’s primary clinic is Moi teaching and referral Hospital
VL1 failureBinaryBinary indicator for first virologic failure

References

Aggarwal, C. C. 2014. Data Classification: Algorithms and Applications, 1st ed. Chapman & Hall/CRC.10.1201/b17320Search in Google Scholar

Akanbi, M. O., A. N. Ocheke, P. A. Agaba, C. A. Daniyam, I. A. Emmanuel, E. N. Okeke, and O. U. Christiana. 2012. “Use of Electronic Health Records in Sub-Saharan Africa: Progress and Challenges.” Journal of Medicine in the Tropics 14: 1–6. Also available at: https://www.bmj.com/content/338/bmj.b375.Search in Google Scholar

Balzer, L. B., V. H. Diane, M. R. Kamya, C. Gabriel, E. D. Charlebois, T. D. Clark, C. A. Koss, D. Kwarisiima, A. James, N. Sang, J. Kabami, M. Atukunda, V. Jain, S. C. Carol, R. C. Craig, A. B. Elizabeth, M. van der Laan, and M. L. Petersen. 2019. “Machine Learning to Identify Persons at High-Risk of Human Immunodeficiency Virus Acquisition in Rural Kenya and Uganda.” Clinical Infectious Diseases, https://doi.org/10.1093/cid/ciz1096.Search in Google Scholar

Breiman, L. 1996. “Bagging Predictors.” Machine Learning 24 (2): 123–40, https://doi.org/10.1023/A:1018054314350.10.1007/BF00058655Search in Google Scholar

Chen, T., and C. Guestrin. 2016. “XGBoost: A Scalable Tree Boosting System.” In KDD ’16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–94.10.1145/2939672.2939785Search in Google Scholar

Chipman, H. A., E. I. George, and R. E. McCulloch. 2010. “BART: Bayesian Additive Regression Trees.” Annals of Appled Statistics 4 (1): 266–98, https://doi.org/10.1214/09-AOAS285.Search in Google Scholar

Feller, D. J., J. Zucker, M. T. Yin, P. Gordon, and N. Elhadad. 2018. “Using Clinical Notes and Natural Language Processing for Automated HIV Risk Assessment.” Journal of Acquired Immune Deficiency Syndromes 77 (2): 160–6https://doi.org/10.1097/qai.0000000000001580.10.1097/QAI.0000000000001580Search in Google Scholar

Friedman, J. H. 2001. “Greedy Function Approximation: A Gradient Boosting Machine.” Annals of Statistics 29 (5): 1189–232, https://doi.org/10.1214/aos/1013203451.Search in Google Scholar

Hamers, R. L., C. Kityo, M. A. L. Joep, T. F. R. de Wit, and M. Peter. 2012. “Global Threat from Drug Resistant HIV in Sub-Saharan Africa.” British Medical Journal 344, https://doi.org/10.1136/bmj.e4159.Search in Google Scholar

Hastie, T., R. Tibshirani, and J. Friedman. 2009. The Elements of Statistical Learning, 2nd ed. Springer.10.1007/978-0-387-84858-7Search in Google Scholar

Krakower, D. S., S. Gruber, K. Hsu, J. T. Menchaca, J. C. Maro, B. A. Kruskal, B. W. Ira, K. H. Mayer, and M. Klompas. 2019. “Development and Validation of an Automated HIV Prediction Algorithm to Identify Candidates for Pre-exposure Prophylaxis: A Modelling Study.” The Lancet HIV 6 (10): e696-704, https://doi.org/10.1016/s2352-3018(19)30139-0.Search in Google Scholar

Lundberg, S. M., and S.-I. Lee. 2017. “A Unified Approach to Interpreting Model Predictions.” In Advances in Neural Information Processing Systems, Vol. 30, edited by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, 4765–74. Curran Associates, Inc. Also available at: http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf.Search in Google Scholar

Marcus, J. L., L. B. Hurley, S. K. Douglas, S. Alexeeff, M. J. Silverberg, and J. E. Volk. 2019. “Use of Electronic Health Record Data and Machine Learning to Identify Candidates for HIV Pre-exposure Prophylaxis: A Modelling Study.” The Lancet HIV 6 (10): e688-95, https://doi.org/10.1016/s2352-3018(19)30137-7.Search in Google Scholar

McMahon, J. H., J. H. Elliott, S. Bertagnolio, R. Kubiak, and M. R. Jordan. 2013. “Viral Suppression after 12 Months of Antiretroviral Therapy in Low- and Middle-Income Countries: A Systematic Review.” Bulletin of the World Health Organization 91 (5): 377E–85E, https://doi.org/10.2471/blt.12.112946.Search in Google Scholar

Miller, W. C., K. A. Powers, M. K. Smith, and M. S. Cohen. 2013. “Community Viral Load as a Measure for Assessment of HIV Treatment as Prevention.” The Lancet Infectious Diseases 13 (5): 459–64, https://doi.org/10.1016/s1473-3099(12)70314-6.Search in Google Scholar

Moons, K. G. M., P. Royston, Y. Vergouwe, D. E. Grobbee, and D. G. Altman. 2009. “Prognosis and Prognostic Research: What, Why, and How?” British Medical Journal 338: b375, https://doi.org/10.1136/bmj.b375.Search in Google Scholar PubMed

Naimi, A. I., and L. B. Balzer. 2018. “Stacked Generalization: An Introduction to Super Learning.” European Journal of Epidemiology 33: 459–64, https://doi.org/10.1007/s10654-018-0390-z.Search in Google Scholar PubMed PubMed Central

Ndembi, N., R. L. Goodall, D. T. Dunn, A. McCormick, A. Burke, F. Lyagoba, P. Munderi, P. Katundu, C. Kityo, V. Robertson, D. L. Yirrell, A. S. Walker, D. M. Gibb, Charles F. Gilks, P. Kaleebu, and D. Pillay, Development of Antiretroviral Treatment. 2010. “In Africa Virology Group and Trial Team. Viral Rebound and Emergence of Drug Resistance in the Absence of Viral Load Testing: A Randomized Comparison between Zidovudine-Lamivudine Plus Nevirapine and Zidovudine-Lamivudine Plus Abacavir.” The Journal of Infectious Diseases 201 (1): 106–13, https://doi.org/10.1086/648590.Search in Google Scholar PubMed

Nguyen, P., T. Tran, N. Wickramasinghe, and S. Venkatesh. 2016. “A Convolutional Net for Medical Records.” IEEE Journal of Biomedical and Health Informatics 21 (1): 22–30, https://doi.org/10.1109/jbhi.2016.2633963.Search in Google Scholar PubMed

Petersen, M. L., E. LeDell, J. Schwab, V. Sarovar, R. Gross, N. Reynolds, E. H. Jessica, K. Goggin, C. Golin, J. Arnsten, M. I. Rosen, R. H. Remien, D. Etoori, B. W. Ira, J. M. Simoni, J. A. Erlen, M. J. van der Laan, H. Liu, and D. R. Bangsberg. 2015. “Super Learner Analysis of Electronic Adherence Data Improves Viral Prediction and May Provide Strategies for Selective HIV RNA Monitoring.” Journal of Acquired Immune Deficiency Syndromes 69 (1): 109–18, https://doi.org/10.1097/qai.0000000000000548.Search in Google Scholar

Raghupathi, W., and V. Raghupathi. 2014. “Big Data Analytics in Healthcare: Promise and Potential.” Health Information Science and Systems 2 (3), https://doi.org/10.1186/2047-2501-2-3.Search in Google Scholar PubMed PubMed Central

Ramesh, A. N., C. Kambhampati, J. R. T. Monson, and P. J. Drew. 2004. “Artificial Intelligence in Medicine.” Annals of the Royal College of Surgeons of England 86 (5): 334–8, https://doi.org/10.1308/147870804290.Search in Google Scholar PubMed PubMed Central

UNAIDS. 2018. Fact Sheet - World AIDS Day 2018: 2017 Global HIV Statistics. Also available at: http://www.unaids.org/sites/default/files/media_asset/UNAIDS_FactSheet_en.pdf.Search in Google Scholar

van der Laan, M. 2007. “Super Learner.” Statistical Applications in Genetics and Molecular Biology 6: 25, https://doi.org/10.2202/1544-6115.1309.Search in Google Scholar PubMed

Xu, Y., T. Liu, M. J. Daniels, R. Kantor, A. Mwangi, and J. W. Hogan. 2019. “Classification Using Ensemble Learning under Weighted Misclassification Loss.” Statistics in Medicine 38: 2002–12, https://doi.org/10.1002/sim.8082.Search in Google Scholar PubMed PubMed Central

Zheng, W., L. Balzer, M. van der Laan, and M. Petersen. 2017. “Constrained Binary Classification Using Ensemble Learning: an Application to Cost-Efficient Targeted PrEP Strategies.” Statistics in Medicine 37 (2): 261–79, https://doi.org/10.1002/sim.7296.Search in Google Scholar PubMed PubMed Central

Zhou, Z.-H. 2015. “Ensemble Learning.” In Encyclopedia of Biometrics, edited by S. Z. Li, and A. K. Jain, 411–16. Boston, MA: Springer US.10.1007/978-1-4899-7488-4_293Search in Google Scholar

Received: 2019-10-18
Accepted: 2020-10-16
Published Online: 2020-11-12

© 2020 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 12.6.2024 from https://www.degruyter.com/document/doi/10.1515/scid-2019-0017/html
Scroll to top button