Evaluation of the diagnostic accuracy of Computer-Aided Detection of tuberculosis on Chest radiography among private sector patients in Pakistan

Zaidi, Syed Mohammad Asad; Habib, Shifa Salman; Van Ginneken, Bram; Ferrand, Rashida Abbas; Creswell, Jacob; Khowaja, Saira; Khan, Aamir

doi:10.1038/s41598-018-30810-1

Download PDF

Article
Open access
Published: 17 August 2018

Evaluation of the diagnostic accuracy of Computer-Aided Detection of tuberculosis on Chest radiography among private sector patients in Pakistan

Syed Mohammad Asad Zaidi¹,
Shifa Salman Habib¹,
Bram Van Ginneken²,
Rashida Abbas Ferrand³,
Jacob Creswell⁴,
Saira Khowaja⁵ &
…
Aamir Khan⁵

Scientific Reports volume 8, Article number: 12339 (2018) Cite this article

4206 Accesses
38 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The introduction of digital CXR with automated computer-aided interpretation, has given impetus to the role of CXR in TB screening, particularly in low resource, high-burden settings. The aim of this study was to evaluate the diagnostic accuracy of CAD4TB as a screening tool, implemented in the private sector in Karachi, Pakistan. This study analyzed retrospective data from CAD4TB and Xpert MTB/RIF testing carried out at two private TB treatment and diagnostic centers in Karachi. Sensitivity, specificity, potential Xperts saved, were computed and the receiver operator characteristic curves were constructed for four different models of CAD4TB. A total of 6,845 individuals with presumptive TB were enrolled in the study, 15.2% of which had MTB + ve result on Xpert. A high sensitivity (range 65.8–97.3%) and NPV (range 93.1–98.4%) were recorded for CAD4TB. The Area under the ROC curve (AUC) for CAD4TB was 0.79. CAD4TB with patient demographics (age and gender) gave an AUC of 0.83. CAD4TB offered high diagnostic accuracy. In low resource settings, CAD4TB, as a triage tool could minimize use of Xpert. Using CAD4TB in combination with age and gender data enhanced the performance of the software. Variations in demographic information generate different individual risk probabilities for the same CAD4TB scores.

An overview of clinical decision support systems: benefits, risks, and strategies for success

Article Open access 06 February 2020

Reed T. Sutton, David Pincock, … Karen I. Kroeker

Pneumonia

Article 08 April 2021

Antoni Torres, Catia Cilloniz, … Tom van der Poll

The global burden of lung cancer: current status and future trends

Article 21 July 2023

Amanda Leiter, Rajwanth R. Veluswamy & Juan P. Wisnivesky

Introduction

Tuberculosis (TB) remains a major cause of morbidity and mortality globally. In 2015, there were an estimated 10.4 million incident cases of TB and 1.8 million TB deaths¹. Active case finding programs are being increasingly utilized to reduce the case-detection gap^2,3.

In recent years, there has been growing interest in the use of chest x-rays (CXR) as a screening tool for TB within active and enhanced case finding programs⁴. Recent TB prevalence surveys have shown that CXR has higher sensitivity than verbal screening for identifying pulmonary TB^5,6,7. Previously, costs, limited access to x-ray facilities, maintenance of equipment, availability of trained personnel, poor specificity and inter-observer variation meant that the role of CXR within diagnostic algorithms was limited⁸.

The advent of digital chest radiography along with software capable of automated interpretation such as the “Computer Assisted Diagnosis for TB” (CAD4TB) software developed by the Diagnostic Image Analysis Group of the Radboud University Medical Centre has prompted reconsideration of the role of CXR in TB screening, particularly in low resource, high-burden settings⁹. Long-term use of digital radiography is cost-efficient compared to conventional radiography as it eliminates recurring costs related to reagent use and radiologists¹⁰. Currently, CAD4TB is the only scoring software that has been evaluated and is being implemented in programmatic settings. Encouraging findings on the diagnostic accuracy of CAD4TB has been reported from sub-Saharan Africa, and most recently from Bangladesh^{11,12,13,14,15}.

The need for improved approaches for screening has acquired greater pertinence following the introduction of sensitive rapid molecular diagnostics for TB such as Xpert MTB/RIF (Xpert) testing^16,17,18. However, the scale-up of Xpert testing is limited in resource-constrained countries by high costs of test cartridges^19,20,21,22.

An increasing body of evidence from high burden countries suggests that the use of digital CXR equipment and the automated reading of CXR with Computer Aided Detection (CAD), as a pre-screening tool, in conjunction with an expensive molecular test such as Xpert can improve case finding efforts²³.

The use of CAD4TB is still in development phase, and the World Health Organization (WHO) has not developed any formal guidelines or recommendations for its use due to limited evidence. The aim of this study was to evaluate the diagnostic accuracy of CAD4TB as a screening tool, in Karachi, Pakistan, a megacity with a high TB prevalence and a substantial burden of undiagnosed TB. Similar studies, reporting diagnostic accuracy using Xpert MTB/RIF as the reference standard have been reported from Zambia in 2013 and Bangladesh in 2017^13,14. Other studies from Zambia, Tanzania, South Africa and England have evaluated CAD4TB against the reference standard of culture^15,24,25,26. Our current study is another data point in the series of studies, carried out in Pakistan. In addition, we also investigated whether different models of CAD4TB implementation that included routinely collected programmatic data such as age and gender can potentially enhance the diagnostic accuracy of the software and yield of TB case-detection.

Methods

Study design and setting

Pakistan has the fifth highest burden of tuberculosis in the world and the third largest number of undiagnosed TB cases¹. Of the estimated 510,000 new TB cases, only 331 809 (65%) were notified to the National Tuberculosis Program (NTP) in 2015, making increased case-detection and notification a key priority²⁷. Currently, smear microscopy is predominantly used as a diagnostic test in a majority of facilities in Pakistan²⁸.

The study was conducted at two purpose built TB treatment and diagnostic centers, called “Sehatmand Zindagi” (Healthy Life) centers, in Karachi, Pakistan, from October 2013 to September 2015. These centers are located in low-middle income neighborhoods of Karachi, Nazimabad and Korangi. In addition to digital CXR equipment with CAD4TB, Xpert testing was carried out at both centers, with initiation of treatment among those diagnosed with TB.

The study was embedded within a broader programme implementing enhanced case finding, whereby community-health workers screened all individuals attending private health providers’ clinics, in the vicinity of the centers, using the WHO TB symptom screen²⁹, that is screening for the presence of either of the following: cough of any duration, fever, hemoptysis, night sweats, weight loss. Following a clinical evaluation by the health providers, those identified with presumptive TB were referred to the centers for further investigation. The target population for this study included individuals with presumptive TB referred by the private providers from the catchment area of the centres, as well as individuals with symptoms who self-referred for investigation for TB. All participants underwent a paid digital CXR (USD 3–5) and were requested to provide a sputum sample for free of cost Xpert testing.

Chest X-Ray scoring procedures

The CXRs were scored for abnormalities suggestive of pulmonary TB by a software system CAD4TB (version 3.07, Diagnostic Image Analysis Group, The Netherlands). CAD4TB was developed utilizing machine learning methods and was trained using labeled samples to differentiate between normal and abnormal x-ray images. The software has two abnormality detection systems that is textural abnormality and shape abnormality systems, which analyze the abnormalities in the unobscured lung fields that have been segmented automatically. The software then uses outputs from its detection systems as image descriptive features to train a k-NN classifier to compute a cumulative abnormality score (Range 0–95) for each CXR^13,30. A higher score is indicative of more serious abnormality suggestive of TB. A CAD4TB threshold score of 50 was used for this population determined using previously collected CXR data in a similar population. All individuals with high CAD4TB scores (50 or greater) were referred back to their consulting physicians for further clinical evaluation.

Data management and analysis

All individuals attending the TB centers were registered online using an open-source platform (Open MRS), by allocation of a unique patient ID, against which baseline information and history of presenting symptoms were recorded. Distribution of CAD4TB scores was compared for various patient characteristics such as age, gender, symptoms and Xpert result. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated for each of the TB symptoms using Xpert result as the standard. Univariate and multivariate associations of CAD4TB score, age, gender and symptoms (as explanatory variables) with TB infection (defined as a positive Xpert result) were computed. Logistic regressions were performed with MTB detection as the outcome variable and CAD4TB score, age and gender as the explanatory variables (Model 1 and 3). Adjusted analyses were subsequently performed through backward step-wise multivariate logistic regression using Akaike’s Information Criteria (AIC) to select the final, parsimonious model where symptoms where included as predictors of TB (Model 2 and 4). The AIC is an estimator that provides the relative quality of various statistical models and allowed for the selection of the most suitable set of predictor variables for the final model. Inclusion of the full set of symptoms screened was selected through the AIC for Models 2 and 4. Receiver Operator Characteristic (ROC) curves were constructed for four prediction models for TB, namely: Model 1 (CAD4TB score only), Model 2: (CAD4TB score, symptoms), Model 3: (CAD4TB score, Age, Gender) and Model 4 (CAD4TB score, age, gender, symptoms). Area-Under the Curve (AUC) statistics were obtained for each ROC curve and confidence intervals were calculated to investigate statistical differences in discriminatory accuracy of the prediction models. Sensitivity, Specificity, PPV and NPV for CAD4TB cutoff thresholds at scores of 50, 80 and 90 were obtained for the four prediction models by determining their predicted probabilities for TB detection. These cut offs were selected based on the CAD4TB score distribution of the study population, with score 50, 80 and 90 being at the 25^th, 50^th percentile 75^th percentile approximately.

A range of predicted probabilities for each CAD4TB score were obtained from the two models that included CAD4TB with demographic information (age and gender) and symptoms. Locally weighted regressions were carried out for the range of predicted probabilities for both models against CAD4TB scores and were used to determine the corresponding predicted probability for MTB detection at the four CAD4TB cut-offs. Predicted probabilities of TB were computed at each CAD4TB cutoff threshold. These estimated the risk of TB detection at each CAD4TB score. These were used to estimate the number of TB cases missed, Xpert cartridges reduced (due to reduced number of individuals with a CAD4TB score above the threshold) and yield (number of MTB positive results out of all those tested) on Xpert test for the four models. All data analysis was carried out using STATA Statistical Software (Stata Corporation Version 11. College Station, TX, USA).

Ethical Approval and informed consent

Ethical approval for the study obtained from the Institutional Review Board (IRB) of Interactive Research & Development that is registered with the Department of Health and Human Services, USA. The methods were carried out in accordance with the relevant guidelines and regulations. Verbal informed consent was obtained from the participants before carrying out screening activities under the project. De-identified data was provided for analysis to the study researchers, whereas all patient screening and diagnostic information was secured on a password-protected server.

Results

A total of 6,845 individuals with presumptive TB were enrolled in the study between October 2013 to September 2015. Out of these, 755 individuals, with invalid, error, no result were excluded from the analysis. The median age of participants was 38.9 (IQR 17.2) years and 3,018 (49.6%) were male. The majority of individuals included in the study reported symptoms of cough (87.5%) and fever (76.1%) (Table 1). Hemoptysis and nightsweats were reported in 13.2% and 30.5% of the study participants respectively. A total of 925 individuals enrolled in the study (15.2%) had MTB + ve results on Xpert (Fig. 1). The majority of (90.2%) people with a MTB + ve result on Xpert had a CAD4TB score >80. However, a high proportion of individuals (74.2%) that tested as MTB-ve also had scores >80 (Table 1).

Table 1 Baseline characteristics of individuals with presumptive TB by Computer-Aided Detection of TB (CAD4TB) scores, visiting TB diagnostic and treatment centers in Karachi, Pakistan (Q3- 2013 to Q2- 2015).

Full size table

Cough <2 weeks (OR 2.05, CI 1.51–2.81) was the strongest predictor of TB disease in the final adjusted models for MTB detection (Table 2). Increasing age (OR 0.96, 95% CI: 0.96–0.97) and female gender were inversely associated with a positive Xpert result (OR 0.79, 95% CI: 0.68–0.93).

Table 2 Predictors for TB detection among individuals tested using Xpert MTB/RIF, visiting TB diagnostic and treatment centers in Karachi, Pakistan (Q3- 2013 to Q2- 2015).

Full size table

A high sensitivity (range 65.8–95.3%) and NPV (range 93.1–98.4%) were recorded for CAD4TB (Table 3). For each model, increasing CAD4TB score thresholds, improved yield of TB case detection, with corresponding increase in specificity and decrease in sensitivity. Using the symptom screen alone, cough of <2 weeks and fever, had higher sensitivities (93.8% and 85.7% respectively) and lower specificities (14.5% and 25.6% respectively) compared to other symptoms (Fig. 2). All symptoms had high negative predictive values and low positive predictive values (Fig. 2).

Table 3 Sensitivity, Specificity, Positive predictive Value, Negative Predictive Value at different CAD4TB score thresholds among individuals tested using Xpert MTB/RIF, visiting TB diagnostic and treatment centers in Karachi, Pakistan (Q3–2013 to Q2–2015).

Full size table

For each of the models, at higher CAD4TB scores the number of Xpert tests carried out was reduced, however, it led to more patients being classified as false-negatives (TB cases missed). At a CAD4TB score of 90, a total of 3,539 Xpert tests will be saved using Model 1 (CAD4TB scores only), 4163 with Model 2 (CAD4TB scores and symptoms), 4,577 will be saved in Model 3 (CAD4TB scores, age and gender), and 4,465 in Model 4 (CAD4TB scores with age, gender and symptoms). The TB cases missed were lowest for a CAD4TB score of 50, 2.7%, 3.2%, 3.7% and 4.2% respectively for the four models. The MTB yield at a score of 90 using the four models was 30.8%, 35%, 40.2% and 39.3% respectively.

The Area under the ROC curve (AUC) for the model with only CAD4TB scores as predictor for MTB detection (Model 1) was 0.79 (95% CI: 0.78–0.81) (Fig. 3) and for Model 2 using CAD4TB scores and symptoms was 0.81. Inclusion of patient demographics (age and gender) to CAD4TB scores (Model 3) increased the AUC to 0.83 (95% CI: 0.82–0.85). A combined model of CAD4TB scores, symptoms, age and gender (Model 4) further increased the AUC to 0.84 (95% CI: 0.82–0.85), however this was not significantly different compared to Model 3.

Table 4 describes a sample of the predicted probabilities for various combination of age and gender for the same selected CAD4TB scores.

Table 4 Sample of probabilities and risk for TB from a prediction model utilizing Computer Aided Detection for TB (CAD4TB) and demographic data from individuals visiting TB diagnostic and treatment centers, in Karachi (Q3 2013–Q2 2015).

Full size table

Discussion

Our study evaluated the performance of CAD4TB software as a screening tool for the detection of tuberculosis in a low-resource, high burden, non-HIV setting, using Xpert as the reference test. This study is one of the largest such evaluations of CAD4TB from a programmatic setting. In our study, CAD4TB was able to correctly identify a high proportion of people who were diagnosed with TB on Xpert and hence could potentially reduce the number of expensive molecular tests needed to detect TB in our sample of patients.

While the use of Xpert in programmatic settings has expanded in recent years, the WHO has also recommended use of more cost effective diagnostic algorithms through screening tools such as CXR^25,29,31. Development of software that offer automated interpretation of CXRs, represents an important milestone that can link technological innovations to mass-screening programs for tuberculosis. The utilization of CAD4TB as a triage tool, to pre-screen individuals for Xpert cannot only, improve case-detection in screening programs but also possibly reduce program costs³².

The findings from this study indicate that CAD4TB offers high diagnostic accuracy. CAD4TB scoring can be utilized to triage individuals for Xpert testing as individuals with a low CAD4TB score had a low probability of being tested positive for TB. In resource constrained settings such as Pakistan, with limited funds to support Xpert testing for all people with presumptive TB, using a triage tool like CAD4TB could promote more rational use of Xpert by minimizing the number of cartridges used. This is also relevant for facilities where an onsite radiologist may not always be available to evaluate the CXR.f It is important to note that the savings offered through reduced Xpert tests need to be offset with the cost of acquiring and maintaining digital X-ray systems. However, a detailed discussion on the costing and policy implications for mass-screening using CXR is beyond the scope of this study. High sensitivity (range 85–97.3%) and NPV (range 96.1–98.4%) were recorded for CAD4TB at the score cut-offs utilized in the analysis, which is similar to what has been reported for CXR in other study settings^18,33,34. The relatively lower specificity (range 30.3–65.7%) and PPV (20–30.8%) were also consistent with findings from another study evaluating CAD4TB¹³.

A high AUC (0.79) was recorded from the model using CAD4TB alone as a screening tool (Model 1). Other studies from Zambia and Bangladesh that also used Xpert as the reference test reported AUCs of 0.71 and 0.74 respectively^13,14. Studies from Africa, using culture as the reference test reported AUC in the range 0.71–0.84³⁵. Our results therefore support investigations elsewhere suggesting that CAD4TB performs well in detecting radiological abnormalities^11,12,13,14. To date, the highest AUC has been reported with the version 3.07 of CAD4TB (compared to older versions)³⁵. With newer versions available and being increasingly utilized by programmes, it is expected that a superior performance of CAD4TB software will be found in future evaluations using newer versions, with improved machine learning capacity. While the combined use of CAD4TB and symptoms has been evaluated in a previous study¹², this is one of the first studies that have evaluated CAD4TB in combination with symptoms as well as demographic information (age and gender). Using CAD4TB in combination with demographic data enhanced the performance of the software, generating a higher AUC (0.83), while such information such as age and gender are routinely captured in screening programmes. However, including clinical symptoms to the model with demographics and CAD4TB did not significantly increase accuracy as was hypothesized by a previous study¹³. Another study from South Africa, reported a superior performance of a combination framework using both CAD4TB scores and symptoms (AUC 0.84)¹². Symptoms may not have contributed to improved performance in our setting as the study population included individuals that were referred for investigations (including self-referrals). This may have led to pre-screening of individuals thereby limiting the added discrimination offered by symptoms. Addition of symptoms improved specificity but decreased sensitivity as a lower number of individuals would have been screened positive under Model 4, and a larger number of TB cases were missed. In order to obtain a precise estimate of the AUC and to detect differences in the AUC between the models, a large sample size was included in the study. Since the data was obtained from a programmatic setting rather than a controlled investigation, a higher proportion of MTB-ve individuals were enrolled reflecting the prevalence of the disease in this population.

The increased diagnostic accuracy offered through demographic data can be utilized to further enhance the yield for Xpert testing than through CAD4TB alone. In this study, we used the dataset to generate a range of predicted probabilities for TB detection using a combination of CAD4TB scores, age and gender, like those shown in Table 4, that can be used to devise risk categories for patients identified through screening, further refining the triage process. Our study demonstrates that for the same CAD4TB scores, variations in demographic information such as age and gender can generate different individual risk probabilities. For example, at a CAD4TB score of 80, a male aged 56 years may have a low probability (5.1%) of being identified as MTB + ve on Xpert compared to a female aged 22 years who may have a higher probability (19.8%) (Table 4). Individualized risk scores could, therefore, assist frontline healthcare workers make informed decisions about whom to test. Sputum samples for Xpert testing may be collected for those with high risk for TB, and repeat tests or clinical evaluations may be carried out for those with medium to high risk, that can potentially save Xpert cartridges, improve testing yields and make programs more cost-effective. In addition to demographic data, routinely collected programmatic information such as history of TB contact, diabetes status and smoking history can be further utilized by future programs to create personalized risk scores. It must be noted that symptoms, while not offering improved accuracy in this study, may be useful in community-settings in active case finding programs where a large number of asymptomatic individuals are also among those screened and may further help improve yield on Xpert.

Our study findings also demonstrate that for increasing CAD4TB score thresholds, the sensitivity decreased, with corresponding increase in specificity, resulting in more TB cases but providing a higher yield (Table 3). Similar findings have been reported from a study in South Africa where 11% of TB cases would have been missed using a threshold score that would have triaged 40% of suspects for Xpert testing²⁵. However, individualized risk assessment, may diminish the need to set CAD4TB thresholds for programs broadly with greater reliance on testing based on personalized assessment.

An additional benefit of utilizing digital X-rays is increased capacity for clinical diagnosis of TB. Images can be archived online using cloud-based software allowing radiologists or clinical officers at TB facilities high quality images for diagnostic evaluation. In addition, mass-screening programs with X-rays are more likely to generate community interest and support mobilization than conventional screening camps with health workers. However, additional operational considerations continue to be relevant regardless of the modality of screening used. Improvements in processes such as health communication activities to promote screening among asymptomatic individuals, adequate resources for sputum induction, increased diagnostic capacity for testing, additional clinical staff for examining bacteriologically negative cases and engineers for providing equipment and software maintenance, will all be required to make screening and community referrals more effective. Since CAD4TB does not differentiate CXR abnormalities that may be observed in other conditions, such as pneumonia, lung cancer, etc., a significant number of people without TB are likely to be referred for diagnostic testing¹⁴. Algorithms and pathways to care will need to be developed for managing the diagnostic workup and treatment for these individuals. This is especially pertinent for developing countries with donor supported TB programs as diagnostics and treatment for other pulmonary pathologies are not funded.

Our study has certain limitations. The major limitation was that Xpert, and not mycobacterial culture was used as the reference standard, whereby Xpert negative, culture positive TB cases may have been missed. Individuals that were unable to expectorate sputum and cases with invalid or error results on xpert (for which additional sputum samples could not be obtained to re-run the test), were excluded from the study. These factors may have decreased the number of patients classified as MTB + ve and affected the accuracy of the results. An evaluation of the performance of CAD4TB compared with human readers was beyond the scope of this analysis as this has been conducted extensively in a number of studies. These evaluations utilized a combination of readings by clinical officers and radiologists and the performance of CAD4TB was found to have been comparable to those of human readers and also has the potential to reduce inter-reader and intra-reader variability and detection errors^11,34,35,36. While these early studies have demonstrated the effectiveness of CAD4TB in place of medical staff, further studies such as ours that utilize a biological reference can further support the use of CAD4TB in screening programs. Finally, the external validity of our study may be limited for active-case finding programs as the participant enrollment was carried out at a facility-based setting, and the results may not be generalizable to the community setting where a large number of asymptomatic people with TB may also be present. We therefore recommend further studies to evaluate CAD4TB in the community setting such as through mobile X-ray units.

Conclusion

This study described the first use of CXRs supported with computer-aided detection as part of enhanced case-finding intervention in the private sector in Pakistan. It demonstrated CAD4TB has the potential to be used as a triage tool to carry out screening of symptomatic individuals who could be excluded from further testing to make screening programs more cost effective by saving the number of Xpert tests. With the large scale roll-outs of Xpert and CAD4TB in local programmatic settings, its use within different case finding approaches should be evaluated and compared. A follow-up study comparing different versions of CAD4TB is also recommended. Screening algorithms need to be tailored to local contexts taking into account priorities for increased case-detection and resources required for testing additional individuals with presumptive TB.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request

References

World Health Organization. Global Tuberculosis Report 2016 (2016). Available at: http://apps.who.int/iris/bitstream/10665/250441/1/9789241565394-eng.pdf (Accessed: 1st June, 2017).
Khan, A. J. et al. Engaging the private sector to increase tuberculosis case detection: an impact evaluation study. Lancet Infect Dis. 12.8, 608–616 (2012).
Article Google Scholar
Fatima, R. et al. Success of active tuberculosis case detection among high-risk groups in urban slums in Pakistan. Int J Tuberc Lung Dis. 18.9, 1099–1104 (2014).
Article Google Scholar
Kranzer, K. et al. The benefits to communities and individuals of screening for active tuberculosis disease: a systematic review [State of the art series. Case finding/screening. Number 2 in the series]. Int J Tuberc Lung Dis. 17.4, 432–446 (2013).
Article Google Scholar
Den Boon, S. et al. An evaluation of symptom and chest radiographic screening in tuberculosis prevalence surveys. Int J Tuberc Lung Dis. 10.8, 876–882 (2006).
Google Scholar
Qadeer, E. et al. Population based national tuberculosis prevalence survey among adults (>15 years) in Pakistan, 2010–2011. PloS one. 11.2, e0148293 (2016).
Article CAS Google Scholar
Ellis, S. M. & Flower, C. The wHo manual of diagnostic imaging: radiographic anatomy and interpretation of the chest and the pulmonary system. World Health Organization; (2006).
Onozaki, I. et al. National tuberculosis prevalence surveys in Asia, 1990–2012: an overview of results and lessons learned. Trop Med Int Health. 20.0, 1128–1145 (2015).
Article Google Scholar
Pinto, L. M. et al. Scoring systems using chest radiographic features for the diagnosis of pulmonary tuberculosis in adults: a systematic review. Eur Respir J. 42, 480–494 (2013).
Article ADS PubMed Google Scholar
Ghiasi, M., Pande, T. & Pai, M. Advances in tuberculosis diagnostics. Curr Trop Med Rep. 2.2, 54–61 (2015).
Article Google Scholar
Breuninger, M. et al. Diagnostic accuracy of computer-aided detection of pulmonary tuberculosis in chest radiographs: a validation study from sub-Saharan Africa. PLoS One 9.9, e106381 (2015).
ADS Google Scholar
Melendez, J. et al. An automated tuberculosis screening strategy combining X-ray-based computer-aided detection and clinical information. Sci Rep. 6, 25265 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Muyoyeta, M. et al. The sensitivity and specificity of using a computer aided diagnosis program for automatically scoring chest X-rays of presumptive TB patients compared with Xpert MTB/RIF in Lusaka and. PloS one 9.4, e93757 (2014).
Article ADS CAS Google Scholar
Rahman, M. T. et al. An evaluation of automated chest radiography reading software for tuberculosis screening among public-and private-sector patients. Eur Respir J. 49.5, 1602159 (2017).
Article Google Scholar
Melendez, J. et al. Automatic versus human reading of chest X-rays in the Zambia National Tuberculosis Prevalence Survey. Int J Tuberc Lung Dis. 21.8, 880–886 (2017).
Article Google Scholar
Durovni, B. et al. Correction: Impact of Replacing Smear Microscopy with Xpert MTB/RIF for Diagnosing Tuberculosis in Brazil: A Stepped-Wedge Cluster-Randomized Trial. PLoS Med. 12.12, e1001928 (2015).
Article Google Scholar
Mbonze, N. B. et al. Xpert® MTB/RIF for smear-negative presumptive TB: impact on case notification in DR Congo. Int J Tuberc Lung Dis. 20.2, 240–246 (2016).
Article Google Scholar
Theron, G. et al. Evaluation of the Xpert MTB/RIF assay for the diagnosis of pulmonary tuberculosis in a high HIV prevalence setting. Am J Respir Crit Care Medicine. 184.1, 132–140 (2011).
Article Google Scholar
Creswell, J. et al. Results from early programmatic implementation of Xpert MTB/RIF testing in nine countries. BMC Infect. Dis. 14.1, 2 (2014).
Article Google Scholar
World Health Organization. Pakistan Country Profile. Available at https://extranet.who.int/sree/Reports?op=Replet&name=/WHO_HQ_Reports/G2/PROD/EXT/TBCountryProfile&ISO2=PK&outtype=pdf (Accessed: 21st July, 2017).
Pai, M. & Palamountain, K. M. New tuberculosis technologies: challenges for retooling and scale-up. Int J Tuberc Lung Dis 16, 1281–1290 (2012).
Article PubMed CAS Google Scholar
Theron, G. et al. Do adjunct tuberculosis tests, when combined with Xpert MTB/RIF, improve accuracy and the cost of diagnosis in a resource-poor setting?. Eur Respir J 4, 161–168 (2012).
Article Google Scholar
van’t Hoog, A. H., Onozaki, I. & Lonnroth, K. Choosing algorithms for TB screening: a modelling study to compare yield, predictive value and diagnostic burden. BMC Infect Dis 14(532), 11 (2014).
Google Scholar
Steiner, A. et al. Screening for pulmonary tuberculosis in a Tanzanian prison and computer-aided interpretation of chest X-rays. Public health action 5(4), 249–254 (2015).
Article PubMed PubMed Central CAS Google Scholar
Philipsen, R. H. H. M. et al. Automated chest-radiography as a triage for Xpert testing in resource-constrained settings: a prospective study of diagnostic accuracy and costs. Scientific Rep. 5, 12215 (2015).
Article ADS CAS Google Scholar
Hogeweg, L. et al. Automatic detection of tuberculosis in chest radiographs using a combination of Computer-aided detection of TB on digital CXR 1229 textural, focal, and shape abnormality analysis. IEEE Trans Med Imaging 34, 2429–2442 (2015).
Article PubMed Google Scholar
Muzaffar, R., Batool, S., Aziz, F., Naqvi, A. & Rizvi, A. Evaluation of the FAST PlaqueTB assay for direct detection of Mycobacterium tuberculosis in sputum specimens. Int J Tuberc Lung Dis. 6.7, 635–40 (2002).
Google Scholar
World Health Organization. Improving the diagnosis and treatment of smear-negative pulmonary and extrapulmonary tuberculosis among adults and adolescents: recommendations for HIV-prevalent and resource-constrained settings. Available at http://apps.who.int/iris/bitstream/10665/69463/1/WHO_HTM_TB_2007.379_eng.pdf (Accessed: 1st June, 2017).
World Health Organization. Systematic screening for active tuberculosis: principles and recommendations. Available at: https://books.google.com.pk/books?hl=en&lr=&id=g7EXDAAAQBAJ&oi=fnd&pg=PP1&dq=Systematic+screening+for+active+tuberculosis:+principles+and+recommendations.+&ots=4BnNSCcvlv&sig=N5mkmTU0Ke24X9C5Y9Dxw1V83_M#v=onepage&q=Systematic%20screening%20for%20active%20tuberculosis%3A%20principles%20and%20recommendations.&f=false.
van Ginneken, B., Stegmann, M. B. & Loog, M. Segmentation of anatomical structures in chest radiographs using supervised methods: a comparative study on a public database. Med Image Anal 10, 19–40 (2006).
Article PubMed Google Scholar
Qin, Z. Z. et al. How is Xpert MTB/RIF being implemented in 22 high tuberculosis burden countries? Eur Respir J. 45.2, 549–54 (2015).
Article Google Scholar
Story, A. et al. Active case finding for pulmonary tuberculosis using mobile digital chest radiography: an observational study. Int J Tuberc Lung Dis 16.11, 1461–1467 (2012).
Article Google Scholar
Van’t Hoog, A. H. et al. High sensitivity of chest radiograph reading by clinical officers in a tuberculosis prevalence survey. Int J Tuberc Lung Dis. 15.10, 1308–1314 (2011).
Article Google Scholar
Maduskar, P. et al. Detection of tuberculosis using digital chest radiography: automated reading vs. interpretation by clinical officers. Int J Tuberc Lung Dis. 17.12, 1613–20 (2013).
Article Google Scholar
Pande, T., Cohen, C., Pai, M. & Ahmad, K. F. Computer-aided detection of pulmonary tuberculosis on digital chest radiographs: a systematic review. Int J Tuberc Lung Dis. 20.9, 1226–1230 (2016).
Article Google Scholar
Jaeger, S., Karargyris, A., Antani, S. & Thoma, G. Detecting tuberculosis in radiographs using combined lung masks. IEEE Eng Med Biol Soc (EMBC), Annual International Conference of the IEEE, 4978–4981 (2012).

Download references

Acknowledgements

Authors would like to acknowledge the support provided by Dr. Sheikh Mohammad Ayub, Community Health Solutions for his contributions to the data management for this study. The implementation of the project was supported by the Stop TB Partnership’s TB REACH initiative. Co-author Rashida Abbas Ferrand is funded by the Wellcome Trust (Grant no 206316/Z/17/Z).

Author information

Authors and Affiliations

Community Health Solutions, Karachi, 74000, Pakistan
Syed Mohammad Asad Zaidi & Shifa Salman Habib
Radboud University Medical Center, 6525 GA, Nijmegen, Netherlands
Bram Van Ginneken
London School of Hygiene and Tropical Medicine, London, WC1E 7HT, United Kingdom
Rashida Abbas Ferrand
StopTB Partnership, 1214 Geneva, 1214, Vernier, Switzerland
Jacob Creswell
Interactive Research & Development, Karachi, 75190, Pakistan
Saira Khowaja & Aamir Khan

Authors

Syed Mohammad Asad Zaidi
View author publications
You can also search for this author in PubMed Google Scholar
Shifa Salman Habib
View author publications
You can also search for this author in PubMed Google Scholar
Bram Van Ginneken
View author publications
You can also search for this author in PubMed Google Scholar
Rashida Abbas Ferrand
View author publications
You can also search for this author in PubMed Google Scholar
Jacob Creswell
View author publications
You can also search for this author in PubMed Google Scholar
Saira Khowaja
View author publications
You can also search for this author in PubMed Google Scholar
Aamir Khan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.M.A.Z., B.V.G., J.C., S.K. and A.K. were involved in conception of the study, finalizing the study design. S.S.H. conducted the literature review and data collection. S.M.A.Z. and S.S.H. were involved in data analysis, data interpretation and drafting the manuscript. B.V.G., R.A.F., and J.C. reviewed the drafts critically and finalized the manuscript. All authors reviewed and approved the final version to be published.

Corresponding author

Correspondence to Shifa Salman Habib.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zaidi, S.M.A., Habib, S.S., Van Ginneken, B. et al. Evaluation of the diagnostic accuracy of Computer-Aided Detection of tuberculosis on Chest radiography among private sector patients in Pakistan. Sci Rep 8, 12339 (2018). https://doi.org/10.1038/s41598-018-30810-1

Download citation

Received: 02 November 2017
Accepted: 31 July 2018
Published: 17 August 2018
DOI: https://doi.org/10.1038/s41598-018-30810-1

This article is cited by

Comparing tuberculosis symptom screening to chest X-ray with artificial intelligence in an active case finding campaign in Northeast Nigeria
- Stephen John
- Suraj Abdulkarim
- Jacob Creswell
BMC Global and Public Health (2023)
Early user perspectives on using computer-aided detection software for interpreting chest X-ray images to enhance access and quality of care for persons with tuberculosis
- Jacob Creswell
- Luan Nguyen Quang Vo
- Andrew James Codlin
BMC Global and Public Health (2023)
A spatial analysis of TB cases and abnormal X-rays detected through active case-finding in Karachi, Pakistan
- Syed Mohammad Asad Zaidi
- Wafa Zehra Jamal
- Shifa Salman Habib
Scientific Reports (2023)
Computer-aided interpretation of chest radiography reveals the spectrum of tuberculosis in rural South Africa
- Jana Fehr
- Stefan Konigorski
- Zizile Sikhosana
npj Digital Medicine (2021)
Computer aided detection of tuberculosis on chest radiographs: An evaluation of the CAD4TB v6 system
- Keelin Murphy
- Shifa Salman Habib
- Bram van Ginneken
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.