Machine learning approach for differentiating cytomegalovirus esophagitis from herpes simplex virus esophagitis

Lee, Jung Su; Yun, Jihye; Ham, Sungwon; Park, Hyunjung; Lee, Hyunsu; Kim, Jeongseok; Byeon, Jeong-Sik; Jung, Hwoon-Yong; Kim, Namkug; Kim, Do Hoon

doi:10.1038/s41598-020-78556-z

Download PDF

Article
Open access
Published: 11 February 2021

Machine learning approach for differentiating cytomegalovirus esophagitis from herpes simplex virus esophagitis

Jung Su Lee^1,2^na1,
Jihye Yun³^na1,
Sungwon Ham⁴,
Hyunjung Park⁴,
Hyunsu Lee⁵,
Jeongseok Kim⁶,
Jeong-Sik Byeon¹,
Hwoon-Yong Jung¹,
Namkug Kim^3,4 &
…
Do Hoon Kim¹

Scientific Reports volume 11, Article number: 3672 (2021) Cite this article

4738 Accesses
8 Citations
3 Altmetric
Metrics details

Subjects

Abstract

The endoscopic features between herpes simplex virus (HSV) and cytomegalovirus (CMV) esophagitis overlap significantly, and hence the differential diagnosis between HSV and CMV esophagitis is sometimes difficult. Therefore, we developed a machine-learning-based classifier to discriminate between CMV and HSV esophagitis. We analyzed 87 patients with HSV esophagitis and 63 patients with CMV esophagitis and developed a machine-learning-based artificial intelligence (AI) system using a total of 666 endoscopic images with HSV esophagitis and 416 endoscopic images with CMV esophagitis. In the five repeated five-fold cross-validations based on the hue–saturation–brightness color model, logistic regression with a least absolute shrinkage and selection operation showed the best performance (sensitivity, specificity, positive predictive value, negative predictive value, accuracy, and area under the receiver operating characteristic curve: 100%, 100%, 100%, 100%, 100%, and 1.0, respectively). Previous history of transplantation was included in classifiers as a clinical factor; the lower the performance of these classifiers, the greater the effect of including this clinical factor. Our machine-learning-based AI system for differential diagnosis between HSV and CMV esophagitis showed high accuracy, which could help clinicians with diagnoses.

An artificial intelligence algorithm is highly accurate for detecting endoscopic features of eosinophilic esophagitis

Article Open access 01 July 2022

Christoph Römmele, Robert Mendel, … Alanna Ebigbo

Application of artificial intelligence in endoscopic image analysis for the diagnosis of a gastric cancer pathogen-Helicobacter pylori infection

Article Open access 17 August 2023

Chih-Hsueh Lin, Ping-I Hsu, … Tsair-Fwu Lee

Prediction of future gastric cancer risk using a machine learning algorithm and comprehensive medical check-up data: A case-control study

Article Open access 27 August 2019

Junichi Taninaga, Yu Nishiyama, … Toshio Naito

Introduction

Viral esophagitis is most commonly caused by herpes simplex virus (HSV) and cytomegalovirus (CMV) in immunocompromised patients and occasionally in immunocompetent patients¹. The diagnosis of viral esophagitis is based on clinical history, endoscopic features, and histopathologic features. Clinically, the most common symptoms of HSV and CMV esophagitis are odynophagia, dysphagia, and chest pain^2,3. The most important risk factor of HSV and CMV esophagitis is an immunocompromised status, including human immunodeficiency virus infection, organ transplantation, or malignancies^4,5. Histopathology with specific immunohistochemical stains (IHC) or deoxyribonucleic acid (DNA) polymerase chain reaction (PCR) using tissues are required for definitive diagnosis of HSV and CMV esophagitis^4,6. However, since tissue-based diagnostic evaluation takes several days for a result, immunocompromised patients with poor general conditions that require rapid treatment after rapid diagnosis often undergo empirical treatment before histological diagnosis.

When considering empirical antiviral agents, particularly in immunocompromised patients, endoscopic features are important for differentiating between HSV and CMV esophagitis until a specific diagnosis is made. According to several studies, the endoscopic features of HSV esophagitis include typically multiple, small, discrete, shallow ulcers with bullae or vesicles; yellowish exudate; and coalescence. The involvement of the middle to distal esophagus is most common^3,7. The specific endoscopic features of CMV esophagitis are solitary, large, deep, punch-out, or demarcated serpiginous ulcers^2,4,8. However, the endoscopic features of CMV esophagitis are variable. CMV esophagitis commonly involves multiple ulcers varying in size in the middle to distal esophagus. The depths of CMV esophageal ulcers are more commonly shallow or intermediate than deep and healed-up⁹. The endoscopic features between HSV and CMV esophagitis significantly overlap^1,8. Therefore, the differential diagnosis between HSV and CMV esophagitis using endoscopic features can sometimes be confusing.

Recently, many studies have reported impressive performances of artificial intelligence (AI) systems for medical imaging^10,11. Using a large dataset, an AI system can compensate for the experience of experts and identify microstructures and quantitative pixel-level features which are undetectable by the human eye ¹². In gastrointestinal (GI) endoscopy, several studies have shown favorable performance for detecting and classifying GI neoplasms¹³. Also, AI algorithms for benign, chronic inflammatory disease with diffuse involvement, such as Helicobacter pylori gastritis, have reported high accuracy in diagnosis using endoscopic images^14,15. Nevertheless, a shortcoming of deep learning is that a large amount of data is needed to minimize overfitting and improve learning¹⁶. Therefore, image feature-based classifiers could be a better classification strategy for small datasets^17,18.

In this study, we aimed to develop a machine-learning-based AI system for differential diagnosis between HSV and CMV esophagitis using endoscopic images. The classification task can be greatly affected by the extraction and classification of different features. To capture better endoscopic features of HSV and CMV esophagitis, we manually annotated the regions of interest (ROIs). Subsequently, the image features were extracted from the annotated ROIs of the endoscopic color images, which were represented by the hue–saturation–brightness (HSB) color model. After channel-wise feature filtering based on each channel of color model, the final features were selected by a least absolute shrinkage and selection operation (LASSO), and then machine learning classifiers were trained. In order to achieve robust performance, ROI-based classifiers were designed instead of image-based classifiers, and image-based and patient-based accuracies were then obtained by ensembling the results of the ROIs.

Results

Baseline and endoscopic characteristics of patients

The clinical and endoscopic characteristics of the 150 patients are summarized in Table 1. Out of 150 patients, 87 were diagnosed with HSV esophagitis and 63 with CMV esophagitis. The median age was 61 years (interquartile range 51–70 years) and 119 patients (79.3%) were immunocompromised. There were no significant differences in age, sex, or comorbidities except for solid organ transplantation. Solid organ transplantation was significantly more common in patients with CMV esophagitis than in those with HSV esophagitis (36.5% vs. 12.6%, p < 0.001).

Table 1 Baseline and endoscopic characteristics of 150 patients with HSV and CMV esophagitis.

Full size table

The distribution of HSV and CMV esophagitis commonly involved two or more segments of the esophagus. In cases of esophagitis involving two or more segments, 53.1% (25/47) of patients with HSV esophagitis and 52% (13/25) of patients with CMV esophagitis had involvement of the middle to distal esophagus. Therefore, the middle and/or distal esophagus were the most involved regions in HSV and CMV esophagitis (94.3% and 95.2%, respectively).

The initial endoscopic diagnosis based on the morphologic findings at the time of endoscopy varied among HSV or CMV esophagitis, reflux esophagitis, and esophageal cancer. Compared with the definite diagnosis, only 57.5% (50/87) of HSV esophagitis cases and 46% (29/63) of CMV esophagitis cases were initially diagnosed by endoscopic features at the time of endoscopy. The overall diagnostic accuracy of endoscopists was 52.7% (79/150). There was no significant difference between the diagnostic accuracy of endoscopists (p = 0.166) for HSV and CMV esophagitis.

Development and performance of the AI system for differential diagnosis between HSV and CMV esophagitis

The classifiers were trained using five repeated five-fold cross-validations in a stratified manner over patients, and they evaluated per-ROI, per-image, and per-patient performances using datasets divided according to the patients. We obtained the image-based and patient-based accuracies from the designed ROI-based classifier through an averaged probability. The probabilities of all ROIs in one image or one patient were averaged and considered the representative probability of the image or patient, respectively. Using these representative probabilities, final diagnoses were made. Classifiers based on an HSB color model surpassed classifiers based on an RGB color model in all classification metrics (Tables 2, 3 and 4). In the case of the HSB color model with superior performance, per-patient accuracies were 100% in all models; therefore, it was difficult to compare the performances between models. For performance comparison between models, the per-image accuracies in the HSB color model were summarized as follows. Logistic regression with LASSO showed the best performance; the sensitivity, specificity, PPV, NPV, accuracy, and AUC were 100%, 100%, 100%, 100%, 100%, and 1.0, respectively. It is recommended to perform random forest classification with LASSO; the sensitivity, specificity, PPV, NPV, accuracy, and AUC were 99.8%, 99.4%, 99.1%, 99.8%, 99.6%, and 1.0, respectively, using LASSO. Previous history of transplantation was included in the features as a clinical factor, and the lower the performance of classifiers, the greater the effect of including this clinical factor. As a result of evaluating the differences in diagnostic performance between models using the Wilcoxon signed-rank test¹⁹, significant differences (p value < 0.05) were observed among three models (logistic regression with LASSO, random forest with LASSO, and random forest) in the case of the HSB color model, but no significant difference was noted in the case of the RGB color model (Supplementary Table S5).

Table 2 Diagnostic performance of logistic regression with LASSO for discriminating cytomegalovirus esophagitis from herpes simplex virus esophagitis.

Full size table

Table 3 Diagnostic performance of random forest with LASSO for discriminating cytomegalovirus esophagitis from herpes simplex virus esophagitis.

Full size table

Table 4 Diagnostic performance of random forest for discriminating cytomegalovirus esophagitis from herpes simplex virus esophagitis.

Full size table

Discussion

We established an AI system with good performance based on endoscopic images for differential diagnosis between HSV and CMV esophagitis. The AI system was trained and validated using 1082 endoscopic images from 150 patients. Our machine-learning-based AI system, which used logistic regression with LASSO for discriminating CMV esophagitis from HSV esophagitis, showed a sensitivity, specificity, PPV, NPV, accuracy, and AUC of 100%, 100%, 100%, 100%, 100%, and 1.0, respectively. To the best of our knowledge, this is the first AI system using endoscopic images with a clinical factor for differential diagnosis between HSV and CMV esophagitis.

Although histopathology with specific IHC stains is the gold standard for the diagnosis of HSV and CMV esophagitis, endoscopic features are important for empirical treatment prior to histopathologic diagnosis because tissue-based diagnostic evaluation takes several days¹. It is very important to start proper treatment as quickly as possible and within a few days, especially for immunocompromised patients. Several studies have reported endoscopic features for HSV or CMV esophagitis^2,7,9,20. However, these features significantly overlap in site involvement as they both feature mainly multiple small-sized and shallow ulcers¹. In our study, the overall diagnostic accuracy of endoscopic features was only 52.7%, which means that nearly 50% of patients may receive erroneous empirical treatment until histopathology results are obtained. The differential diagnosis between HSV and CMV esophagitis based on endoscopic features will be the most important prognostic parameter for immunocompromised patients, in whom rapid treatment can determine prognosis.

Recently, our group investigated the implications of using endoscopic findings for the diagnosis of HSV and CMV esophagitis²¹. The average diagnostic accuracy of eight highly experienced endoscopists was 74.3%, and about a quarter of the patients diagnosed as HSV or CMV esophagitis based on endoscopic features were misdiagnosed regardless of the endoscopists’ expertise. Therefore, we developed a predictive model based on the categorization of endoscopic features and history of transplantation with a high accuracy (92.6%) in discriminating CMV esophagitis from HSV esophagitis. Training through categorizing endoscopic features can help endoscopists make accurate diagnoses, but sufficient training is difficult because of the rarity of CMV and HSV esophagitis. Machine learning approaches using retrospective data can overcome dependency on experience and the rarity of the disease.

The classification task can be greatly affected by different feature extraction and classification methods. To capture better endoscopic features of HSV and CMV esophagitis, we manually annotated ROIs with the assistance of an expert endoscopist and then extracted image features using an HSB color model. The accuracy of the HSB color model was significantly better than that of the RGB color model, because the HSB color model is designed to approximate the way humans perceive and interpret color and could be a device-independent color representation format²². The robust performance was achieved by averaging the results of the ROI-based classifiers. In our study, the diagnostic accuracy of the developed classifier (logistic regression with LASSO) in discriminating CMV esophagitis from HSV esophagitis was 100%, which is better than that of the initial diagnoses by endoscopists (100% vs. 52.7%) as well as that of experienced endoscopists (100% vs. 74.3%) reported previously²¹. The developed AI system has potential for clinical application in differential diagnosis between HSV and CMV esophagitis.

Some methodological limitations of this study should be noted. First of all, our study design was retrospective in nature and had a small sample size. However, viral esophagitis is rare in immunocompetent patients and is an opportunistic disease in immunocompromised patients. Additionally, to the best of our knowledge, this study is the largest study of HSV and CMV esophagitis, respectively. The development of an AI system using images is needed for a large dataset of high-quality images. Therefore, considering the rarity of HSV and CMV esophagitis, our study enrolled the largest number of HSV and CMV esophagitis cases and developed an AI system for differential diagnosis between HSV and CMV esophagitis. Second, we did not perform comparisons between endoscopists and our AI system for validation. We previously reported differential diagnosis between HSV and CMV esophagitis using categorization of endoscopic features²¹. In that study, the diagnostic accuracy of endoscopists in randomly selected cases of esophagitis was 74.3% in the experienced group and 74.7% in the less experienced group. A highly experienced endoscopist categorized the endoscopic features and the diagnostic accuracy improved to 92.6%. Therefore, the categorization of endoscopic features is dependent on the experience of endoscopists. Our AI system can compensate for expert experience and can support less experienced endoscopists. Finally, ROI annotation is required for the developed AI system. We have already assigned ROIs with the help of an expert, and this dataset can be used for training an AI system for ROI annotation, enabling an end-to-end system.

In conclusion, our machine-learning-based AI system using logistic regression with LASSO for differential diagnosis between HSV and CMV esophagitis showed high accuracy. The improvement of the diagnostic accuracy of clinicians through this AI system will contribute to improving the prognosis of patients by providing rapid treatment based on a quick prediction.

Materials and methods

Patients and date collection

We retrospectively reviewed the medical records and endoscopic images of all patients diagnosed with HSV or CMV esophagitis between April 2008 and December 2016 at Asan Medical Center (Seoul, Korea). The diagnosis of HSV or CMV esophagitis was confirmed with clinical symptoms, endoscopic findings, and histopathologic review with IHC and/or PCR. Patients were excluded according to the following criteria: co-infection with HSV and CMV, final pathologic diagnosis of malignancy, recurrent infection, or missing information on endoscopic findings. The institutional review board of Asan Medical Center approved the study (IRB No. 2020-0495). Due to the retrospective study design, written informed consent was not obtained from participants. The IRB of our institution waived the need for informed consent based on the non-invasive and anonymized nature of this study. This study was conducted in accordance with institutional ethical guidelines and the Declaration of Helsinki.

Lesion segmentation and feature extraction

In order to extract imaging features to differentiate between the two types of esophagitis, one board-certified expert (more than 15 years of experience in endoscopy) reviewed the quality of the collected endoscopic images and manually annotated the regions of interest (ROIs). Cases of shaky images or lesions far away from the endoscope light source were excluded because the shapes of the lesions were not clearly visible. ROIs were drawn as close to the margins of the lesions as possible so as to not include the normal esophageal mucosa (Fig. 1).

The hue–saturation–brightness (HSB) color model was employed to extract image features from endoscopic color images. In color image processing, there are various color models designed for specific purposes, such as red–green–blue (RGB), cyan–magenta–yellow–black (CMYK), and HSB. The HSB color model, which was designed to approximate the way humans perceive and interpret color, is often used in computer vision for feature detection or image segmentation since it is a device-independent color representation format²². Our esophagitis classifier was compared with one based on the RGB color model, which is the most widely used. Since the characteristics of each ROI in the image are expected to be different, ROI-based classifiers were designed instead of image-based classifiers, and then image-based accuracy was obtained by averaging the results of the ROIs. We collected 1082 endoscopic images from 150 patients, obtaining a total of 3444 ROIs (HSV: 87 patients, 666 endoscopic images, 2628 ROIs; CMV: 63 patients, 416 endoscopic images, 816 ROIs).

There were 520 image features extracted from each channel of the HSB and RGB color models, resulting in a total of 1,560 image features extracted from each ROI, including first-order (N = 17), texture (N = 87) and wavelet analyses (N = 416) (Supplementary Appendix I). The first-order features were derived from intensity histograms using first-order statistics, including intensity range, energy, entropy, kurtosis/skewness, maximum/minimum, mean, median, uniformity, and variance. Texture features were obtained with a gray-level co-occurrence matrix (GLCM) and a gray-level run length matrix (GLRLM) in four directions in two-dimensional (2D) space²³; GLCM texture features were computed for varying distances of 1, 2, and 3 pixels in four directions. The wavelet transformation was applied with a single-level directional discrete wavelet transformation of high-pass and low-pass filters²⁴. In total, four wavelet-decomposition images were generated from each ROI: LL, LH, HL, and HH images, where ‘L’ means ‘low-pass filter’ and ‘H’ means ‘high-pass filter.’ Then, the first-order and texture features were applied to the wavelet-transformed images, yielding 416 wavelet features (17 first-order and 87 texture features per wavelet-transformed image). All image features were standardized by z-transformation before applying classification metrics.

Classification metrics

Effective feature selection is a crucial step because image features are multiple collinear and correlated predictors that could produce unstable estimates and might overfit predictions. The feature selection methods can be divided by how they are coupled to the classification or learning algorithms as follows: (1) filter method, (2) wrapper method, (3) embedded method²⁵. Filter methods reduce the number of features independently. Wrapper methods wrap the feature selection around the classification method and use the prediction accuracy of the model to iteratively select or eliminate a set of features. In embedded methods, the feature selection process is an integral part of the classification model. We made feature selection more efficient by combining the filter method (i.e., feature filtering using univariate feature selection) and the embedded method (i.e., LASSO). First, we filtered the extracted features using univariate feature selection in terms of each channel of the HSB and RGB color models. Based on the p value (< 0.05) of ANOVA tests, 124 features of HSB color models were filtered out, and the remaining features included 478 H-channel features, 481 S-channel features, and 477 B-channel features. For the RGB color model, 420 features were filtered out, and the remaining features included 341 R-channel features, 410 G-channel features, and 389 B-channel features. After channel-wise feature filtering, the remaining features were combined according to color model (HSB color model: 1436 features, RGB color model: 1140 features). A LASSO was then employed for feature selection of combined features. A total of 25 LASSOs were performed by five repeated five-fold cross-validations, and 11–18 features and 11–20 features were selected from the HSB and RGB color models, respectively (Supplementary Appendix II). Using selected image features, two different machine learning classifiers were trained: logistic regression and random forest. The random forest is a classifier that derives and ensembles several decision tree classifiers on various sub-samples of the dataset to improve the predictive accuracy and control overfitting. In other words, random forest does not require additional feature selection. However, we tried to improve the performance of random forest by combining LASSO since our dataset has many features compared with the number of datasets. While performing five repeated five-fold cross-validations, the hyperparameters of logistic regression and random forest were obtained by nested cross-validation in each fold. To maximize the probabilities of correct decisions, we found an optimal cutoff value using the true-positive and false-positive rates forming the receiver operating characteristic (ROC) curve²⁶. Univariate feature selection, LASSO, logistic regression, and random forest classification were implemented using the Scikit-learn package (https://github.com/scikit-learn/scikit-learn)²⁷.

Statistics

Categorical data were analyzed using the chi-squared test or Fisher’s exact test as appropriate. Numerical data were analyzed using Student’s t-test. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and area under the curve (AUC) were calculated by standard definitions to evaluate the performance of the developed AI system. To evaluate the differences in performance between models, we performed the Wilcoxon signed-rank test¹⁹. All statistical analyses were performed using SPSS Statistics for Windows, version 18.0 (IBM; Armonk, NY). p values < 0.05 were considered statistically significant.

References

Hoversten, P., Kamboj, A. & Katzka, D. A. Infections of the esophagus: an update on risk factors, diagnosis, and management. Dis. Esophagus 31, doy094 (2018).
Google Scholar
Wilcox, C. M., Diehl, D. L., Cello, J. P., Margaretten, W. & Jacobson, M. A. Cytomegalovirus esophagitis in patients with AIDS: a clinical, endoscopic, and pathologic correlation. Ann. Intern. Med. 113, 589–593 (1990).
Article CAS Google Scholar
McBane, R. D. & Gross, J. B. Herpes esophagitis: clinical syndrome, endoscopic appearance, and diagnosis in 23 patients. Gastrointest. Endosc. 37, 600–603 (1991).
Article CAS Google Scholar
You, D. M. & Johnson, M. D. Cytomegalovirus infection and the gastrointestinal tract. Curr. Gastroenterol. Rep. 14, 334–342 (2012).
Article Google Scholar
Hoversten, P., Kamboj, A. K., Wu, T.-T. & Katzka, D. A. Variations in the clinical course of patients with herpes simplex virus esophagitis based on immunocompetence and presence of underlying esophageal disease. Dig. Dis. Sci. 64, 1893–1900 (2019).
Article Google Scholar
Jazeron, J. F. et al. Virological diagnosis of herpes simplex virus 1 esophagitis by quantitative real-time PCR assay. J. Clin. Microbiol. 50, 948–952. https://doi.org/10.1128/jcm.05748-11 (2012).
Article PubMed PubMed Central Google Scholar
Ramanathan, J., Rammouni, M., Baran, J. Jr. & Khatib, R. Herpes simplex virus esophagitis in the immunocompetent host: an overview. Am. J. Gastroenterol. 95, 2171–2176 (2000).
Article CAS Google Scholar
Werneck-Silva, A. L. & Prado, I. B. Role of upper endoscopy in diagnosing opportunistic infections in human immunodeficiency virus-infected patients. World J. Gastroenterol. 15, 1050–1056. https://doi.org/10.3748/wjg.15.1050 (2009).
Article PubMed PubMed Central Google Scholar
Wilcox, C. M., Straub, R. F. & Schwartz, D. A. Prospective endoscopic characterization of cytomegalovirus esophagitis in AIDS. Gastrointest. Endosc. 40, 481–484 (1994).
Article CAS Google Scholar
Litjens, G. et al. A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017).
Article Google Scholar
Sahiner, B. et al. Deep learning in medical imaging and radiation therapy. Med. Phys. 46, e1–e36 (2019).
Article Google Scholar
Erickson, B. J., Korfiatis, P., Akkus, Z. & Kline, T. L. Machine learning for medical imaging. Radiographics 37, 505–515 (2017).
Article Google Scholar
Min, J. K., Kwak, M. S. & Cha, J. M. Overview of deep learning in gastrointestinal endoscopy. Gut Liver 13, 388 (2019).
Article Google Scholar
Itoh, T., Kawahira, H., Nakashima, H. & Yata, N. Deep learning analyzes Helicobacter pylori infection by upper gastrointestinal endoscopy images. Endosc. Int. Open 6, E139–E144 (2018).
Article Google Scholar
Shichijo, S. et al. Application of convolutional neural networks in the diagnosis of Helicobacter pylori infection based on endoscopic images. EBioMed. 25, 106–111 (2017).
Article Google Scholar
Arpit, D. et al. A closer look at memorization in deep networks. arXiv preprint https://arxiv.org/abs/1706.05394 (2017).
Chen, C.-H. et al. Radiomic features analysis in computed tomography images of lung nodule classification. PLoS ONE 13, e0192002 (2018).
Article Google Scholar
Yun, J. et al. Radiomic features and multilayer perceptron network classifier: a robust MRI classification strategy for distinguishing glioblastoma from primary central nervous system lymphoma. Sci. Rep. 9, 1–10 (2019).
Article Google Scholar
Woolson, R. Wilcoxon signed-rank test. In Wiley Encyclopedia of Clinical Trials, 1–3 (2007).
Wang, H.-W. et al. Clinical characteristics and manifestation of herpes esophagitis: one single-center experience in Taiwan. Medicine 95, e3187 (2016).
Article Google Scholar
Jung, K. H. et al. Can endoscopists differentiate cytomegalovirus esophagitis from herpes simplex virus esophagitis based on gross endoscopic findings?. Medicine 98, e15845 (2019).
Article Google Scholar
Cheng, H.-D., Jiang, X. H., Sun, Y. & Wang, J. Color image segmentation: advances and prospects. Pattern Recognit. 34, 2259–2281 (2001).
Article Google Scholar
Materka, A. & Strzelecki, M. Texture analysis methods–a review. Technical university of lodz, institute of electronics, COST B11 report, Brussels vol 10, 4968 (1998).
Wang, J. Z. Wavelets and imaging informatics: a review of the literature. J. Biomed. Inform. 34, 129–141 (2001).
Article CAS Google Scholar
Saeys, Y., Inza, I. & Larrañaga, P. A review of feature selection techniques in bioinformatics. Bioinformatics 23, 2507–2517 (2007).
Article CAS Google Scholar
Youden, W. J. Index for rating diagnostic tests. Cancer 3, 32–35 (1950).
Article CAS Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar

Download references

Acknowledgements

This study was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (HI18C0022, and HI18C2383).

Author information

These authors contributed equally: Jung Su Lee and Jihye Yun.

Authors and Affiliations

Department of Gastroenterology, Asan Medical Center, University of Ulsan College of Medicine, 88, Olympic-Ro 43-Gil, Songpa-gu, Seoul, 05505, Republic of Korea
Jung Su Lee, Jeong-Sik Byeon, Hwoon-Yong Jung & Do Hoon Kim
Department of Gastroenterology, Ilsan Paik Hospital, Inje University College of Medicine, Goyang, Republic of Korea
Jung Su Lee
Department of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Jihye Yun & Namkug Kim
Department of Convergence Medicine, Asan Medical Center, Asan Medical Institute of Convergence Science and Technology, University of Ulsan College of Medicine, 88, Olympic-Ro 43-Gil, Songpa-gu, Seoul, 05505, Republic of Korea
Sungwon Ham, Hyunjung Park & Namkug Kim
Department of Anatomy, Keimyung University School of Medicine, Daegu, Republic of Korea
Hyunsu Lee
Department of Internal Medicine, Keimyung University School of Medicine, Daegu, Republic of Korea
Jeongseok Kim

Authors

Jung Su Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jihye Yun
View author publications
You can also search for this author in PubMed Google Scholar
Sungwon Ham
View author publications
You can also search for this author in PubMed Google Scholar
Hyunjung Park
View author publications
You can also search for this author in PubMed Google Scholar
Hyunsu Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jeongseok Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jeong-Sik Byeon
View author publications
You can also search for this author in PubMed Google Scholar
Hwoon-Yong Jung
View author publications
You can also search for this author in PubMed Google Scholar
Namkug Kim
View author publications
You can also search for this author in PubMed Google Scholar
Do Hoon Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.S.L.: analysis and interpretation of data, and drafting of the article. J.H.Y.: analysis and interpretation of data, and drafting of the article. N.K.K.: study conception and design, critical revision of the article for important intellectual content, and final approval of the article. D.H.K.: study conception and design, critical revision of the article for important intellectual content, and final approval of the article. S.W.H.: final approval of the article. H.J.P.: final approval of the article. H.S.L.: final approval of the article. J.S.K.: final approval of the article. J.S.B.: final approval of the article. H.Y.J.: final approval of the article.

Corresponding authors

Correspondence to Namkug Kim or Do Hoon Kim.

Ethics declarations

Competing interests

The authors except Namkug Kim have no conflicts of interest to disclose. Namkug Kim is a stakeholder of Promedius Inc.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, J.S., Yun, J., Ham, S. et al. Machine learning approach for differentiating cytomegalovirus esophagitis from herpes simplex virus esophagitis. Sci Rep 11, 3672 (2021). https://doi.org/10.1038/s41598-020-78556-z

Download citation

Received: 19 May 2020
Accepted: 17 November 2020
Published: 11 February 2021
DOI: https://doi.org/10.1038/s41598-020-78556-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.