Abstract

Introduction. Considering the narrow window of surgery, early diagnosis of liver cancer is still a fundamental issue to explore. Hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICCA) are considered as two different types of liver cancer because of their distinct pathogenesis, pathological features, prognosis, and responses to adjuvant therapies. Qualitative analysis of image is not enough to make a discrimination of liver cancer, especially early-stage HCC or ICCA. Methods. This retrospective study developed a radiomic-based model in a training cohort of 122 patients. Radiomic features were extracted from computed tomography (CT) scans. Feature selection was operated with the least absolute shrinkage and operator (LASSO) logistic method. The support vector machine (SVM) was selected to build a model. An internal validation was conducted in 89 patients. Results. In the training set, the AUC of the evaluation of the radiomics was 0.855 higher than for radiologists at 0.689. In the valuation cohorts, the AUC of the evaluation was 0.847 and the validation was 0.659, which indicated that the established model has a significantly better performance in distinguishing the HCC from ICCA. Conclusion. We developed a radiomic diagnosis model based on CT image that can quickly distinguish HCC from ICCA, which may facilitate the differential diagnosis of HCC and ICCA in the future.

1. Introduction

According to the latest report of the International Agency for Research on Cancer, liver cancer is one of the most common digestive cancers. Primary liver cancer is the sixth most commonly occurring cancer and the third leading cause of cancer-related deaths worldwide, ranking fifth in incidence and fourth in mortality [1]. Despite the available treatment options, the incidence and mortality rates are nearly equal [2]. Surgery, with a narrow therapeutic window, remains the mainstay of liver cancer therapy for patients at early stage [3]. Thus, early diagnosis is still a fundamental issue to explore.

The dominant histological types of primary liver cancer are hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICCA), accounting for over 99% of primary liver cancer cases [4, 5]. Considering the clinical stages, patients with HCC and ICCA may be assigned to similar clinical managements [6]. However, HCC and ICCA are considered as completely different two types of liver cancer because of their distinct pathogenesis, pathological features, prognosis, and responses to adjuvant therapies [7]. Therefore, early discrimination of these two types of liver cancer contributes to designing personalized treatment strategies.

Computed tomography (CT), as a common type of imaging tool, plays a major part in diagnosis, staging, treatment, and follow-up of oncologic patients. During routine preoperative evaluation in a clinical setting, three-dimensional (3D) reconstruction of CT images helps formulate more reasonable surgical planning [8]. The contrast-enhanced CT provides higher resolution images and defines the nature of the lesion [9, 10]. However, the diagnostic accuracy is dependent on variations in radiologists’ level of experience, resulting in frequent misdiagnosis [11]. Despite the development of modern imaging techniques, qualitative analysis of image is not enough to make a discrimination of liver cancer, especially early-stage HCC or ICCA, which have puzzled researchers for several years.

Radiomics, as a novel image processing technology, can automatically provide a large number of quantitative image features from medical images, which may be impossible for naked eyes to recognize [12, 13] These image features can also be combined with machine learning algorithms to make a prediction for diagnosis. Several studies have shown the outlook of prediction for cancer outcome [14]. The radiomic-based classifiers using routine magnetic resonance imaging (MRI) sequences in differentiation of peripheral schwannomas and neurofibromas showed higher area under the curve (AUC) values on the receiver operator characteristic (ROC) curve than expert human evaluators [15] and so was the random forest model based on CT radiomics [16]. Radiomics can significantly improve the accuracy and consistency of diagnosis. Unfortunately, very few studies have investigated the CT radiomic-based model to distinguish HCC and ICCA.

In this study, we established a support vector machine (SVM) based on radiomic features at noncontrast CT to train a discriminative model for HCC and ICCA at early stage. The diagnostic performance was also compared with experienced radiologists.

2. Materials and Methods

2.1. Patients and Liver Pathological Diagnosis

The workflow is schematically depicted in Figure 1. All patients with pathologic results of liver cancer underwent noncontrast CT at our institution between August 2018 and November 2019. Here are the exclusion criteria for patient screening: (1) lack of abdominal noncontrast CT image at 1.5 mm thickness (), (2) poor image quality (), (3) an interval between pathological results and CT examination of more than 3 months (), (4) coinfection with virus such as HBV or HCV (), and (5) incomplete clinical data ().

We partitioned the whole cohort into two parts, 122 patients for training (from August 2018 to March 2019) and 89 patients for validation (from April 2019 to November 2019).

The clinical characteristics and the data of CT scan were obtained from medical records. Clinical data included age, sex, blood routine tests (red blood cell (RBC), white blood cell (WBC), platelet (PLT) count, and hemoglobin (Hb)), liver function examinations (ALT, aspartame aminotransferase (AST), alkaline phosphate (ALP), gluttony transpiration (GGT), lactate dehydrogenase (LDH), total bilirubin (TB), conjugated bilirubin (CB), albumin (ALB), globulin (GLOB), total bile acid (TBA), and leucine acrylamide (LAP)), lipid metabolism tests (total cholesterol (TC), high density lipoprotein cholesterol (HDL-C), low density lipoprotein cholesterol (LDL-C), apolipoprotein A1 (Apo A1), and apolipoprotein B (Apo B)), C-reactive protein (CRP), and blood coagulation function (prothrombin time (PT) and international normalized ratio (INR)).

The histopathological analysis of the liver was made by two pathologists with over 5-year working experience. And the pathologists were blinded to the clinical information.

2.2. CT Image Acquisition and Evaluation

All patients received the examination with the same CT scanner in supine position (LightSpeed, VCT, or Discovery HD 750, GE Healthcare, US). The CT scanner parameters are listed as follows: tube voltage 120 kVp, tube current 250–350 mA, collimating slice thickness 5 mm, reconstruction slice thickness 1.25 mm, slice interval 5 mm, rotation time 0.6 s, helical pitch 1.375, the field of view between 35 and 40 cm, and matrix . The image was reconstructed using a standard algorithm. The CT images were reviewed by two independent radiologists. The radiologists were aware of the diagnostic criteria and blinded to the clinical radiological details. Any differences were resolved through discussion.

2.3. The Establishment of Radiomic Model in the Training Cohort

Regions of interest (ROIs) were selected in the liver of all patients by two radiologists using 3D slicer (version 4.8.0; http://www.slicer.org) [17]. The ROIs were manually segmented along the tumor contour on each transverse section. Image preprocessing and feature extraction were performed by Pyradiomics package (http://www.radiomics.io/pyradiomics.html). The voxel spacing was standardized with the size of  mm. The voxel intensity values were discretized with a bin width of 25 HU to reduce the interference of image noise and normalize intensities [18]. Eight hundred forty-one radiomic features (13 shape statistics, 18 first-order statistics, 74 textural features, and 736 wavelet-based transformations) were extracted from each ROI (Table 1).

The intra- and interobserver reliability for each radiomic feature was calculated by using intraclass correlation coefficient (ICC). Radiomic features with both intra- and interobserver ICC greater than 0.8 were selected for subsequent analysis. The least absolute shrinkage and selection operator (LASSO) logistic regression algorithm [19], along with penalty parameter tuning conducted by 10-fold cross-validation, was performed to select cirrhosis-related features (with nonzero coefficients).

2.4. Statistical Analysis

Categorical and continuous variables were compared by test and Student test, respectively. The R package “e1071” was used to perform the SVM, and “glmnet” was used for LASSO regression on R software (version 3.6.1, http://www.r-project.org). The diagnostic performance of established models was evaluated by the ROC curve and area under the curve (AUC) value. The DeLong test was used to compare AUC values. Calibration curves were plotted via bootstrapping with 1000 resamples, accompanied by the Hosmer-Lemeshow test, to evaluate the calibration of the established model. The decision curve analysis (DCA) was used to calculate the net benefit from the use of established models. was considered statistically significant.

3. Results

3.1. Baseline Characteristics

The clinical characteristics are shown in Table 2. There was no significant difference in age, sex, AFP, CEA, CA199 between the training, and validation cohorts.

3.2. Radiomic Analysis

Of 841 extracted features, 76 features (8 first-order statistics, 21 textural features, and 56 wavelet-based transformations) with high reproducibility were selected for subsequent analysis. 43 independent significant features were identified by the LASSO logistic regression model (Figure 2). A radiomic model was constructed using SVM algorithm, of which the type is eps-regression. The kernel function is radial-based and the number of support vectors is 73.

3.3. Model Establishment and Validation

The ROC analysis is shown in Figure 3 and the summary of the model is shown in Table 3. In the training set, the AUC of the evaluation of the radiomics was 0.855 higher than for radiologists at 0.689. In the validation cohorts, the AUC of the radiomics model was 0.847 and the radiologic evaluation was 0.659, which indicated that the radiomics model have a significant benefit in distinguish the HCC form ICCA. The calibration of the training set model shows that the models established have a great agreement with the actual result and preside results. Besides, the curve of the validation cohorts shows the same results. The Hosmer-Lemeshow test yielded values of 0.056 and 0.217 in the training and validation cohort, indicating no departure from the good fit.

The DCA is shown in Figure 4. Compared with scenarios in which no prediction model would be used (i.e., treat-all or treat-none scheme), the radiomic model can provide better net benefit in distinguishing the HCC from ICCA than radiologic evaluation for threshold probabilities of more than 20% in the training and validation cohorts.

4. Discussion

This is a brand new study for making a discrimination analysis for HCC and ICCA by establishing a radiomic-based classification model at noncontrast CT which showed higher efficacy than experienced radiologists. There are 43 selected radiomic features integrated in the model, and the great diagnostic performance was achieved.

HCC and ICCA are classified into liver cancer in the clinical classification but completely different in initiation and progression. First, viral infections, alcoholism, and fatty liver are leading risk factors for HCC, as primary sclerosing cholangitis, bile duct cyst, and hepatolithiasis are for ICCA. Second, HCC and ICCA originate from completely different cell population. It is generally believed that HCC is a highly aggressive epithelial tumor originating both from mature hepatocytes and stem cells [20]. However, ICCA is likely to arise from cholangiocytes or the epithelial cells lining the biliary tree [21]. Third, although surgical resection is the only preferred therapeutic option for both two cancers at early stage, the nonoperative treatment is significantly divergent [22]. Thus, early discrimination is of great value to cancer patients.

Unfortunately, the discrimination owes a major debt to image. The ability of radiologists plays major roles in that. Imaging examination mainly distinguishes them according to the difference of blood supply. The typical hepatocellular carcinoma has an abundant blood supply, which can facilitate the presence of nonrim-like enhancement of an observation in the arterial phase and the wash out in the venous phase [23]. Meanwhile, the hypovascular intrahepatic cholangiocarcinoma frequently results in insufficient enhancement of arterial phase [24]. But when it comes to nontypical HCC, it is difficult to identify HCC from ICCA [25]. The gold standard diagnostic test for liver cancer is the pathology, which need paracentesis or surgery. The selection of cancer diagnostics and treatments may delay for quite a long time. Therefore, it is necessary to find a more reliable, efficient, and user-friendly method to distinguish them.

The contrast-enhanced CT or MRI, which can provide more information than noncontrast CT, was suggested by guidelines for patients with liver mass. However, many patients in China only accept noncontrast CT examinations because of limited cost-effectiveness (mainly attributed to price and waiting time). CT scan is one of the most economical, noninvasive, and convenient examination approaches along the presurgical evaluation and postoperative follow-up [26]. The radiomic approach relies on highly informative image data. Indeed, this is exactly why we selected radiomic analysis based on CT data. Researchers have determined the ability of MR perfusion-based radiomics to discriminate pseudoprogression from progressive disease in glioblastoma patients [27] and the normal tissue (glandular) from benign and malignant tumors in patients with breast mass [28].

There have been many liver cancer studies based on radiomics. However, these studies focused on the distinguishment of microvasculature and prediction of prognosis while may not meet the requirements for clinical application [29]. In this study, we established the model by analyzing the image features of patients and selected the features related to the pathological types of patients through logistic regression analysis. By analyzing the CT image features of the training cohort, we selected the image features with high correlation with patient diagnosis as factors of the radiomic model. To further verify the diagnostic validity of the data model, we validated this in the validation cohort. It is promising that the model we established has higher accuracy than experienced clinical radiologist. A stable radiomic-based model may play an important role in a regional hospital lack of experienced radiologists.

Meanwhile, some limitations pertain to our study. First, it is a retrospective study with some considerable risk of bias in the data-driven procedure. Second, the established model was based on a single center. Larger, multicenter cohort study analyses are needed to validate such model. Third, biochemical markers significantly contirbute to liver cancer-related studies, and this study did not involve biochemical markers into the established model. The analysis combining biochemical indicators and image features should be considered in the subsequent studies.

5. Conclusion

In summary, we developed a radiomic diagnosis model based on CT image that can quickly distinguish HCC from ICCA, which may facilitate the differential diagnosis of HCC and ICCA in the future.

Data Availability

The datasets analyzed during the current study are available from the corresponding authors on reasonable request.

Ethical Approval

This retrospective study was approved by the institutional review board of the Affiliated Cancer Hospital of Nanjing Medical University and Drum Tower Clinical Medical College of Nanjing Medical University. The requirement for written informed consent was waived by the institutional review board due to its retrospective nature.

Conflicts of Interest

On behalf of all authors, the corresponding authors state that there is no conflict of interest.

Authors’ Contributions

All authors gave final approval of the version to be published and agreed to be accountable for all aspects of the work. Xiaoliang Xu, Yingfan Mao, and Yanqiu Tang share co-first authorship.

Acknowledgments

We would like to express our great appreciation to the R Development Core Team and contributors for R packages used in our study. This study was supported by the National Natural Science Youth Foundation (Grant Number 81902415 to Y.Y., Grant Number 82103135 to X.X., and Grant Number 82101850 to Y.L.), Natural Science Youth Foundation of Jiangsu Province (Grant Number BK20190116 to Y.Y.), and Key Laboratory of Imaging Diagnosis and Minimally Invasive Interventional Research of Zhejiang Province (No. YXJR202002 to J.W.).