CT-based radiomics with various classifiers for histological differentiation of parotid gland tumors

Lu, Yang; Liu, Haifeng; Liu, Qi; Wang, Siqi; Zhu, Zuhui; Qiu, Jianguo; Xing, Wei

doi:10.3389/fonc.2023.1118351

ORIGINAL RESEARCH article

Front. Oncol., 10 March 2023

Sec. Cancer Imaging and Image-directed Interventions

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1118351

CT-based radiomics with various classifiers for histological differentiation of parotid gland tumors

Yang Lu

Haifeng Liu

Qi Liu

Siqi Wang

Zuhui Zhu

Jianguo Qiu

Wei Xing^*

Radiology, Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu, China

Objective: This study assessed whether radiomics features could stratify parotid gland tumours accurately based on only noncontrast CT images and validated the best classifier of different radiomics models.

Methods: In this single-centre study, we retrospectively recruited 249 patients with a diagnosis of pleomorphic adenoma (PA), Warthin tumour (WT), basal cell adenoma (BCA) or malignant parotid gland tumours (MPGTs) from June 2020 to August 2022. Each patient was randomly classified into training and testing cohorts at a ratio of 7:3, and then, pairwise comparisons in different parotid tumour groups were performed. CT images were transferred to 3D-Slicer software and the region of interest was manually drawn for feature extraction. Feature selection methods were performed using the intraclass correlation coefficient, t test and least absolute shrinkage and selection operator. Five common classifiers, namely, random forest (RF), support vector machine (SVM), logistic regression (LR), K-nearest neighbours (KNN) and general Bayesian network (Gnb), were selected to build different radiomics models. The receiver operating characteristic curve, area under the curve (AUC), accuracy, sensitivity, specificity and F-1 score were used to assess the prediction performances of these models. The calibration of the model was calculated by the Hosmer–Lemeshow test. DeLong’s test was utilized for comparing the AUCs.

Results: The radiomics model based on the RF, SVM, Gnb, LR, LR and RF classifiers obtained the highest AUC in differentiating PA from MPGTs, WT from MPGTs, BCA from MPGTs, PA from WT, PA from BCA, and WT from BCA, respectively. Accordingly, the AUC and the accuracy of the model for each classifier were 0.834 and 0.71, 0.893 and 0.79, 0.844 and 0.79, 0.902 and 0.88, 0.602 and 0.68, and 0.861 and 0.94, respectively.

Conclusion: Our study demonstrated that noncontrast CT-based radiomics could stratify refined pathological types of parotid tumours well but could not sufficiently differentiate PA from BCA. Different classifiers had the best diagnostic performance for different parotid tumours. Our study findings add to the current knowledge on the differential diagnosis of parotid tumours.

Introduction

Parotid gland tumours are the main tumours of the salivary glands, and more than 80% are benign. However, an early accurate diagnosis is still needed to define the proper surgical treatment (1). For patients with malignant parotid gland tumours (MPGTs), total parotidectomy is necessary, and postoperative chemoradiation is considered if patients have high-risk factors (2). Among benign parotid gland tumours (BPGTs), the major types are Warthin tumour (WT), pleomorphic adenoma (PA) and basal cell adenoma (BCA), and the operation types are also different. Due to its higher malignancy and recurrence rates, PA is treated by partial parotidectomy (3), while WA and BCA are treated only by local surgical excision of the tumour or by conservative treatment, given that malignant transformation is rare (4).

Thus, a simple and effective diagnostic method is crucial and necessary for the differential diagnosis of parotid tumours before surgical treatment. Routine fine needle aspiration is largely dependent on the experience of the clinical operators, as the diagnostic accuracy is sometimes poor due to insufficient or nonrepresentative aspiration (5). In addition, the conventional radiological features of different parotid tumour types may considerably overlap (6). Some studies have reported that changes in parotid tumour margins may not indicate malignancy, and heterogeneously enhanced features cannot be used to distinguish benign from malignant parotid tumours (7, 8). Some BPGTs resemble MPGTs with a heterogeneous appearance due to the existence of the area of cystoid variation and necrosis (9). All of these results present significant diagnostic challenges in the preoperative diagnosis of parotid gland tumours.

Radiomics is a fast-growing research field that is widely used in tumour imaging. The radiomics approach can automatically extract comprehensive data present in imaging modalities and uncover much more quantitative tumour information than our eyes can detect. In recent years, multiple studies have reported that radiomics may be applied to parotid gland tumours with promising preoperative diagnostic results (10). Li et al. confirmed that radiomics analysis of ultrasound images may help improve the discrimination of BPGTs from MPGTs (11). Zheng et al. developed a computed tomography (CT)-based radiomics nomogram to distinguish benign lymphoepithelial lesions from mucosa-associated lymphoid tissue lymphoma, which has promising predictive efficacy (12). In addition, the magnetic resonance (MR) radiomics model has yielded excellent diagnostic performance in differentiating BPGTs from MPGTs and PA from WT (13–18).

Many studies have explored radiomics for the differential diagnosis of parotid tumours based on multiphasic CT or multisequence MR radiomics features; however, it is still necessary to further explore the diagnostic performance of radiomics models based on noncontrast CT. Contrast-enhanced CT or MR studies have superior diagnostic results. However, they often have downsides, and MR may require long acquisition times and have absolute and relative MR contraindications. Contrast-enhanced CT may often burden the patient with more radiation exposure and have contrast agent contraindications. These factors could make noncontrast CT-based radiomics an attractive choice, at least in selected patients. Another potential advantage of CT-based radiomics is the possibility of detecting and characterizing incidental parotid masses in patients undergoing CT for other unrelated reasons. Furthermore, previous CT radiomics studies focused on distinguishing benign from malignant parotid tumours, but there is little research addressing the possibility of distinguishing among the detailed pathological types of parotid tumour. Typically, only a single machine learning classifier was used in previous research, and different classifiers may lead to different diagnostic performances. Hence, it would be beneficial to evaluate whether noncontrast CT-based radiomics can perform well in stratifying different pathological types of parotid tumours and whether there are differences in the diagnostic value of various machine learning classifiers in the diagnosis of parotid gland tumours. This may help distinguish different parotid tumours accurately and conveniently and guide the selection of the best model for future multicentre research of large datasets.

The purpose of this study was to construct different radiomics models based on noncontrast CT images with five mainstream classifiers to compare the predictive ability of various radiomics models for different parotid tumours, such as MPGTs, PA, WT and BCA, and to determine the classifier with the best diagnostic performance for each parotid tumour.

Materials and methods

Patients

In this single-centre retrospective study, a total of 415 patients with definite pathological results indicating a parotid gland tumour in the Third Affiliated Hospital of Soochow University were registered from June 2020 to August 2022. The exclusion criteria were as follows (1): parotid tumour recurrence or previous treatment (n=51) (2); no CT examination of the parotid gland before treatment (n=35) (3); maximum tumour diameter less than 0.5 cm (n=25) (4); unsatisfactory image quality due to the existence of metallic or beam hardening artefacts (n=41); or (5) simple cystic lesions (n=14). Thus, a total of 249 patients were included in our study. The baseline clinical characteristics were collected by retrieving the patients’ hospital records. CT was performed with four CT scanners: a double source scanner (SOMATOM Definition Flash, Siemens Healthcare, Forchheim, Germany), a 64-slice CT scanner (Discovery 750 HD, GE Healthcare, Milwaukee, Wisconsin), a 320-slice CT scanner (Aquilion ONE, Toshiba Medical Systems, Otawara, Japan), and a 256-slice CT scanner (Brilliance iCT; Philips Healthcare, Cleveland, OH, USA). According to the pathological results of their parotid gland tumours, the patients were divided into the MPGT, PA, WT and BCA groups. The flowchart for selecting the study population is shown in Figure 1. Our study was approved by the ethics committee of the Third Affiliated Hospital of Soochow University, Jiangsu, China, and exempted from informed consent requirements due to the retrospective nature of the study.

FIGURE 1

Figure 1 Flowchart for selecting the study population. PA, pleomorphic adenoma; WT, Warthin tumour; BCA, basal cell adenoma; MPGTs, malignant parotid gland tumours.

CT image acquisition

Each patient underwent noncontrast imaging with a multislice spiral CT scanner. The CT scanners and parameters were as follows (1): Discovery 750 HD: 120 kV tube voltage; smart mA (100-450 mAs) tube current, section thickness, 2.5 mm; section interval, 2.5 mm; gantry rotation time, 0.6 seconds; detector collimation, 64 mm × 0.625 mm; matrix512×512 (2); SOMATOM Definition Flash: 120 kV tube voltage; tube current with dose modulation (Care Dose 4D), section thickness, 3 mm; section interval, 3 mm; gantry rotation time, 0.5 seconds; detector collimation, 128 mm × 0.6 mm; matrix512×512 (3); Aquilion ONE: 120 kV tube voltage; 250 mAs tube current, section thickness, 3 mm; section interval, 3 mm; gantry rotation time, 0.35 seconds; detector collimation, 320 mm × 0.5 mm; matrix512×512 (4); Brilliance iCT: 120 kV tube voltage; 250 mAs tube current, section thickness, 3 mm; section interval, 3 mm; gantry rotation time, 0.27 seconds; detector collimation, 256 mm × 0.625 mm; matrix512×512. All scans were performed from 1 cm below the aortic arch to the top of the head.

ROI segmentation

All noncontrast CT images were stored in the Digital Imaging and Communications in Medicine format and imported to 3D-Slicer software for manual segmentation of the regions of interest (ROIs) by two radiologists who were blinded to the pathological results. Contours were drawn slice-by-slice within the borders of the tumours on axial CT images, excluding adjacent bone and vessels. The intraclass correlation coefficients (ICCs) were used to evaluate the stability and agreement of the features, and an ICC greater than 0.75 indicated good agreement.

Imaging feature extraction

Image preprocessing and feature extraction were performed using the open-source package PyRadiomics 3.0 in python software (version 3.7.6; http://www.radiomics.io/pyradiomics.html). To eliminate the potential impact of the different CT devices on the extracted features, a voxel spacing of 1 × 1 × 1 mm³ was performed to resample the images, and a fixed bin width of 25 was used to normalize image intensity (19). Then, 1323 features were retrieved from each VOI as follows: (a) shape-based features; (b) first-order statistics features; (c) grey-level co-occurrence matrix-based features (GLCM); (d) grey-level run-length matrix-based features (GLRLM); (e) grey-level size zone matrix (GLSZM); (f) neighbouring grey tone difference matrix (NGTDM); (g) grey-level dependence matrix (GLDM) and (h) transform-filtered features (including square, square root, logarithm, exponential, gradient, Laplacian of Gaussian [LOG], wavelet). Finally, z score normalization was also performed for all features to reduce the influence of different dimensions among features (20).

Feature selection

In this study, patients were divided into four different groups (PA, WT, BCA and MPGT) according to pathological type. In addition, each patient was randomly assigned to the training or test cohort at a ratio of 7:3, and then, pairwise comparisons were performed between different groups after analysis was performed according to the following pipeline. Three steps were performed for feature selection. First, the features with ICCs >0.75 were selected due to their stability. Second, to select features that differed significantly between groups, the t test was performed. Finally, a least absolute shrinkage and selection operator (LASSO) regression model with 10-fold cross-validation was performed to select features with nonzero coefficients.

Statistical analysis

The final selected features were utilized for modelling with five mainstream classifiers, including logistic regression (LR), K-nearest neighbours (KNN), support vector machine (SVM), random forest (RF) and GaussianNB (Gnb). The diagnostic performance of each model for the differential diagnosis of parotid gland tumours (PA and MT, PA and WT, PA and BCA, WT and MT, WT and BCA, and BCA and MT) was quantitatively evaluated by means of the area under the curve (AUC) of the receiver operating characteristic (ROC), accuracy, sensitivity, specificity and F-1 score. The calibration of the radiomics model was calculated by the Hosmer–Lemeshow test. DeLong’s test was utilized for comparisons of AUCs. A p value < 0.05 indicates a significant difference. The distributions of radiomics scores for each validation cohort patient in the different models are presented as a waterfall plot. All the above processes were implemented in Python (version 3.7.6), except DeLong’s test, which was implemented with MedCalc19.8 software (MedCalc, Ostend, Belgium). A flow diagram describing the radiomics analysis process is shown in Figure 2.

FIGURE 2

Figure 2 Workflow of the radiomics analysis. LASSO, least absolute shrinkage and selection operator; MPGTs, malignant parotid gland tumors; PA, pleomorphic adenoma; WT, warthin tumor; BCA, basal cell adenoma; SVM, support vector machine; RF, random forest; KNN, k-Nearest Neighbor; LR, logistic regression; Gnb, GaussianNB.

Results

Study cohort

Among the 249 patients included in this study, 154 (61.85%) were men, and 95 (38.15%) were women. The average age of the patients was 52.72 ± 15.22 years. Among the 180 BPGT cases, the most common subtype was PA (71, 39.44%), followed by WT (68, 37.78%) and BCA (41, 22.78%). The other 69 lesions were MPGTs. The numbers of cases evaluated with the Discovery 750 HD, SOMATOM Definition Flash, Aquilion ONE and Brilliance iCT scanners were 61, 75, 71 and 42, respectively.

MPGTs vs. PA

In the comparisons of MPGTs and PAs, a total of 503 radiomics features were selected after being screened by the ICC and t test. Then, 16 features were finally selected by LASSO for building the radiomics models, and the best tuned regularization parameter lambda was 0.0569. There were 1 first-order statistics feature, 1 GLCM feature, 1 gradient feature and 13 wavelet features among the final selected features.

The radiomics model of the RF classifier obtained the best diagnostic performance in differentiating PA from MPGTs compared with the other four classifiers. The AUC and accuracy were 0.834 and 0.71, with sensitivity, specificity and F-1 scores of 0.87, 0.62 and 0.82, respectively. The p value of the RF model in the Hosmer–Lemeshow test was 0.139 (>0.05), so the calibration of the RF model was reliable. Analysis by Delong’s test showed that the AUC of the RF model was the highest but was significantly higher than that of the Gnb model only (p=0.021), with no significant differences compared to those of the other three models (p>0.05). The waterfall plot of the RF model in differentiating PA from MPGTs in the validation cohort is presented in Figure 3A. The ROC curve is shown in Figure 4A.

FIGURE 3

Figure 3 Waterfall plots for distribution of scores based on different radiomics models for each patient in the validation cohort. (A) MPGTs vs. PA; (B) MPGTs vs. WT; (C) MPGTs vs. BCA; (D) PA vs. WT; (E) WT vs. BCA; (F) PA vs. BCA. MPGTs, malignant parotid gland tumors; PA, pleomorphic adenoma; WT, warthin tumor; BCA, basal cell adenoma.

FIGURE 4

Figure 4 The ROC curves of the different radiomics models: (A) MPGTs vs. PA; (B) MPGTs vs. WT; (C) MPGTs vs. BCA; (D) PA vs. WT; (E) WT vs. BCA; (F) PA vs. BCA. MPGTs, malignant parotid gland tumors; PA, pleomorphic adenoma; WT, warthin tumor; BCA, basal cell adenoma; SVM, support vector machine; RF, random forest; KNN, k-Nearest Neighbor; LR, logistic regression; Gnb, GaussianNB.

MPGTs vs. WT

In the differentiation between MPGTs and WTs, a total of 456 radiomics features were selected according to the ICC and t test. Then, 14 features were finally selected by LASSO for building the radiomics models, and the best tuned regularization parameter lambda was 0.0281. There were 1 shape-based feature, 1 exponential feature, 1 logarithm feature and 11 wavelet features among the final selected features.

The radiomics model of the SVM classifier had the best diagnostic performance in differentiating WT from MPGTs compared with the other four classifiers. The AUC and accuracy were 0.893 and 0.79, with sensitivity, specificity and F-1 values of 0.79, 0.78 and 0.84, respectively. The p value of the RF model in the Hosmer–Lemeshow test was 0.911 (>0.05), so the calibration of the SVM model was reliable. Analysis by Delong’s test showed that the AUC of the SVM model was significantly better than that of the LR model (p=0.022) or Gnb model (p=0.010), but there was no significant difference compared to the AUCs of the RF and KNN models (p>0.05). The waterfall plot of the SVM model in differentiating WT from MPGTs in the validation cohort is presented in Figure 3B. The ROC curve is shown in Figure 4B.

MPGTs vs. BCA

In the differential diagnosis between MPGTs and BCAs, a total of 503 radiomics features were selected after being screened by the ICC and t test. Then, 16 features were finally selected by LASSO for building the radiomics models, and the best tuned regularization parameter lambda was 0.036. There were 1 shape-based feature, 1 GLCM feature, 1 GLRLM feature, 2 exponential features and 11 wavelet features among the final selected features.

The radiomics model of the Gnb classifier obtained the best diagnostic performance in differentiating BCA from MPGTs compared with the other four classifiers. The AUC and accuracy were 0.844 and 0.79, with sensitivity, specificity and F-1 values of 0.84, 0.79 and 0.84, respectively. The p value of the Gnb model in the Hosmer–Lemeshow test was 0.908 (>0.05), so the calibration of the Gnb model was reliable. Analysis by DeLong’s test showed that the Gnb model achieved the highest AUC, but there were no significant differences between the AUC of the Gnb model and those of the other four models (p>0.05). The waterfall plot of the Gnb model in differentiating BCA from MPGTs in the validation cohort is presented in Figure 3C. The ROC curve is shown in Figure 4C.

PA vs. WT

In the comparisons of PAs and WTs, a total of 336 radiomics features were selected after being screened by the ICC and t test. Then, 18 features were finally selected by LASSO for building the radiomics models, and the best tuned regularization parameter lambda was 0.022. There were 2 shape-based features, 1 first-order statistics feature, 1 GLCM feature, 1 gradient feature, 3 logarithm features, 2 square root features and 8 wavelet features among the final selected features.

Compared with the other four classifiers, the radiomics model of the LR classifier obtained the best diagnostic performance in differentiating PA from WT. The AUC and accuracy were 0.902 and 0.88, with sensitivity, specificity and F-1 values of 0.84, 0.83 and 0.86, respectively. The p value of the LR model in the Hosmer–Lemeshow test was 0.243 (>0.05), so the calibration of the LR model was reliable. Analysis by Delong’s test showed that the LR model achieved the highest AUC but that the AUC was significantly higher than that of the Gnb model only (p=0.019), with no significant differences compared to those of the other models (p>0.05). The waterfall plot of the LR model in differentiating PA from WT in the validation cohort is presented in Figure 3D. The ROC curve is shown in Figure 4D.

WT vs. BCA

In the differential diagnosis between WTs and BCAs, a total of 193 radiomics features were selected after being screened by the ICC and t test. Then, 15 features were finally selected by LASSO for building the radiomics models, and the best tuned regularization parameter lambda was 0.028. There were 1 shape-based feature, 2 first-order statistics features, 1 gradient feature, 1 logarithmic feature, 1 square root feature and 9 wavelet features among the final selected features.

The radiomics model of the RF classifier obtained the best diagnostic performance in differentiating WT from BCA compared with the other four classifiers. The AUC and accuracy were 0.861 and 0.94, with sensitivity, specificity and F-1 scores of 0.83, 0.90 and 0.91, respectively. The p value of the RF model in the Hosmer–Lemeshow test was 0412 (>0.05), so the calibration of the RF model was reliable. Analysis by DeLong’s test showed that the RF model had the highest AUC but that this value was not significantly different from those of the other models (p>0.05). The waterfall plot of the RF model in differentiating WT from BCA in the validation cohort is presented in Figure 3E. The ROC curve is shown in Figure 4E.

PA vs. BCA

In the differentiation between PA and BCA, a total of 93 radiomics features were selected after being screened by the ICC and t test. Then, 10 features were finally selected by LASSO for building the radiomics models, and the best tuned regularization parameter lambda was 0.018. There were 2 first-order statistics features, 1 GLDM feature, 1 GLSZM feature and 6 wavelet features among the final selected features.

The radiomics model of the LR classifier obtained the best diagnostic performance between differentiating PA and BCA compared with the other four classifiers. However, the AUC and accuracy were only 0.602 and 0.68, yielding sensitivity, specificity and F-1 values of 0.66, 0.68 and 0.59, respectively. The p value of the LR model in the Hosmer–Lemeshow test was 0.357 (>0.05), so the calibration of the LR model was reliable. Analysis by DeLong’s test showed that the AUC of the LR model was not significantly different from those of the other four models (p>0.05). The waterfall plot of the LR model in differentiating PA from BCA in the validation cohort is presented in Figure 3F. The ROC curve is shown in Figure 4F.

The detailed selected features and coefficients of different radiomics models are shown in the Supplementary Table. The detailed diagnostic performance of all models is displayed in Table 1. The detailed results of DeLong’s test of the AUCs among the different models are shown in Table 2.

TABLE 1

Table 1 Predictive performance of different models.

TABLE 2

Table 2 Comparison of the performance of the different models with DeLong’s test.

Discussion

In this study, we provided a detailed analysis of the radiomics model based on noncontrast CT scans and advantageous machine learning classifiers that differentiate MPGTs, PA, WT and BCA. Our results revealed that noncontrast CT-based radiomics might help distinguish all parotid tumours with promising diagnostic results, except for the differentiation between PA and BCA. The classifier with the best diagnostic performance for each parotid tumour was different.

Radiomics uses mathematical calculations to identify invisible imaging features and then quantifies the different characteristics that parotid tumour tissues exhibit in radiological data to distinguish different parotid gland tumours (21). In our study, the highest AUCs in the comparisons of PA and MPGTs, WT and MPGTs, BCA and MPGTs, PA and WT, and BCA and WT were 0.834, 0.893, 0.844, 0.902 and 0.861, respectively. The diagnostic efficiency was promising and similar to that in previous studies. Zheng et al. extracted radiomics features from nonenhanced, arterial, and venous phase CT images and constructed LR-, SVM-, and RF-based radiomics models to differentiate between benign and malignant parotid tumours (22). They demonstrated that the model using SVM exhibited the best predictive accuracy, with an AUC of 0.844. Xu et al. extracted imaging features from noncontrast and contrast-enhanced CT images for differentiating between benign and malignant parotid gland tumours via multicentre cohorts (23). In their report, the accuracy of the SVM-based radiomics model reached 0.854. Xu et al. established a machine learning predictive model based on CT radiomics to improve the accuracy of differentiation among PA, WT and parotid carcinoma, with a total accuracy of 80.5% (24). All these studies used CT-based radiomics models to differentiate various parotid tumour types with promising performance. Unlike the abovementioned literature, we not only performed differentiation between benign and malignant tumours but also classified parotid tumours according to differences in pathological results, and various classifiers were used. Our study demonstrated that in addition to benign and malignant tumours, refined pathological types of parotid tumours could be stratified well by CT radiomics.

However, not all radiomics results are ideal. In our study, the nonenhanced CT-based radiomics model did not achieve good diagnostic performance in differentiating PA from BCA, and the highest AUC was only 0.602. It seemed that PA and BCA may not be effectively differentiated based on the noncontrast CT-based radiomics model alone. This result is similar to that in previous studies. Zheng et al. constructed radiomics models based on noncontrast CT for differentiating PA from BCA, and the AUCs of the models in the testing cohort with classifiers based on SVM, KNN, and LR were only 0.691, 0.612 and 0.652, respectively (25). This may be due to the pathological components of PA and BCA. The pathological structure of PA is complex and contains mixed components, such as glandular cells, myoepithelial cells, the parotid duct, mucus and cartilage-like tissue (26). In CT images, the density of the tumour was heterogeneous and may present cystic and necrosis. For BCA, there are four histological subtypes, namely, solid, trabecular, tubular, and membranous (27). Pathological composition varies by BCA histological subtype, which makes the radiomics features of BCA more complex. For the limited cases of BCA, we did not divide the BCA patients into different histological subtype groups. The mixed subtypes of BCA and high pathological heterogeneity of PA make it more difficult to differentiate them on noncontrast CT. Future radiomics models may need to incorporate additional CT-enhanced phases to refine model performance.

In addition, it should also be noted that among the selected radiomics features for predicting different tumours, most were transform-filtered features. The higher-order statistics performed by transform-filtered features can extract areas with increasingly coarse texture patterns more flexibly and thus have the potential to highlight more details in the original images (28). Among the transform-filtered features, wavelets were more valuable in our data analysis. The frequencies of wavelet features in the final selected features in the comparison of PA and MPGTs, WT and MPGTs, BCA and MPGTs, PA and WT, PA and BCA, and WT and BCA were 13/16, 11/14, 11/16, 8/18 6/10 and 9/15, respectively. Wavelet transforms can decompose image signals by using low- and high-pass filters and may amplify the heterogeneity information of texture features in radiological imaging, which is similar to previous studies. Jiang et al. reported that wavelet transformation can enhance CT texture features and may be used to effectively assess the grade of pulmonary lesions caused by COVID-19 (29). Regarding the best performance in discriminating an expansive from an infiltrative front in tumour growth, Granata et al. reported that wavelet transformation had the best performance in identifying tumour recurrence (30). This study suggests that in distinguishing different parotid gland tumours, the transform-filtered features, especially the wavelet transform-filtered features, may be more indicative of parotid tumour heterogeneity than other features (31).

In radiomics analysis, it is crucial to develop robust predictive models to select valid and appropriate modelling classifiers. Different classifiers mean different model algorithms and may lead to different diagnostic performances. Therefore, five frequently utilized machine learning classifiers were investigated in this study, namely, LR, KNN, RF, Gnb and SVM. LR is one of the most commonly used binary classification algorithms. The principle of KNN is that if most of the k-nearest samples near a sample belong to a certain category, the sample also belongs to this category. The advantages are that it is insensitive to outliers. RF is an ensemble algorithm with multiple decision trees. Its advantages include its high accuracy and that it does not easily result in overfitting, and its disadvantage is the large calculation. The mechanism of SVM is to build a decision boundary between two classes to predict labels from one or more feature vectors. SVM was powerful in analysing complex datasets but is also too complex to prevent overfitting. Finally, Gnb is a relatively simple algorithm but performs well on small-scale data. In our results, the classifier with the best diagnostic performance for each group was different. The classifiers with the highest AUCs in the comparisons of PA and MPGTs, WT and MPGTs, BCA and MPGTs, PA and WT, PA and BCA, and BCA and WT were RF, SVM, Gnb, LR, LR and RF, respectively. In addition, after analysis by DeLong’s test, in the comparisons of BCA and MPGTs, PA and BCA, and BCA and WT, there were no significant differences in AUC between the different classifiers. In the comparisons of MPGTs and PAs, MPGTs and WTs, and PAs and WTs, the AUCs of the best classifier were observed only to have significant differences with some of the classifiers. This was different from previous studies, which suggested that the performance of SVM was superior to that of other machine learning classifiers for total diagnostic accuracy (23, 25). We suggest that different classification models have their own advantages for different tasks. The performance of the radiomics model may depend more on the characteristics of the classifier algorithm and how well the classifiers match the model target tumour. Moreover, the different methods in radiomics feature extraction and selection would influence the final selected features and affect the diagnostic efficiency of models constructed with different classifiers. In our study, the results indicated that the key radiomics features among the different parotid tumours varied, so the selected classifier in the model with the best diagnostic efficacy was different. However, for the prediction efficiency of some parotid tumours, there seems to be no significant difference in the selection of classifiers. The results of our study could be a good reference in guiding the selection of the most appropriate classifiers for constructing different parotid gland tumour radiomics models.

There were several limitations in this study. First, potential selection bias may have occurred due to the retrospective nature of our study design. Second, the patients were enrolled from a single centre; thus, multicentre studies with much larger patient cohorts are necessary. Third, although our study included a large number of patients, the more detailed patient classification resulted in small numbers of cases in each group, especially the BCA group, so our study is still limited by the small number of samples in our dataset. Follow-up studies with larger sample sizes are needed. Fourth, to ensure that our results encompass different CT manufacturers, the CT-based radiomics features were from four different CT scanners. However, different scanning protocols, especially the fixed mA protocol, might affect the diagnostic performances of the radiomics features. Finally, we used the PyRadiomics package for feature extraction and image preprocessing in this study. Therefore, our results apply only to this package. Since other radiomics software packages may use different preprocessing filters, it is unclear whether our conclusions could apply to these radiomics packages. Regarding future research prospects, many machine learning radiomics studies have tried to predict early recurrence in different carcinomas after resection (32, 33), offering the possibility that radiomics models may also be used to predict recurrence in malignant parotid tumours after resection. Moreover, whether radiomics models could differentiate the inflammatory pathology of the parotid gland from neoplasms has rarely been discussed, and further studies are needed to research these topics.

Conclusion

Based on this study, we propose using noncontrast CT-based radiomics features for the differential diagnosis of PA, WT, BCA and MPGT, as they show good predictive performance for all comparisons except for that of PA and BCA. Our findings suggest that noncontrast CT radiomics analysis can be used as an additional tool to support radiologists in their decision-making in distinguishing different parotid gland tumours.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

All authors contributed to the study conception and design. Study concepts and design were performed by YL, WX. Material preparation, data collection and analysis were performed by YL, QL, ZZ, SW, and HL, JQ made substantial contributions to data acquisition and interpretation. The first draft of the manuscript was written by YL and all authors commented on previous versions of the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1118351/full#supplementary-material

References

1. Moore MG, Yueh B, Lin DT, Bradford CR, Smith RV, Khariwala SS. Controversies in the workup and surgical management of parotid neoplasms. Otolaryngol Head Neck Surg (2021) 164(1):27–36. doi: 10.1177/0194599820932512

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Gökce E. Multiparametric magnetic resonance imaging for the diagnosis and differential diagnosis of parotid gland tumors. J Magn Reson Imaging (2020) 52(1):11–32. doi: 10.1002/jmri.27061

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Comoglu S, Ozturk E, Celik M, Avci H, Sonmez S, Basaran B, et al. Comprehensive analysis of parotid mass: A retrospective study of 369 cases. Auris Nasus Larynx (2018) 45(2):320–7. doi: 10.1016/j.anl.2017.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Quer M, Vander Poorten V, Takes RP, Silver CE, Boedeker CC, de Bree R, et al. Surgical options in benign parotid tumors: A proposal for classification. Eur Arch Otorhinolaryngol (2017) 274(11):3825–36. doi: 10.1007/s00405-017-4650-4

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Hurry KJ, Karunaratne D, Westley S, Booth A, Ramesar K, Zhang TT, et al. Ultrasound-guided core biopsy in the diagnosis of parotid neoplasia: An overview and update with a review of the literature. Br J Radiol (2022) 95(1130):20210972. doi: 10.1259/bjr.20210972

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Teresi LM, Lufkin RB, Wortham DC, Abemayor E, Hanafee WN. Parotid masses: MR imaging. Radiology (1989) 163(2):405–9. doi: 10.1148/radiology.163.2.35628188

CrossRef Full Text | Google Scholar

7. Kessler AT, Bhatt AA. Review of the major and minor salivary glands, part 2: Neoplasms and tumor-like lesions. J Clin Imaging Sci (2018) 8:48. doi: 10.4103/jcis.JCIS_46_18

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Yuan Y, Tang W, Tao X. Parotid gland lesions: Separate and combined diagnostic value of conventional MRI, diffusion-weighted imaging and dynamic contrast-enhanced MRI. Br J Radiol (2016) 89(1060):20150912. doi: 10.1259/bjr.20150912

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Yabuuchi H, Matsuo Y, Kamitani T, Setoguchi T, Okafuji T, Soeda H, et al. Parotid gland tumors: can addition of diffusion-weighted MR imaging to dynamic contrast-enhanced MR imaging improve diagnostic accuracy in characterization? Radiology (2008) 249(3):909–16. doi: 10.1148/radiol.2493072045

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images are more than pictures, they are data. Radiology (2016) 278(2):563–77. doi: 10.1148/radiol.2015151169

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Li Q, Jiang T, Zhang C, Zhang Y, Huang Z, Zhou H. A nomogram based on clinical information, conventional ultrasound and radiomics improves prediction of malignant parotid gland lesions. Cancer Lett (2022) 527:107–14. doi: 10.1016/j.canlet.2021.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zheng YM, Xu WJ, Hao DP, Liu XJ, Gao CP, Tang GZ, et al. A CT-based radiomics nomogram for differentiation of lympho-associated benign and malignant lesions of the parotid gland. Eur Radiol (2021) 31(5):2886–95. doi: 10.1007/s00330-020-07421-4

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Song LL, Chen SJ, Chen W, Shi Z, Wang XD, Song LN, et al. Radiomic model for differentiating parotid pleomorphic adenoma from parotid adenolymphoma based on MRI images. BMC Med Imaging (2021) 21(1):54. doi: 10.1186/s12880-021-00581-9

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Vernuccio F, Arnone F, Cannella R, Verro B, Comelli A, Agnello F, et al. Diagnostic performance of qualitative and radiomics approach to parotid gland tumors: which is the added benefit of texture analysis? Br J Radiol (2021) 94(1128):20210340. doi: 10.1259/bjr.20210340

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Gabelloni M, Faggioni L, Attanasio S, Vani V, Goddi A, Colantonio S, et al. Can magnetic resonance radiomics analysis discriminate parotid gland tumors? A pilot study. Diagnostics (Basel) (2020) 10(11):900. doi: 10.3390/diagnostics10110900

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Piludu F, Marzi S, Ravanelli M, Pellini R, Covello R, Terrenato I, et al. MRI-Based radiomics to differentiate between benign and malignant parotid tumors with external validation. Front Oncol (2021) 11:656918. doi: 10.3389/fonc.2021.656918

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Wen BH, Zhang ZX, Zhu J, Liu L, Li YH, Huang HY, et al. Apparent diffusion coefficient map-based radiomics features for differential diagnosis of pleomorphic adenomas and warthin tumors from malignant tumors. Front Oncol (2022) 12:830496. doi: 10.3389/fonc.2022.830496

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Qi J, Gao A, Ma X, Song Y, Zhao G, Bai J, et al. Differentiation of benign from malignant parotid gland tumors using conventional MRI based on radiomics nomogram. Front Oncol (2022) 12:937050. doi: 10.3389/fonc.2022.937050

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Zhan PC, Lyu PJ, Li Z, Liu X, Wang HX, Liu NN, et al. CT-based radiomics analysis for noninvasive prediction of perineural invasion of perihilar cholangiocarcinoma. Front Oncol (2022) 12:900478. doi: 10.3389/fonc.2022.900478

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Wang J, Xiong X, Ye J, Yang Y, He J, Liu J, et al. A radiomics nomogram for classifying hematoma entities in acute spontaneous intracerebral hemorrhage on non-contrast-Enhanced computed tomography. Front Neurosci (2022) 16:837041. doi: 10.3389/fnins.2022.837041

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Tomaszewski MR, Gillies RJ. The biological meaning of radiomic features. Radiology (2021) 298(3):505–16. doi: 10.1148/radiol.2021202553

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Zheng YL, Zhou D, Liu H, Wen M. CT-based radiomics analysis of different machine learning models for differentiating benign and malignant parotid tumors. Eur Radiol (2022) 32(10):6953–64. doi: 10.1007/s00330-022-08830-3

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Xu Y, Shu Z, Song G, Liu Y, Pang P, Wen X, et al. The role of preoperative computed tomography radiomics in distinguishing benign and malignant tumors of the parotid gland. Front Oncol (2021) 11:634452. doi: 10.3389/fonc.2021.634452

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Xu Z, Jin Y, Wu W, Wu J, Luo B, Zeng C, et al. Machine learning-based multiparametric traditional multislice computed tomography radiomics for improving the discrimination of parotid neoplasms. Mol Clin Oncol (2021) 15(5):245. doi: 10.3892/mco.2021.2407

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Zheng YL, Zheng YN, Li CF, Gao JN, Zhang XY, Li XY, et al. Comparison of different machine models based on multi-phase computed tomography radiomic analysis to differentiate parotid basal cell adenoma from pleomorphic adenoma. Front Oncol (2022) 12:889833. doi: 10.3389/fonc.2022.889833

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Sood S, McGurk M, Vaz F. Management of salivary gland tumours: United Kingdom national multidisciplinary guidelines. J Laryngol Otol (2016) 130(S2):S142–9. doi: 10.1017/S0022215116000566

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Jang M, Park D, Lee SR, Hahm CK, Kim Y, Kim Y, et al. Basal cell adenoma in the parotid gland: CT and MR findings. AJNR Am J Neuroradiol (2004) 25(4):631–5.

PubMed Abstract | Google Scholar

28. Florez E, Fatemi A, Claudio PP, Howard CM. Emergence of radiomics: Novel methodology identifying imaging biomarkers of disease in diagnosis, response, and progression. SM J Clin Med Imaging (2018) 4(1):1019.

PubMed Abstract | Google Scholar

29. Jiang Z, Yin J, Han P, Chen N, Kang Q, Qiu Y, et al. Wavelet transformation can enhance computed tomography texture features: A multicenter radiomics study for grade assessment of COVID-19 pulmonary lesions. Quant Imaging Med Surg (2022) 12(10):4758–70. doi: 10.21037/qims-22-252

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Granata V, Fusco R, De Muzio F, Cutolo C, Mattace Raso M, Gabelloni M, et al. Radiomics and machine learning analysis based on magnetic resonance imaging in the assessment of colorectal liver metastases growth pattern. Diagnostics (Basel) (2022) 12(5):1115. doi: 10.3390/diagnostics12051115

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Wang Y, Shen L, Jin J, Wang G. Application and clinical value of machine learning-based cervical cancer diagnosis and prediction model in adjuvant chemotherapy for cervical cancer: A single-center, controlled, non-arbitrary size case-control study. Contrast Media Mol Imaging (2022) 2022:2432291. doi: 10.1155/2022/2432291

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Ji GW, Zhu FP, Xu Q, Wang K, Wu MY, Tang WW, et al. Machine-learning analysis of contrast-enhanced CT radiomics predicts recurrence of hepatocellular carcinoma after resection: A multi-institutional study. EBioMedicine (2019) 50:156–65. doi: 10.1016/j.ebiom.2019.10.057

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Xu X, Wang H, Guo Y, Zhang X, Li B, Du P, et al. Study progress of noninvasive imaging and radiomics for decoding the phenotypes and recurrence risk of bladder cancer. Front Oncol (2021) 11:704039. doi: 10.3389/fonc.2021.704039

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: radiomics, computed tomography, tumor, differentiation, parotid gland (PG)

Citation: Lu Y, Liu H, Liu Q, Wang S, Zhu Z, Qiu J and Xing W (2023) CT-based radiomics with various classifiers for histological differentiation of parotid gland tumors. Front. Oncol. 13:1118351. doi: 10.3389/fonc.2023.1118351

Received: 07 December 2022; Accepted: 23 February 2023;
Published: 10 March 2023.

Edited by:

Francesco Ricchetti, Sacro Cuore Don Calabria Hospital (IRCCS), Italy

Reviewed by:

Shreya Shukla, Tata Memorial Hospital, India
Lorenzo Faggioni, University of Pisa, Italy

Copyright © 2023 Lu, Liu, Liu, Wang, Zhu, Qiu and Xing. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wei Xing, suzhxingwei@suda.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.