Development and external validation of the multichannel deep learning model based on unenhanced CT for differentiating fat-poor angiomyolipoma from renal cell carcinoma: a two-center retrospective study

Yao, Haohua; Tian, Li; Liu, Xi; Li, Shurong; Chen, Yuhang; Cao, Jiazheng; Zhang, Zhiling; Chen, Zhenhua; Feng, Zihao; Xu, Quanhui; Zhu, Jiangquan; Wang, Yinghan; Guo, Yan; Chen, Wei; Li, Caixia; Li, Peixing; Wang, Huanjun; Luo, Junhang

doi:10.1007/s00432-023-05339-0

Development and external validation of the multichannel deep learning model based on unenhanced CT for differentiating fat-poor angiomyolipoma from renal cell carcinoma: a two-center retrospective study

Research
Open access
Published: 06 September 2023

Volume 149, pages 15827–15838, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Cancer Research and Clinical Oncology Aims and scope Submit manuscript

Development and external validation of the multichannel deep learning model based on unenhanced CT for differentiating fat-poor angiomyolipoma from renal cell carcinoma: a two-center retrospective study

Download PDF

Haohua Yao^1,2^na1,
Li Tian³^na1,
Xi Liu¹^na1,
Shurong Li⁴^na1,
Yuhang Chen¹,
Jiazheng Cao⁵,
Zhiling Zhang⁶,
Zhenhua Chen¹,
Zihao Feng¹,
Quanhui Xu¹,
Jiangquan Zhu¹,
Yinghan Wang¹,
Yan Guo⁴,
Wei Chen¹,
Caixia Li⁷,
Peixing Li⁷,
Huanjun Wang⁴ &
…
Junhang Luo¹

1005 Accesses
Explore all metrics

Abstract

Purpose

There are undetectable levels of fat in fat-poor angiomyolipoma. Thus, it is often misdiagnosed as renal cell carcinoma. We aimed to develop and evaluate a multichannel deep learning model for differentiating fat-poor angiomyolipoma (fp-AML) from renal cell carcinoma (RCC).

Methods

This two-center retrospective study included 320 patients from the First Affiliated Hospital of Sun Yat-Sen University (FAHSYSU) and 132 patients from the Sun Yat-Sen University Cancer Center (SYSUCC). Data from patients at FAHSYSU were divided into a development dataset (n = 267) and a hold-out dataset (n = 53). The development dataset was used to obtain the optimal combination of CT modality and input channel. The hold-out dataset and SYSUCC dataset were used for independent internal and external validation, respectively.

Results

In the development phase, models trained on unenhanced CT images performed significantly better than those trained on enhanced CT images based on the fivefold cross-validation. The best patient-level performance, with an average area under the receiver operating characteristic curve (AUC) of 0.951 ± 0.026 (mean ± SD), was achieved using the “unenhanced CT and 7-channel” model, which was finally selected as the optimal model. In the independent internal and external validation, AUCs of 0.966 (95% CI 0.919–1.000) and 0.898 (95% CI 0.824–0.972), respectively, were obtained using the optimal model. In addition, the performance of this model was better on large tumors (≥ 40 mm) in both internal and external validation.

Conclusion

The promising results suggest that our multichannel deep learning classifier based on unenhanced whole-tumor CT images is a highly useful tool for differentiating fp-AML from RCC.

Multi-modality radiomics model predicts axillary lymph node metastasis of breast cancer using MRI and mammography

Article 10 February 2024

Discrimination between HER2-overexpressing, -low-expressing, and -zero-expressing statuses in breast cancer using multiparametric MRI-based radiomics

Article 16 February 2024

Longitudinal ultrasound-based AI model predicts axillary lymph node response to neoadjuvant chemotherapy in breast cancer: a multicenter study

Article Open access 10 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Renal angiomyolipoma (AML) is a form of benign solid tumor that is composed of fat, smooth muscle, and abnormal blood vessels in varying proportions. Because fat can show negative attenuation values on unenhanced computed tomography (CT) images, AML can be accurately diagnosed by detecting fat within tumors (Nelson and Sanda 2002; Jinzaki et al. 2014). However, fat-poor angiomyolipoma (fp-AML), a special type of AML, contains undetectable levels of fat or may be devoid of fat altogether, and is often misdiagnosed as renal cell carcinoma (RCC) (Fujii et al. 2008; Takahashi and Kawashima 2012; Jinzaki et al. 2014; Park 2017).

In practice, patients with fp-AML are always treated as RCC. Some patients with fp-AML that did not require nephrectomy were misdiagnosed as RCC and underwent radical nephrectomy; some patients with small fp-AML that did not require surgery underwent partial nephrectomy. A report from the Cleveland Clinic suggested that 55% of 219 patients with AML who underwent surgery were suspected to have RCC by the preoperative imaging examination (Lane et al. 2008). Patients with fp-AML could avoid unnecessary surgery, especially radical nephrectomy, if an accurate diagnosis can be obtained prior to surgery (Schachter et al. 2007; Campbell et al. 2021). Consequently, there is an urgent need to develop a novel strategy to accurately identify fp-AML before surgery.

Artificial intelligence (AI), including machine learning and deep learning, has become the focus of the medical field to assist diagnosis and provide clinical decision support (Oh and Jung 2004; Schmidhuber 2015; Tandel et al. 2019; Rezaeijo et al. 2022; Taghizadeh et al. 2022). Currently, AI has been increasingly applied in the analysis of medical images, and their potential has been demonstrated not only for disease screening (Long et al. 2017; Xia et al. 2018; Fu et al. 2020; Jahangirimehr et al. 2022), but also for the diagnosis and treatment of difficult cases (Anthimopoulos et al. 2016; Lu et al. 2018; Kavur et al. 2020; Castillo et al. 2022; Cui et al. 2022; Salmanpour et al. 2023). However, there are few studies using AI to assist in the identification of fp-AML and RCC (Hodgdon et al. 2015; Lee et al. 2017, 2018; Feng et al. 2018; Cui et al. 2019; Yang et al. 2020). Reviewing these previous studies, they have limitations such as small sample size, low accuracy, or lack of external validation.

The purpose of this study was to develop a multichannel deep learning model, which is trained using CT images of whole tumors, to classify fp-AML and three common pathological subtypes of RCC: clear cell renal cell carcinoma (ccRCC), papillary renal cell carcinoma (pRCC), and chromophobe renal cell carcinoma (chRCC).

Methods

Patient cohort

We reviewed the medical records of patients with solid renal masses that were histologically diagnosed as AML, ccRCC, pRCC and chRCC at the First Affiliated Hospital of Sun Yat-Sen University (FAHSYSU) from January 2014 to August 2021 and at the Sun Yat-Sen University Cancer Center (SYSUCC) from January 2017 to June 2020. The data imbalance between patients with ccRCC and patients with the other pathological categories might affect the training of the model. Thus, data from the patients with ccRCC at both institutions were randomly downsampled by approximately 20%. Patients were excluded using the following criteria: (1) incorrect anatomic specimen location; (2) no available preoperative CT; (3) incomplete 4-phase CT scanning (unenhanced, corticomedullary, nephrographic, and excretory phases); (4) CT slice thickness > 1 mm; (5) poor image quality (such as motion artifacts or metal artifacts); and (6) more than 2 primary tumors. For the fp-AML group, patients with macroscopic fat within the tumor were excluded. For the RCC group, patients were excluded if the target lesion was primarily cystic. Finally, 60 fp-AML patients and 260 RCC patients (ccRCC, 135; pRCC, 62; chRCC, 63) from FAHSYSU and 31 fp-AML patients and 101 RCC patients (ccRCC, 58; pRCC, 24; chRCC, 19) from SYSUCC were enrolled in this study (Fig. 1a, b). The study was approved by the Ethics Committee of the First Affiliated Hospital of Sun Yat-Sen University (IIT-2022-678), and the requirement for individual consent for this retrospective analysis was waived.

CT image preprocessing and labeling

All patients underwent CT examination using multi-slice spiral CT scanners (FAHSYSU: Aquilion 64, Toshiba, Tokyo, Japan; SYSUCC: Somatom Force, Siemens Healthineers, Forchheim, Germany). The unenhanced and corticomedullary enhanced CT slices from each patient were downloaded in digital imaging and communications in medicine (DICOM) format and converted to joint photographic experts group (JPEG) data with a resolution of 512 × 512 and a window of 40 × 300 (level × width) before labeling. CT slices without a target renal mass were excluded.

We cropped out the region of interest (ROI), a rectangular box, at the tumor location in each CT image, ensuring that the border of the rectangular box was close to the tumor. The boundary of the rectangular box was determined by two experienced radiologists, and a third radiologist was consulted in the case of disagreement. Once the boundary of the rectangle was determined, the image was assigned an ID, and the coordinates of the rectangular box and the pathological type of the tumor were recorded.

Deep learning model development

The FAHSYSU dataset was divided into a development dataset (n = 267) and a hold-out dataset (n = 53). The development dataset was used to evaluate the model performance with different numbers of input channels and different CT modalities. The hold-out dataset was used for independent internal validation (Fig. 1c).

Our deep learning model was an end-to-end multichannel convolutional neural network (CNN) based on Xception architecture (Chollet 2017). We established 10 models with different numbers of input channels, from 1 to 10 channels. These models with different numbers of input channels were trained on enhanced and unenhanced whole-tumor CT images. Hence, there were 20 combinations of input channels and CT modalities. Fivefold cross-validation was used for training and validation for each combination. Here, the development dataset was split into five partitions before training, keeping the fp-AML and RCC labels balanced between partitions. In each iteration of cross-validation, one partition was utilized for validation, while the other four partitions were used for model training. Training and validation sets were always split on the patient level so that no CT slices from the same patient were ever part of a training set and a validation set.

The input to the CNN model was composed of single or multiple consecutive CT slices from whole tumors depending on the number of input channels. A patient’s whole-tumor CT slices were processed by the model to obtain multiple image-level predictions. Using these image-level predictions, we calculated a patient-level score for each patient, as shown in Fig. 2. The patient-level scores helped us to divide patients into two defined clinical categories: fp-AML and RCC. The optimal classification threshold was determined by receiver operating characteristic (ROC) curve analysis in a manner that maximized the Youden index.

The performance of the model was evaluated at the image level and the patient level. The categorical accuracy (ACC) metric was used to evaluate the model at the image level. The area under the ROC curve (AUC), sensitivity and specificity were used to evaluate the patient-level performance of the model. In fivefold cross-validation, the average of the evaluation metrics across all five folds represents the overall performance.

All models were trained on a GeForce RTX 2080 Ti (NVIDIA) graphics processing unit and built using Python 3.7 and PyTorch 1.7. The input resolution of Xception was reduced from a matrix size of 299 × 299 to 171 × 171. Following the application of Xception, the softmax function was used to create a probability distribution over two classes; the class with the higher probability was selected as the output. The cross-entropy function was selected as the loss function. For data augmentation, we augmented the training samples by randomly flipping and scaling the images. Based on numerous preliminary experiments, training was performed using the Adam optimizer for 50 epochs, with a batch size of 120 and a learning rate of 0.01. The dropout probability before the final fully connected layer was set as 0.2. In each batch of training, CT images of fp-AML were dynamically oversampled to strike a balance between the RCC samples and the fp-AML samples. In each fold of training, the weights were saved as the best-performing weights if the ACC performed best in the validation set.

Independent internal validation and external validation

In the model development stage, the optimal combination of input channel number and CT modality was obtained according to the performance of the model at the patient level in cross-validation. Keeping the other hyperparameters the same as in the development phase, the model was trained using all patients in the development dataset without any tuning. Then, the model was validated with 53 unseen patients from the hold-out dataset. Furthermore, we externally validated the model using subjects from the SYSUCC cohort to evaluate the generalizability of the model. Again, we evaluated the performance of the model using ACC at the image level and the ROC curves, AUC, sensitivity, and specificity at the patient level.

Statistical analysis

All statistical analyses were performed with SPSS software (version 22.0, SPSS, Inc., Chicago, IL, U.S.A.) and R statistical software (version 3.5.6, The R Foundation for Statistical Computing). The Pearson chi-square test or Fisher’s exact test was used to assess the distribution of categorical variables, and the independent T test was used for continuous variables. With the number of input channel as the pairing factor, the paired T test was conducted to compare the performance of the model based on enhanced CT images and unenhanced CT images in the fivefold cross-validation. The threshold for statistical significance was set as p < 0.05. In addition, 95% confidence intervals for AUC values were obtained via bootstrapping with 1000 iterations.

Results

Patient characteristics

This retrospective study included a total of 320 individuals from FAHSYSU and 132 individuals from SYSUCC. Patient characteristics are shown in Table 1. The proportion of patients with fp-AML, approximately 20%, was similar in both cohorts (FAHSYSU: 19%; SYSUCC: 23%). The RCC group included a mixture of ccRCC (42% vs. 44%), pRCC (19% vs. 18%) and chRCC (20% vs. 15%), with proportions well balanced amongst the two cohorts. In terms of age, sex, tumor size, location, or appearance, there were no significant differences between the two cohorts.

Table 1 Patient characteristics of the FAHSYSU cohort and SYSUCC cohort

Full size table

Development phase and cross-validation

Table 2 shows the fivefold cross-validation results of the 20 combinations in the model development phase. As shown in Fig. 3a, b, scatter plots were used to show the model's performance at the image-level and patient-level in each fold. The paired T test was used to compare the mean values of image-level ACC and patient-level AUC of models trained on different CT modalities in the fivefold cross-validation (Fig. 3c, d). Overall, the models trained on unenhanced CT images performed better than those trained on enhanced CT images, both at the image level (p < 0.001) and at the patient level (p < 0.001).

Table 2 The cross-validation results of 20 models with different combinations

Full size table

In terms of image-level performance, the worst ACC of the unenhanced CT models was 0.886 ± 0.021, while the best ACC of the enhanced CT models was only 0.858 ± 0.025. In the unenhanced CT models, ACC increased with the increase in the number of input channels, while ACC was unstable in the enhanced CT models.

In terms of patient-level performance, the unenhanced CT models also outperformed the enhanced CT models, especially in terms of AUC and specificity. The lowest AUC of the unenhanced CT models was 0.897 ± 0.046, while the highest AUC of the enhanced CT models was 0.806 ± 0.071. Among all model combinations, the “unenhanced CT and 7-channel” model achieved the best AUC of 0.951 ± 0.026 with a sensitivity of 0.903 ± 0.026 and a specificity of 0.960 ± 0.049. Based on the AUC performance, we finally selected the “unenhanced CT and 7-channel” model as the optimal combination of our multichannel deep learning model for independent internal and external validation.

Independent internal validation and external validation

In independent internal validation (hold-out dataset), the image-level ACC of the “unenhanced CT and 7-channel” model was 0.921. At the patient level, the AUC reached 0.966 [95% confidence interval (CI) 0.919–1.000] with a sensitivity of 0.930 and a specificity of 1.000. In external validation (SYSUCC dataset), the image-level ACC was 0.865. At the patient level, the AUC was 0.898 (95% CI 0.824–0.972), with a sensitivity of 0.802 and a specificity of 0.903 (Fig. 4a). Compared with internal validation, both the image-level performance and the patient-level performance of the model decreased in external validation.

Furthermore, we compared the performance of the model when the maximum tumor diameter was < 40 mm and ≥ 40 mm. As shown in Table 3, the “unenhanced CT and 7-channel” model performed better when the maximum tumor diameter was ≥ 40 mm in both internal validation (AUC 1.000 [95% CI 1.000–1.000] vs. 0.942 [95% CI 0.863–1.000]) and external validation (AUC, 0.973 [95% CI 0.932–1.000] vs. 0.873 [95% CI 0.776–0.970]). The confusion matrix for the independent internal validation and external validation is shown in Fig. 4b.

Table 3 Performance of the “unenhanced CT and 7-channel” model on the hold-out and SYSUCC datasets

Full size table

In addition, we applied t-distributed stochastic neighbor embedding (t-SNE) to visualize the basis of classification. The t-SNE visualization showed that the dots in the SYSUCC dataset were significantly more scattered than those in the hold-out dataset, which was consistent with the performance of the model on the image-level ACC (Fig. 5a, b). Furthermore, one case of fp-AML (Fig. 6a) and one case of RCC (Fig. 6b) from the SYSUCC dataset were selected to demonstrate the diagnostic performance of the model. According to the class activation mapping (CAM) images of the ROI, the region of high predictive value was located in the center of the tumor, whether fp-AML or RCC. This indicates that information about the central region of the tumor was critical for the model to discriminate between fp-AML and RCC.

Discussion

In this study, we collected CT data from 91 patients with fp-AML and 361 patients with RCC from two centers. The multichannel deep learning model based on whole-tumor unenhanced CT images developed in our study achieved an AUC of 0.966 (95% CI 0.919–1.000) in independent internal validation. We further evaluated the generalization performance of the model with an external dataset and obtained an AUC of 0.898 (95% CI 0.824–0.972). Moreover, our model performed better with large tumors (≥ 40 mm) than with small tumors (< 40 mm) in both internal and external validation. This indicates that larger tumors can provide more CT slices and thus provide more useful information to our multichannel deep learning model.

To our best knowledge, our study enrolled a larger number of patients and achieved higher accuracy than any previous study (Hodgdon et al. 2015; Lee et al. 2017, 2018; Feng et al. 2018; Cui et al. 2019; Yang et al. 2020). Also, the present study is the first to evaluate the generalizability of the model using a dataset from an external center. In terms of algorithms, deep learning was used in this study, while machine learning was used in previous studies. The use of end-to-end training and prediction removes the need for deep learning algorithms, such as CNNs, to involve burdensome feature engineering. These engineering features are mainly hidden in numerous layers of a CNN, and learned from data using a general-purpose learning procedure. Another advantage of deep learning is its ability to better fit large datasets. Therefore, facing the growing amount of medical data, deep learning has great potential.

It is noteworthy that unenhanced CT images were more suitable for distinguishing fp-AML from RCC than enhanced CT images, according to the results of fivefold cross-validation in the development phase. Compared with tri-phase enhanced scans, the measurements acquired from unenhanced scans are more stable. The quality of enhancement CT imaging is often affected by a variety of factors, including renal function, the concentration of the contrast medium, and the scanning protocol used. Collectively, these factors make it difficult to reproduce results derived from enhancement measurements and have even led to different conclusions in the earlier literature (Kim et al. 2004; Zhang et al. 2007; Yang et al. 2013).

In practice, it is noticed that radiologists always need to consider the continuity between adjacent slices when analyzing CT images. The multichannel CNN was designed to simulate the behavior of radiologists reading CT images. By increasing the number of input channels of the CNN, we can input multiple continuous CT images into the model at the same time. Several studies have confirmed that deep learning models based on multi-slice CT images perform better than those based on single-slice CT images (Zhang et al. 2018; La Greca Saint-Esteven et al. 2022; Takao et al. 2022). Our study also confirms the correctness of the above view that appropriately increasing the number of input channels can improve the performance of the model. This may benefit from the structure of the multichannel model, which allows us to deliver more effective information to the CNN in one input. However, as the number of input channels increases, the computational load of the model also increases. Therefore, the number of input channels should be within a reasonable range to avoid excessive computational load.

Furthermore, whole-tumor CT images were used to train the model in this study. In some early CT image-based AI studies (Yan et al. 2015; Feng et al. 2018), only one or several representative CT slices of each tumor were used for feature extraction and model training. The process of CT slice selection is clearly subjective and could lead to instability. Critical information may be missed in the selected slices, thus affecting the performance of the model. Here, the use of whole-tumor CT slices allowed us to avoid such problems while also making our dataset suitable for the training needs of multichannel CNN.

The performance of our model decreased in the external validation compared to the internal validation in this study. In our opinion, both the hardware and software of the CT scanner have an impact on model performance. Model performance may also be affected by different CT scanning protocols. What's more, the number of samples may not be sufficiently representative of the whole population, especially in the case of rare categories. As there may be other reasons for the degradation of the model performance in the external dataset, this is a further interesting and worthwhile topic to be investigated.

While promising results have been obtained from our multichannel deep learning model, there do exist several limitations. Firstly, CT images of the nephrographic and excretory phase were not used for model training in this study. This is mainly due to the long-time span of cases that we reviewed. In most of the early cases, the image quality of nephrographic and excretory phase is far inferior to that of the corticomedullary phase in terms of the thickness of the reconstruction, tumor scan integrity, and scan timing. Secondly, the use of whole-tumor CT slices can make the labeling of ROIs burdensome. Developing an accurate and stable automatic segmentation method for tumor ROIs will help to improve the practicability of our classification model. Moreover, a larger multi-center validation study will be needed to further assess the robustness of the model across populations. Also, given the retrospective nature of this study, a prospective study may be required to evaluate the clinical value of our model.

In conclusion, the present study demonstrated that a multichannel deep learning model based on whole-tumor unenhanced CT images represents a highly useful tool for differentiating fp-AML from RCC. This tool may improve the accuracy of preoperative diagnosis for patients with renal masses and therefore facilitate the clinical decision-making process.

Data availability

The CT imaging data and clinical information in the current study are not publicly available due to patient privacy obligations but are available from the corresponding authors on reasonable request.

References

Anthimopoulos M, Christodoulidis S, Ebner L, Christe A, Mougiakakou S (2016) Lung pattern classification for interstitial lung diseases using a deep convolutional neural network. IEEE Trans Med Imaging 35(5):1207–1216. https://doi.org/10.1109/Tmi.2016.2535865
Article PubMed Google Scholar
Campbell SC, Clark PE, Chang SS, Karam JA, Souter L, Uzzo RG (2021) Renal mass and localized renal cancer: evaluation, management, and follow-up: AUA guideline: part I. J Urol 206(2):199–208. https://doi.org/10.1097/JU.0000000000001911
Article PubMed Google Scholar
Castillo TJMM, Arif M, Starmans MPA, Niessen WJ, Bangma CH, Schoots IG et al (2022) Classification of clinically significant prostate cancer on multi-parametric MRI: a validation study comparing deep learning and radiomics. Cancers. https://doi.org/10.3390/cancers14010012
Article PubMed PubMed Central Google Scholar
Chollet F (2017) Xception: deep learning with depthwise separable convolutions. In: 30th IEEE conference on computer vision and pattern recognition (Cvpr 2017), pp 1800–1807. https://doi.org/10.1109/Cvpr.2017.195
Cui EM, Lin F, Li Q, Li RG, Chen XM, Liu ZS et al (2019) Differentiation of renal angiomyolipoma without visible fat from renal cell carcinoma by machine learning based on whole-tumor computed tomography texture features. Acta Radiol 60(11):1543–1552. https://doi.org/10.1177/0284185119830282
Article PubMed Google Scholar
Cui Y, Zhang J, Li Z, Wei K, Lei Y, Ren J et al (2022) A CT-based deep learning radiomics nomogram for predicting the response to neoadjuvant chemotherapy in patients with locally advanced gastric cancer: a multicenter cohort study. Eclin Med 46:101348. https://doi.org/10.1016/j.eclinm.2022.101348
Article Google Scholar
Feng Z, Rong P, Cao P, Zhou Q, Zhu W, Yan Z et al (2018) Machine learning-based quantitative texture analysis of CT images of small renal masses: differentiation of angiomyolipoma without visible fat from renal cell carcinoma. Eur Radiol 28(4):1625–1633. https://doi.org/10.1007/s00330-017-5118-z
Article PubMed Google Scholar
Fu Q, Chen Y, Li Z, Jing Q, Hu C, Liu H et al (2020) A deep learning algorithm for detection of oral cavity squamous cell carcinoma from photographic images: a retrospective study. Eclin Med 27:100558. https://doi.org/10.1016/j.eclinm.2020.100558
Article Google Scholar
Fujii Y, Komai Y, Saito K, Iimura Y, Yonese J, Kawakami S et al (2008) Incidence of benign pathologic lesions at partial nephrectomy for presumed RCC renal masses: Japanese dual-center experience with 176 consecutive patients. Urology 72(3):598–602. https://doi.org/10.1016/j.urology.2008.04.054
Article PubMed Google Scholar
Hodgdon T, McInnes MDF, Schieda N, Flood TA, Lamb L, Thornhill RE (2015) Can quantitative CT texture analysis be used to differentiate fat-poor renal angiomyolipoma from renal cell carcinoma on unenhanced CT images? Radiology 276(3):787–796. https://doi.org/10.1148/radiol.2015142215
Article PubMed Google Scholar
Jahangirimehr A, AbdolahiShahvali E, Rezaeijo SM, Khalighi A, Honarmandpour A, Honarmandpour F et al (2022) Machine learning approach for automated predicting of COVID-19 severity based on clinical and paraclinical characteristics: serum levels of zinc, calcium, and vitamin D. Clin Nutr ESPEN 51:404–411. https://doi.org/10.1016/j.clnesp.2022.07.011
Article PubMed PubMed Central Google Scholar
Jinzaki M, Silverman SG, Akita H, Nagashima Y, Mikami S, Oya M (2014) Renal angiomyolipoma: a radiological classification and update on recent developments in diagnosis and management. Abdom Imaging 39(3):588–604. https://doi.org/10.1007/s00261-014-0083-3
Article PubMed PubMed Central Google Scholar
Kavur AE, Gezer NS, Baris M, Sahin Y, Ozkan S, Baydar B et al (2020) Comparison of semi-automatic and deep learning-based automatic methods for liver segmentation in living liver transplant donors. Diagn Interv Radiol 26(1):11–21. https://doi.org/10.5152/dir.2019.19025
Article PubMed Google Scholar
Kim JK, Park SY, Shon JH, Cho KS (2004) Angiomyolipoma with minimal fat: differentiation from renal cell carcinoma at biphasic helical CT. Radiology 230(3):677–684. https://doi.org/10.1148/radiol.2303030003
Article PubMed Google Scholar
La Greca Saint-Esteven A, Bogowicz M, Konukoglu E, Riesterer O, Balermpas P, Guckenberger M et al (2022) A 2.5D convolutional neural network for HPV prediction in advanced oropharyngeal cancer. Comput Biol Med 142:105215. https://doi.org/10.1016/j.compbiomed.2022.105215
Article PubMed Google Scholar
Lane BR, Aydin H, Danforth TL, Zhou M, Remer EM, Novick AC et al (2008) Clinical correlates of renal angiomyolipoma subtypes in 209 patients: classic, fat poor, tuberous sclerosis associated and epithelioid. J Urol 180(3):836–843. https://doi.org/10.1016/j.juro.2008.05.041
Article PubMed Google Scholar
Lee HS, Hong H, Jung DC, Park S, Kim J (2017) Differentiation of fat-poor angiomyolipoma from clear cell renal cell carcinoma in contrast-enhanced MDCT images using quantitative feature classification. Med Phys 44(7):3604–3614. https://doi.org/10.1002/mp.12258
Article CAS PubMed Google Scholar
Lee H, Hong H, Kim J, Jung DC (2018) Deep feature classification of angiomyolipoma without visible fat and renal cell carcinoma in abdominal contrast-enhanced CT images with texture image patches and hand-crafted feature concatenation. Med Phys 45(4):1550–1561. https://doi.org/10.1002/mp.12828
Article PubMed Google Scholar
Long EP, Lin HT, Liu ZZ, Wu XH, Wang LM, Jiang JW et al (2017) An artificial intelligence platform for the multihospital collaborative management of congenital cataracts. Nat Biomed Eng. https://doi.org/10.1038/s41551-016-0024
Article Google Scholar
Lu Y, Yu QY, Gao YX, Zhou YP, Liu GW, Dong Q et al (2018) Identification of metastatic lymph nodes in MR imaging with faster region-based convolutional neural networks. Can Res 78(17):5135–5143. https://doi.org/10.1158/0008-5472.Can-18-0494
Article CAS Google Scholar
Nelson CP, Sanda MG (2002) Contemporary diagnosis and management of renal angiomyolipoma. J Urol 168(4):1315–1325. https://doi.org/10.1016/S0022-5347(05)64440-0
Article PubMed Google Scholar
Oh KS, Jung K (2004) GPU implementation of neural networks. Pattern Recogn 37(6):1311–1314. https://doi.org/10.1016/j.patcog.2004.01.013
Article Google Scholar
Park BK (2017) Renal angiomyolipoma: radiologic classification and imaging features according to the amount of fat. AJR Am J Roentgenol 209(4):826–835. https://doi.org/10.2214/AJR.17.17973
Article PubMed Google Scholar
Rezaeijo SM, JafarpoorNesheli S, Fatan Serj M, Tahmasebi Birgani MJ (2022) Segmentation of the prostate, its zones, anterior fibromuscular stroma, and urethra on the MRIs and multimodality image fusion using U-Net model. Quant Imaging Med Surg 12(10):4786–4804. https://doi.org/10.21037/qims-22-115
Article PubMed PubMed Central Google Scholar
Salmanpour MR, Rezaeijo SM, Hosseinzadeh M, Rahmim A (2023) Deep versus handcrafted tensor radiomics features: prediction of survival in head and neck cancer using machine learning and fusion techniques. Diagnostics (basel). https://doi.org/10.3390/diagnostics13101696
Article PubMed Google Scholar
Schachter LR, Cookson MS, Chang SS, Smith JA, Dietrich MS, Jayaram G et al (2007) Second prize: 2006 endourological society essay competition—frequency of benign renal cortical tumors and histologic subtypes based on size in a contemporary series: what to tell our patients. J Endourol 21(8):819–823. https://doi.org/10.1089/end.2006.9937
Article PubMed Google Scholar
Schmidhuber J (2015) Deep learning in neural networks: an overview. Neural Netw 61:85–117. https://doi.org/10.1016/j.neunet.2014.09.003
Article PubMed Google Scholar
Taghizadeh E, Heydarheydari S, Saberi A, JafarpoorNesheli S, Rezaeijo SM (2022) Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods. BMC Bioinform 23(1):410. https://doi.org/10.1186/s12859-022-04965-8
Article CAS Google Scholar
Takahashi N, Kawashima A (2012) Fat-poor angiomyolipoma and renal cell carcinoma: differentiation with MR imaging and accuracy of histopathologic evaluation response. Radiology 265(3):980–981
Google Scholar
Takao H, Amemiya S, Kato S, Yamashita H, Sakamoto N, Abe O (2022) Deep-learning 2.5-dimensional single-shot detector improves the performance of automated detection of brain metastases on contrast-enhanced CT. Neuroradiology 64(8):1511–1518. https://doi.org/10.1007/s00234-022-02902-3
Article PubMed Google Scholar
Tandel GS, Biswas M, Kakde OG, Tiwari A, Suri HS, Turk M et al (2019) A review on a deep learning perspective in brain cancer classification. Cancers. https://doi.org/10.3390/cancers11010111
Article PubMed Central Google Scholar
Xia Y, Wulan N, Wang KQ, Zhang HG (2018) Detecting atrial fibrillation by deep convolutional neural networks. Comput Biol Med 93:84–92. https://doi.org/10.1016/j.compbiomed.2017.12.007
Article PubMed Google Scholar
Yan L, Liu Z, Wang G, Huang Y, Liu Y, Yu Y et al (2015) Angiomyolipoma with minimal fat: differentiation from clear cell renal cell carcinoma and papillary renal cell carcinoma by texture analysis on CT images. Acad Radiol 22(9):1115–1121. https://doi.org/10.1016/j.acra.2015.04.004
Article PubMed Google Scholar
Yang CW, Shen SH, Chang YH, Chung HJ, Wang JH, Lin AT et al (2013) Are there useful CT features to differentiate renal cell carcinoma from lipid-poor renal angiomyolipoma? Am J Roentgenol 201(5):1017–1028. https://doi.org/10.2214/Ajr.12.10204
Article Google Scholar
Yang R, Wu J, Sun L, Lai S, Xu Y, Liu X et al (2020) Radiomics of small renal masses on multiphasic CT: accuracy of machine learning-based classification models for the differentiation of renal cell carcinoma and angiomyolipoma without visible fat. Eur Radiol 30(2):1254–1263. https://doi.org/10.1007/s00330-019-06384-5
Article PubMed Google Scholar
Zhang J, Lefkowitz RA, Ishill NM, Wang L, Moskowitz CS, Russo P et al (2007) Solid renal cortical tumors: differentiation with CT. Radiology 244(2):494–504. https://doi.org/10.1148/radiol.2442060927
Article PubMed Google Scholar
Zhang Y, Zhang J, Zhao L, Wei X, Zhang Q (2018). Classification of benign and malignant pulmonary nodules based on deep learning. In: 2018 5th international conference on information science and control engineering (ICISCE)

Download references

Funding

This work was supported in part by the National Natural Science Foundation of China (award number: 82373433, 81725016, 81872094, 82272862, 81902576), the National Key Research and Development Program of China (award number: 2016YFC0902600) and the Guangzhou Science and Technology Projects (award number: 202201010910).

Author information

Haohua Yao, Li Tian, Xi Liu and Shurong Li contributed equally to this work.

Authors and Affiliations

Department of Urology, The First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China
Haohua Yao, Xi Liu, Yuhang Chen, Zhenhua Chen, Zihao Feng, Quanhui Xu, Jiangquan Zhu, Yinghan Wang, Wei Chen & Junhang Luo
Department of Urology, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
Haohua Yao
Department of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
Li Tian
Department of Radiology, The First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, China
Shurong Li, Yan Guo & Huanjun Wang
Department of Urology, Jiangmen Central Hospital, Jiangmen, China
Jiazheng Cao
Department of Urology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, China
Zhiling Zhang
School of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou, China
Caixia Li & Peixing Li

Authors

Haohua Yao
View author publications
You can also search for this author in PubMed Google Scholar
Li Tian
View author publications
You can also search for this author in PubMed Google Scholar
Xi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shurong Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuhang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiazheng Cao
View author publications
You can also search for this author in PubMed Google Scholar
Zhiling Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenhua Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zihao Feng
View author publications
You can also search for this author in PubMed Google Scholar
Quanhui Xu
View author publications
You can also search for this author in PubMed Google Scholar
Jiangquan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yinghan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Guo
View author publications
You can also search for this author in PubMed Google Scholar
Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Caixia Li
View author publications
You can also search for this author in PubMed Google Scholar
Peixing Li
View author publications
You can also search for this author in PubMed Google Scholar
Huanjun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Junhang Luo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JL, HW, HY and LT designed the study. XL, SL and LT retrieved pathological reports and collected the imaging data. JC, ZZ, JZ and YW collected the clinical data. SL, LT, HW and YG labeled the imaging data. HY, YC, ZF, CL and PL performed the deep learning. XL, ZC, QX and WC did the statistical analysis. HY and XL wrote the first draft of the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Huanjun Wang or Junhang Luo.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Ethical approval

The study was approved by the Ethics Committee of the First Affiliated Hospital of Sun Yat-Sen University (IIT-2022-678), and the requirement for individual consent for this retrospective analysis was waived.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yao, H., Tian, L., Liu, X. et al. Development and external validation of the multichannel deep learning model based on unenhanced CT for differentiating fat-poor angiomyolipoma from renal cell carcinoma: a two-center retrospective study. J Cancer Res Clin Oncol 149, 15827–15838 (2023). https://doi.org/10.1007/s00432-023-05339-0

Download citation

Received: 24 July 2023
Accepted: 24 August 2023
Published: 06 September 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00432-023-05339-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Development and external validation of the multichannel deep learning model based on unenhanced CT for differentiating fat-poor angiomyolipoma from renal cell carcinoma: a two-center retrospective study