The Impact of Edema on MRI Radiomics for the Prediction of Lung Metastasis in Soft Tissue Sarcoma

Casale, Roberto; De Angelis, Riccardo; Coquelet, Nicolas; Mokhtari, Ayoub; Bali, Maria Antonietta

doi:10.3390/diagnostics13193134

Open AccessArticle

The Impact of Edema on MRI Radiomics for the Prediction of Lung Metastasis in Soft Tissue Sarcoma

Institut Jules Bordet Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, 1070 Brussels, Belgium

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Diagnostics 2023, 13(19), 3134; https://doi.org/10.3390/diagnostics13193134

Submission received: 11 August 2023 / Revised: 3 September 2023 / Accepted: 25 September 2023 / Published: 5 October 2023

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Highlights

What are the main findings?

Both the model utilizing edema-related features and the model utilizing mass-related features demonstrated promising results in predicting the occurrence of lung metastases, with similar performances.

What is the implication of the main finding?

The findings suggest that the analysis of radiomic features extracted exclusively from edema can offer valuable insights into the prediction of lung metastases.

Abstract

Introduction: This study aimed to evaluate whether radiomic features extracted solely from the edema of soft tissue sarcomas (STS) could predict the occurrence of lung metastasis in comparison with features extracted solely from the tumoral mass. Materials and Methods: We retrospectively analyzed magnetic resonance imaging (MRI) scans of 32 STSs, including 14 with lung metastasis and 18 without. A segmentation of the tumor mass and edema was assessed for each MRI examination. A total of 107 radiomic features were extracted for each mass segmentation and 107 radiomic features for each edema segmentation. A two-step feature selection process was applied. Two predictive features for the development of lung metastasis were selected from the mass-related features, as well as two predictive features from the edema-related features. Two Random Forest models were created based on these selected features; 100 random subsampling runs were performed. Key performance metrics, including accuracy and area under the ROC curve (AUC), were calculated, and the resulting accuracies were compared. Results: The model based on mass-related features achieved a median accuracy of 0.83 and a median AUC of 0.88, while the model based on edema-related features achieved a median accuracy of 0.75 and a median AUC of 0.79. A statistical analysis comparing the accuracies of the two models revealed no significant difference. Conclusion: Both models showed promise in predicting the occurrence of lung metastasis in soft tissue sarcomas. These findings suggest that radiomic analysis of edema features can provide valuable insights into the prediction of lung metastasis in soft tissue sarcomas.

Keywords:

radiomics; magnetic resonance imaging (MRI); soft tissue sarcoma; lung metastasis; edema

1. Introduction

Soft tissue sarcomas (STSs) encompass a diverse range of malignancies originating from mesenchymal cells. The World Health Organization recognizes more than 50 distinct subtypes within this category. STSs are rare tumors, accounting for approximately 1% of all cancer cases [1]. Despite their low incidence, they pose significant concerns due to their potential for distant metastases, which occur in about 25% of cases and contribute to the majority of deaths; high-grade STSs can exhibit a metastatic rate of up to 50% [2,3,4]. The lungs are the most common site of metastasis, accounting for around 80% of lesions [5].

The prognosis for patients who develop metastases is generally poor. Those who undergo surgical metastasectomy have a 3-year survival rate of less than 50%, while patients who are not eligible for surgery have a survival rate below 20%. The median survival time following the diagnosis of distant metastasis is approximately 11.6 months [2]. The identification of patients with a heightened susceptibility to developing distant metastases holds the potential to enhance the efficacy of therapeutic interventions [6,7].

In a study conducted by White [8], the presence of satellite tumor cells was observed in 10 out of 15 patients with STSs. In 9 out of 15 cases, tumor cells were identified beyond the sarcoma margin within regions exhibiting peritumoral edema and reactive changes as observed on preoperative MRI scans.

By thoroughly investigating the edema, researchers can gain valuable insights into the intricate interactions between the tumor and its surrounding tissue. The edema is closely interconnected with the tumor microenvironment, which encompasses factors such as inflammation, angiogenesis, and tissue remodeling. This comprehensive analysis of the edema can provide additional prognostic information beyond relying solely on the tumor volume. The incorporation of edema analysis in the evaluation of STSs has the potential to aid in patient risk stratification and facilitate personalized treatment decisions.

Despite the significance of the edema in the tumor microenvironment, there is a notable gap in the existing literature. Our literature search on PubMed using the keywords [(“soft tissue sarcoma” OR “soft tissue sarcoma”) AND edema] revealed a lack of studies specifically focused on radiomic features extracted solely from the edema. Therefore, the primary objective of our study is to fill this gap by investigating the potential correlations between radiomic features derived from the edema of STSs and the occurrence of lung metastases.

Through this exploration, our study aims to uncover the prognostic value and clinical significance of these radiomic features in relation to lung metastases in STS patients. By elucidating the role of edema-related radiomic features, we can advance our understanding of STSs and improve patient management strategies. This investigation may also lead to the identification of biomarkers associated with tumor behavior and response. Ultimately, our study seeks to contribute valuable knowledge to the field and enhance the care provided to STS patients.

2. Materials and Methods

2.1. Dataset

For our study, we employed an open-source anonymized database as the principal data repository (available online: http://doi.org/10.7937/K9/TCIA.2015.7GO2GSKS; accessed on 2 September 2023); this comprehensive dataset consisted of 51 cases of STSs affecting the extremities, which were histologically confirmed [7,9]. Each patient in the dataset had undergone fluoro-D-glucose positron emission tomography and MRI scans as part of their evaluation, conducted between November 2004 and November 2011.

It is important to note that the MRI protocols employed were not standardized across all patients. To ensure consistency in our analysis, we specifically selected T2-weighted fat-saturated (T2FS) or short tau inversion recovery (STIR) MRI scans. The patients were categorized into two groups based on clinical outcomes: “no lung metastases” (group A) and “lung metastases” (group B).

The inclusion criteria required that the selected examinations exhibit distinct segmentations for both the tumor mass and the tumor mass plus the associated edema, while excluding cases where the two segmentations were identical (e.g., cases with no observable edema). In other studies, T2FS and STIR images are deemed comparable in terms of texture analysis; therefore, we grouped them together as a single category [7,10].

Following these criteria, a total of 32 patients were included in our analysis.

2.2. Segmentation and Feature Extraction

The segmentations for the examinations were acquired from the aforementioned publicly available database. Each individual segmentation underwent visual evaluation by a radiologist with eight years of experience, and modifications were made as deemed necessary. The 3D Slicer software, version 4.13, was employed for this process [11].

For every exam, the following segmentations were considered:

Gross Tumoral Volume (GTV): a segmentation that encompassed only the tumor mass;
Edema Tumoral Volume (EDV): this segmentation was derived by subtracting the tumor mass segmentation alone (GTV) from the segmentation that encompasses both the tumor mass (GTV) and the associated edema (see Figure 1).

The extraction of features, i.e., the derivation of features from radiological images, was performed using Pyradiomics 3.0.1 (https://pyradiomics.readthedocs.io; accessed on 2 September 2023), a software library designed for the extraction of radiomic features from medical imaging data [12]. Additionally, a Python script developed by the authors was utilized, ensuring compliance with the Image Biomarker Standardization Initiative (IBSI) standard [13].

The hyperparameters for feature extraction were set with the following values: normalize = True; removeOutliers = 3; binCount = 50; resampledPixelSpacing = 0.8, 0.8, 5.5; interpolator = sitk.sitkBSpline; correctMask = True. All other parameters were kept at their default values. For each examination, the features were extracted individually from each exam in a separate manner.

The radiomic features extracted in this study were categorized into seven main groups: First Order (FOF) Features; Shape Features (SHAPE); Gray Level Co-occurrence Matrix (GLCM) Features; Gray Level Run Length Matrix (GLRLM) Features; Gray Level Size Zone Matrix (GLSZM) Features; Gray Level Dependence Matrix (GLDM) Features; Neighboring Gray Tone Difference Matrix (NGTDM) Features. The definitions and a detailed list of these features can be found in the Pyradiomics feature documentation, available at https://pyradiomics.readthedocs.io (accessed on 2 September 2023).

2.3. Feature Selection

The feature selection process aimed to identify and select the most informative features for incorporation into our models. To commence this process, we initiated the identification and elimination of highly correlated features.

This was achieved through the utilization of the Spearman correlation coefficient, where features displaying a correlation value surpassing 0.8 were systematically discarded.

Following this initial step, our approach involved a comprehensive evaluation of potential feature combinations. We commenced this evaluation by considering individual features and then progressively expanding the combination size, ultimately capping it at a maximum of five features. For each combination size, we harnessed the Exhaustive Feature Selection algorithm [14], which meticulously scrutinized all possible combinations; we computed the average area under the receiver operating characteristic curve (AUC) score using a 5-fold cross-validation approach and a Random Forest (RF) classifier. In essence, for each combination size, we identified and selected the combination that yielded the highest average AUC score, thus designating it as the optimal combination for that specific number of features.

Finally, the number and names of the ultimately selected features were determined by identifying the first peak value in the average AUC score (avg_score). This selection process was conducted across the best combinations ranging from 1 to 5 features.

To illustrate with an example, we systematically explored various feature combinations of different sizes to identify the optimal set of features for our analysis:

Single Feature Evaluation: When considering single features in isolation, we observed that ‘feature_C’ exhibited the highest AUC of 0.65, outperforming all other individual features.
Two-Feature Combinations: Expanding our investigation to pairs of features, we found that the combination of ‘feature_D’ and ‘feature_H’ produced the most favorable result, with an AUC of 0.77. This combination surpassed all other two-feature combinations.
Three-Feature Combinations: Continuing our analysis, we explored combinations of three features. Among these, ‘feature_A’ + ‘feature_C’ + ‘feature_F’ yielded the highest AUC of 0.75, demonstrating superior performance when compared to other three-feature combinations.
Four-Feature Combinations: Extending our search to combinations of four features, ‘feature_B’ + ‘feature_D’ + ‘feature_F’ + ‘feature_H’ achieved an AUC of 0.71. This particular combination displayed notable predictive power within the set of four-feature combinations.
Five-Feature Combinations: Finally, in the context of five-feature combinations, ‘feature_A’ + ‘feature_C’ + ‘feature_F’ + ‘feature_H’ + ‘feature_G’ exhibited the highest AUC of 0.81, outperforming all other five-feature combinations.

After these five steps, we opted for a two-feature combination, ‘feature_D’ + ‘feature_H’, which achieved an AUC of 0.77. This decision was based on the observation that it represented the first peak of the AUC values among the feature combinations, ranging from one to five features.

More details regarding the Exhaustive Feature Selection algorithm and the curves obtained in our analysis are elaborated in the Supplementary Materials (Exhaustive Feature Selection algorithm section, Figures S1 and S2).

2.4. Modeling and Statistical Analysis

An RF model based on the selected GTV features (RF-GTV) and an RF model based on the selected EDV features (RF-EDV) were compared.

In particular, we performed 100 random subsampling iterations to evaluate the performances of the two models. For each iteration, we randomly split the dataset into training and testing sets; as suggested by Nadeau and Bengio [15], the training set was five times larger than the testing set.

The RF models were trained on the training sets and evaluated on the corresponding testing sets. Performance metrics, such as accuracy, sensitivity, specificity, and AUC, were computed for both algorithms.

The median and interquartile range (IQR) of accuracy, sensitivity, specificity, and AUC were calculated across the 100 iterations for both the RF-GTV and the RF-EDV models.

To compare the two models, we used the Nadeau and Bengio corrected resampled t-test for the obtained accuracies. According to [15,16], performing 100 randomized subsampling iterations and the Nadeau and Bengio corrected resampled t-test guarantee a close alignment of Type I error with the significance level. Importantly, in contrast to McNemar’s test and the 5 × 2 cross-validation test, this method doesn’t exhibit a heightened Type II error rate. Moreover, when employing a total of 100 runs, the level of replicability reaches a satisfactory threshold, thereby enabling reliable comparisons among diverse algorithms.

The Spearman correlation coefficient was employed to calculate the intercorrelation among the selected features. Additionally, the Mann-Whitney test was utilized to assess statistically significant differences in selected feature values between group A/group B.

The correlation between the clinical features and the clinical outcomes (the “no lung metastases” group and the “lung metastases” group) was subjected to statistical analysis. This analysis employed the Mann-Whitney U test and Fisher’s exact test [17].

To enhance the generalizability of our findings to a wider population, we utilized 10000 stratified bootstrap iterations to calculate 95% confidence intervals (CI) [18], with a particular focus on accuracy and AUC.

The described pipeline was performed using Python version 3.8. For RF, max_depth (the longest path from the root node to the leaf node) was set to 10; the default values were retained for all the remaining parameters.

3. Results

3.1. Dataset

Our study comprised a cohort of 32 patients, consisting of 14 males and 18 females, with a median age of 60 years (range: 16–83 years). Throughout the follow-up period, 18 patients remained free from lung metastases (group A), while 14 patients experienced lung metastases (group B).

The median duration from the diagnosis to the last follow-up was 684.5 days (range: 377–1329 days) for group A, whereas the median duration from the diagnosis to the onset of metastases or local recurrence was 162 days (range: 29–731 days) for group B.

Regarding histological grades, 18 patients had high-grade sarcomas (8 patients in group A and 10 patients in group B), 13 patients had intermediate-grade sarcomas (10 patients in group A and 3 patients in group B), and 1 patient had a low-grade sarcoma (in group A). Further details on relevant clinical parameters and treatment modalities can be found in Table 1, along with the supplementary information provided in the Supplementary Table S1 section under “Clinical data”.

The MRI protocols were heterogeneous; T2FS or STIR sequences were used. Additional details regarding the MRI acquisition protocols can be found in the Supplementary Table S1 section under “MRI data”. Not all individual patients had both STIR and T2FS sequences available. Consequently, we selected the only fluid-sensitive sequence that was accessible for each patient during the analysis [7,10].

3.2. Features Extraction

After conducting a visual evaluation, it was determined that the segmentations of 31 exams were suitable for both GTV and EDV; however, in one exam, manual adjustments were made to improve the delineation of the EDV segmentation.

A comprehensive set of 214 radiomics features was extracted, specifically comprising 107 features from the EDV segmentation and 107 features from the GTV segmentation.

3.3. Features Selection

In relation to the features extracted from GTV and EDV, after the removal of the highly correlated features, a total of 31 features were retained for GTV and 33 for EDV. Subsequently, the Exhaustive Feature Selection algorithm was iteratively applied, considering the range of one to five features, and identified the first peak in the average AUC score (more details regarding the curves obtained are elaborated in the Supplementary Materials Figures S1 and S2). As a result, two features were selected for both GTV and EDV, as shown in Table 2.

3.4. Classification Performance

After conducting 100 random subsampling iterations for both the RF-GTV and the RF-EDV models, the resulting performance metrics are shown in Table 3; in particular, the accuracy was 0.83 for the RF-GTV and 0.75 for the RF-EDV.

Based on the results of the Nadeau and Bengio corrected resampled t-test, there was no statistically significant difference observed between the accuracies of the two models (p-value = 0.433).

Figure 2 presents the ROC curves, along with the corresponding AUC values for both models. These ROC curves and AUC values serve as essential visual and quantitative tools for the assessment of the predictive performance and discriminative capabilities of their respective models, offering valuable insights into their effectiveness in distinguishing between various classes (group A, “no lung metastases” versus group B, “lung metastases”). The ROC curves provide a graphical representation of the models’ trade-offs between sensitivity and specificity across different threshold settings, enabling a nuanced evaluation of their diagnostic or predictive utility. Meanwhile, the AUC values summarize the overall discriminatory power of each model. In particular, the RF-GTV model obtained an AUC of 0.88 and the RF-EDV model achieved an AUC of 0.79.

The bootstrap evaluation, which went through 10000 iterations for calculating the median and 95% CI, revealed the following values:

For the RF-GTV: a median accuracy of 0.71 [95% CI: 0.46–0.92], a median AUC of 0.79 [95% CI: 0.50 1.00];
For the RF-EDV: a median accuracy of 0.69 [95% CI: 0.43–0.91], a median AUC of 0.73 [95% CI: 0.45 0.94].

Further details regarding the bootstrap results can be found in the Supplementary Materials (Figures S3–S7).

Figure 3 displays the intercorrelation patterns observed among the features selected for both the RF-GTV and the RF-EDV models. These intercorrelations were computed through the application of the Spearman correlation coefficient. The examination of the correlation coefficients revealed values consistently below the value of 0.3, demonstrating a lack of substantial correlation. Such findings emphasize the relative independence of these features within the context of our models.

Figure 4 and Figure 5 illustrate the boxplots representing the selected features in group A and group B; the statistical analysis conducted using the Mann-Whitney test revealed no significant differences among the selected features in terms of the comparison between group A and group B.

4. Discussion

This study aimed to compare the predictive ability of radiomic features extracted from the edemas and tumoral masses of STSs in predicting lung metastases. MRI scans of 32 STSs were retrospectively analyzed, of which 18 cases were without lung metastases and 14 cases had lung metastases. A total of 107 radiomic features were extracted from each GTV and EDV segmentation. After feature selection, the feature vectors contained two features for the mass model (original_glcm_Correlation and original_glszm_SmallAreaLowGrayLevelEmphasis) and two features for the edema model (original_firstorder_Kurtosis and original_glszm_SizeZoneNonUniformityNormalized). Random Forest models were created using the selected features, and key performance metrics were calculated. The model based on the mass-related features (RF-GTV) achieved a median accuracy of 0.83 and a median AUC of 0.88, while the model based on the edema-related features (RF-EDV) achieved a median accuracy of 0.75 and a median AUC of 0.79. According to the Nadeau and Bengio corrected t-test, the statistical analysis showed no significant difference between the accuracies of the two models.

The Spearman correlation coefficient was used to assess the independence of feature vectors; the results revealed an absence of substantial correlation (Spearman correlation coefficient < 0.3).

In relation to the statistical differences in the distribution of clinical parameters between Group A and Group B, no statistically significant differences were observed for age, gender, grade, and MSKCC type (p-value > 0.05). It is noteworthy that, despite references [19,20], which assert that the risk of distant metastases in STSs can range from 20% to nearly 100% based on grading and histological type, our study did not identify any significant correlations between these clinical parameters and the risk of pulmonary metastases. Consequently, the radiomic model demonstrated its capacity to predict outcomes, in contrast to the clinical parameters examined in this study.

In the context of comparing images obtained from the EDV segmentations of individuals without lung metastases (group A) and those with lung metastases (group B), some differences were observed, even though they were not statistically significant. Specifically, Group A exhibited a lower median value of Original FirstOrder Kurtosis, indicating a more peaked distribution of pixel intensities around the mean. Furthermore, Group A exhibited a significantly higher median value of Original GLSZM Size-Zone Non-Uniformity Normalized, indicating the presence of regions within the image with varying sizes or distinct patterns.

In other terms, group A’s images showed a relatively consistent overall texture with localized variations or structures that contribute to the general heterogeneity of the images. In contrast, group B had pronounced variations and irregularities in pixel intensities throughout the region, with regions that exhibited relatively consistent size zones, indicating the presence of distinct histological structures or patterns (e.g., tumor cells arranged in well-defined nests).

To support these findings, the presence of satellite tumor cells within the context of edema and the association of edema and high-grade STSs has been investigated in [8]. Moreover, several previous studies [21,22,23,24,25,26,27] have examined the impact of various factors on MRI (including edema), and have highlighted the prognostic significance of baseline size, heterogeneous signal intensities on pre-treatment conventional MRI sequences, necrotic signals, peritumoral edema and enhancement, and the presence of a tail sign; these studies have also investigated the associations between these features and the histological grading according to the “Fédération National des Centres de Lutte Contre le Cancer” grading system.

Other studies have examined edema [7,26,28,29,30,31,32,33,34,35], but none of these studies have specifically extracted radiomic features solely from the edema region of STSs. In particular, Crombé et al. [26] examined the changes in semantic features before, during, and after neoadjuvant therapy and surgery using MRI. It was found that changes in edema enhancement were associated with the presence of tumor cells beyond the lesion borders, while variations in edema were associated with disease-free survival; however, none of the studied outcomes were associated with the assessment of edema on baseline MRI. Fadli et al. [28] examined the changes in semantic and radiomic features in a cohort of two consecutive pre-therapy MRIs; the findings revealed a significant association between the presence or increase in edemas (assessed semantically) and the occurrence of local recurrence. In another study [30], selected semantic and radiomic features were analyzed in a cohort of patients who underwent two consecutive MRI scans before and after two cycles of neoadjuvant chemotherapy; the analysis revealed that variations in the surrounding edema (measured semantically) were associated with a positive treatment response, defined as a threshold of less than 10% viable cells on surgical specimens.

Our results have been compared with other studies that utilized the same dataset [7,9]. Specifically, Vallières et al. [7], using a model based on four features extracted from FDG-PET/T1 and FDG-PET/T2FS to predict the onset of lung metastases, achieved an AUC of 0.984 using a bootstrap evaluation. Zhao et al. [10], employing a signature based on T2-weighted MRI features to predict the development of metastasis or recurrence, obtained AUC values of 0.8481 and 0.7351, respectively, in the training and validation datasets, using a four-fold cross-validation. Escobar et al. [36] developed a model to predict the onset of lung metastases using MRI sequences, and achieved an AUC of 0.840 using BootstrapOutOfBag. In [37], a methodology utilizing formal logic and radiomics models to predict the risk of metastasis or recurrence yielded an accuracy of 0.74. However, unlike our study, the first three works employed segmentations that included only the GTV, while the fourth study used segmentations that included both the GTV and edema together.

The aforementioned literature has highlighted the scarcity of studies dedicated to the analysis of radiomic features extracted exclusively from edema. In contrast to previous studies, our findings underscore the importance of investigating radiomic features derived solely from edema. This is attributed to the potential they hold in providing valuable insights into the prediction of lung metastases.

The current study has several limitations that need to be acknowledged. Firstly, the sample size used in this study is relatively small, which can reduce the statistical power of the classification outcomes. To mitigate this limitation, we employed 100 random subsampling iterations to assess the performance of the two models. Each iteration involved randomly splitting the dataset into training and testing sets, following the recommendation of Nadeau and Bengio [15], with the training set being five times larger than the testing set. Secondly, the study faced limitations associated with variations in MRI scanning parameters. These differences could potentially introduce batch effects, but they also presented an opportunity to examine the robustness of the methods across diverse image acquisition parameters. Thirdly, we did not employ the DeLong test to compare the AUC values of the models. We made this decision based on concerns raised about the DeLong method, primarily due to its misuse when training and testing the models using the same dataset [38,39]. Instead, we opted to use the Nadeau and Bengio corrected t-test to compare the accuracies of the models [16].

In summary, both the RF-GTV and the RF-EDV models exhibited promising potential for predicting the occurrence of lung metastasis in soft tissue sarcomas. Specifically, the model incorporating radiomic features solely extracted from the edema region (RF-EDV) demonstrated the capability to predict lung metastases, although its performance was slightly inferior to the model based on mass-related features (RF-GTV). However, the disparity between the two models did not reach statistical significance.

5. Conclusions

These findings suggest that the utilization of radiomic analysis focusing on edema features holds promise in predicting lung metastases in STSs, providing results that are comparable to those obtained from mass-related features. Further investigations involving larger cohorts are warranted to validate the clinical utility of these models.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics13193134/s1, Figure S1–S7; Table S1: Clinical data, MRI data; Algorithm S1: Exhaustive Feature Selection algorithm. References [12,14,18,40,41] are cited in the Supplementary Materials.

Author Contributions

R.C.: Conceptualization, Methodology, Software, Formal analysis, Resources, and Writing—original draft. R.D.A. and N.C.: Methodology, Formal analysis, Resources, and Investigation. A.M.: Methodology, Visualization, Formal analysis, and Writing—review and editing. M.A.B.: Supervision and Methodology. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset, the code and the segmentations used in this article can be provided upon contact with the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

MRI	Magnetic Resonance Imaging
AUC	Area Under the ROC Curve
STS	Soft-Tissue Sarcoma
T2FS	T2-weighted Fat-Saturated
STIR	Short Tau Inversion Recovery
GTV	Gross Tumoral Volume
EDV	Edema Tumoral Volume
FOF	First Order Features
SHAPE	Shape Features
GLCM	Gray Level Co-occurrence Matrix Features
GLRLM	Gray Level Run Length Matrix Features
GLSZM	Gray Level Size Zone Matrix Features
GLDM	Gray Level Dependence Matrix Features
NGTDM	Neighboring Gray Tone Difference Matrix Features
RF	Random Forest
RF-GTV	Random Forest model based on selected GTV features
RF-EDV	Random Forest model based on selected EDV features
IQR	interquartile range
CI	confidence intervals

References

Kransdorf, M.J. Malignant soft-tissue tumors in a large referral population: Distribution of diagnoses by age, sex, and location. AJR Am. J. Roentgenol. 1995, 164, 129–134. [Google Scholar] [CrossRef] [PubMed]
Billingsley, K.G.; Lewis, J.J.; Leung, D.H.; Casper, E.S.; Woodruff, J.M.; Brennan, M.F. Multifactorial analysis of the survival of patients with distant metastasis arising from primary extremity sarcoma. Cancer 1999, 85, 389–395. [Google Scholar] [CrossRef]
Brennan, M.F. Soft tissue sarcoma: Advances in understanding and management. Surgeon 2005, 3, 216–223. [Google Scholar] [CrossRef]
Stojadinovic, A.; Leung, D.H.Y.; Hoos, A.; Jaques, D.P.; Lewis, J.J.; Brennan, M.F. Analysis of the prognostic significance of microscopic margins in 2,084 localized primary adult soft tissue sarcomas. Ann. Surg. 2002, 235, 424–434. [Google Scholar] [CrossRef]
Lewis, J.J.; Brennan, M.F. Soft tissue sarcomas. Curr. Probl. Surg. 1996, 33, 817–872. [Google Scholar] [CrossRef] [PubMed]
Komdeur, R.; Hoekstra, H.J.; van den Berg, E.; Molenaar, W.M.; Pras, E.; de Vries, E.G.; van der Graaf, W.T. Metastasis in soft tissue sarcomas: Prognostic criteria and treatment perspectives. Cancer Metastasis Rev. 2002, 21, 167–183. [Google Scholar] [CrossRef]
Vallières, M.; Freeman, C.R.; Skamene, S.R.; El Naqa, I. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities. Phys. Med. Biol. 2015, 60, 5471–5496. [Google Scholar] [CrossRef]
White, L.M.; Wunder, J.S.; Bell, R.S.; O’Sullivan, B.; Catton, C.; Ferguson, P.; Blackstein, M.; Kandel, R.A. Histologic assessment of peritumoral edema in soft tissue sarcoma. Int. J. Radiat. Oncol. Biol. Phys. 2005, 61, 1439–1445. [Google Scholar] [CrossRef]
Clark, K.; Vendt, B.; Smith, K.; Freymann, J.; Kirby, J.; Koppel, P.; Moore, S.; Phillips, S.; Maffitt, D.; Pringle, M.; et al. The Cancer Imaging Archive (TCIA): Maintaining and operating a public information repository. J. Digit. Imaging 2013, 26, 1045–1057. [Google Scholar] [CrossRef] [PubMed]
Zhao, W.; Huang, X.; Wang, G.; Guo, J. PET/MR fusion texture analysis for the clinical outcome prediction in soft-tissue sarcoma. Cancer Imaging 2022, 22, 7. [Google Scholar] [CrossRef]
Fedorov, A.; Beichel, R.; Kalpathy-Cramer, J.; Finet, J.; Fillion-Robin, J.C.; Pujol, S.; Bauer, C.; Jennings, D.; Fennessy, F.; Sonka, M.; et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn. Reson. Imaging 2012, 30, 1323–1341. [Google Scholar] [CrossRef]
Van Griethuysen, J.J.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.; Fillion-Robin, J.C.; Pieper, S.; Aerts, H.J. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef] [PubMed]
Zwanenburg, A.; Vallières, M.; Abdalah, M.A.; Aerts, H.J.; Andrearczyk, V.; Apte, A.; Ashrafinia, S.; Bakas, S.; Beukinga, R.J.; Boellaard, R.; et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology 2020, 295, 328–338. [Google Scholar] [CrossRef] [PubMed]
Raschka, S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J. Open Source Softw. 2018, 3, 638. [Google Scholar] [CrossRef]
Nadeau, C.; Bengio, Y. Inference for the Generalization Error. Mach. Learn. 2003, 52, 239–281. [Google Scholar] [CrossRef]
Bouckaert, R.R.; Frank, E. Evaluating the replicability of significance tests for comparing learning algorithms. In Advances in Knowledge Discovery and Data Mining; Springer: Berlin/Heidelberg, Germany, 2004; pp. 3–12. [Google Scholar]
Kim, H.-Y. Statistical notes for clinical researchers: Chi-squared test and Fisher’s exact test. Restor. Dent. Endod. 2017, 42, 152–155. [Google Scholar] [CrossRef] [PubMed]
Efron, B. Bootstrap methods: Another look at the jackknife. Ann. Stat. 1979, 7, 1–26. [Google Scholar] [CrossRef]
Fletcher, C.D.; Unni, K.; Mertens, F. World Health Organization Classification of Tumours. Pathology and Genetics of Tumours of Soft Tissue and Bone; IARC Press: Lyon, France, 2002. [Google Scholar]
Fletcher, C.D.M. The evolving classification of soft tissue tumours: An update based on the new WHO classification. Histopathology 2006, 48, 3–12. [Google Scholar] [CrossRef]
Zhao, F.; Ahlawat, S.; Farahani, S.J.; Weber, K.L.; Montgomery, E.A.; Carrino, J.A.; Fayad, L.M. Can MR imaging be used to predict tumor grade in soft-tissue sarcoma? Radiology 2014, 272, 192–201. [Google Scholar] [CrossRef]
Lefkowitz, R.A.; Landa, J.; Hwang, S.; Zabor, E.C.; Moskowitz, C.S.; Agaram, N.P.; Panicek, D.M. Myxofibrosarcoma: Prevalence and diagnostic value of the “tail sign” on magnetic resonance imaging. Skelet. Radiol. 2013, 42, 809–818. [Google Scholar] [CrossRef]
Yoo, H.J.; Hong, S.H.; Kang, Y.; Choi, J.Y.; Moon, K.C.; Kim, H.S.; Han, I.; Yi, M.; Kang, H.S. MR imaging of myxofibrosarcoma and undifferentiated sarcoma with emphasis on tail sign; diagnostic and prognostic value. Eur. Radiol. 2014, 24, 1749–1757. [Google Scholar] [CrossRef]
Trojani, M.; Contesso, G.; Coindre, J.M.; Rouesse, J.; Bui, N.B.; De Mascarel, A.; Goussot, J.F.; David, M.; Bonichon, F.; Lagarde, C. Soft-tissue sarcomas of adults; study of pathological prognostic variables and definition of a histopathological grading system. Int. J. Cancer 1984, 33, 37–42. [Google Scholar] [CrossRef]
Fernebro, J.; Wiklund, M.; Jonsson, K.; Bendahl, P.-O.; Rydholm, A.; Nilbert, M.; Engellau, J. Focus on the tumour periphery in MRI evaluation of soft tissue sarcoma: Infiltrative growth signifies poor prognosis. Sarcoma 2006, 2006, 21251. [Google Scholar] [CrossRef] [PubMed]
Crombé, A.; Le Loarer, F.; Stoeckle, E.; Cousin, S.; Michot, A.; Italiano, A.; Buy, X.; Kind, M. MRI assessment of surrounding tissues in soft-tissue sarcoma during neoadjuvant chemotherapy can help predicting response and prognosis. Eur. J. Radiol. 2018, 109, 178–187. [Google Scholar] [CrossRef] [PubMed]
Nakamura, T.; Matsumine, A.; Matsubara, T.; Asanuma, K.; Yada, Y.; Hagi, T.; Sudo, A. Infiltrative tumor growth patterns on magnetic resonance imaging associated with systemic inflammation and oncological outcome in patients with high-grade soft-tissue sarcoma. PLoS ONE 2017, 12, e0181787. [Google Scholar] [CrossRef] [PubMed]
Fadli, D.; Kind, M.; Michot, A.; Le Loarer, F.; Crombé, A. Natural Changes in Radiological and Radiomics Features on MRIs of Soft-Tissue Sarcomas Naïve of Treatment: Correlations With Histology and Patients’ Outcomes. J. Magn. Reson. Imaging 2022, 56, 77–96. [Google Scholar] [CrossRef] [PubMed]
Sedaghat, S.; Schmitz, F.; Meschede, J.; Sedaghat, M. Systematic analysis of post-treatment soft-tissue edema and seroma on MRI in 177 sarcoma patients. Surg. Oncol. 2020, 35, 218–223. [Google Scholar] [CrossRef] [PubMed]
Crombé, A.; Périer, C.; Kind, M.; De Senneville, B.D.; Le Loarer, F.; Italiano, A.; Buy, X.; Saut, O. T₂-based MRI Delta-radiomics improve response prediction in soft-tissue sarcomas treated by neoadjuvant chemotherapy. J. Magn. Reson. Imaging 2019, 50, 497–510. [Google Scholar] [CrossRef]
Tsagozis, P.; Brosjö, O.; Skorpil, M. Preoperative radiotherapy of soft-tissue sarcomas: Surgical and radiologic parameters associated with local control and survival. Clin. Sarcoma Res. 2018, 8, 19. [Google Scholar] [CrossRef] [PubMed]
Wang, L.; Pretell-Mazzini, J.; Kerr, D.A.; Chelala, L.; Yang, X.; Jose, J.; Subhawong, T.K. MRI findings associated with microscopic residual tumor following unplanned excision of soft tissue sarcomas in the extremities. Skelet. Radiol. 2018, 47, 181–190. [Google Scholar] [CrossRef]
Bahig, H.; Roberge, D.; Bosch, W.; Levin, W.; Petersen, I.; Haddock, M.; Freeman, C.; DeLaney, T.F.; Abrams, R.A.; Indelicato, D.J.; et al. Agreement among RTOG sarcoma radiation oncologists in contouring suspicious peritumoral edema for preoperative radiation therapy of soft tissue sarcoma of the extremity. Int. J. Radiat. Oncol. Biol. Phys. 2013, 86, 298–303. [Google Scholar] [CrossRef] [PubMed]
Duy Hung, N.; Tam, N.-T.; Khanh Huyen, D.; Thi, N.-V.; Minh Duc, N. Diagnostic performance of magnetic resonance imaging in discriminating benign and malignant soft tissue tumors. Int. J. Gen. Med. 2023, 16, 1383–1391. [Google Scholar] [CrossRef] [PubMed]
Crombé, A.; Bertolo, F.; Fadli, D.; Kind, M.; Le Loarer, F.; Perret, R.; Chaire, V.; Spinnato, P.; Lucchesi, C.; Italiano, A. Distinct patterns of the natural evolution of soft tissue sarcomas on pre-treatment MRIs captured with delta-radiomics correlate with gene expression profiles. Eur. Radiol. 2023, 33, 1205–1218. [Google Scholar] [CrossRef] [PubMed]
Escobar, T.; Vauclin, S.; Orlhac, F.; Nioche, C.; Pineau, P.; Champion, L.; Brisse, H.; Buvat, I. Voxel-wise supervised analysis of tumors with multimodal engineered features to highlight interpretable biological patterns. Med. Phys. 2022, 49, 3816–3829. [Google Scholar] [CrossRef]
Casale, R.; Varriano, G.; Santone, A.; Messina, C.; Casale, C.; Gitto, S.; Sconfienza, L.M.; Bali, M.A.; Brunese, L. Predicting risk of metastases and recurrence in soft-tissue sarcomas via Radiomics and Formal Methods. JAMIA Open 2023, 6, ooad025. [Google Scholar] [CrossRef]
Chen, W.; Samuelson, F.W.; Gallas, B.D.; Kang, L.; Sahiner, B.; Petrick, N. On the assessment of the added value of new predictive biomarkers. BMC Med. Res. Methodol. 2013, 13, 98. [Google Scholar] [CrossRef]
Kang, L.; Chen, W.; Petrick, N.A.; Gallas, B.D. Comparing two correlated C indices with right-censored survival outcome: A one-shot nonparametric approach. Stat. Med. 2015, 34, 685–703. [Google Scholar] [CrossRef]
Kohavi, R.; John, G.H. The wrapper approach. In Feature Extraction, Construction and Selection; Springer: Boston, MA, USA, 1998; pp. 33–50. [Google Scholar]
Ho, T.K. Random Decision Forests. In Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, Canada, 14–16 August 1995. [Google Scholar]

Figure 1. GTV and EDV segmentations; EDV segmentations were obtained by subtracting the GTV segmentation from the segmentation that encompasses both the tumor mass (GTV) and the associated edema.

Figure 2. ROCs and AUCs for the RF-GTV (GTV model) and RF-EDV (edema model).

Figure 3. Intercorrelation among the selected features for both Gross Tumor Volume (GTV) and Edema Tumor Volume (EDV).

Figure 4. Boxplots for selected features for Gross Tumor Volume (GTV) model; Mann-Whitney p-value for statistically significant differences of value distribution. The horizontal orange line represents the median.

Figure 5. Boxplots for selected features for Edema Tumor Volume (EDV) model; Mann-Whitney p-value for statistically significant differences of value distribution. The horizontal orange line represents the median.

Table 1. Clinical parameters. (* p-value for statistically significant differences of value distribution in Group A and Group B; age—Mann–Whitney; gender ratio, grade radio and MSKCC type—Fisher’s exact test).

	Group A (No Lung Metastases)	Group B (Lung Metastases)	p-Value *
Number of patients	18	14	-
Gender ratio (M/F)	5/13	9/5	0.072
Age, y, median (range)	53.5 (16–83)	62.5 (44–74)	0.106
Grade ratio (Low/Intermediate/High)	1/9/8	0/4/10	0.216
MSKCC type (Fibrosarcoma/Leiomyosarcoma/Liposarcoma/MFH/Synovial sarcoma/Other)	1/6/3/3/3/2	0/3/2/8/1/0	0.238

Table 2. Selected features for Gross Tumor Volume (GTV) and Edema Tumor Volume (EDV).

Selected Features
Gross Tumor Volume (GTV)	Edema Tumor Volume (EDV)
original_glcm_Correlation	original_firstorder_Kurtosis
original_glszm_SmallAreaLowGrayLevelEmphasis	original_glszm_SizeZoneNonUniformityNormalized

Table 3. Classification performance on 100 random subsampling iterations.

	RF-GTV Median [Interquartile Range]	RF-EDV Median [Interquartile Range]
Accuracy	0.83 [0.17]	0.75 [0.17]
Sensitivity	0.67 [0.50]	0.67 [0.50]
Specificity	1.00 [0.33]	0.80 [0.33]
AUC	0.88 [0.23]	0.79 [0.38]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Casale, R.; De Angelis, R.; Coquelet, N.; Mokhtari, A.; Bali, M.A. The Impact of Edema on MRI Radiomics for the Prediction of Lung Metastasis in Soft Tissue Sarcoma. Diagnostics 2023, 13, 3134. https://doi.org/10.3390/diagnostics13193134

AMA Style

Casale R, De Angelis R, Coquelet N, Mokhtari A, Bali MA. The Impact of Edema on MRI Radiomics for the Prediction of Lung Metastasis in Soft Tissue Sarcoma. Diagnostics. 2023; 13(19):3134. https://doi.org/10.3390/diagnostics13193134

Chicago/Turabian Style

Casale, Roberto, Riccardo De Angelis, Nicolas Coquelet, Ayoub Mokhtari, and Maria Antonietta Bali. 2023. "The Impact of Edema on MRI Radiomics for the Prediction of Lung Metastasis in Soft Tissue Sarcoma" Diagnostics 13, no. 19: 3134. https://doi.org/10.3390/diagnostics13193134

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Impact of Edema on MRI Radiomics for the Prediction of Lung Metastasis in Soft Tissue Sarcoma

Abstract

Highlights

Abstract

1. Introduction

2. Materials and Methods

2.1. Dataset

2.2. Segmentation and Feature Extraction

2.3. Feature Selection

2.4. Modeling and Statistical Analysis

3. Results

3.1. Dataset

3.2. Features Extraction

3.3. Features Selection

3.4. Classification Performance

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI