GOBO: Gene Expression-Based Outcome for Breast Cancer Online
Figure 5
Sample Prediction analysis of the CSR gene signature in the 1881-sample breast cancer data set.
(A) Composition of CSR classification groups for different clinical variables and Hu et al. gene expression subtypes in the 1881-sample set. (B) Association of the CSR activated fibroblast classification group with clinical variables and gene expression subtypes (Hu et al.). Y-axis display log10(P-value) from Fisher tests for each category. E.g., for the basal-like subtype a 2×2 table is generated containing number of basal-like tumors in the CSR activated fibroblast class, number of non-basal-like tumors in the CSR activated fibroblast class, number of basal-like tumors in the CSR non-activated class and number of non-basal-like tumors in the CSR non-activated class. Fisher P-values from tests with odds ratios <1 (negative association) are depicted as log10(P) (negative values on y-axis), whereas odds ratios >1 (positive association) are depicted as –log10(P) (positive values on y-axis). Results can be interpreted such that the CSR activated fibroblast class is associated with ER-negative tumors, tumors with histological grade 3, and tumors classified as basal-like or luminal B. (C) Association with outcome for CSR classification in subgroups of breast cancer using DMFS as endpoint and 10-year censoring. Samples in the 1881-sample set were stratified into two groups based on correlation to the CSR activated fibroblast gene signature, followed by Kaplan-Meier survival analysis in 21 subgroups using 1379 cases with DMFS follow-up. Logrank P-values are shown as −log10(P-value). (D) Corresponding multivariate analysis for ER-positive tumors (n = 554) using lymph node status and stratified histological grade (histological grade 1 and 2 vs. 3) as covariates, DMFS as endpoint and 10-year censoring. ER status is omitted from the multivariate analysis since all investigated cases are ER-positive.