Introduction

Acute lymphoblastic leukemia (ALL) is a hematologic disease characterized by uncontrolled proliferation of immature lymphoid cells. Its incidence varies between countries. In the USA, an incidence of 1.7 cases per 100,000 person-years [1] has been reported; in Latin America, the incidence is higher with up to 2.1 cases per 100,000 person-years [2]. In the last decade, survival rates in children have improved, with a 5-year overall survival (OS) approaching 90% [2, 3]; however, this has not been reproduced in adults. According to age group, 5-year OS for adolescents and young adults (AYA) is 42–63%, 24% in patients between 40 and 59 years, and 17% in those older than 60 years [4, 5].

It is known that bone marrow (BM) fibrosis is commonly found in patients with acute leukemia [6] and that this fibrosis reduces significantly after treatment [7,8,9,10,11]. Previous studies have described the role of BM microenvironment and fibrosis as a prognostic marker in ALL with contradictory results. Some clinical studies propose that fibrosis serves as a sanctuary for blasts, reducing response to chemotherapy [12, 13], and others have associated BM fibrosis with lower white blood cell and peripheral blast counts, characteristics classically associated with better prognosis [14, 15].

Norén-Nyström et al. in Sweden is the only group that has consistently reported the prognostic role of BM fibrosis and vascular density in pediatric patients with ALL. In high-risk patients, they described more BM fibrosis and vascular density at diagnosis, and this was related to higher rates of post-induction positive minimal residual disease and shorter progression-free survival (PFS). In addition, a rapid and greater than 20% reduction in post-induction BM fibrosis was associated with longer relapse-free survival (RFS). They did not find any association between BM fibrosis and specific demographic characteristics, immunophenotype, karyotype, or a prognostic role in standard-risk patients. [16,17,18]

To our knowledge, there are no studies describing the prognostic role of BM fibrosis on the survival outcomes of adult patients with ALL. The aim of this study is to describe the impact of BM fibrosis at diagnosis and its reduction post-induction treatment on induction failure, RFS, and OS in adult patients with ALL.

Materials and methods

Patients

From February 2011 to February 2016, we retrospectively collected demographic information, clinical data, and outcomes from clinical records of patients. We included patients aged ≥ 18 years, consecutively diagnosed with ALL, treated, and followed at our institution, with available and good-quality BM biopsies on diagnosis and post-induction treatment, and who had no previous cancer or disease known to cause BM fibrosis. This protocol was approved by our local ethical committee.

ALL diagnosis was established according to the 2016 World Health Organization (WHO) criteria [19, 20]. Risk was stratified according to classic risk factors (> 35 years of age, > 30 × 106 leukocytes/μL, early T cell leukemia, Philadelphia chromosome, complex karyotype, and failure to induction treatment) [4]. Patients received either HyperCVAD [21] or an institutional protocol (1095 protocol) [22] with or without rituximab. Complete remission (CR) was defined as leukemia cells not detectable by light microscopy in peripheral blood or cerebrospinal fluid and < 5% blasts in a BM with trilineage hematopoiesis. To consider CR, absolute neutrophil count (ANC) greater than 1.0 × 109/L, platelet counts greater than 100 × 109/L, and no recurrence should be observed for at least 4 weeks [23, 24]. Induction failure was defined as failure to achieve CR after first induction therapy; relapsed disease as the reappearance of blasts in blood or BM (> 5%). RFS was defined as the time elapsed from diagnosis to relapse and OS as the time elapsed from diagnosis to death by any cause or last follow up.

Bone marrow fibrosis

BM biopsies were evaluated in a blinded manner by one hematopathologist. Biopsy samples were fixed in formaldehyde, decalcified, and embedded in paraffin. Pathology material was discarded when samples were crushed, had extensive necrosis, where subcortical marrows and/or the evaluation area was less than three marrow spaces. Hematoxylin-eosin, reticulin, and Masson trichrome stains were performed. BM cellularity and blast count were estimated in all samples. Fibrosis was described using both modified Bauermeister scale as described by Bain et al. [25], and volume of reticulin/collagen fibers adapted from Nóren-Nyström et.al. [17]

Modified Bauermeister scale included four grades: grade 0 with no reticulin fibers demonstrable, grade 1 with occasional fine individual fibers and foci of a fine network, grade 2 with diffuse fiber network of fine fibers without coarse fibers, grade 3 with diffuse fiber network with scattered thick coarse fibers but no mature collagen, and grade 4 with diffuse coarse fiber network with areas of collagenization. Significant fibrosis was considered a modified Bauermeister scale ≥ grade 2.

The volume of reticulin/collagen fibers for each sample was estimated using a 63-point graticule representing 25 microns [2] at × 40 magnification in 10 randomly selected fields within the marrow area and finally expressed as the average of this 10 selected fields per sample. The volume was determined by the percentage of the area occupied by graticule-crossing points or hits overlaying fibers versus hits over reference space (Fig. 1). A Lumera microscope camera and Infinity Analyze software version 6.4.1 was used. Significant or deep fibrosis reduction was considered ≥ 50% reduction of the volume of reticulin/collagen fibers after induction treatment.

Fig. 1
figure 1

Bone marrow biopsy stained with Masson trichrome stain. Twenty-five microns [2], × 40. Sixty-three-point graticule. In this figure, there are 17 (of 63 total) graticule points crossed by collagen fibers representing 27% volume of collagen fibers of the field

Survival analyses were performed regarding (1) the presence or not of significant BM fibrosis at diagnosis, (2) the significant reduction of BM fibrosis post-induction treatment, and (3) the amount of BM fibrosis at diagnosis combined with the depth of BM fibrosis reduction post-induction treatment. For this third analysis, patients were classified into three groups: without significant fibrosis at diagnosis regardless of fibrosis reduction post-induction treatment (group 1), with significant fibrosis at diagnosis and a reduction of volume of reticulin/collagen fibers post-induction treatment of < 50% (group 2), and with significant fibrosis at diagnosis and reduction of volume of reticulin/collagen fibers post-induction treatment of ≥ 50% (group 3).

Statistical analysis

Continuous variables were described as medians and were compared with the Mann-Whitney U test. Categorical variables were described as frequencies and proportions and were compared with X2 and Fisher’s exact test. Correlation analysis was performed using the Spearman correlation test. Survival analysis was performed with Kaplan-Meier curves and differences determined by log-rank test. Statistical significance was established at p < 0.05 (two-tailed). All statistical analyses were conducted using SPSS 20.0 software program (Statistical Package for Social Sciences, SPSS Inc. Chicago, IL., USA).

Results

Baseline characteristics

A total of 44 patients were included in the study. Median age at diagnosis was 27 years (range, 17–67 years), 23 (52%) were male, and 38 (86%) patients had ECOG performance score of 0–1. Regarding cytogenetic profile, 20 (46%) patients had a normal karyotype, 1 (2%) patient had complex karyotype (≥ 5 chromosomal abnormalities), and Philadelphia chromosome was identified by FISH or karyotype in 10 (23%) patients. There were no T or mature B cell leukemias, 41 (93.2%) patients had pre-B cell leukemia and 28 (64%) were considered high-risk. Additional baseline characteristics are listed in Table 1.

Table 1 Demographics and characteristic of evaluable patients at diagnosis

Therapy and response

A total of 27 (61%) patients received HyperCVAD regimen with or without rituximab. Of the 10 patients with Philadelphia chromosome, 8 (80%) received imatinib. During induction, 57% used granulocyte colony stimulating factor and 77% developed an infection. In terms of response assessment, 31 (71%) patients achieved CR with induction therapy and 13 (29%) failed induction. After 24 months follow up, 29 (59%) patients relapsed; for the whole group, RFS was 10.6 months (IC 95%, 7.08–14.07) and OS was 17.1 months (IC 95%, 10.33–23.96).

Bone marrow fibrosis assessment

Bone marrow core biopsies had a median size of 1.3 cm (range, 0.7–3.4 cm) at diagnosis and 1.5 cm (range, 0.6–2.5 cm) post-induction treatment. Median evaluable BM spaces at diagnosis were 11.5 (range, 5–40) and post induction, 10 (range, 4–21).

We correlated BM fibrosis description using the volume of reticulin/collagen fibers with the modified Bauermeister scale, both methods had a good correlation at diagnosis (p ≤ .001) and post-induction treatment (p ≤ .001) (Fig. 2). Since the modified Bauermeister scale is widely used for fibrosis description, we compared patients based on the presence (≥ grade 2) or absence (grade 0–1) of significant fibrosis at diagnosis (Table 1). Twenty-nine (66%) patients had significant BM fibrosis at diagnosis, and there were no significant differences in baseline characteristics at diagnosis between patients with or without fibrosis. There were more high-risk patients (55% vs 80%, p = .185) in the group without significant fibrosis. Median white blood cell count (7.4 × 103/μL vs 20.7 × 103/μL, p = .154) and peripheral blast count (25% vs 66%, p = .063) were lower in patients with significant fibrosis at diagnosis. These differences were not statistically significant.

Fig. 2
figure 2

Correlation of fibrosis description by volume of reticulin/collagen fibers and fibrosis by modified Bauermeister scale at diagnosis (p ≤ .001). We found the same correlation with p ≤ .001 after induction treatment

The presence of significant BM fibrosis at diagnosis was not correlated with induction failure (6 patients vs 7 patients, p = .313) or OS (20.0 months vs 12.3 months, p = .061). RFS was longer in patients with significant fibrosis at diagnosis (12.3 months vs 8.8 months, p = .044), but the significance was lost when patients were classified according to risk (high-risk patients 9.4 months vs 8.2 months, p = .291; standard-risk patients 12.4 months vs 10.6 months, p = .094).

When Philadelphia positive patients were analyzed, the presence of significant BM fibrosis at diagnosis was not correlated with longer RFS (3.4 months vs 8.2 months, p = .438) or OS (5.6 months vs 10.8 months, p = .600). In Philadelphia negative patients, RFS (12.4 months vs 10.5 months, p = .015) and OS (28.3 months vs 17.2 months, p = .052) were longer in those with significant BM fibrosis at diagnosis.

Twenty (45%) patients achieved a significant reduction in BM fibrosis post-induction treatment measured by volume of reticulin/collagen fibers, this was correlated to less induction failure (2 patients vs 11 patients, p = .018), longer RFS (12.1 months vs 10.5 months, p = .037), and longer OS (28.3 months vs 13.1 months, p = .028). When standard and high-risk patients were individually analyzed, the impact of BM fibrosis reduction post-induction treatment was lost for high-risk patients (RFS: 8.7 months vs 8.2 months, p = .336. OS: 14.1 months vs 12.1 months, p = .600) (Fig. 3). The significance was maintained in standard-risk patients (RFS: 19.1 months vs 10.6 months, p = .022. OS: 54.4 months vs 16.4 months, p = .033) (Fig. 4).

Fig. 3
figure 3

Survival in high-risk patients a RFS: 8.8 months for patients with ≥ 50% reduction vs 8.2 months for those without, p = .336. b OS: 14.1 months for patients with ≥ 50% reduction vs 12.1 months for those without, p = .600

Fig. 4
figure 4

Survival in standard-risk patients a RFS: 19.1 months for patients with ≥50% reduction vs 10.6 months for those without, p = .022. b OS: 54.4 months for patients with ≥50% reduction vs 16.4 months for those without, p = .033

When Philadelphia positive patients were analyzed, a significant reduction in BM fibrosis after induction treatment was not correlated with longer RFS (8.4 months vs 8.2 months, p = .702) or OS (10.8 months vs 5.6 months, p = .935). In Philadelphia negative patients, the reduction in BM fibrosis was positively correlated with RFS (19.1 months vs 10.5 months, p = .003) and OS (39.3 months vs 16.4 months, p = .023).

We did survival analysis regarding the amount of BM fibrosis at diagnosis combined with the depth BM fibrosis reduction post-induction treatment. Patients were classified into three groups: without significant fibrosis at diagnosis regardless of fibrosis reduction post-induction treatment (group 1), with significant fibrosis at diagnosis and a reduction of volume of reticulin/collagen fibers post-induction treatment of < 50% (group 2), and with significant fibrosis at diagnosis and reduction of volume of reticulin/collagen fibers post-induction treatment of ≥ 50% (group 3). Patients in group 3 had a significantly better RFS (18 months) vs patients in group 1 (9 months) and group 2 (7 months); p = .024. OS was also better in patients in group 3 (39 months) vs groups 1 and 2 (12 months); p = .01 (Fig. 5). The 2-year RFS for patients in group 3 was 37% vs 12% for group 2 and 22% for group 1, and 2-year OS for patients in group 3 was 60% vs 17% for group 2 and 9% for group 1.

Fig. 5
figure 5

Survival in patients classified in three group categories for fibrosis reduction. a RFS 18.4 months—group 3 vs 8.8 months—group 1 and 6.9 months—group 2, p = .024. b OS 39 months—group 3 vs 12 months for groups 1 and 2, p = .01

Treatment choice between HyperCVAD or an institutional protocol or the use of imatinib did not impact on BM fibrosis reduction post-induction.

The differences between diagnosis and post-induction BM fibrosis are shown in Table 2. Median volume of reticulin/collagen fibers was 15.3% (range, 0.32%–44.6%) at diagnosis and 9.5% (range, 0.16%–38.6%) post-induction. After induction, none of the patients had grade 3 or 4 fibrosis, 19 (43%) biopsies had grade 1 fibrosis and 22 (50%) biopsies had grade 2 fibrosis.

Table 2 Bone marrow fibrosis described by volume of reticulin/collagen fibers and modified Bauermeister scale at diagnosis and post-induction

When analyzed by a median change in volume of reticulin/collagen fibers, we found that patients with normal karyotype had an incremental increase of 19.4% in BM fibrosis post-induction instead of the reduction seen in patients with other karyotypes.

Discussion

Bone marrow vascular and endosteal niches provide support for normal hematopoietic stem cells and control proliferation, differentiation, migration, and death [26, 27]. Leukemia cells (LCs) share the same interaction mechanisms as normal hematopoietic stem cells with marrow microenvironment, but somehow overcome apoptosis and replace normal cells [25, 28]. Previous reports on ALL confirm that at diagnosis, there is increased BM fibrosis and abnormal angiogenesis.

LCs interact through VLA-4 with VCAM-1 and fibronectin on stromal cells which then secrete TGFβ1 and BMP-6, cytokines associated with fibrosis. They also interact through CXCL12 with its receptor CXCR4 producing an elevation of IL-8 that promotes angiogenesis. This interaction with stromal cells inhibits LC apoptosis through the activation of Bcl-2 and Bcl-xL. [12, 13, 29, 30] LCs compete with normal hematopoietic stem cells for space in the marrow niche by secreting stem cell factor which inhibits normal cell interaction [30]. At diagnosis, serum concentration cytokines associated with fibrosis such as TNFα, FGFβ, HGF, and MMP-9 are high in patients with ALL. [31, 32] VEGF concentration is low at diagnosis and returns to normal after treatment suggesting that it may be consumed because of increased angiogenesis [33]. All these pathways ensure LC survival and proliferation.

Since Hann et al. in 1978 described the presence of BM fibrosis in 57% of children with ALL, every report in acute leukemia confirms that the stromal microenvironment is part of the pathophysiology of the disease [7, 16,17,18].

Wallis et al. described BM fibrosis in 63 children with ALL. [14] They did not find any association between the presence of BM fibrosis at diagnosis and karyotype, hemoglobin or platelet count. Patients with fibrosis had lower leukocyte counts at diagnosis and, after induction treatment, all but two BMs returned to normal.

Nath et al. also found an inverse correlation between BM fibrosis and leukocyte and blast counts, patients with significant fibrosis had lower leukocyte and blast counts in peripheral blood at diagnosis [30]. They described higher cellularity in patients with significant fibrosis and suggest this was because of trapped blasts in the fibrotic BM. They proposed that LCs create an environment that protects them from apoptosis, but this might be overcome by intensive chemotherapy.

In a letter to the editor of Leukemia Journal in 2005, Nóren-Nyström et al. reported for the first time the negative prognostic impact of vascular density and BM fibrosis evaluated by reticulin fiber density (RFD) in 81 children with ALL. [16] They found that patients with high-risk ALL had higher BM fibrosis at diagnosis and those patients relapsed more. In 2008, they published a complete report on BM fibrosis in 166 children [17]. Patients with B cell ALL had higher RFD compared with T cell ALL patients. They also found a negative correlation between RFD and leukocyte count. They did not find any other baseline characteristic (sex, risk, central nervous system disease, or karyotype) to be associated with higher RFD. In this second report, they only found a prognostic impact of RFD at diagnosis in low-risk patients, those with higher RFD had shorter RFS. Patients that achieved more than the median RFD reduction (> 8.4%) had better outcomes but it was only significant in those with low RFD at diagnosis.

In 2010, Bharos A et al. published contradicting results. They did not find any association between the presence of BM fibrosis at diagnosis and RFS or OS [15]. They described BM fibrosis with the modified Bauermeister scale and using RFD with good correlation between both methods (p < .001). They found that 2.5% of the patients had grade 0 fibrosis, 33.3% had grade 1, 37% had grade 2, 19.7% had grade 3, and 7.5% had grade 4 fibrosis. They evidenced a negative correlation between RFD and leukocyte and blast counts in peripheral blood at diagnosis as documented by previous authors.

In our study, 66% of the patients had significant fibrosis at diagnosis. We did not find any association between the presence or absence of significant fibrosis at diagnosis and any of the baseline characteristics such as sex, age, ECOG performance status, karyotype, immunophenotype, leukemia risk, induction treatment given, hemoglobin, or platelet count. Like Wallis et al. and Nath et al., we found that patients with significant fibrosis had lower leukocyte counts at diagnosis (7.4 × 10 [3]/μL vs 20.7 × 10 [3]/μL, p = .125) and lower blast counts in peripheral blood (35% vs 66%, p = .063), but these did not reach statistical significance. We did not find any association between the presence of fibrosis at diagnosis and BM cellularity.

The presence of significant BM fibrosis at diagnosis was associated with longer RFS, but the significance was lost when patients were classified according to risk. Philadelphia negative patients with significant fibrosis at diagnosis had longer RFS. Most of the previous studies excluded these patients.

Patients who achieved a significant reduction in fibrosis post-induction treatment measured by the volume of reticulin/collagen fibers had less induction failure, longer RFS and OS in our study. The significance was only maintained in standard-risk and Philadelphia negative patients as described by the second report by Nóren-Nyström.

When classified by groups regarding the amount of BM fibrosis at diagnosis combined with the depth of BM fibrosis reduction post-induction treatment, we found that patients with significant fibrosis at diagnosis and reduction of volume of reticulin/collagen fibers post-induction treatment of ≥ 50% (group 3) had longer RFS and OS compared with those who did not have fibrosis at diagnosis regardless of its reduction (group 1) or patients who had fibrosis at diagnosis and did not achieve such a reduction (group 2).

In our study, 2.3% of patients had grade 0 fibrosis, 31.8% had grade 1, 31.8% had grade 2, 31.8% had grade 3, and 1% had grade 4 fibrosis. After induction, fibrosis by volume of reticulin/collagen fibers reduced from 15.3% at diagnosis to 9.48% post-induction treatment. Thomas X et al. [26] reported more patients with grade 0 fibrosis (36%) at diagnosis.

Patients with normal karyotype had an incremental increase in BM fibrosis post-induction. These might point to the fact that patients with normal karyotype might have other high-risk molecular abnormalities that we are not able to detect such as Philadelphia-like mutations and that these abnormalities could cause an increment in fibrosis [34, 35].

Our hypothesis is that the leukemia-stimulated stromal environment keeps the malignant cells in the BM initially and protects them from apoptosis through the interactions previously described. In vitro, these interactions make the LCs survive longer after exposition to chemotherapy [7, 13, 14]. This hypothesis would explain the lower WBC and peripheral blood blast counts in patients with significant BM fibrosis at diagnosis. Our findings suggest that in vivo, successful and intensive chemotherapy reduces fibrosis and overcomes the protection that the microenvironment brought to LCs in standard-risk and Philadelphia negative patients, but not in high-risk patients, including Philadelphia positive patients. Our results should be taken with care when evaluating the latter.

Although the Thiele et al. grading scale [36] is recommended by the WHO for the evaluation of BM fibrosis, this was reviewed mainly in the context of myeloproliferative neoplasms. Bauermeister scale modified by Bain et al. in 2001 has been previously used for fibrosis description in patients with ALL and was found to have good correlation with fibrosis description by volume of reticulin/collagen fibers [10, 15]. Both the modified Bauermeister and the Thiele et al. scale take into account reticulin and collagen fibers [25], and we consider they could both be used for future description of fibrosis in ALL patients.

As a limitation of this study, we did not measure microvessel density (MVD) systematically using immunohistochemistry stain for von-Willebrand factor, CD34 or CD31. This has been described by some authors as a prognostic marker for survival in children with ALL and it is also an important way to differentiate BM fibrosis per se from perivascular fibrosis. Norén-Nyström et al. and Perez-Atayde et al. reported that MVD was higher in T cell ALL, and when reticulin/collagen fiber description was combined with MVD, the survival curves dissected better, those with low reticulin/collagen fibrosis and low MVD had longer disease-free survival. [8, 18] Others have not found any correlation between higher MVD at diagnosis and survival. [9, 15] In all of these studies, MVD was increased at diagnosis at the expense of small microvessels, after induction treatment, MVD reduced and the vessels became larger.

Other limitations include the retrospective nature of this study, the lack of minimal residual disease measurement and the number of patients included. At our institution, minimal residual disease measurement was started in 2017.

The WHO recommends that BM biopsies be at least 1.5 cm in length so that 10 BM spaces can be evaluated. In the interest of increasing patient sample, we used the inclusion criteria described by Norén-Nyström et al. were BM biopsies were accepted if they had more than three evaluable BM spaces.

We successfully reproduced the method of fibrosis estimation using the volume of reticulin/collagen fibers. Since this method is more time consuming and it has good correlation with the modified Bauermeister scale, we encourage the use of the latter or other widely accepted BM fibrosis description scales like the one described by Thiele et al. [36]

Conclusions

This study demonstrates that there is a role for BM fibrosis reduction as a prognostic factor in adult patients with ALL. That although having significant fibrosis at diagnosis does not clearly impact on outcomes, its reduction post-induction treatment has an impact on the survival of standard-risk patients. When we analyzed the amount of fibrosis at diagnosis combined with the depth of the reduction post-treatment, we found that the importance of fibrosis reduction lies on its presence at diagnosis and the capacity to reduce it with induction chemotherapy. Patients without fibrosis at diagnosis regardless of its reduction and patients who did not achieve a deep reduction of fibrosis post-treatment had worse survival. These results must be correlated with other prognostic factors like minimal residual disease and Philadelphia-like mutations.