Introduction

TAFRO syndrome is a rare inflammatory disorder of unknown etiology characterized by thrombocytopenia, anasarca (pleural effusion, ascites, and edema), fever, reticulin myelofibrosis, renal dysfunction, and organomegaly (hepatosplenomegaly and/or lymphadenopathy) [1, 2]. A large proportion of patients with this syndrome present with generalized lymphadenopathy, and histology of their lymph nodes exhibits features consistent with idiopathic multicentric Castleman disease (iMCD) [3]. Therefore, some researchers assume that this syndrome is a subtype of iMCD (iMCD-TAFRO) [4]. However, the clinical features of this syndrome differ considerably from those of typical iMCD-not otherwise specified (iMCD-NOS). For example, TAFRO syndrome often develops acutely or subacutely, progresses rapidly, and presents with thrombocytopenia without hyper-γ-globulinemia, while iMCD-NOS usually develops gradually, and presents with thrombocytosis and hyper-γ-globulinemia [5, 6]. Therefore, others consider this syndrome a distinct clinical entity [6]; thus, the association between TAFRO syndrome and iMCD is still controversial, especially due to the absence of specific biomarkers for these conditions.

The treatment of TAFRO syndrome encompasses the use of corticosteroids, including methylprednisolone pulse therapy, and other immunosuppressive agents such as tocilizumab, siltuximab, cyclosporine, tacrolimus, rituximab, and rapamycin [7,8,9,10,11,12,13,14,15]. Despite such immunosuppressive treatments, a third of patients with TAFRO syndrome die within 2 years [6]. International consensus treatment guidelines recommend interleukin 6 (IL-6) blockade (siltuximab or tocilizumab) with or without high-dose steroids as the first-line treatment for iMCD-TAFRO [16], but this recommendation was primarily based on a clinical trial that was conducted for entire iMCD patients; the number of enrolled iMCD-TAFRO patients is undescribed [17]. To find an optimal treatment strategy for TAFRO syndrome, we recently analyzed the efficacies of immunosuppressive agents by utilizing a patient registry in Japan [18]. Among 81 patients with TAFRO syndrome, 68 patients received corticosteroids as the first-line treatment, and 47 patients received additional immunosuppressive agents as the second-line treatments. We divided these patients into groups according to the second-line treatments (rituximab, tocilizumab, and cyclosporine A groups) and compared their efficacies based on the primary endpoint set as the time-to-next-treatment or death (TTNT). In this previous study, we found that the TTNT of patients who received rituximab was significantly longer than that of patients who received tocilizumab, although the inter-group differences in overall survival (OS) were not significant [18]. In this cohort, 15 patients who received corticosteroids alone survived beyond 1 year without receiving other immunosuppressive agents. Furthermore, 21 deaths were reported in this cohort, and infections were the most common causes of death. Although serious infections were not common in patients with iMCD treated with siltuximab or tocilizumab [17, 19], intensive immunosuppressive therapies theoretically increase risks of serious infections. Therefore, to avoid excessive immunosuppression, risk stratification is crucial to improve the prognosis of patients with this syndrome. In this study, we investigated the prognostic factors for TAFRO syndrome at initial presentation using a retrospective patient database in Japan.

Methods

Patient cohort

Since Oct 2013, patients with suspected multicentric Castleman disease (MCD) and TAFRO syndrome have been registered in the Multicenter Collaborative Retrospective Study for Establishing the Concept of TAFRO Syndrome registry (UMIN000011809) from 89 collaborating centers in Japan (Supplementary Table S1). The database includes clinical data at the onset or diagnosis of the condition, histopathological reports, treatments, and outcomes of these patients. We extracted patients with TAFRO syndrome from this database by applying our diagnostic criteria [20]. Briefly, a diagnosis of TAFRO syndrome required all three major categories (anasarca, thrombocytopenia, and systemic inflammation) and at least two of four minor categories (Castleman disease-like features on lymph node biopsy, reticulin myelofibrosis and/or an increased number of megakaryocytes, organomegaly, and progressive renal insufficiency). Patients with human herpesvirus (HHV)-8-associated MCD, POEMS syndrome, common collagen diseases such as systemic lupus erythematosus and Sjögren’s syndrome, infectious diseases, and malignancies, including malignant lymphomas were carefully excluded. This process was performed by reviewing the provided data, including autoantibody tests, a minimum of whole body computed tomography scan, and bone marrow biopsy. This study was performed in accordance with the Declaration of Helsinki, and its protocol was approved by the Ethics Committee of Kanazawa Medical University (E183 and I540) as well as the ethics committee of each participating institution.

Statistical analyses

We identified the following clinical and laboratory parameters as potential prognostic factors: age, sex, presence or absence of iMCD, white blood cell counts (WBC), hemoglobin (Hb), platelet counts (Plt), C-reactive protein (CRP), IL-6, soluble IL-2 receptor (sIL2R), blood urea nitrogen (BUN), creatinine (Cre), lactate dehydrogenase (LDH), aspartate aminotransferase (AST), alanine aminotransferase (ALT), alkaline phosphatase (ALP), γ-glutamyl transpeptidase (γ-GTP), total bilirubin (T-bil), albumin, immunoglobulin G (IgG), fibrinogen, fibrin/fibrinogen degradation products (FDP), and d-dimer. To determine the cut-off values of these laboratory parameters, we performed receiver operating characteristic (ROC) analysis by setting the dependent variable (outcome) as 1-year survival after initial presentation. Preliminary cut-off points of these laboratory parameters were determined based on the maximum of the Youden index in the ROC curves [21]. We divided our cohort into groups based on these preliminary cut-off values and the median values (round figures were used where appropriate). Final cut-off values were determined based on clinical meaningfulness. The Kaplan–Meier method was used for survival analysis, and log-rank tests were used for comparison between groups (univariable analysis). Multivariable analysis of the potential prognostic factors for TAFRO syndrome was performed using the logistic regression model by setting the outcome as 1-year survival. To confirm independent risk factors for OS, multivariable Cox-proportional hazards model was also performed. Cases with missing data were excluded from the analysis for the related variable. Results with P values less than 0.05 were considered statistically significant. All analyses were performed using EZR, version 1.54 [22].

Results

Patient characteristics

In total, 243 patients were registered in the database from Oct 2013 to Jun 2020. Among them, 95 patients were diagnosed with TAFRO syndrome, and 83 patients had follow-up data. We restricted this study to these 83 patients. Forty-seven patients were male, and the median age was 50 years (range, 20 − 85 years). According to the disease severity classification system for TAFRO syndrome wherein scores were calculated by assessing the severities of anasarca, thrombocytopenia, systemic inflammation, and renal damage (Supplementary Table S2) [20], 8, 24, 34, and 12 patients were classified as grades 2, 3, 4, and 5, respectively, at the initial presentation, with no patient classified as grade 1 (Table 1). Lymph node biopsy was performed in 66 patients and histology consistent with iMCD was confirmed in 63 patients; these patients fulfilled the diagnostic criteria for HHV-8-negative iMCD (histology of the other 3 patients was nonspecific) [3] (Table 1). In the other 17 patients, lymph node biopsy was not performed due to either the absence of lymphadenopathy or severe hemorrhagic tendency. The median follow-up period for survivors was 53 months. During the follow-up periods, all patients received corticosteroids, including methylprednisolone pulse therapies (n = 43). Concerning immunosuppressive treatment, 33 patients received tocilizumab, an anti-IL-6 receptor antibody; 32 patients received cyclosporine; 19 patients received rituximab; 9 patients received cyclophosphamide or combination chemotherapies containing cyclophosphamide; and 1 patient received tacrolimus (some patients sequentially received multiple immunosuppressive therapies). In contrast, non-immunosuppressive treatments encompassed hemodialysis performed in 26 patients, plasma exchange in 11 patients, thrombopoietin receptor agonists in 10 patients, and intravenous immunoglobulin therapy in 2 patients. The 1- and 2-year OS ratios of the entire cohort were 72.0% and 67.8%, respectively (Fig. 1, panel a).

Table 1 Univariable analysis for overall survival
Fig. 1
figure 1

Overall survival after the initial presentation in patients with TAFRO syndrome according to their clinical parameters. a The survival curve of the entire cohort. bg Comparison of survival curves between groups according to their b age, c hemoglobin (Hb), d aspartate aminotransferase (AST), e fibrinogen, f d-dimer, and g, h disease severity determined by the method shown in Supplementary Table S2. Survival of the patients was compared using the Kaplan–Meier method with log-rank tests

Univariable analysis

Univariable analysis was performed for OS with each clinical parameter. There were no significant differences in OS between men and women and between those with iMCD and those without proven iMCD (Table 1). To determine an optimal cut-off point of age, we first divided the patients into 5 age groups: < 40, 40–49, 50–59, 60–69, and ≥ 70 years and compared their OS using Kaplan–Meier analysis. OS curves of these groups were significantly different (P = 2.1e-06), and those of age groups < 40, 40–49, and 50–59 years were superior to those of age groups 60–69 and ≥ 70 years (Supplementary Fig. S1). Therefore, we set the cut-off age as 60 years. The 1-year OS of the < 60 years-old group (n = 57) was 89%, while that of the ≥ 60 years-old group (n = 26) was 33% (P = 3.1e-08, Fig. 1, panel b).

Regarding the laboratory parameters, we explored the optimal cut-off points to use to divide the groups using ROC analysis, as described in the method section (Supplementary Table S3 and Supplementary Fig. S2). Round figures around these values or median values were set as preliminary cut-off points. We determined the final cut-off points considering the clinical meaningfulness (Supplementary Table S4). For example, for AST, ALT, LDH, T-bil, and IgG, both the preliminary cut-off value calculated by the ROC analysis and the median value were within reference ranges; hence, round figures just on or above the upper limits of these reference ranges were used as the final cut-off points. Patients were divided into two groups according to these points, and survival was compared using the Kaplan–Meier method. Among laboratory parameters, WBC, CRP, sIL2R, ALT, LDH, ALP, and γ-GTP did not significantly correlate with OS (Table 1). In contrast, Hb < 8 g/dL, platelet counts < 30 × 103/μL, BUN ≥ 40 mg/dL, creatinine ≥ 2 mg/dL, AST ≥ 35 U/L, albumin < 2.5 g/dL, IgG ≥ 1700 mg/dL, fibrinogen < 450 mg/dL, FDP ≥ 10 μg/dL, D-dimer ≥ 18 μg/dL, and IL-6 ≥ 21 pg/mL were significantly associated with inferior OS in univariable analysis (Table 1; the results of the representative variables are also shown in Fig. 1, panels c − f). We tested whether the disease severity classification system could predict patient’s prognosis using Kaplan–Meier analysis with the log-rank tests. The OS curves for patients with grades 2, 3, and 4 were not distinct, but the OS curve of the patients with grade 5 was significantly inferior to that of the other grades when combined (Fig. 1, panels g and h).

Multivariable analyses

Next, we explored independent prognostic factors using the multivariable logistic regression model by setting the outcome as 1-year survival. Factors that showed significant association with survival in univariable analysis were candidates to incorporate into this analysis. However, considering the low number of events (n = 26), to avoid overfitting, only a couple of variables could be incorporated. We considered that five variables with the most significant inter-group differences in the univariable analysis shown in Table 1 (P < 0.01, i.e., age, Hb, AST, fibrinogen, and d-dimer) were candidates. Among them, we decided to select three variables by considering their clinical relevance. d-dimer is a degenerative product of fibrinogen, and both are related to coagulation status. Because d-dimer showed a smaller P value than fibrinogen, we chose d-dimer as a representative candidate of coagulation markers and discarded fibrinogen in the multivariable analysis. All of four parameters that could be associated with coagulopathy (PLT, fibrinogen, FDP, and d-dimer) showed significant association with OS in the univariable analysis (Table 1), indicative of the importance of coagulation status on patient prognosis. Next, we questioned whether AST is a reasonable variable to incorporate into the multivariable analysis, because both its preliminary cut-off value calculated by ROC analysis (27 U/L) and median value (23.5 U/L) were within its reference range, AST > 35 U/L can be nonspecifically observed in apparently healthy individuals, its area under the curve in ROC analysis was relatively small (0.6254, Supplementary Table S3 and Supplementary Fig. S2), and, moreover, none of the other markers of liver (and/or tissue) damage, i.e., LDH, ALT, γ-GTP, ALP, and T-bil, were significantly associated with survival in univariable analysis (Table 1). Therefore, we decided not to incorporate AST into the multivariable analysis, and chose the remaining 3 variables, i.e., age, d-dimer, and Hb. Our variable selection was supported by the fact that, when we preliminarily included all these 5 candidate variables into the multivariable logistic regression model, stepwise variable selection based on Bayesian information criterion also eliminated AST and fibrinogen. Accordingly, age, Hb, and d-dimer were identified as significant prognostic factors associated with 1-year survival in the final model (Table 2). When we included these 3 factors into the Cox-proportional hazards model for OS, only age ≥ 60 years and D-dimer ≥ 18 μg/mL remained significant prognostic factors (Table 3).

Table 2 Multivariable analysis for 1-year survival
Table 3 Multivariable analysis for overall survival

Development of a prognostic scoring system for TAFRO syndrome

Based on the multivariable analysis shown in Table 3, a simple clinical scoring system was developed by assigning one point each to age ≥ 60 years and d-dimer ≥ 18 μg/dL. The values of d-dimer at initial presentation were available in only 64 patients; we classified these patients into 3 groups based on the total scores of 0, 1, and 2 for low, intermediate and high-risk groups, respectively, and analyzed their OS using the Kaplan–Meier method. The OS curves for the low (n = 35), intermediate (n = 22), and high-risk (n = 7) groups were clearly distinct, with the 1-year survival ratios of 97.1, 54.5, and 21.4%, respectively (P = 1.0 × 10–7; Fig. 2).

Fig. 2
figure 2

Overall survival after the initial presentation in patients with TAFRO syndrome according to risk groups stratified by the TS-PSS system. Survival of the patients was compared using the Kaplan–Meier method with log-rank tests

Discussion

In this study, we investigated potential prognostic factors for patients with TAFRO syndrome using a Japanese patient database and found a number of factors that were associated with patient survival in univariable analysis. Among them, age ≥ 60 years, Hb < 8 g/dL, AST ≥ 35 U/L, fibrinogen < 450 mg/dL, and d-dimer ≥ 18 μg/dL were significantly associated with poor OS at P values less than 0.01. Among these variables, ages, Hb, and d-dimer were chosen as clinically relevant variables to incorporate into multivariable analyses. All of these 3 were significantly associated with 1-year survival in the multivariable logistic regression model. Furthermore, age ≥ 60 years and d-dimer ≥ 18 μg/dL remained as statistically significant predictors of poor OS in the Cox-proportional hazards model. Applying the latter results, we developed a simple prognostic scoring system for TAFRO syndrome (TS-PSS). Using this system, patients in our cohort were stratified into low, intermediate, and high-risk groups. Most patients were classified into the low-risk group with a favorable prognosis. In contrast, the prognosis of those in the high-risk group was dismal, hence the need to develop novel therapeutic strategies for them. The prognosis of patients in the intermediate-risk group was variable; it may be changeable depending on therapeutic interventions.

Of the prognostic factors identified in this study, age ≥ 60 years was the strongest predictor of poor OS; this observation was reasonable because it probably reflects the reduction in reserve capacities of various organs and vascular aging. Notably, only one-third of patients aged ≥ 60 years survived beyond 12 months in our cohort.

Elevation of plasma d-dimer levels was another predictor for the survival of patients with TAFRO syndrome in multivariable analysis. We previously reported significantly higher plasma FDP and d-dimer levels in patients with TAFRO syndrome than in those with iMCD-NOS [6]. d-dimer is a component of FDP, and its elevation in plasma is usually associated with coagulopathies such as disseminated intravascular coagulation (DIC) and other thrombotic or microthrombotic events. Because simultaneous measurements of d-dimer and FDP were not approved under the Japanese health insurance system, only one of these values was available in some patients. Despite the missing data, d-dimer was identified as statistically significant prognostic factors for poor OS, indicating the significant involvement of coagulopathy in TAFRO syndrome. Significantly poor OS in patients with relatively low fibrinogen levels (< 450 μg/dL) notwithstanding systemic inflammation, might also reflect the activation of the coagulation system in which fibrinogen was consumed. Critically ill patients with TAFRO syndrome may have coagulopathy with microangiopathy, similar to those with severe COVID-19 infection [23]. Indeed, thrombotic microangiopathy was demonstrated in renal biopsy specimens in patients with TAFRO syndrome [24,25,26,27,28].

Anemia (Hb < 8 g/dL) was also selected as a significant risk factor for 1-year mortality in the logistic regression model, though it failed to show statistical significance in Cox-proportional hazards model for OS. Anemia observed in TAFRO syndrome may be multifactorial, but it is probably associated with inflammation. The major mechanism of anemia of inflammation is based on the restriction of available iron through overproduction of hepcidin, the master regulator of iron homeostasis, induced by IL-6 [29]. Relatively decreased production of erythropoietin caused by renal damage, and shortening of erythrocyte life-span, may also be involved [29].

Elevation of AST ≥ 35 U/L was also found to be a predictor of poor OS with a small P value in the univariable analysis. This result is rather incomprehensible because the cut-off value for AST (35 U/L) was just above the reference range, and none of the other parameters for hepatic damage were significantly associated with OS in the univariable analysis. Based on clinical considerations, we assumed that the small P value observed for AST in the univariable analysis might be incidental, and we eliminated AST from the multivariable analyses. Notably, the elevation of AST is less specific than that of ALT for liver diseases; as it can be observed in cardiac, skeletal muscle, kidney, erythrocytes, and other tissue damages [30]. Given the nature of the registry, the elevated AST levels in patients with poor OS might have reflected unreported comorbidities. Thus, the clinical significance of AST ≥ 35 U/L in this syndrome is still uncertain.

We previously proposed the disease severity classification system wherein disease severity was classified based on the severity of anasarca, fever/CRP, thrombocytopenia, and renal damage (Supplementary Table S2) [20]. Anasarca is closely related to hypoalbuminemia, and renal damage is associated with serum BUN and Cre levels. Therefore, from its definition, the disease severity score should be correlated with CRP and BUN/Cre, and inversely correlated with albumin and PLT. We did not incorporate disease severity score into the multivariable analyses because it is closely related to these laboratory parameters from its definition. However, among these parameters, albumin, PLT, BUN, and Cre were significantly associated with OS in univariable analysis with P values between 0.05 and 0.01 (Table 1). In fact, the OS of patients with disease severity grade 5 was significantly inferior to that of those with grades 4 or less, when combined, as shown in new Fig. 1, panel h. Therefore, the disease severity score at presentation may still be useful to identify those with very poor prognosis.

Our study has several limitations. First, the patients were not prospectively registered but registered in many institutes, which might have caused selection bias and possible inaccuracy in data. Second, because TAFRO syndrome usually progresses rapidly and the patient’s condition deteriorates quickly, their laboratory data change dynamically. Though we used patient’s data at their initial presentation, the timings of data collection in their clinical courses might have been diverse, and these might have affected the results of this study. Third, the numbers of enrolled patients and events were small; therefore, we could include only a few parameters in the multivariable analyses. In addition, some of the clinical parameters such as FDP and d-dimer values were missing at presentation in a considerable number of cases, which further reduced the statistical power. Fourth, although the diagnosis of TAFRO syndrome was centrally reviewed, other diseases, such as infectious, autoimmune, and malignant diseases, could be misdiagnosed for this syndrome due to the lack of specific diagnostic markers. Fifth, treatment strategies were not standardized; instead, they were chosen by attending physicians in each institute. However, because of the rarity of TAFRO syndrome [31], conducting a large-scale prospective study on this syndrome is unrealistic at this moment, unfortunately.

In conclusion, although there were several limitations, we revealed that older age and raised d-dimer levels at initial presentation were predictors of poor OS based on limited resources. Utilizing these results, we have, for the first time, developed a prognostic scoring system, TS-PSS, which could stratify patients with TAFRO syndrome into 3 risk groups. This system may allow physicians to assess prognosis earlier, when they first suspect this intractable disorder. Prognostic prediction is quite important in identifying those who require intensive treatments and closer monitoring. With independent patient cohorts, this system should be verified in future studies. Further, the development of novel therapeutic strategies for high-risk patients requires urgent attention.