Introduction

Gastroesophageal variceal hemorrhage is one of the most serious decompensation events that directly affects patient survival rate [1]. Within the last few years, international guidelines [2] arrived at the consensus that “all cirrhotic patients should be screened for varices at diagnosis.” However, the reported prevalence of high-risk varices (HRV) is as low as 16% in patients with well-compensated cirrhosis [3]. Considering the invasiveness and low acceptance of esophagogastroduodenoscopy (EGD) in patients, it is necessary to develop non-invasive methods to evaluate the presence of varices. The 2015 Baveno VI Conference consensus recommended that platelet count (PLT) and liver stiffness (LS) be assessed to identify patients with HRV (Baveno VI criteria, B6C) [4]. Patients with compensated liver cirrhosis (CLC) who meet the criteria (LS < 20 kPa and PLT > 150 × 109/L) avoid undergoing EGD assessment as risk of developing HRV is considered acceptably low (risk < 5%). Although B6C has been shown to be effective in preventing 16–33% of EGDs [5,6,7], it remains perceived as conservative.

The expanded Baveno VI criteria (EB6C) (PLT < 110 × 109/L and LS > 25 kPa) helped to avoid up to 40% of EGDs [8]. However, among the validations of B6C and EB6C, inconsistent conclusions were drawn owing to the etiological differences of cohorts [9,10,11] and a limited Asian cohort with hepatitis B. Therefore, verification and optimization of B6C are needed in large-scale cohorts. Additionally, a substitution for LS is required as transient elastography (TE) is not widely accessible in all medical institutions. Meanwhile, a growing number of early cirrhosis diagnoses have been confirmed by the progressive development of non-invasive methods. In light of these reasons, it is necessary to optimize the B6C to rule-out more patients from undergoing EGD.

In this study, we aimed to identify the risk factors that contribute to HRV and to optimize B6C (modified Baveno VI criteria, MB6C). Consequently, “easy-to-use” criteria (alternative Baveno VI criteria, AB6C) with more accuracy were developed using other laboratory variables as alternatives to LS. Furthermore, we directly compared the performance of MB6C and AB6C with B6C and EB6C.

Patients and methods

Patient selection

Demographic, laboratory, and clinical data were collected for all patients. Clinical data of inpatients with CLC were analyzed between January 2016 and December 2019 (clinical trial registration number: ChiCTR-DDD-17013845).

The inclusion criteria were: (i) 18–80 years of age; (ii) patients with an irregular and nodular liver, along with impaired liver synthetic function, who were diagnosed with cirrhosis using imaging (ultrasonography, computed tomography, or magnetic resonance imaging) [12]; (iii) the absence of previous decompensated complications of cirrhosis including ascites, hepatic encephalopathy, or gastroesophageal variceal hemorrhage; and (iv) patients that had their data record updated (EGD, imaging, and laboratory examinations) within 3 months of this study.

Exclusion criteria were: (i) decompensated cirrhosis; (ii) primary prevention using non-selective β-blockers or endoscopic ligation; (iii) portal vein thrombosis; (iv) non-cirrhotic portal hypertension; (v) splenectomy or congenital absence of the spleen; (vi) transjugular intrahepatic portosystemic shunt procedure performed on the patient; (vii) hepatocellular carcinoma; (viii) liver transplantation; and (ix) incomplete clinical data.

Patient evaluation

EGD

EGD was performed by two experienced operators using the Olympus CV-260SL processor (Olympus medical imaging, Osaka, Japan). The varices were described according to the expert guidelines [5, 13], and HRV was defined as gastroesophageal varices with a diameter ≥ 5 mm or were red-sign positive, which required non-selective beta-blockers (NSBB), ligation, or sclerotherapy to prevent bleeding.

Transient elastography

The procedure of LS measurement was performed using Fibroscan 520 (Echosens, Paris, France) by two senior trained professionals. The patients were in a fasting state on the morning of assessment and were placed in a supine position with the right arm in maximum abduction to make the right lobe of the liver accessible during operation. Ten measurements were obtained with a success rate of 60% and an interquartile range of less than 30%.

Spleen size

Patients routinely underwent abdominal ultrasound examinations to measure spleen size. The spleen diameter and thickness (measured across from the spleen hilum) were reported and recorded to calculate spleen area (SA = diameter × thickness) according to the published guidelines [14]. The patients fasted overnight. Ultrasound was then performed by professional operators who were blinded from the clinical details of the patients.

Statistical analysis

Statistical analyses were performed using SPSS 20.0 package (IBM SPSS, Chicago, IL). Continuous data were represented by the median (interquartile range), and categorical variables by proportions. Chi-square tests or Mann–Whitney U tests were used for group comparisons of categorical and continuous variables. The entire cohort was divided into a training cohort (TC) (n = 583) and a validation cohort (VC) (n = 605) by the split-sample method. Variables found to be significant in univariate analyses (p < 0.05) were included in multivariable logistic regressions (forward selection approach for stepwise regressions) in the TC. The predicting performance of non-invasive indicators for HRV was assessed by receiver operating curve (ROC) analyses. Sensitivity (SE), specificity (SP), and negative predictive value (NPV) were calculated for each criterion under different age stratifications.

Results

General characteristics

A total of 1,188 participants were included in the study and were divided into 583 and 605 participants in TC and VC groups, respectively. The flow chart for participant inclusion/exclusion is provided in Fig. 1. The clinical characteristics of patients in the TC and VC are listed in Table 1. The median age of patients in the entire cohort was 52 (25th to 75th percentiles, 42–59), among whom 701 (59.0%) were male, 1068 (89.9%) were Child–Pugh A class, 576 (48.5%) had varices, and 146 (12.3%) had HRV. The most common cause of cirrhosis was hepatitis B viral infection, which was found in 415 of the 583 patients (71.2%) in the TC and 446 of the 605 participants (73.7%) in the VC. The VC had similar baseline characteristics to the TC. There was no statistical difference between the TC and VC. A comparison of patients with and without HRV in the entire cohort is provided in Supporting Table S1.

Fig. 1
figure 1

Flow chart of the study population

Table 1 General characteristics and comparison of TC and VC

Predictors of HRV in the TC

Univariate analyses were performed to detect variables associated with the presence of HRV. Table 2 shows that age, albumin, total bilirubin, international normalized ratio, PLT, LS, and SA were significantly different (all p < 0.05). ROC analyses showed that the area under the receiver operating characteristic (AUROC) of age was 0.605 (95% confidence interval [CI]: 0.564–0.645), and the best cut-off value was 52 years of age. To create an “easy-to-use” algorithm, 50 years of age was selected for risk stratification. Age was evaluated as dichotomous data in the multivariate analyses. The risk of HRV in patients over 50 years was 2.387-fold greater than those under 50 years of age (Fig. 2a).

Table 2 Predictors of HRV in TC by Univariate analysis and Multivariate analysis respectively

Multivariate analyses showed that age ≥ 50 years, LS, PLT, and SA were independently associated with HRV (Supporting Table S4). In the TC, the predicting performance of PLT, SA and LS for HRV was assessed by a receiver operating curve (ROC) analysis. The AUROC of PLT, SA and LS were 0.781 ([CI] 0.745–0.814), 0.731 ([CI] 0.694–0.767), and 0.735 ([CI] 0.697–0.770), respectively. The Delong test showed that there was no statistical difference between PLT, SA, and LS (Fig. 2b). Therefore, SA could replace LS to help establish a new criterion to predict HRV.

Modified B6C based on age stratification

Table 3 shows the performance of the exploratory data and the new age-based criteria by adjusting the cut-off value of LS and PLT in each age-related subgroup. In patients < 50 years of age, the criteria of PLT > 100 × 109/L and LS < 30 kPa maximized the number of potential EGDs that were avoided while keeping the risk of missing a HRV below the 5% threshold. For patients aged ≥ 50 years, the same result was obtained with a combination of PLT > 125 × 109/L and LS < 20 kPa. We proposed to name this new algorithm the modified B6C (MB6C).

Table 3 Performance of the exploratory data and MB6C and AB6C classification rules based on age in the TC
Fig. 2
figure 2

Age, PLT, SA and LS for predicting HRV. LRV low-risk varices, HRV high-risk varices, PLT platelet count (× 109/L), LS liver stiffness (kPa), SA spleen area. a Prevalence of varices and high-risk varices in age-related subgroups of the entire cohort. The risk of HRV in patients ≥ 50 years was 2.387 times higher than that in patients < 50 years. b Efficacy of PLT, LSM and spleen area in predicting HRV. There was no statistical difference between PLT, SA and LS. Thus, SA could replace LS for build new criteria for predicting HRV

Table 4 reports the performance of MB6C in the VC group. In patients < 50 years of age (n = 269), 187 (69.5%) met the MB6C, among whom 3 (1.6%) had HRV. In patients aged ≥ 50, 123 (36.6%) met the MB6C and 4 (3.3%) had HRV. Overall, MB6C helped avoid EGD in 310 (51.2%) patients, whereas 7 (2.3%) cases of HRV were missed.

Table 4 Performance of the MB6C and AB6C in the VC

Construction of the alternative B6C based on age stratification

SA was selected to replace LS for building the alternative B6C (AB6C). In patients < 50 years of age, the criteria of PLT > 100 × 109/L and SA < 55 cm2 maximized the number of potential EGDs avoided while keeping the risk of HRV missed below 5%. In patients aged ≥ 50, the AB6C used a combination of PLT > 125 × 109/L and SA < 44 cm2 (Table 3).

The performance of AB6C in the VC is shown in Table 4. A total of 174 (67.4%) cases met the AB6C for the subgroup age < 50 (n = 269), among them 2 (1.1%) had HRV. The AB6C for the subgroup aged ≥ 50 was met by 123 (36.6%) cases, and among them 6 (4.9%) had HRV. Overall, the AB6C avoided 297 (49.1%) EGDs with a total of 8 (2.7%) cases of HRV that were missed.

Validation and comparisons of B6C, EB6C, MB6C, and AB6C

In the entire cohort (n = 1188), a total of 377 (31.7%) patients met the B6C. Of these 377 patients, 245 (64.9%) had no varices, 126 (33.4%) were low-risk varices (LRV) (diameter ≤ 5 mm), and only 6 (1.6%) had HRV. Among the 651 (54.8%) patients that met the EB6C, 402 (61.8%) did not have varices, 225 (34.6%) had LRV, and 24 (3.7%) had HRV (Supporting Table S2).

Next, we evaluated the performance of these criteria across the entire cohort and age-related subgroups (Supporting Table S2 and Supporting Table S3). In patients < 50 years of age (n = 516), the B6C, EB6C, and MB6C safely helped avoid 36.8%, 59.1%, and 69.4% EGDs, respectively. In patients ≥ 50 years of age (n = 672), B6C safely avoided 27.8% of EGDs with a < 5% risk of missing HRV. Although the EB6C avoided more EGDs (51.5%), 6.4% of HRV were missed. The MB6C avoided more EGDs (39.9%) than B6C (27.8%) and kept the low risk of missing HRV (3.7%). Overall, among the entire cohort, MB6C helped avoid 52.7% of potential EGDs, with only 2.2% HRV that were missed.

AB6C also showed a good performance across the entire cohort. Using AB6C in patients aged < 50 (n = 516) and ≥ 50 (n = 672), 64.3% and 37.5% of the patients avoided EGD with 1.5% and 4.8% of HRV being missed, respectively. Overall, AB6C safely spared 49.2% EGDs while missing only 2.9% HRV (Fig. 3).

Fig. 3
figure 3

Comparison of the performance of B6C, EB6C, MB6C, and AB6C in ruling out of HRV in the entire cohort and age-related subgroups. B6C Baveno VI criteria, EB6C expanded Baveno VI criteria, MB6C modified Baveno VI criteria, AB6C alternative Baveno VI criteria, EGD esophagogastroduodenoscopy, HRV high-risk varices

Discussion

In this study, we proposed and validated a new screening criterion of HRV based on age stratification in patients with CLC. In similarity to Berger et al. [15], we found that the prevalence of HRV was 12.3% across the entire cohort. However, risk of HRV increased significantly with age (15.9% in patients ≥ 50 years of age versus 7.6% in patients < 50 years of age). Hepatitis B (HBV), followed by hepatitis C (HCV), were the main causes of liver cirrhosis in our cohort (86.7%), and their natural history was closely related to age. Thabut et al. [16] analyzed 891 patients with HBV and/or HCV in their prospective cohort from 35 centers in France, and found that age, an absence of viral suppression, and an unfavorable B6C status were independently associated with poorer overall survival.

It is reasonable and necessary to predict the prognosis of patients with cirrhosis by age stratification. Therefore, we hypothesized that age could be a risk stratification indicator for HRV in patients with CLC. We proposed and validated the established screening strategies based on age stratification. We first validated B6C and EB6C across the entire cohort and the age-stratified subgroups (Supporting Table S2). The number of EGDs avoided using the B6C was relatively low, and 57% of patients were misclassified. Using EB6C enhanced the number of spared EGDs among the entire cohort, but the risk of missing HRV increased to 6.4% in patients ≥ 50 years of age. Therefore, EB6C had less safety in older patients with greater risk of HRV.

Regardless of age, screening criteria for MB6C were milder than B6C. Consistent with our expectations, the screening criteria for MB6C in patients aged < 50 were milder than in patients ≥ 50 years of age. Augustin et al. [9] found that PLT > 100 × 109/L and LS < 30 kPa increased the risk of missing HRV to 8.7%, but we found that this broader criterion worked well in patients < 50 years old, and sharply increased the avoidance of EGD (69.4% versus 36.8% in B6C), whereas the risk of missing HRV slightly increased (1.1%). For patients ≥ 50 years old, the more stringent screening strategy of PLT > 125 × 109/L and LS < 20 kPa were adopted to keep the risk of missing HRV < 5% while the number of avoided EGDs increased by 12.1%, compared to the B6C (Supporting Tables S2 and S3).

The limited implementation of TE in many institutions has restricted the clinical use of B6C. We provided a screening strategy that does not require a LS score, offering doctors in primary hospitals a simpler method to assess and screen HRV. Ultrasound is the most commonly used liver imaging technique for its non-invasive, inexpensive, real-time imaging capability, and repeatability. SA was proposed and classified by Ishibashi et al. [17] in 1991. Their study showed that spleen index/SA correlated with resected spleen volume and guidelines highlight that a SA > 45 cm2 is an indicator of portal hypertension [14]. In our study, we established an alternative B6C named “AB6C” based on a combination of PLT and SA, of which the latter acted as a promising tool independently associated with the presence of HRV. Considering the wide availability of ultrasound in most medical institutions, SA could serve as an appropriate alternative to LS. AB6C retained the PLT cut-off values of MB6C, and 55 cm2 and 44 cm2 were selected as the SA cut-off values for patients aged < 50 and ≥ 50 years, respectively. AB6C achieved a similar efficacy to MB6C in ruling out HRV (Supporting Tables S2 and S3). In total, AB6C safely avoided 49.2% EGD in the entire cohort (n = 1188) regardless of age.

The MB6C and AB6C were formulated for patients in different age stratifications, were internally verified, and showed good performance across the entire cohort. Using these strategies may help improve safety in patients with CLC. Overall, the performance of MB6C was slightly better than AB6C, whereas both performed better than B6C and were safer than EB6C.

Our study had a few limitations. First, this was a retrospective study with data obtained from medical records, which meant that these new criteria require external verification. Second, the examinations were performed by independent operators, although they all had professional training. However, this is a common situation in clinical research.

In summary, our study demonstrated that Baveno VI criteria for avoidance of EGD in patients with CLC could be safely optimized, with LS replaced by SA. Through application of age-based MB6C and AB6C, patient avoidance of EGD achieved over 60% and close to 40% in patients aged < 50 and ≥ 50, respectively. In the entirety of the cohort, MB6C and AB6C safely avoided approximately 50% EGD, which out-performed B6C and EB6C.