Introduction

Hepatocellular carcinoma (HCC) is the fourth cause of cancer death in the world, with an increasing incidence, particularly in Western countries [1]. Many patients present with advanced stage disease, especially if the diagnosis is made outside of a surveillance program [2,3,4]. Sorafenib is a multi-kinase inhibitor, which interferes with neo-angiogenesis [2]. Its use, in the Sorafenib HCC Assessment Randomized Protocol (SHARP) and Asia Pacific trials [5, 6], induced a modest but significant increase in survival (3 months) with respect to the control group, although no radiologic evidence of response to therapy was reported [7]. Relevant side effects limit the use of this drug [5,6,7,8], and it would be crucial to identify specific biomarkers for therapy response prediction, currently not available, although several studies looked for computed tomography (CT) or magnetic resonance (MR) parameters to anticipate the response to treatment [9,10,11]. The arterial enhancement coefficient (ΔArt%) is a simple parameter, which provides information on the grade of tissue vascularisation by arterial phase evaluation of a standard contrast-enhanced CT. Choi et al [10] reported that changes in tumour vascularity were the most specific indicators of treatment response in patients with gastrointestinal stromal tumour on imatinib. Smith et al [12] made similar remarks for metastatic renal cell carcinomas on sorafenib or sunitinib. However, there are only few reports in the context of HCC [13].

In this study, we retrospectively evaluated the possible predictive value of the volume of enhancement of disease (VED), a new radiologic parameter based on ΔArt%, in predicting early response to treatment and survival in a group of patients with advanced HCC treated with sorafenib.

Materials and methods

Patients

The ethics committee of our institution approved this retrospective study on 27 January (ref 2016-435; OSS. 16-260). Each patient was assigned a numerical code to ensure the anonymity of the clinical data. Written informed consent was obtained for sorafenib treatment and for CT scans with contrast agent (CA) administration, according to the principles of the Declaration of Helsinki (revision of Edinburgh, 2000). Patients with advanced HCC followed in the hepatology division of our hospital and treated with sorafenib between October 2012 and May 2016 were considered. They were diagnosed with advanced HCC (BCLC-C) according to the European guidelines [2]. Patients underwent treatment with sorafenib at a dose of 400 to 800 mg/day. Only patients who had undergone contrast-enhanced CT examination before therapy (T0) and after 60–80 days of starting treatment (T1) at the local institution were considered. Patients with less than 45 days of treatment or patients with target lesion not evaluable (e.g. nodules < 1 cm) were excluded from the study.

CT acquisition

All CT exams were performed with a standard protocol, using a 64-row detector scanner (Somatom Sensation CT, Siemens Medical Systems). The images were obtained in the cranial–caudal direction with breath-hold helical acquisition. The scanning parameters were 1.2 × 24 collimation, 120 kV (peak), 140–240 mAs (using automated dose modulation), 5.0 mm slice thickness with a reconstruction interval of 2.0 mm, pitch 1.2 and 0.5 s gantry rotation time. All patients received intravenous non-ionic CA (Ultravist 370, Bayer HealthCare Pharmaceuticals; 370 mg of iodine/1 ml), at a volume of 1.4 ml/kg of body weight, by a bolus at 3 ml/s, using a mechanical power injector (Medrad Stellant CT Injection System), followed by a 40 ml saline flush through a 20-G catheter inserted into an antecubital vein. After unenhanced CT, the time-to-peak aortic enhancement was evaluated by an automatic bolus tracking technique (CARE Bolus CT, Siemens Medical Systems) to determine the optimal scanning delay for the arterial phase. The single-level monitoring low-dose scanning (20 mAs) was initiated 5 s after CA injection, and arterial phase scanning was started automatically 15 s after the trigger threshold (increase of 120 Hounsfield units (HU)) had been reached at the level of suprarenal abdominal aorta. Portal venous (extended to the chest and lower abdomen) and equilibrium-phase acquisitions were obtained at 70 s and 180 s, respectively.

Evaluation of the response to sorafenib

The anonymised images were evaluated in consensus by two abdominal radiologists (10-year experienced), and if discordant, a consensus was reached through a joint review with the study coordinator (30-year experienced). Following the modified Response Evaluation Criteria in Solid Tumors (mRECIST) criteria, the patient’s baseline level was established, annotating the characteristic of eligible lesions as “target” and “non-target” [14]. Selection criteria of the target lesion(s) were diameter > 1 cm, easily measurable and well-defined margins, with intratumoural arterial enhancement; HCC lesions previously treated with locoregional treatments were selected if the lesion showed a well-delineated area of viable tumour (at least 1 cm in longest diameter) on the arterial phase; in the presence of multiple HCC nodules, a maximum of two target lesions was selected; all other lesions or sites of disease were considered non-target lesions, including malignant portal vein thrombosis and lymph nodes detected at the porta hepatis with short axis > 20 mm. At T1, overall patient response was a result of the combined assessment of target lesions, non-target lesions and new lesions [14]. We considered new intrahepatic lesion, the nodules ≥ 1 cm with arterial enhancement with or without washout. The appearance of one or more new lesions indicated progressive disease (PD) regardless of the result of the comparison of target and non-target lesions. For the purpose of this study, only two groups were considered: PD and clinical benefit (CB), the latter comprising complete response (CR), partial response (PR) and stable disease (SD) [14].

VED calculation

After the definition of the response to therapy, in a second session 15 days apart, the readers reviewed the images to calculate the volume of the liver target lesions, their arterial enhancement rate and the VED. If the reviewers were disagreeing, they reached a consensus through a joint review of the recorded images together with the coordinator. The same liver target lesions utilised for the assessment of response according to mRECIST criteria were used for the VED calculation. If more than two lesions were present, the largest were chosen to evaluate a quantity of disease in any case greater than 80%. The volume of the entire lesion, including necrotic areas, was calculated using OsiriX, an open-source Digital Imaging and Communications in Medicine (DICOM) viewer (Fig. 1). The degree of arterial enhancement was assessed at T0 and T1 time points according to the following formula:

$$\Delta \mathrm{Art}\%=\left[\left({\mathrm{HU}}_{\mathrm{arterial}\ \mathrm{phase}}-{\mathrm{HU}}_{\mathrm{unenhanced}\ \mathrm{phase}}\right)/{\mathrm{HU}}_{\mathrm{unenhanced}\ \mathrm{phase}}\right]\times 100$$
Fig. 1
figure 1

VED calculation. ad In the 2D viewer, the ROI is marked on several arterial phase images with the “closed polygon” ROI tool (from the most caudal to the most cranial part of the lesion). Selecting the “ROI/ROI volume/Generate missing ROIs,” ROIs from the slices not included in the previous selection were generated. e After adjusting the contours of the lesion, if necessary, the “ROI/ROI volume/Compute volume” tool is used to obtain the 3D reconstruction, the volume and the mean density in the arterial phase (HU arterial phase) of the selected lesion, by summing the areas of all the ROIs, both selected and generated. After this, the operator copies the ROI of each arterial phase image and pastes it on the same-level unenhanced image. So, the estimation of the mean density at unenhanced phase (HU unenhanced phase) is obtained (not shown in the figure)

Therefore, to weight the ΔArt% of each lesion for the volume of the lesion itself, the new parameter, i.e. the VED, was calculated as follows: volume lesion × ΔArt% / volume lesion. While for a single target lesion ΔArt% = VED, when two target lesions were present, to take into account the possibility of heterogeneous behaviour of them, the VED was calculated according to the following formula:

$$\left(\mathrm{V}1\times \Delta \mathrm{Art}\%1\times \mathrm{V}2\times \Delta \mathrm{Art}\%2\right)/\left(\mathrm{V}1+\mathrm{V}2\right)\times 100$$

where V1 is volume lesion 1, ΔArt%1 is enhancement coefficient lesion 1, V2 is volume lesion 2 and ΔArt%2 is enhancement coefficient lesion 2.

The VED was calculated for each patient, both at baseline (VEDT0) and after therapy (VEDT1).

To evaluate the possible changes in mean enhancement of disease during sorafenib treatment, we calculated the ΔVED, applying the following formula: ΔVED = VEDT1 − VEDT0. The patients were classified as ΔVED(pos), if VEDT1 > VEDT0, and as ΔVED(neg), if VEDT1 < VEDT0. Finally, we compared the volume and VED values between CB and PD patients at T0 and T1 time points. To avoid the possible reproducibility bias due to CA administration rate variation, HU arterial/unenhanced phase values of cancer-free parenchyma have been calculated from an average of 3 circular ROIs (1 cm in diameter) inserted on 3 consecutive slices on the parenchyma surrounding the lesion. So, the ΔArt% of the parenchyma was evaluated at T0 and T1 time points, for each patient.

Ancillary imaging findings and blood chemistries

The presence of malignant portal vein thrombosis, distant metastases and the diameter of enlarged lymph nodes were assessed in all patients. Lymph nodes located at the level of the hepatic hilum were considered as metastatic in the case of a minor axis > 2 cm [14]. The values of the alpha-fetoprotein (AFP) and other serum parameters (total bilirubin, alkaline phosphatase, platelets, gamma glutamyl transferase, alanine aminotransferase, aspartate aminotransferase, international normalised ratio) were evaluated prior to and after therapy for CB and PD patients.

Prediction of the therapy outcome and patient survival time

Survival time was evaluated for the study population and for each patient group. We tried to detect a VEDT0 cutoff value that allowed us to classify the patients in CB and PD groups, with significantly different survival days. Finally, we evaluated the VEDT0 cutoff values, ancillary imaging findings and laboratory parameters that could influence survival.

Statistical analysis

Data were analysed using the SPSS® v.24.0 statistical analysis software (IBM Corp., 131; formerly SPSS Inc.) and Stata/IC 11 (StataCorp). For each variable, normality was evaluated using the Kolmogorov–Smirnov test. Since all the variables were not normally distributed, non-parametric tests (Mann–Whitney U test and Kruskal–Wallis statistical test for independent samples, Wilcoxon signed-rank test for correlated sample, McNemar’s test for paired proportions) were used to compare the distributions between subgroups or between subjects at T0 and T1 time points. Receiver operating characteristic (ROC) curves were used to find the best cutoff value of VEDT0 to discriminate CB from PD patients. The area under the ROC curve was used as predictive power of the test. For different VEDT0 cutoff values (from 10 to 110, in steps of 10), we evaluated patients’ survival, at T0 and T1 time points. Kaplan–Meier curves were used to graphically depict survival probabilities. Survival in different groups (VEDT0 > 70; VEDT0 ≤ 70) was compared, using the log-rank test. Moreover, given that AFP serum levels > 400 ng/ml are considered as diagnostic and specific for HCC [15], we compared the survival with the log-rank test in the following patient groups: AFP > 400 and VEDT0 > 70, AFP ≤ 400 and VEDT0 > 70, AFP > 400 and VEDT0 ≤ 70, and AFP ≤ 400 and VEDT0 ≤ 70. Univariate and multivariate linear regression analyses were used to evaluate the predictive value of VED and AFP (independent variables) for survival (outcome variable). Specifically, both independent variables were dichotomous for VED (> 70 or ≤ 70) and for AFP (> 400 or ≤ 400). For each analysis, a p value ≤ 0.05 was considered statistically significant.

Results

Patients’ characteristics and evaluation of the response to treatment

Forty-eight patients with advanced HCC treated with sorafenib were initially assessed for eligibility (Fig. 2), and a total of 32 patients were included (11 patients with 1 nodule and 21 patients with 2 or more nodules). Characteristics of the 32 patients enrolled in the study are shown in Table 1. A maximum of 2 nodules was selected as target lesion for each patient, for a total of 53 nodules in 32 patients. The median duration of sorafenib therapy was 117 days (range, 45 to 255). The main adverse events during treatment were fatigue (16 patients), diarrhoea (13), hand–foot syndrome (13) and major worsening of liver function (3). At T1 control, the expert reader advice was required for 3 patients. No patients showed a CR, and 1 patient had a PR to therapy, with clear reduction in both size and enhancement (Fig. 3). The CB group included the latter, and 7 patients with SD. Twenty-four patients were in the PD group: ten PD patients showed a greater than 20% increase in the sum of the diameters of viable target lesions, one with 1 target lesion and 9 with 2 target lesions. In the other 14 patients, the sum of these diameters did not reach the 20% threshold as requested by mRECIST criteria to define a PD. However, in this subgroup, disease progression was due to appearance of new lesions and/or appearance or progression of neoplastic portal vein thrombosis (5 patients), lymph node involvement (5 patients) and distant metastases (4 patients).

Fig. 2
figure 2

Patient disposition. HCC, hepatocellular carcinoma; CT, computed tomography; CB, clinical benefit; PD, progressive disease; CR, complete response; SD, stable disease; PR, partial response

Table 1 Characteristics of the 32 patients enrolled in the study
Fig. 3
figure 3

Representative images from a 51-year-old man with advanced HCC and partial response to sorafenib. a Arterial phase spiral CT scan at T0 time shows multiple merging nodular lesions in the right hepatic lobe with a marked enhancement in the arterial phase. b CT scan during arterial phase at T1 time shows a reduction in the size of the right hepatic lobe lesions, with significant lessening of vascularisation

Tumour volume and VED

In CB patients, we found a reduction of about 15% in mean volume, while in PD patients, a significant increase in the volume of the target lesions was found, with an average increase of about 84% (Table 2). Patients with CB had higher baseline VED values than those with PD (Fig. 4) although the significance level of this difference was only borderline, due to the high variability. However, in CB patients, the VED values at the T1 time point were significantly lower than those at T0 (p = 0.018) (Fig. 4). When the ΔVED parameter was analysed, all CB patients fell into the ΔVED(neg) class, while this behaviour was observed in only 9 out of 24 (38%) patients with PD. ΔArt% values of cancer-free parenchyma for the PD and CB groups patients did not show statistically significant differences comparing the two time points.

Table 2 Volume of the target lesions at T0 and T1 time points, and survival time of the patients
Fig. 4
figure 4

Box and whisker plot showing the distribution of VED (%) at T0 and T1 time points in CB and PD patients. Wilcoxon signed-rank test for the paired samples (T0 vs. T1 values) for PD patients (p = 0.097) and for CB patients (p = 0.018). Mann–Whitney U test for the independent samples (CB vs. PD) at T0 (p = 0.070) and at T1 (p = 0.064)

Ancillary imaging findings and blood parameters

At the T0 time point, 10 patients had portal vein thrombosis, 11 had lymph node involvement and 4 had metastases (3 with lung involvement and 1 with bone involvement). Comparing the presence/absence of the findings, no statistically significant differences were found between the 2 time points (Table 3). Blood parameters and their temporal trends are summarised (Table 4). Only the median values of aspartate aminotransferase were significantly different comparing the pre/post-therapy values. In PD patients, both aminotransferase and bilirubin values were significantly different.

Table 3 Presence of portal vein thrombosis, lymph nodes and metastases, at T0 and T1 time points
Table 4 Blood parameters at T0 and T1 time points

Prediction of the response to therapy and patient survival time

To evaluate the significance of VED in the prediction of the outcome of therapy, ROC curves (Fig. 5) showed that a VEDT0 cutoff value of 70% had the highest sensitivity and specificity (100% and 54.2%, respectively) in discriminating CB from PD patients. Survival time from the beginning of sorafenib therapy was highly variable (from 4 months to more than 2 years), and it was significantly longer in CB vs. PD patients (p = 0.001, Table 2). Separating the patients according to different cutoff values of VEDT0, those with VEDT0 > 70% showed a significantly longer survival time than those with lower VEDT0 (506 ± 306 days vs. 266 ± 133 days, p = 0.032; Fig. 6, Table 5). Additionally, patients with ΔVED(neg) showed a tendency to an average survival time longer than those with ΔVED(pos) (493 ± 319 days vs. 328 ± 201 days, p = 0.189). At T0, the presence of portal vein thrombosis, lymph nodes or metastases did not significantly influence survival (p = 0.411, p = 0.327 and p = 0.564, respectively). Among blood parameters, only AFP at T0 significantly influenced survival time. Twenty-one patients with AFPT0 ≤ 400 ng/ml showed an average survival of 478 ± 282 days, while in 11 patients with AFPT0 > 400 ng/ml, survival was 299 ± 243 days (p = 0.02). When VEDT0 values and AFP levels were combined, median survival was significantly longer in patients with VEDT0 > 70% and AFPT0 ≤ 400 ng/ml than in all other combinations (Fig. 6). A multivariate linear regression analysis clarified the role and the weight of the baseline VED and AFP, in predicting survival. The results showed that VEDT0 > 70% predicts a longer survival (β = 209.6; p = 0.037), while AFP lost its predictive role (p = 0.216).

Fig. 5
figure 5

Receiver operating characteristic (ROC) curve. VEDT0 value to discriminate CB from PD patients. The area under the ROC curve is 0.716

Fig. 6
figure 6

Survival analysis. Kaplan–Meier plots by the level of VEDT0 (> 70, ≤ 70) (a) and by the level of AFT (> 400, ≤ 400) and VEDT0 (> 70, ≤ 70) (b). For a, log rank (Mantel–Cox) = 0.007, Breslow (generalised Wilcoxon) = 0.024 and Tarone–Ware = 0.013. For b, log rank (Mantel–Cox) = 0.030, Breslow (generalised Wilcoxon) = 0.019 and Tarone–Ware = 0.024. In particular, survival time (mean) is as follows: if VEDT0 > 70% and AFPT0 ≤ 400 ng/ml, 582 days, and AFPT0 > 400 ng/ml, 208 days; and if VEDT0 ≤ 70% and AFPT0 ≤ 400 ng/ml, 213 days, and AFPT0 > 400 ng/ml, 213 days

Table 5 Survival time by different cutoff values of VED, at T0 and T1 time points

Discussion

After its approval in 2008, sorafenib remained the only first-line treatment for advanced HCC, until the recent approval of lenvatinib [16]. Clinical experience accumulated during these years [17,18,19] indicates that sorafenib improves overall survival of patients with HCC, in the absence of objective response, and that tumour progression is better used as a surrogate of survival. However, to achieve the best results with sorafenib treatment of advanced HCC, a strict selection of patients is needed. Therefore, considerable efforts have been made to identify baseline factors that could predict the response to sorafenib. Minor advances were made with a few biohumoral factors weakly associated with a good response to therapy, while no molecular markers of response were identified [20,21,22]. Patients undergoing sorafenib therapy are often elderly, and the therapy is associated with important side effects, yet affording a limited survival advantage over untreated patients.

In this study, we identified the VED, a parameter based on the degree of arterial enhancement of HCC nodules, weighed by the volume of the target lesion(s), as a relevant factor in the prediction of the response to sorafenib. Several data from our study support the potential utility of this new parameter in the management of patients with HCC. We observed that CB patients tended to have a higher mean VED at baseline and a significant decrease in VED was found at the T1 time in CB patients, as compared with PD, suggesting that a positive outcome of sorafenib therapy is associated with a reduction in this parameter. Importantly, all CB patients fell in the group with higher VEDT0, i.e. > 70%. These data are strongly supported by the analysis of survival in the different groups. In fact, median survival in the VEDT0 > 70% group was almost twice longer than that in patients with lower VED as baseline. In contrast, none of the patients with a VEDT0 ≤ 70% had a CB from sorafenib therapy. Therefore, this parameter might be especially useful to identify the patients who are not likely to respond, characterised by low basal VED. Conversely, among patients with VEDT0 > 70%, 12 out of 20 still did not respond to treatment.

Our results support the hypothesis that low-vascularised HCC nodules are poorly sensitive to sorafenib therapy. This assumption is biologically plausible based on the pharmacological properties of sorafenib, whose main mechanism of action is the reduction of neo-angiogenesis, inhibiting the activity of vascular endothelial and platelet-derived growth factors [23]. In the treatment of other neoplastic diseases, high expression of these factors and/or increased activity of their cognate receptors makes the drug more likely to be effective [24,25,26,27]. Thus, a high pre-therapy VED may be viewed as a proxy for a high pro-angiogenic activity targeted by sorafenib. In agreement, CB patients had a significant reduction in VED at T1.

Our study also provides additional evidence for the negative prognostic role of AFP elevation [28]. In fact, patients with AFP > 400 ng/ml had a significantly shorter survival than the others. Combining the VED and AFP values at T0 allowed us to stratify the patients, where those with VEDT0 > 70% and AFPT0 < 400 ng/ml were more likely to respond, showing an average survival of 17 months (vs. less than 10 months of the other groups) (Fig. 6).

The VED parameter is non-invasive, economic, fast and easy to calculate in standard CT acquisitions, even without specific hardware and software, and it could provide a semi-quantitative and reliable evaluation of the volume of the disease and its perfusion. The reproducibility of VED computation is influenced by various factors, such as the concentration, amount and flow rate of CA administered, in association with the characteristics of each patient, mainly cardiac function. Therefore, it is important to underline that in our study, the dynamic CT was acquired with the bolus tracking technique to ensure that enhanced phases were as comparable as possible among patients. The lack of statistically significant variation of the ΔArt% value in the non-focal parenchyma at T0 and T1 time points supports the high reproducibility of this method.

The possible predictive role of MR diffusion and perfusion techniques in sorafenib-treated patients was evaluated [29,30,31,32,33,34,35,36]. Perfusion CT allows quantitative analysis of various parameters related to the micro-vascularisation of a neoplasm [37,38,39] and seems able to disclose significant differences between tumour tissue and surrounding parenchyma [30, 40]. However, perfusion CT is poorly reproducible and requires a software program which is not widely available, and the radiation dose administered is considerably higher than that of standard CT acquisitions. Also, dynamic contrast-enhanced ultrasound was investigated in evaluating the effectiveness of anti-angiogenic drugs on tumour perfusion in patients with HCC with encouraging results [41,42,43,44,45].

Some limitations of this study should be acknowledged. Firstly, the patients are few, and extension of these results to a larger series is warranted; however, each one was followed till his death. Also, the study has been conducted in a single centre, and reproducibility in other non-specialised facilities should be assessed. Then, to classify CB vs. PD patients, we could not use LI-RADS v2018 (not dealing with the evaluation of the response to systemic treatment). Thus, we chose mRECIST [14], even if we were conscious of some bias related to these criteria (non-volumetric evaluation, non-percentage assessment of the enhancement, incomplete evaluation reliability for systemic therapy) [8, 14, 16, 46,47,48,49], and that a revision of this system has been recently published by the same authors [50]. Furthermore, in this series, the VED was calculated for a maximum of 2 target lesions, which accounted for more than 80% of the overall disease burden. It should be pointed out that VED measurement may be difficult in some patients, especially when multifocal and infiltrative disease, with poor margins and different degrees of enhancement, is present. Finally, although all patients have been investigated with the same protocol and the same CT scanner, the retrospective nature of our work is an additional weakness.

In conclusion, this study identified the VED as a novel parameter obtained by a standard CT, which could be helpful, if confirmed by larger series in a prospective fashion, in predicting the response to treatment, identifying patients who are less likely to respond to sorafenib.