Introduction

Primary brain tumors (PBTs) account for only 1–2 % of adult cancers, according to the Central Brain Tumor Registry of the United States, with over 311,000 new patients with primary brain tumor or central nervous system tumor diagnosed between 2005 and 2009 [1]. Both tumoral lesions (gliomas, non-glioma PBTs, and metastases) and non-tumoral lesions (abscesses, parasitic lesions, demyelinating disease, infarcts, and old hematomas) may present as ring-enhancing masses on conventional magnetic resonance imaging (MRI) sequences. Gliomas, however, are the most frequent brain tumors and their histological differentiation and grading are predictive of aggressiveness and outcome [2]. They are currently divided into: (1) grade I and II tumors, considered low-grade with a protracted evolution, and (2) grade III (anaplastic glioma) and grade IV (glioblastoma) tumors, considered high-grade lesions leading to death within weeks to months when untreated.

Morphological assessment by MRI is very precise, allowing tumor diagnosis and assessment of tumor size, mass effect, edema and hemorrhagic effusion. However, it lacks specificity and does not easily allow the assessment of tumor activity and metabolism. Magnetic resonance spectroscopy (MRS), to increase specificity, investigates the presence of neuronal and membrane metabolites, such as N-acetylaspartate, choline, creatine, inositol or lactates, but it is limited by poor spatial resolution and frequent artifacts due to proximity with spinal fluid or skull bone. On the contrary, molecular imaging with positron-emission tomography (PET) provides information on tumor metabolism and grade, and helps to identify and delineate tumor zones with increased growth activity [3]. As PET with 18F-fluorodeoxyglucose (FDG) is not able to reliably predict the tumoral nature of a lesion due to high uptake in normal brain and unspecific uptake in inflammatory or benign lesions, amino acid tracers have been developed in recent decades.

18F-fluoroethyltyrosine (FET) is an artificial amino acid which is taken up into upregulated tumoral cells via the membranous L-, B0,+- and B0-transporter system and accumulates in the cell; it is not incorporated into proteins like the natural amino acids, such as 11C-methionine, and its uptake is not directly proportional to grade [4, 5]. However, FET can be used to obtain good contrast images in both high- and low-grade tumors, at a lower effective dose as compared to FDG [6, 7]. Qualitative FET-PET uptake does not allow tumor grading, but dynamic FET-PET analysis helps in the differentiation of low- from high-grade tumors [8]. FET-PET has also been shown to be valuable for guiding biopsy, diagnosing brain tumor recurrence after initial therapy, and directing radiosurgery [9]. As FET exhibits lower uptake by inflammatory cells than 11C-methionine and FDG, it can be used for differentiating tumor from radionecrosis [10]. Early changes in FET uptake have also been shown to be helpful in predicting tumor response to treatment and assessing prognosis [11]. As there are only a few published studies with limited patient populations that evaluate FET-PET in newly identified brain lesions, a couple of years ago, we carried out a systematic review and meta-analysis which demonstrated excellent performances of FET-PET for the diagnosis of PBT and glioma [12].

The purpose of this report is to provide an up-to-date systematic review and meta-analysis of the diagnostic performances of FET-PET in patients with suspicion of primary brain tumors and gliomas and to assess the performance of FET uptake quantification.

Materials and methods

Data sources and search

We performed a systematic literature search of the PubMed and EMBASE medical databases, looking for English and non-English publications from the period January 1977 to September 2012. As previously reported [12], the search terms were: “(“O-(2-fluoroethyl)tyrosine”[All fields] OR “(18F)fluoroethyltyrosine”[All fields] OR “Tyrosine/diagnostic use”[Mesh]) AND (“Brain Neoplasms/radionuclide imaging”[Mesh] OR “Glioma”[All fields]) AND “Humans”[Mesh])”. Errata, reviews, preclinical, animal and non-radiopharmaceutical studies were excluded.

Study selection

We considered only human studies in which FET-PET was used in the assessment of newly diagnosed, untreated brain tumors. Inclusion criteria were: (1) FET-PET was used in patients with newly diagnosed brain lesions; (2) patients had received no radiotherapy, surgery or chemotherapy prior to FET-PET, and (3) histology was used as the gold standard. Studies written in languages other than English, French, or German were excluded, as were ones that included fewer than ten patients.

Data extraction and quality assessment

From each study, we collected the following information: first author, year of publication, study population (number of patients with initial assessment of brain tumor, gender, age, and histological analysis); FET-PET results [positive or negative examination as assessed by authors, maximal target-to-background ratio (TBR) and/or mean TBR, when available] and PET imaging protocol (time between injection and image acquisition, static or dynamic imaging protocol, and number of frames). Data at patient level, when provided, were recorded. Study quality and applicability were assessed using the Quality Assessment of Diagnostic Accuracy Studies checklists (QUADAS, scale range 0–14) and the Standards for Reporting Studies of Diagnostic Accuracy checklist (STARD, scale range 0–25) [13, 14].

Statistical analysis

We performed all analyses at patient level using dichotomized histological diagnosis (tumor or not, glioma or not) as the gold standard according to the WHO classification of tumors of the central nervous system [2] and the 3rd edition of the International Classification of Diseases for Oncology (ICD-O-3); glioma was defined by ICD-O-3 codes 9380–9384, 9391–9460, and 9480. We used the criteria described in each study to define FET-PET positivity. Data synthesis used a bivariate mixed-effects regression model and average sensitivity, specificity, positive and negative likelihood ratios (LRs) and odds ratios with their respective 95 % confidence intervals (CIs) were calculated from the maximum likelihood estimates and graphically assessed by a summary receiver operating characteristics (SROC) curve with computation of the area under the curve (AUC). SROC is a composite statistic reflecting the discriminative ability of a diagnostic test and it can overcome some limitations associated with the direct pooling of sensitivity and specificity values of individual studies [15]. We used graphical analysis of Forest plots and statistical analysis with the χ 2 and Cochran’s Q tests to assess heterogeneity of results. Inconsistency of between-study results was assessed using the I 2 statistics (percentage of total variation across studies attributable to heterogeneity rather than chance). The Funnel plot asymmetry test assessed publication bias (tendency to publish only positive results) [16]. This was performed using the regression of logarithmic diagnostic odds ratio against the inverse square root of the effective sample size (ESS) weighted by ESS with a p value <0.05 for the slope coefficient indicating significant asymmetry [17].

We also carried out analyses at patient level to determine the optimal diagnostic threshold of mean TBR (meanTBR = mean activity of the lesion/mean activity of the contralateral brain) and maximum TBR (maxTBR = max activity of the lesion/mean activity of the contralateral brain). In the studies, a variety of methods were used for computing TBR: (1) a region-of-interest (ROI) manually drawn around the tumor based on T1- and/or T2-weighted MR images [1820], or (2) increased FET uptake [8, 2124], (3) automated ROI drawing of the area with increased FET uptake >3 standard deviations of the mean standardized uptake value (SUV) of a control region [25], or (4) based on a 90 % SUVmax threshold isocontour method [26], or (5) a fixed 25-mm2 elliptic ROI centered on the area of most increased FET uptake [27, 28]. In three studies, the method used to define the ROI was not specified [2931]. We performed receiver operating characteristics (ROC) analysis of pooled individual patient values to determine the optimal meanTBR and maxTBR thresholds. Finally, Student’s t test was used to assess the difference in meanTBR or maxTBR according to glioma WHO grade. All statistics were performed with Stata 12.1 (StataCorp, College Station, TX, USA). p values <0.05 were considered to indicate statistical significance. By convention throughout the text, tables and figures, we used the small letter n to describe the number of studies and the capital letter N to describe the number of patients.

Results

Study characteristics

Our initial search of the PubMed and EMBASE databases identified a total of n = 139 articles. After exclusion of non-FET reports (n = 78), errata (n = 2), reviews (n = 10), and preclinical (n = 6) and animal studies (n = 1), 42 clinical articles on FET-PET in patients remained (Fig. 1). After applying the inclusion criteria, 12 studies remained, excluding reports on recurrent disease (n = 18), reports without FET results (n = 4), reports including <10 primary brain lesions (n = 4), reports on non-tumoral brain lesions (n = 3), and our previous meta-analysis (n = 1) [12]. Four additional studies were found through reference screening of the excluded papers (Fig. 1).

Fig. 1
figure 1

Flowchart of study identification and selection. *Note: Among these, a subpopulation of n = 14 studies with N = 525 patients did report target-to-background ratio (TBR) data, which were used for deriving TBR thresholds yielding the best diagnostic performances

Sixteen studies [8, 1832] with a total of N = 641 patients (Table 1) were eligible for inclusion in the present meta-analysis. Six studies had mixed populations with both primary brain tumor and recurrence (#1, 9, 11, 12, 13, and 14, Table 1) and one did not report histology results for all patients (#2, Table 1). Consequently, from these seven studies only patients with FET-PET results in primary assessment of the lesion verified by a subsequent biopsy or resection were included. Thus, N = 568 patients finally remained (median age 45 years; mean age 45 ± 15 years; sex ratio F:M = 0.8:1), of whom 516 (91 %) had a tumor, which was a glioma in N = 498 (88 %) patients. Non-glioma brain tumor was diagnosed in N = 18 patients (N = 5 lymphomas, N = 4 metastases, N = 3 hemangioblastomas, N = 1 cavernoma, N = 1 germinoma, N = 1 teratoma, N = 1 medulloblastoma, N = 1 pinealoblastoma, and N = 1 primitive neuroectodermal tumor). There were also N = 52 non-malignant lesions (N = 22 gliosis lesions, N = 11 infectious disease lesions, N = 5 demyelinating lesions, N = 4 infarcts, N = 4 hemorrhages, N = 2 encephalitis, N = 2 cortical dysplasia, N = 1 adenoma, and N = 1 aneurysm).

Table 1 Characteristics of the studies

Performance for the diagnosis of brain tumor and glioma

Six of the 16 selected studies, for a total of N = 307 patients, could be included in the bivariate mixed-effects regression model (Fig. 1). The other ten studies could not be included in our analysis as they (1) did not report the true negative and false positive rates necessary to compute specificity, or (2) did not report dichotomized FET-PET results. The results of the pooled analysis showed, for FET-PET diagnosis of tumoral vs non-tumoral brain lesions, a sensitivity of 0.80 (95 % CI 0.74–0.85), a specificity of 0.67 (0.44–0.84), a positive LR (LR+) of 2.4 (1.3–4.7), a negative LR (LR−) of 0.29 (0.19–0.45), a diagnostic odds ratio of 8 [323], and an AUC of 0.81 (0.78–0.85). The SROC curve is presented in Fig. 2 with study symbols weighted by patient population size.

Fig. 2
figure 2

Summary receiver operating characteristics (SROC) curve for the diagnosis of tumoral vs non-tumoral brain lesions

Similarly, FET-PET diagnosis of glioma vs non-glioma brain lesions showed a sensitivity of 0.82 (0.74–0.87), a specificity of 0.59 (0.32–0.81), an LR+ of 2.0 (1.0–3.9), an LR− of 0.31 (0.16–0.61), a diagnostic odds ratio of 6 [223], and an AUC of 0.82 (0.78–0.85).

Assessment of the heterogeneity, inconsistency, and quality of the studies

Forest plots (Fig. 3) did not show any significant performance heterogeneity or inconsistency between studies for the diagnosis of brain tumors (Q = 1.5, p = 0.24, and I 2 = 0 %). Likewise, glioma diagnosis did not reveal significant performance heterogeneity (Q = 3.8, p = 0.08) or inconsistency (I 2 = 47 %), although specificity differed significantly between studies (Q = 15.8, p = 0.01). Funnel plots (Fig. 4a) did not reveal any publication bias for brain tumor (p = 0.64), or glioma diagnosis (p = 0.98). Figure 4b summarizes graphically the assessment of study quality using the QUADAS and STARD criteria.

Fig. 3
figure 3

Forest plots of sensitivity and specificity for brain tumor diagnosis of the n = 6 studies included in the meta-analysis

Fig. 4
figure 4

a Funnel plots of the inverse square root of the effective sample size (ESS) vs diagnostic logarithmic odds ratio showing no significant publication bias for tumors or gliomas. b Study quality grading using QUADAS (range 0–14) and STARD scores (range 0–25). Asterisk these studies were included in the meta-analysis

Quantitative tumor-to-background ratio (TBR) threshold for diagnosing brain tumor and glioma

Individual patient meanTBR results were given in n = 9 studies (N = 271 patients) [1822, 2729, 31]) and maxTBR in n = 7 studies (N = 329 patients) [8, 19, 20, 2426, 30]. The ROC curve analysis indicated that meanTBR ≥1.6 and maxTBR ≥2.2 had the best value for diagnosing tumoral vs non-tumoral brain lesions, with a 0.77 (0.71–0.82) sensitivity, 0.81 (0.64–0.92) specificity, 0.96 (0.92–0.98) positive predictive value, and 0.35 (0.25–0.46) negative predictive value for meanTBR, and a 0.65 (0.60–71) sensitivity, 0.82 (57–96) specificity, 0.99 (96–1.0) positive predictive value, and 0.11 (0.06–0.19) negative predictive value for maxTBR. For diagnosing brain glioma vs non-glioma brain disease, meanTBR ≥1.7 and maxTBR ≥2.2 showed the best diagnostic performances, having sensitivity, specificity, positive predictive and negative predictive values of 0.71 (0.65–0.77), 0.72 (0.57–0.84), 0.92 (0.87–0.96), 0.34 (0.25–0.45), and 0.65 (0.60–0.71), 0.67 (0.45–0.84), 0.96 (0.93–0.98), 0.13 (0.08–0.20), respectively.

High-grade gliomas (WHO I–II) had significantly higher meanTBR and maxTBR values than the low-grade gliomas (WHO III–IV), and non-tumoral lesions (2.6 ± 1.0 vs 1.7 ± 0.7 vs 1.5 ± 0.6, p < 0.001; 3.0 ± 1.1 vs 2.4 ± 1.0 vs 2.1 ± 0.8, p < 0.001) (Fig. 5). There was no significant difference in TBR between non-tumoral lesions and low-grade gliomas (p < 0.3).

Fig. 5
figure 5

Target-to-background (TBR) comparison box plot according to histological WHO grading (black box meanTBR, white box maxTBR). *p < 0.001 vs non-tumoral lesions

Fig. 6
figure 6

a T1-weighted sagittal view (left) in a 47-year-old patient with a growing anterior frontal mass with decreased T1-weighted and enhanced T2-weighted (right) signal, with restricted diffusion, perfusion with neoangiogenesis signs, and no Gd-enhancement. Spectroscopy showed a tumoral spectrum with decreased N-acetyl-aspartate (NAA) and an increased choline/creatine ratio of 2. The most probable diagnosis was intermediate-grade oligodendroglioma of corresponding 18F-FET-PET fused with T1-weighted image (bottom) showed an increased tumor-to-background ratio (TBR) of 2.4 (SUVmax50–60′ lesion = 1.7 g/mL; SUVavg50–60′ background = 0.7 g/mL) with a washout indicating high-grade glioma. The final diagnosis was grade III anaplastic oligoastrocytoma. b T1-weighted sagittal view (left) in a 73-year-old patient with a corpus callosum and right cingular gyrus lesion with decreased T1-weighted and enhanced T2-weighted (right) signal, with restricted diffusion and perfusion with neoangiogenesis signs, and no Gd-enhancement. The likely diagnosis was intermediate-grade glioma. The 18F-FET-PET/T1-weighted MR image (bottom) showed an increased TBR of 2.2 (SUVmax50–60′ lesion = 1.8 g/mL; SUVavg50–60′ background = 0.8 g/mL) with a radiotracer washout indicating high-grade glioma. The final diagnosis was grade III anaplastic astrocytoma. For both lesions, the information gained by 18F-FET helped to direct the biopsy to the area of highest TBR and pinpointed higher-grade (III–IV) washout curves. None of these lesions had any Gd-enhancement on MR, making it difficult to select the biopsy target

Discussion

Our updated systematic review and meta-analysis confirms the good performances of FET-PET for the initial diagnosis of brain tumors and gliomas. In this large population of patients with newly diagnosed brain lesions, FET was found to have good sensitivity (80 and 82 %) and moderate specificity (67 and 59 %) for the diagnosis of brain tumors and gliomas, respectively. The high LR+ and low LR− suggest that FET-PET is useful for confirming or excluding brain tumors or gliomas. The high positive predictive value of the proposed mean- and maxTBR thresholds confirms the utility of quantitative analyses for identifying brain tumors and gliomas.

For over a decade, FET-PET has been known to help in distinguishing between proliferating tumoral and non-tumoral lesions [5, 10]. An initial PET comparison study demonstrated that FET uptake ratios correlated with 11C-methionine tumor uptake, with the advantage that FET-PET does not require an on-site cyclotron [6]. This allowed more centers to have access to FET. Comparison studies also showed lower uptake by inflammatory cells [3335], opening the way for better discrimination between infectious and tumoral lesions [36], and between recurrent tumor and radionecrosis [10]. Following these early studies in recurrent tumor [37], several studies assessed FET-PET performances in the initial evaluation of isolated brain tumors. Of the sixteen studies included in this systematic review, only six provided enough details to determine sensitivity and specificity, suggesting that there is a need for better reporting of FET-PET study results. By comparison, MRI had a diagnostic accuracy of 55 %, which increased to 71 % when MRS was added in one large study (N = 176) [38], a value that is still lower than the AUC of 0.81 (0.78–0.85) reported in this updated meta-analysis. This finding confirms our previous results that seemed to show that the ability of FET-PET to assess the tumoral nature of isolated brain lesions was superior to that of MRI alone [12]. This is also known from specific studies combining PET and MR examinations, such as the one by Pauleit et al. [27] in which PET–MR fusion significantly increased the specificity for guiding diagnostic biopsy from 53 % for MRI alone to 94 % (Fig. 6). In another study, Floeth et al. [29] reported that the combination of MRI with FET-PET and MRS yielded a diagnostic accuracy of 97 %.

Quantitative imaging analysis using ratio to normal brain (meanTBR and maxTBR) is useful for characterizing lesion uptake and allows inter- and intra-patient comparisons, as normal brain FET uptake can be quite variable on consecutive PET examinations, even in the same patient. As in our previous meta-analysis [12], meanTBR ≥1.6 and maxTBR ≥2.2 thresholds demonstrated excellent performance for diagnosing brain tumor. This is consistent with results previously reported by Pauleit et al. [27] (meanTBR ≥1.6 for tumoral tissue) and by Pöpperl et al. [37] (maxTBR >2.0 for recurrence). Similarly, for gliomas, a meanTBR ≥1.7 and a maxTBR ≥2.2 demonstrated excellent diagnostic performances, with significantly lower meanTBR and maxTBR in low-grade than in high-grade gliomas. On dynamic acquisitions, several studies [8, 23, 26, 39] reported accurate distinction between low-grade and high-grade untreated gliomas using information from tumor time-activity curves, such as an increasing/decreasing slope uptake pattern [26, 39] or early-to-middle uptake ratio [23]. Whether combining conventional MRI with MRS, diffusion imaging and dynamic FET-PET imaging could provide better performances for diagnosing glioma that has not been established in large populations [40, 41]. This could also be an interesting issue to explore in combinations with the latest hybrid PET/MRI scanners [42].

Although we were able to include only 16 studies in our analysis, they were of good quality with 16/16 (100 %) having QUADAS scores ≥10 and 12/16 (75 %) STARD scores ≥18, and they all used a histological WHO classification gold standard. Moreover, there was no publication bias or inconsistency between the studies, strengthening the validity of our results. Nonetheless, some limitations remain. First, the meta-analysis included only n = 6 studies, due to patient population (absence of true negative cases) or methodological (absence of dichotomized reporting of FET-PET results) limitations. Second, the optimal TBR thresholds identified from our meta-analysis must be interpreted cautiously, as these values are PET protocol-dependent. For instance, differences in uptake kinetics between low- and high-grade tumors exist, and these could influence TBR, particularly if early (10–20 min post-injection) frames are added to late >30-min frames [8, 25]. Moreover, the use of several methods to define regions of interest most certainly influences TBR.

Further investigations are needed and great care should be taken to ensure strict standardization of PET acquisition protocols to increase FET-PET inter-center reproducibility and allow multicenter data analysis. Recommendations on imaging protocols already exist, such as the EANM and German guidelines on amino acid analogs for PET imaging of the brain [43, 44]. The inclusion of more true negative cases would also help to better determine the specificity, although this could be difficult, as patients may not necessarily undergo biopsy if FET-PET is negative. Finally, although cost-effectiveness of FET-PET in the management of patients with brain tumor has recently been studied, the results obtained should be treated with caution due to several limitations of the reported model [45, 46]. Thus, there is still a need for cost-effectiveness studies of FET-PET, used separately or in combination with MRI, in the clinical setting.

Conclusion

The results of this updated systematic review and meta-analysis confirm the good performances of FET-PET for the initial assessment of patients with new, isolated, untreated brain lesions. Strict standardization of PET imaging protocols and large, prospective, multicenter studies investigating the added value of this technique over conventional MRI and new MRI methods are now needed to consolidate FET-PET as a highly relevant tool for patient management.