Introduction

Diagnostic Risk Calculators

Historically, the diagnosis of prostate cancer (PCA) has been dominated by a single test, prostate-specific antigen (PSA) which when combined with age, clinical examination and medical history can allow for a prediction of whether or not it is justified to undertake further clinical testing which until recent times has been normally done by prostate biopsy. At present, for the initial work-up (for example by the primary care physician in the UK), the decision to refer a patient for a biopsy is based entirely on age-specific normal ranges for PSA, which thus fails to take into account other risk factors (which may either increase or decrease underlying risk of significant prostate cancer). Transrectal ultrasound (TRUS) biopsy is then normally undertaken to assess whether histologically confirmed cancer is present. More recently, MRI has been looked at, either to guide the biopsy procedure or as a diagnostic modality in its own right.

Risk prediction models allow for the integration of multiple factors and with increasing markers of disease and of treatment response becoming available it is likely that risk-based models will be further developed and utilised and will continue to gain wider clinical utility and significance. A number of issues remain to be fully assessed though, and there remains particular debate as to how best to define clinically significant disease. This uncertainty adds to the difficulty in developing risk prediction models and in obtaining wide spread clinical usage.

There has been a growing number of prostate cancer risk prediction models developed over recent years aimed at helping the clinician and or the man himself estimate their chance of having prostate cancer either now or in a given future period (e.g. 10 years, lifetime, etc.). The large numbers of risk calculators for different periods of risk has to some extent led to confusion and overload. It is not clear how effective many of these risk predictions might be when employed at the first line medical health check as many were developed in the urology clinic setting. Furthermore, there is very little knowledge or use of these risk calculators by many frontline physicians.

Risk calculators that have potential application in the community setting need to be based on factors and tests that can be readily assessed in the initial diagnostic setting. Among the first of these to gain prominence in Canada and the USA was the Sunnybrook risk calculator which was constructed using all known risk factors to assess an individual’s risk of having prostate cancer for men undergoing a PSA test [1]. It was derived from a study of 3108 Canadian men who underwent a prostate biopsy, 408 of whom had a normal PSA level. Overall, 42 % were diagnosed with prostate cancer, 11 % of which were high grade (Gleason grades 7, 8, 9 and 10). These results were used to construct a nomogram to predict the presence of both “any prostate cancer” and the aggressive form of prostate cancer. Factors included in the model were age, family history of prostate cancer, ethnicity, urinary symptoms (AUA symptom score), PSA, PSA free-to-total ratio and DRE. The area under the receiver-operator curve (AUC) (prediction for presence of prostate cancer) was 0.74 (95 % confidence interval (C.I.) 0.71–0.81) and AUC for high-grade cancer was 0.77 (95 % C.I. 0.74–0.81). Internal validation was carried out by comparing the nomogram with conventional screening using only PSA and digital rectal examination (DRE). The AUC for conventional PCA detection was significantly lower (0.62, 95 % C.I. 0.58–0.66 for the presence of prostate cancer and 0.69, 95 % C.I. 0.65–0.72 for the presence of high-grade cancer). The risk prediction model was well-calibrated, yielding estimates of risk similar to the prevalence at biopsy [1]. Extensions to the original model in terms of evaluating genetic markers in the form of selected single nucleotide polymorphism did not add to the clinical utility of the approach [2]. The calculator has been principally used at selected centres in Canada and the USA as a guide to aid clinical decision-making.

A further leading community-based calculator was developed in the USA using data from the Prostate Cancer Prevention Trial (PCPT). The first version of the PCPT prostate cancer risk calculator (PCPTRC) was published in 2006 [3]. The study used logistic regression to produce a model which includes age at biopsy, race, family history of prostate cancer, PSA level, PSA velocity, DRE result and previous prostate biopsy results. The PCPTRC can be used in settings where regular PSA testing is employed such as the USA.

The PCPTRC model has an average AUC for the prediction of prostate cancer of 0.70 with a standard deviation = 0.006 [3]. It was externally validated in the San Antonio Centre of Biomarkers of Risk for Prostate Cancer (SABOR) cohort of the Early Detection Research Network which was a younger and more ethnically diverse population than the original PCPT trial population and had an AUC of 0.66 [4]. In 2010, this nomogram was further externally validated in a Portuguese population with an AUC of 0.70 (Oliveira et al., 2010).

For comparison, Nam et al. conducted an external validation of their Sunnybrook nomogram-based prostate cancer risk calculator (SRC) and compared with the PCPTRC in a prospective, multi-institutional study of 2130 patients who underwent a prostate biopsy for prostate cancer detection. The AUC for the SRC was 0.67 (95 % CI, 0.65 to 0.69), and the AUC for the PCPTRC was 0.61 (95 % CI, 0.59 to 0.64). The AUC was higher for predicting aggressive disease from the SRC (0.72; 95 % CI, 0.70 to 0.75) compared with that from the PCPRT (0.67; 95 % CI, 0.64 to 0.70) [5].

The PCPTRC was also validated in ten international biopsy cohorts, and the results suggested variations in the AUCs between cohorts. The authors concluded that the variations were due to different criteria for work-up before biopsy within these cohorts [6].

In 2014, a further version of the PCPTRC (Version 2.0) was published [7]. The newer model aims to predict the risk of no vs low-grade (Gleason grade <7) vs high-grade cancer (Gleason grade 7). In this current model, the authors applied percent free-PSA and did not use PSA velocity. The parameters included in the model are therefore PSA, digital rectal examination, age, race, prior biopsy and family history. The new PCPTRC 2.0 either with or without percent free-PSA is claimed to be well-validated externally. However, the inclusion of percent free, or free/total PSA may limit applicability in the initial diagnostic setting as this test may not be routinely available in many settings.

In Europe, the Prostate Cancer Risk Calculator (PCRC) was constructed using data from Rotterdam, one of the study centres involved in the European Randomised Study of Prostate Cancer Screening (ERSPC). The calculator is designed to avoid unnecessary biopsies and to increase detection of potentially aggressive prostate cancer [8, 9]. The ERSPC Prostate Cancer Risk Calculator was based on 6288 Dutch men and utilised their biopsy results to construct a graphic device which uses six steps to estimate risk. The study reported an AUC up to step 3 of 0.77 (95 % C.I. 0.74–0.79) [10]. The risk calculator (up to step 3) was externally validated with 390 Portuguese patients who had ten-core systematic transrectal prostate biopsies; it had a high predictive accuracy of 77.9 % [11].

The Rotterdam group of the ERSPC trial has subsequently developed a modification of the risk calculator to allow for the inclusion of estimated prostate volume by DRE (ERSPC3 + DRE). Previously, prostate volume was obtained by ultrasound of the prostate gland which limited the practicability of the risk calculator in a UK primary care setting for example. The modified ERSPC risk calculator therefore offers a good tool for use in primary care, although its performance in comparison to the Sunnybrook calculator remains to be established. Even with a relatively rough estimation of the total prostate volume at DRE (25, 40 or 60 cc), the adapted ERSPC Risk calculator performs better than PSA alone and PSA in combination with DRE [12].

The European Randomised Study of Screening for Prostate Cancer Risk Calculator (ERSPC RC) has also been updated to predict potentially aggressive prostate cancer, defined as Gleason ≥7 and/or T-stage >T2B. The further version of the calculator showed an AUC of 0.86 (95 % CI 0.84–0.86) on the ERSPC dataset (AUC PSA alone 0.74). AUC’s of the ERSPC RC and the PCPT RC for high-grade PC on a Canadian (N = 982, clinical cohort) validation cohort were 0.76 (0.72–0.79) and 0.68 (0.64–0.72), p < 0.0001, respectively. The AUC of PSA alone was 0.61 [11, 13].

A recent publication has looked specifically at a head to head comparison of prostate cancer risk prediction models [14••]. The meta-analysis used three main criteria including the presence of PSA level in the model, models that have been validated in ≥5 study populations and models that have assessed area under the curve (AUC) for prediction of any or clinically significant PCA. The authors identified 6 out of 127 prostate cancer prediction models met the criteria. These risk prediction models were: Prostataclass, Finne, Karakiewcz, Prostate Cancer Prevention Trial (PCPT), Chun and the European Randomised Study of Screening for Prostate Cancer Risk Calculator 3 (ERSPC RC3). The ERSPC RC3 and Prostataclass had the highest AUC (0.79). The PCPT did not show any difference in its ability to predict any prostate cancer compared to PSA testing alone (AUC = 0.66 for both), but it did have better ability to predict clinically significant prostate cancer (AUC = 0.71 and AUC = 0.66, respectively). Five of these risk predictions except by Chun have incorporated DRE into the model. Chun instead used sampling TRUS from biopsy. Calibration measures of the models were in general poorly reported. Only Finne and Chun have described calibration of the risk algorithm fully [14••]. The conclusion of the meta-analysis was that risk prediction models do perform better than PSA alone.

In addition to the highlighted community-based risk calculators above, there are over a hundred further calculators available which calculate a variety of risks principally for the clinical situation [15, 16].

Approaches to get widespread uptake of any of the calculators in primary care/front line diagnostics remain to be done, and the risk calculators have to be further improved to better detect clinically significant cancers.

Other issues still remain to be evaluated in that costs and cost effectiveness become keys as the diagnostic approach has to be used in large numbers of men. Furthermore, newly emerging risk prediction approaches include specific high and low penetrance genes (rather than questionnaire based on family history) which may affect men’s consideration of their use as they have potential implications for other family members.

The issue of how best to further monitor men after initial risk assessment has been done also remains to be established.

The Future of Diagnostic Risk Assessment

As many of the above calculators offer good potential to aid in the diagnostic pathway for PCA, the choice and use of which calculator needs to be determined by the clinical setting in which it is to be used and the wider full head to head comparisons can often be difficult to evaluate. There remains an important issue in terms of the error inherent in defining the gold standard of diagnosis, i.e. all the calculators above relied on the TRUS biopsy as being definitive but this is known to have error rates as high as 30 %. Hence, the new paradigm is how these calculators will perform in an optimally investigated population (e.g. using MRI and targeted biopsies to define whether cancer is truly present).

Perhaps leading the way in terms of evaluating the use of risk calculators in a near population-based community setting, recent work in Sweden has been undertaken to investigate the clinical advantages of using a combined risk assessment to reduce the over diagnosis of low-grade tumours and the study results are expected to be published soon. It is likely that the study will be able to demonstrate in a setting of wide-scale population-based “screening” the sorts of advantages seen in previous more constrained clinic-based evaluations. The Swedish research team are looking to continue their evaluations to include advances in biopsy procedures including MRI guided biopsies.

In addition, there are a number of further ongoing programs of work evaluating the practical issues involved in changing the diagnostic pathways and to further develop the risk models used in practice. Examples of such programs include the “Riskman” program in the UK which is underway to develop and evaluate the “Riskman” calculator, a risk assessment algorithm for GPs to assess which men should be referred for prostate biopsy [17]. The calculator is being developed for use in the primary care setting in the UK. It will be constructed using the most informative markers of existing community-based calculators together with the best combination of further biomarkers, epidemiological factors and SNPs now available, for potential incorporation to produce a calculator with optimised performance and costs for future wide-scale use in the NHS in the UK. The program is focused not only on further developing the risk prediction models but also on the best translational routes to direct implementation in the NHS.

Future risk calculators will need to further evaluate a range of new and emerging biomarkers both in terms of genetic markers, blood-based proteins and potentially also informative urine markers (see Table 1).

Table 1 Emerging biomarkers for characterising the risk to the individual from their particular cancer

The role of genetic markers also needs further consideration, particularly in terms of individual risk prediction versus population stratification. To date, the rapid gene sequencing advanced by the Human Genome Project and genome-wide association studies have identified over 100 single nucleotide polymorphisms that predict PCA. In the UK, the PROFILE study by Eeles et al. is utilising a risk score based on a combined SNPs profile to identify a population of men at high risk of developing prostate cancer. At present, such tests are likely to be offered through genetic counselling clinics, but in the future, direct-to-consumer (DTC) genetic testing makes it likely that such tests will be commercially marketed directly to the public. Men using DTC’s testing in this way would not have direct access to a doctor or genetic counsellor’s guidance, creating possible anxiety or false reassurance pushing people to make medical decisions on incomplete or misunderstood information [18].

Alternatively, utilising such genetic markers has the potential to be used either to add to existing risk prediction models or to stratify the population by predisposition risk and then employ conventional risk prediction tools.

Issues of risk communication however remain pervasive in the use of risk prediction models. The wider issue of cancer risk communication is not unique to prostate cancer risk calculators. Unresolved at this time are how to best present the information from a risk model and what the patient understands by the outputs. Thus, greater future research should also focus on risk presentation and risk communication [19••]. These aspects are equally as important as building reliable risk calculators. Health professionals need to understand how best to communicate the risk and publics views and need to be able to understand the meaning of risk and potential ways to manage their risk profile. Colditz and Wei (in the parallel paper in this series) argue, as does Moons [20, 21], that the end user must be considered from the first steps in model development to assure transfer to use that will improve health outcomes.

Han et al. investigated patient experience with prostate cancer with risk information and their perceptions of the value of personalised risk information using a focus group approach. They reported most prostate cancer patients had very little experience in utilising risk information. Some patients perceived personalised risk information as less valuable than other types of evidence [22]. How best to communicate and inform the results thus remains to be established.

In sum, risk-based prostate cancer detection in the community offers potentially significant improvements in performance over the current PSA threshold-based approach particularly as a first-line diagnostic testing approach. Nearly all the leading community-based risk calculators, however, still require an assessment of the prostate by digital rectal examination (DRE). This requirement limits their widest possible use, particularly in some ethnic groups and some clinical settings. Further work on improving the accuracy and removing the need for undertaking a DRE assessment would widen still further their utility and likely benefit to improving the diagnostic pathway for PCA. How best to communicate the results, how to create awareness of them and how to use them in the clinical setting remain key issues to be resolved.

Management Options

Once diagnosed and clinically characterised, there is a need to correctly risk stratify those patients that need the most effective treatment be it surgery, radiotherapy, chemotherapy or combinations of any of these. At present, the only robust choices on treatment allocation is the choice between AS and radical therapy, although there is a lot of research effort to more precisely risk assess allow more individualised treatment approaches.

To date, clinicians typically have been using different forms of risk stratification particularly to help with the management of localised and locally advanced non-metastatic prostate cancer. The easiest and most commonly used method of risk stratification is that of D’Amico et al. [23]. Three combinations of parameters, including preoperative PSA, clinical stage and Gleason score which are used to classify patients diagnosed with prostate cancer into low-, intermediate-, and high-risk categories.

In 2012, a critical review by Rodrigues et al. offered evidence that this simplistic model needed modification based on clinical prognostic evidence. The proposed changes included the creation of new risk group for “very-low” risk patients. Further stratification based on the splitting of intermediate-risk into a low- and high-intermediate-risk groups and the clarification of the interface between intermediate- and high-risk disease was desirable [24]. For future enhancements, the authors also recommended the potential use of other novel pathological factors such as percentage of biopsy core positive and subtype of Gleason grade 7.

By comparison, the Cancer of the Prostate Risk Assessment (UCSF-CAPRA) score was invented by incorporating two further factors including age at diagnosis and percentage of positive biopsy core. The CAPRA score ranges from 0 to 10 score with score of 0 to 2 indicative low-risk, score of 3 to 5 indicative intermediate-risk and score of 6 to 10 indicative of high risk [25].

At present, prostate cancer risk stratification mainly relies on clinical stage, initial PSA level, and Gleason score with some risk stratification scheme adding on further prognostic factors such as percentage of positive biopsy cores, PSA doubling time, PSA velocity, etc.

Future risk stratification will incorporate a much wider range of biomarkers. Current promising treatment stratification markers are shown in Table 1, but these are likely to be added to including epigenetic, micro-RNA and even circulated tumour cells themselves. These should provide better ways to characterise the individualised patient’s disease risk and hence appropriate management can be provided and are part of the big drive to “personalised medicine”.

In summary, at the present time, given that the best ways to treat different forms of PCA are not yet fully established has led to some confusion on how to best risk stratify men with diagnosed cancers. Ongoing large scale clinical evaluations of the best way to treat the various forms of prostate cancer, including for example the UK-based ProtecT trial will provide opportunities to further develop improved risk prediction to improve the prediction of best treatment options to aid both patients and clinicians alike. Furthermore, the very recent the International Society of Urological Pathology (ISUP) grading prognostic factors which will likely change all the current risk criteria [26].

Conclusions

Risk offers a unifying approach to both the diagnosis and risk stratification of men both before and after a diagnosis of PCA. Several areas of further development are required and these could best be done in an incremental way whilst starting to use the approach in practice.

Challenges remain including how best to use the individual risk calculators in the context of population stratification. There remains a lack of large scale studies on the actual usage/translation into practice and a number of factors work against this, i.e. who decides which is the best one and how to get widespread uptake and acceptance and how best to evaluate them (including choosing appropriate levels of AUC, calibration and validity). Relative costs will come into play depending on where they are going to be employed. Many risk prediction models derived from one country/healthcare system may not be accepted in other countries. Furthermore, what do men themselves feel about risk—qualitative studies suggests they often decide on particular actions based more in a heuristic (rule of thumb) way. The whole issue of communication of the risk is very important; we are not all the same and whilst some men might think one risk is high whilst others may consider differently. It is further worth noting, different attitudes within ethnic groups to risk, i.e. black men in UK were less likely to act at a given risk than white men [27]. In addition, other co-morbidities must feed into the consideration and is the decision on appropriate further action to be taken made by the medical profession or by the man himself.

Finally, how best can such estimates of PCA risk be used alongside estimates for other cancers/serious conditions which are also rapidly being developed.

In sum, risk prediction for PCA does improve particularly the diagnostic pathway for PCA and has the potential to be further developed to enhance risk stratification to help define better disease management. How best to use existing models in practice and to further enhance them remains to be fully established. If such translational studies are set up, future risk prediction improvements could be further refined in practice and in an ongoing way to more rapidly realise the full potential that the approach offers.