Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Improving diagnostic sensitivity of combined dermoscopy and reflectance confocal microscopy imaging through double reader concordance evaluation in telemedicine settings: A retrospective study of 1000 equivocal cases

  • A. M. Witkowski ,

    Contributed equally to this work with: A. M. Witkowski, J. Łudzik, G. Pellacani

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing

    alexander.witkowski@unimore.it

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • J. Łudzik ,

    Contributed equally to this work with: A. M. Witkowski, J. Łudzik, G. Pellacani

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Writing – original draft

    Affiliations Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy, Department of Biostatistics and Telemedicine, Jagiellonian University Medical College, Krakow, Poland

  • F. Arginelli ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • S. Bassoli ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • E. Benati ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • A. Casari ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • N. De Carvalho ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • B. De Pace ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • F. Farnetani ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • A. Losi ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • M. Manfredini ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • C. Reggiani ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

  • J. Malvehy ,

    Roles Formal analysis, Investigation

    ‡ These authors also contributed equally to this work.

    Affiliation Dermatology Department, Melanoma Unit, Barcelona, Spain

  • G. Pellacani

    Contributed equally to this work with: A. M. Witkowski, J. Łudzik, G. Pellacani

    Roles Conceptualization, Data curation, Project administration, Supervision, Validation, Writing – review & editing

    Affiliation Department of Dermatology, University of Modena and Reggio Emilia, Modena, Italy

Abstract

Background

Reflectance confocal microscopy (RCM) is an imaging device that permits non-invasive visualization of cellular morphology and has been shown to improve diagnostic accuracy of dermoscopically equivocal cutaneous lesions. The application of double reader concordance evaluation of dermoscopy-RCM image sets in retrospective settings and its potential application to telemedicine evaluation has not been tested in a large study population.

Objective

To improve diagnostic sensitivity of RCM image diagnosis using a double reader concordance evaluation approach; to reduce mismanagement of equivocal cutaneous lesions in retrospective consultation and telemedicine settings.

Methods

1000 combined dermoscopy-RCM image sets were evaluated in blind by 10 readers with advanced training and internship in dermoscopy and RCM evaluation. We compared sensitivity and specificity of single reader evaluation versus double reader concordance evaluation as well as the effect of diagnostic confidence on lesion management in a retrospective setting.

Results

Single reader evaluation resulted in an overall sensitivity of 95.2% and specificity of 76.3%, with misdiagnosis of 8 melanomas, 4 basal cell carcinomas and 2 squamous cell carcinomas. Combined double reader evaluation resulted in an overall sensitivity of 98.3% and specificity of 65.5%, with misdiagnosis of 1 in-situ melanoma and 2 basal cell carcinomas.

Conclusion

Evaluation of dermoscopy-RCM image sets of cutaneous lesions by single reader evaluation in retrospective settings is limited by sensitivity levels that may result in potential mismanagement of malignant lesions. Double reader blind concordance evaluation may improve the sensitivity of diagnosis and management safety. The use of a second check can be implemented in telemedicine settings where expert consultation and second opinions may be required.

Introduction

Cutaneous tumor diagnosis can be difficult due to the diverse clinical and dermoscopic presentation of cutaneous lesions. In order to correctly identify an early melanoma (MM) the use of dermoscopy has been shown to significantly increase the sensitivity and specificity of diagnosis when compared with traditional naked-eye examination [1,2]. In equivocal cases benign lesions may be excised when further cytological information is required to rule out malignancy. The efficacy of benign tumor differentiation from MM can be measured by the number needed to excise ratio (NNE), for which dermoscopy ranges between 8.7 to 29.4, according to the level of expertise and clinical setting [3,4].

In the past decade reflectance confocal microscopy (RCM) use in clinical practice has been shown to further improve early MM diagnosis and help to reduce the number of unnecessary excisions in different settings, as summarized in Table 1 [522] and confirmed by recent reviews and meta-analysis [23,24].

thumbnail
Table 1. Previous studies evaluating the accuracy of reflectance confocal microscopy.

https://doi.org/10.1371/journal.pone.0187748.t001

Recently, the reliability of teleconsultation with dermoscopy images only [25,26] or combined dermoscopy-RCM images [5,10,14] has been tested which showed the capability of an accurate diagnosis but risk of mismanagement of potentially dangerous lesions, with diagnostic accuracy depending greatly on the level of expertise. Since RCM is a newly adopted technology its limitation to proper and safe diffusion may be due to the limited number of expert users in the field and lack of dedicated training programs to properly support the necessary knowledge acquisition and experience needed for safe and effective implementation in clinical practice. With the potential for an increase in requests for distant expert consultation of RCM images there is a need to provide proper, accurate and safe management of lesions sent for second expert consultation and in addition provide an available and effective training program for new users. Our goal in this study was to test the diagnostic sensitivity of readers who followed a dedicated dermoscopy-RCM training program and evaluate the difference between single reading versus double reading concordance evaluation of dermoscopy-RCM image sets. Additionally, we intended to determine if it is possible to reduce mismanagement of equivocal cutaneous lesions in retrospective consultation and telemedicine settings.

Materials and methods

Patient population

This was an Ethical Committee approved retrospective study (protocol 71.14; November 11, 2015) within the European Project DIAGNOPTICS (grant n. 621066) based on 1000 consecutive cases that were evaluated with dermoscopy and RCM imaging for diagnostic decision in order to rule out a diagnosis of MM during the period of January 2010 to August 2011 at the Dermatology Department at the University of Modena and Reggio Emilia (UNIMORE). The inclusion criteria were: (i) lesion excised during the first visit or follow-up sequential digital dermoscopy (i.e. videodermoscopy; mole-mapping) control visit with corresponding histology report; (ii) lesion not excised but with at least a 1 year stable digital dermoscopy and RCM imaging follow-up with no significant structural changes in dermoscopy and absence of increased atypia in RCM–considered to be benign and comparable to the histopathology gold standard; (iii) availability of digital dermoscopy images; (iv) availability of a complete standard set of RCM images. The first 1000 cases fitting the inclusion criteria were selected for this study in order to obtain an even number of cases to be evaluated by the participating readers. All patient cases were organized and renamed (0001–1000) by AW into separate folders without the patient name, number or identifying information so that the participating readers were blinded from the actual diagnosis and treatment course listed in the department database.

Reader training

Ten readers that included eight dermatology residents and two newly trained dermatology specialists participated in a tutorial training program for a six month duration with a minimum of 2000 dermoscopy-RCM case reviews and at least three months of clinical exposure to bedside dermoscopy-RCM clinical decision making at the UNIMORE skin cancer unit.

Imaging protocol and evaluation

Dermoscopy images were obtained with a DermLite FOTO System (DermLite Photo; 3Gen, San Juan Capistrano, CA, USA). RCM images were obtained with a reflectance confocal microscope (Vivascope 1500; MAVIG GmbH, Munich, Germany). A minimum of three mosaics (VivaCube; Caliber I.D., Inc., Rochester, NY, USA), images of 0.5 mm x 0.5 mm acquired and stitched into composite images, were obtained at different depths, corresponding to the stratum granulosum/spinosum, the dermal-epidermal junction (DEJ) and the papillary dermis [22]. The 1000 consecutive cases selected for evaluation in this study included the following malignancies: MM, basal cell carcinoma (BCC) and squamous cell carcinoma (SCC) and benign lesions (including naevi, solar lentigos (SL), seborrheic keratosis (SK), lichen planus-like keratosis (LPLK) and actinic keratosis (AK)), or other benign lesions that were not used during the training period of the 10 readers. The image sets were made available for evaluation during a time period of 3 months in which each reader received a total of 200 randomized cases of dermoscopy-RCM image sets. Cases were randomized so that each case was evaluated in blind by two different readers. None of the readers obtained an identical case list and each case was only evaluated twice. Each reader was asked to provide their management decision, degree of confidence and suspected diagnosis based only on the provided image sets into a Microsoft excel file. Management recommendation (hypothetical) was grouped into two categories: (i) excision or (ii) no-excision. Management decision confidence level was graded: (i) low or (ii) high. In order to test concordance of double reading, data from the excel files (Readers 1 to 10) were matched by computer and automatically classified for management with excision when (1) management decision was concordant for excision or when (2) management decision was discordant (i.e. one reader was recommending excision and the other no-excision).

Statistical analysis

Statistical analyses were performed using IBM SPSS Statistics Professional software (release 20.0.0, 2011; SPSS Inc., Chicago, IL, U.S.A.). Mean and standard deviation were calculated for Breslow’s thickness. Diagnostic values of sensitivity and specificity of individual readers were calculated for malignant versus benign lesions. Receiver Operating Characteristic (ROC) was calculated using binary diagnosis values (0: all benign lesion types, 1: all malignant lesion types) as the state variable and combination of “confidence” and “management” obtained by the 2 readers, corresponding to the following scores: -3 high confidence for no-excision of both readers; -2 no-excision suggested by both reader, one with high confidence and the other with low confidence; -1 low confidence for no-excision of both readers; 0 disagreement (one reader suggesting excision, while the other no-excision); +1 low confidence for excision of both readers; +2 excision suggested by both reader, one with high confidence and the other with low confidence; +3 high confidence for excision of both readers.

Results

Based on histologic diagnosis there were 176 malignant cases that included MM (83 cases, mean Breslow’s thickness: 0.537 mm; standard deviation: 0.693 mm), BCC (87 cases) and SCC (6 cases). The remaining 824 cases included 749 naevi, of which 16 were spitz naevi, SL/SK/LPLK/AK (58 cases) and other benign lesions that included dermatofibromas, angiokeratomas, angiomas and other benign lesions classified as other (17 cases). The evaluation of 200 cases by each of the ten confocal readers was compared with the actual diagnosis resulting in 2000 total evaluations (each case evaluated by 2 different readers). Overall reader diagnostic performance is reported in (Table 2) (S1 Table).

thumbnail
Table 2. Individual management performance of single reader evaluations.

https://doi.org/10.1371/journal.pone.0187748.t002

After randomization each reader received an average of 35 malignancies in their case series (ranging from 25 to 42 malignancies) including an average of 16.4 MMs (ranging between 14 to 22 MMs per reader). The overall average sensitivity for single reader evaluation was 95.2% (ranging between 86 to 100%) and specificity was 76.3% (ranging between 67 to 84%). Excision was recommended with high confidence in 142/166 (85.5%) evaluations of 83 MMs, 83/87 (95.4%) BCCs, 4/6 (66.7%) SCCs and 81/749 (10.8%) of benign lesions. Seventeen misdiagnoses of a malignancy were given out of 352 evaluations of malignant lesions, including 9 times for MM diagnosis (one of which was recommended twice for no-excision), 6 times for BCC diagnosis (two of which were recommended twice for no-excision) and twice for SCC diagnosis. In the group of hypothetically mismanaged malignancies readers reported high confidence in 2 out of 6 misdiagnosed invasive MMs, 2 out of 3 misdiagnosed in-situ MMs, 3 out of 6 misdiagnosed BCCs and 1 out of 2 misdiagnosed SCCs.

Considering double reading the overall sensitivity was 98.3% and specificity was 65.5% for a cut-off threshold for excision when at least 1 reader suggested excision, regardless of their confidence level (Table 3). Only 1/83 (1.2%) MMs, corresponding to an in-situ MM, was mismanaged with concordant management decisions for no-excision by both readers. For BCCs 2/87 (2.3%) were mismanaged with concordant decisions of no-excision by both readers. For SCC excision was recommended in all cases. When two readers concordantly chose to manage a case with no-excision with both readers reporting high confidence no malignant lesions were mismanaged (35.4%, corresponding to 292 of 824 benign lesions). In all 3 mismanaged malignancies (1 in-situ MM and 2 BCCs) at least one reader reported a decision with low confidence. A discordant report (one reader suggesting excision and the other no-excision) was given for 12 malignancies (8 MMs, 2 BCCs and 2 SCCs) and 176 out of 824 benign lesions. Concordant agreement to excise resulted in correct management of 161/176 (91.4%) of malignancies and an over diagnosis of 108/824 (13.1%) benign lesions. ROC area under the curve for double reading diagnostic performance was 0.948 (p < 0.001) (Fig 1).

thumbnail
Table 3. Overall management performance of double reader evaluation.

https://doi.org/10.1371/journal.pone.0187748.t003

Individual RCM features were evaluated (based on 2000 evaluations) for their frequencies in the following categories: presence of pagetoid cells, architectural disarray with melanocytic features, BCC tumor islands and/or cords and marked dyskeratosis of the epidermis (Table 4). Pagetoid cells (atypical melanocytes at the level of the epidermis) were reported in 84.3% of MMs, 0.6% of BCCs and none of the SCCs. Pagetoid cells were also reported in benign lesions: 20.3% of naevi (excluding spitz naevi), 81.2% of spitz naevi, 8.8% of SL/SK/LPLK/AK and 2.7% of other benign lesions (p < 0.001). Architectural disarray with melanocytic features at the level of the dermal-epidermal junction was reported in 80.1% of MMs and was completely absent in both BCC and SCC. Frequencies of architectural disarray in benign lesions were the following: 16.6% of naevi (excluding spitz naevi), 43.8% of spitz naevi, 5.3% SL/SK/LPLK/AK and absent in other benign lesions (p < 0.001). For the evaluation of BCC tumor islands and/or cords their presence was reported in 89.1% of BCCs, 2.4% of MMs and 16.7% of SCCs. In all other benign lesions BCC tumor islands and/or cords were reported in less than 1% of cases (p < 0.001). Finally, marked dyskeratosis (greater than one third) of the epidermis was evaluated with the highest frequency in SCCs 58.3% and less than 10% in all other malignant and benign lesions (p < 0.001).

thumbnail
Table 4. Reported frequencies of specific RCM features (based on 2000 evaluations).

https://doi.org/10.1371/journal.pone.0187748.t004

Discussion

The purpose of our study was to compare single reading versus double reading concordance evaluation of dermoscopy-RCM image sets. In the analysis we compared evaluations made by 10 readers to either histopathology diagnosis (excised cases) or what we considered benign lesions, those that were stable after one-year sequential digital dermoscopy in both dermoscopy and RCM imaging. We tested the diagnostic sensitivity and safety of evaluations in a retrospective and telemedicine setting, resulting in acceptable general diagnostic accuracy for both methods.

Unlike previous studies that tested the diagnostic accuracy of RCM based on single reader evaluations in non-telemedicine settings where one evaluation made on-site resulted in one simultaneous final management decision, we tested the possibility to implement RCM towards equivocal lesions that in real-world settings may be sent for a second opinion. By adding a second RCM reader and testing whether reader concordance and/or confidence has an impact on potential improvement of sensitivity we aimed to reduce the limitations of single RCM reader evaluation in both dermoscopic equivocal and confocal equivocal cases.

Previous RCM studies tested performance of single evaluators and reported variable sensitivities and specificities, mainly related to different study settings, study populations and evaluator expertise, but all studies demonstrated a good diagnostic performance in settings of difficult to diagnose lesions [522]. Previous studies include two main types: non-interventional and interventional studies. Non-interventional studies tested diagnostic accuracy in multiple study populations that in total included 3524 lesions, including 879 MMs. These studies mostly evaluated diagnostic performance of RCM alone and differential diagnosis was predominantly based on dermoscopically equivocal lesions [5,710,1322]. Reported diagnostic sensitivity was high in all cases (ranging between 84 to 100%) and specificity variable, lower in more difficult study populations such as those containing pink lesions [5,7] and equivocal lesions that presented with architectural changes during sequential digital dermoscopic follow-up [8,9], demonstrating the usefulness and reliability of this methodology for achieving a precise MM diagnosis. The always superior specificity and comparable sensitivity, when compared with dermoscopy alone, suggests the benefit of combined use of these technologies in clinical settings [18,20].

The second study type, prospective interventional, is characterized by the measurement of accuracy in real world settings with clinicians making decisions and bearing legally responsibility while using RCM [6,11,12]. In these studies diagnostic and management decision was taken using all available information including clinical and dermoscopic, explaining the excellent sensitivities and on average a lower specificity. The main outcome measure was reported as the NNE, corresponding to the number of benign lesions excised to detect one MM. Pellacani et al. reported a NNE of 6.8, in contrast to 14.6 without RCM, when RCM was used for management recommendation in a more conservative setting of moderately equivocal lesions [11]. On the other hand, Alarcon et al. stressed the use of RCM in a cohort of dermoscopically positive lesions, reaching a reduction of the NNE from 2.87 to 1.12 [12]. Of note, the recent study by Borsari et al. studied a large sample size of 1279 lesions that underwent RCM examination in a real-world setting because of dermoscopy indication, similar to Alarcon et al., confirming the achievement of an excellent NNE and reported sensitivity of 95.3% [6]. From literature data we can conclude that RCM use in clinical settings is both powerful and cost-effective with the ability to save unnecessary excisions and a very low risk to mismanage a MM, especially evident when applied to dermoscopically positive lesions [27].

However, one limitation for RCM introduction into clinical practice is due to the demonstrated need of dedicated expert training before performing at a clinically comfortable and safe level. Rao et al. reported a good overall diagnostic performance of RCM in a telemedicine setting, but perfect management of all MMs was proposed only by the expert reader whereas the less experienced reader mismanaged 1 out of 9 MMs [14]. A subsequent case revision from the same group showed improvement of diagnostic performance in the same clinical setting after several additional years of RCM practice in a clinical setting [28]. Similarly, Farnetani et al. showed variable accuracy, ranging from a sensitivity of 88.9% and specificity of 79.3%, correlated directly with the expertise of each evaluator [10], confirming the common assertion that RCM is accurate and safe only in expert hands. The need for rapid transfer of essential RCM knowledge and expertise in order to immediately offer the clinical benefits of RCM into practice is supportive of telemedicine application in this field of non-invasive skin cancer screening [29]. However, a telemedicine setting is only able to transfer part of the information and does not permit the "in person patient-doctor experience" which may sometimes be relevant to perceive diagnostic hints and clinical clues able to instinctively direct the clinician to the correct management.

In this study we aimed to test the validity of a RCM telemedicine setting to support diagnosis at a distance and to identify potential strategies that may minimize the risk of MM mismanagement in clinical settings. In our study there were no expert confocal readers involved, but each of the 10 readers received identical dedicated training during a 6 month period for the purpose of future telemedicine evaluation. All 10 readers in our study followed a specific training program that included the individual study and courses entailing dermoscopy and RCM textbooks for skin cancer diagnosis as well as a minimum of 3 months daily exposure to our dermoscopy and RCM outpatient clinic with at least 2000 clinical cases observed and discussed with a tutor.

The dermoscopy-RCM image sets in our study assessed by single reader evaluation reproduced the context of a telemedicine setting and resulted in an overall sensitivity of 95.2% (range: 86 to 100%) and specificity of 76.3% (minimum: 67.5%). Our data is consistent with published literature showing that RCM has a high sensitivity for malignancies: 94.0% for MM, 96.5% for BCC and 87.5% for SCC as well as high specificity for naevi (excluding spitz naevi) (77.5%) and the group of SK/SL/LPLK/AK (79.6%). Overall single reader evaluation resulted in the mismanagement of 8 individual MMs, 4 BCCs and 2 SCCs (based on 2000 total evaluations). These initial results illustrate the potential limitation of single reader evaluation where the diagnostic sensitivity may not be high enough for safe management of MM and allow for potentially dangerous misdiagnosis of lesions that are already invasive or can be life threatening in their progression. This limitation can be attributed to a variety of factors including: lack of sufficient training, limited clinical exposure to RCM, confidence variability, different evaluation criteria followed, fatigue or expediting completion of a high volume of cases in different scenarios.

The addition of a second reader with management for excision made when excision was recommended in at least one of the two evaluations improved the overall sensitivity to 98.3% and lowered specificity to 65.5% (10.8% reduction) resulting in hypothetical mismanagement of only 1 in-situ MM and 2 BCCs. The effect of double reader performance decreased the likelihood of mismanagement of an invasive MM in the study population while maintaining an acceptable specificity where the number of unnecessary excision of benign lesions was comparable to that of single reading and to current data from real-world settings [6,11,12]. The mismanaged in-situ MM was a diagnostic challenge upon initial presentation. Upon histopathology examination this lesion was characterized by a predominantly lentiginous atypical melanocytic proliferation, consistent with the pattern of lentiginous MM, that represents a slowly progressing variant of MM [30,31]. This lesion was placed into sequential digital dermoscopy follow-up during the first patient visit and was detected during second visit control examination and subsequently removed at that time. The two BCCs cases mismanaged were both small superficial types present on the trunk. Since BCC rarely metastasis these mismanaged false negative lesions would not present significant health risk to these patients and most likely would be diagnosed on a future follow-up digital dermoscopy visit when dermoscopic features would become mature and more apparent.

For individual RCM features we evaluated the frequency of the presence of pagetoid cells, architectural disarray with melanocytic features, BCC tumor islands and/or cords and marked dyskeratosis of the epidermis. Pagetoid cells were reported in 84.3% of MM, 81.2% of spitz naevi and 20.3% of naevi (excluding spitz naevi) as well as architectural disarray with presence of melanocytic features at the DEJ in in 80.1% of MM, 43.8% of spitz naevi and 16.6% of naevi (excluding spitz naevi). These frequencies demonstrate that both pagetoid cells and architectural disarray with melanocytic features at the DEJ are prevalent features of MM. Since both of these features overlap into the category spitz naevi our data is in line with recent publications showing the limitation of RCM in distinguishing spitz naevi from MM, in our study overall single reader specificity was 6.2% and double reader specificity was 0% [32,33]. In the naevi category (excluding spitz naevi) both of these features were present in low but notable frequencies demonstrating their presence in selected groups of melanocytic naevi, particularly those that are congenital, traumatized and/or dysplastic [34,35]. Additionally, the frequencies of BCC tumor islands and/or cords in the BCC population (89.1%) and dyskeratosis in the SCC population (58.3%) were also consistent with literature highlighting their presence as specific for BCC and common in SCC [3638].

Compared to previously reported methods and technology utilized for skin cancer diagnosis, RCM resulted in a superior combined sensitivity and specificity when applied to dermoscopically equivocal lesions. As examples, multispectral imaging studies reported high sensitivity but have always been associated with poor specificity [39,40]. Monheit et al. performed a prospective, multicenter, blinded study of 1632 lesions and used multispectral imaging to differentiate cutaneous MMs from melanocytic skin lesions using histology as the reference standard. Standard images and patient information for a subset of 50 randomly selected lesions (25 MMs) were used in a reader study of 39 independent dermatologists, resulting in an overall sensitivity of 98.3% and specificity of 9.9% [39]. Lui et al. evaluated 1022 lesions testing Raman-spectroscopy in-vivo and reported that for high sensitivities (95% to 99%) the specificities ranged between 15% and 54% [41], data confirmed in other studies [42,43]. The OCT method is very popular with BCC diagnosis [44,45], but has limited application in differentiating melanocytic tumors due to its lower resolution than RCM [46]. Although Gambichler et al. reported a sensitivity of 74.1% and specificity of 92.4% in the differentiation of cutaneous MM and melanocytic naevi, the authors only included clear MMs and benign lesions, not equivocal ones, which may have given OCT an artificially positive outlook in its application to pigmented lesions [47]. Due to the high resolution and the capability to detect cell metabolic activity, in-vivo multiphoton microscopy is a promising research tool in the field of MM diagnosis [48]. Dimitrow et al. tested in-vivo multiphoton microscopy application in 53 melanocytic lesions (of which there were 26 MMs, 8 of which were in-situ) and reported sensitivity up to 95% and specificity up to 97% in the detection of MM [49]. This technology seems to be very accurate but is limited in clinical settings due to its single point view of a skin lesion prohibiting full-field of view and in time-restrained settings where complete visualization of a cutaneous lesion architecture is necessary to make an accurate diagnosis, mismanagement of MM is possible [50].

In our study it is important to consider the limitation of reader performance based on a retrospective study setting where readers may obtain a higher performance when liability is absent. In consideration of this variable, results in real-world clinical and telemedicine settings may be lower than our own. Therefore, the application of double reading concordance may supplement this potential reduction of performance. Additionally, several malignant cases were managed as benign with high confidence by single reader evaluation when they were in fact malignant and invasive. When the same lesion management was paired in a double reading concordance setting this potentially dangerous mismanagement was reduced due to decision to excise lesions when the double reading was discordant or management decision was concordant for excision.

Double reading enhanced the diagnostic sensitivity and reduced the likelihood to mismanage an invasive MM with the limitation shown to be in in-situ lesions, impossible to detect on initial presentation with traditional naked-eye examination as well as a low chance of detection using dermoscopy. These in-situ lesions can be well managed with annual screenings and registration of suspicious lesions with sequential digital dermoscopy monitoring. The optimal threshold of safety and accuracy in our study was when two readers managed a skin lesion with concordant benign diagnosis resulting (no-excision) and management of excision when at least one reader was proposing management with excision (discordance). The expected sensitivity/specificity tradeoff must always be considered with any type of screening test and in our study the use of double reading brought sensitivity closer to 100% while reducing specificity from its single reader baseline by only 10.8%. Since an additional benefit of RCM implementation is the reduction of unnecessary excisions, calculated by NNE, this variable should be considered [3].

In conclusion, integration of dermoscopy-RCM imaging is valid in a clinical setting and may be reliable for second expert consultation via telemedicine by adding confidence to equivocal dermoscopic evaluation. The double reader concordance approach may minimize the risk to mismanage a potentially life-threatening skin cancer while maintaining acceptable specificity. Patient screening and sequential digital dermoscopy follow-up should be recommended for all equivocal lesions that are not excised on first visits in order to detect changes over time that may not be visible at initial presentation and initiation of sequential digital dermoscopy (mole-mapping) programs should be considered. Given the high variability of sensitivity and specificity amongst the readers it is important to consider implementation of standardized training and reading programs to help minimize the risk of potential mismanagement using RCM in both clinical and store-and-forward telemedicine settings. Furthermore, a prospective study seems necessary in order to evaluate this promising technology in a real-time telemedicine scenario.

Supporting information

References

  1. 1. Vestergaard ME, Macaskill P, Holt PE, Menzies SW. Dermoscopy compared with naked eye examination for the diagnosis of primary melanoma: a meta-analysis of studies performed in a clinical setting. Br J Dermatol. 2008 Sep;159(3):669–76. pmid:18616769
  2. 2. Kittler H, Pehamberger H, Wolff K, Binder M. Diagnostic accuracy of dermoscopy. Lancet Oncol. 2002 Mar;3(3):159–65. pmid:11902502
  3. 3. Argenziano G, Cerroni L, Zalaudek I, Staibano S, Hofmann-Wellenhof R, Arpaia N et al. Accuracy in melanoma detection: a 10-year multicenter survey. J Am Acad Dermatol. 2012 Jul;67(1):54–9. pmid:21982636
  4. 4. Argenziano G, Moscarella E, Annetta A, Battarra VC, Brunetti B, Buligan C et al. Melanoma detection in Italian pigmented lesion clinics. G Ital Dermatol Venereol. 2014 Apr;149(2):161–6. pmid:24819635
  5. 5. Łudzik J, Witkowski AM, Roterman-Konieczna I, Bassoli S, Farnetani F, Pellacani G. Improving Diagnostic Accuracy of Dermoscopically Equivocal Pink Cutaneous Lesions with Reflectance Confocal Microscopy in Telemedicine Settings: Double Reader Concordance Evaluation of 316 Cases. PLoS One. 2016 Sep 8;11(9).
  6. 6. Borsari S, Pampena R, Lallas A, Kyrgidis A, Moscarella E, Benati E et al. Clinical Indications for Use of Reflectance Confocal Microscopy for Skin Cancer Diagnosis. JAMA Dermatol. 2016 Oct 1;152(10):1093–1098. pmid:27580185
  7. 7. Guitera P, Menzies SW, Argenziano G, Longo C, Losi A, Drummond M et al. Dermoscopy and in vivo confocal microscopy are complementary techniques for diagnosis of difficult amelanotic and light-coloured skin lesions. Br J Dermatol. 2016 Dec;175(6):1311–1319. pmid:27177158
  8. 8. Stanganelli I, Longo C, Mazzoni L, Magi S, Medri M, Lanzanova G et al. Integration of reflectance confocal microscopy in sequential dermoscopy follow-up improves melanoma detection accuracy. Br J Dermatol. 2015 Feb;172(2):365–71. pmid:25154446
  9. 9. Lovatto L, Carrera C, Salerni G, Alós L, Malvehy J, Puig S. In vivo reflectance confocal microscopy of equivocal melanocytic lesions detected by digital dermoscopy follow-up. J Eur Acad Dermatol Venereol. 2015 Oct;29(10):1918–25. pmid:25752663
  10. 10. Farnetani F, Scope A, Braun RP, Gonzalez S, Guitera P, Malvehy J et al. Skin Cancer Diagnosis With Reflectance Confocal Microscopy: Reproducibility of Feature Recognition and Accuracy of Diagnosis. JAMA Dermatol. 2015 Oct;151(10):1075–80. pmid:25993262
  11. 11. Pellacani G, Pepe P, Casari A, Longo C. Reflectance confocal microscopy as a second-level examination in skin oncology improves diagnostic accuracy and saves unnecessary excisions: a longitudinal prospective study. Br J Dermatol. 2014 Nov;171(5):1044–51. pmid:24891083
  12. 12. Alarcon I, Carrera C, Palou J, Alos L, Malvehy J, Puig S. Impact of in vivo reflectance confocal microscopy on the number needed to treat melanoma in doubtful lesions. Br J Dermatol. 2014 Apr; 170(4): 802–808. pmid:24124911
  13. 13. Ferrari B, Pupelli G, Farnetani F, De Carvalho NT, Longo C, Reggiani C et al. Dermoscopic difficult lesions: An objective evaluation of reflectance confocal microscopy impact for accurate diagnosis. J Eu Acad Dermatol Venereol 2015;29(6):1135–1140.
  14. 14. Rao BK, Mateus R, Wassef C, Pellacani G. In vivo confocal microscopy in clinical practice: comparison of bedside diagnostic accuracy of a trained physician and distant diagnosis of an expert reader. J Am Acad Dermatol. 2013 Dec;69(6):e295–300. pmid:24035553
  15. 15. Longo C, Farnetani F, Ciardo S, Cesinaro AM, Moscarella E, Ponti G et al. Is confocal microscopy a valuable tool in diagnosing nodular lesions? A study of 140 cases. Br J Dermatol. 2013:169(1):58–67. pmid:23374159
  16. 16. Guitera P, Menzies SW, Longo C, Cesinaro AM, Scolyer RA, Pellacani G. In vivo confocal microscopy for diagnosis of melanoma and basal cell carcinoma using a two-step method: analysis of 710 consecutive clinically equivocal cases. J Invest Dermatol. 2012 Oct;132(10):2386–94. pmid:22718115
  17. 17. Guitera P, Pellacani G, Crotty KA, Scolyer RA, Li LX, Bassoli S et al. The impact of in vivo reflectance confocal microscopy on the diagnostic accuracy of lentigo maligna and equivocal pigmented and nonpigmented macules of the face. J Invest Dermatol. 2010 Aug;130(8):2080–91. pmid:20393481
  18. 18. Guitera P, Pellacani G, Longo C, Seidenari S, Avramidis M, Menzies SW. In vivo reflectance confocal microscopy enhances secondary evaluation of melanocytic lesions. J Invest Dermatol. 2009 Jan;129(1):131–8. pmid:18633444
  19. 19. Segura S, Puig S, Carrera C, Palou J, Malvehy J. Development of a two-step method for the diagnosis of melanoma by reflectance confocal microscopy. J Am Acad Dermatol. 2009 Aug;6192):216–29.
  20. 20. Langley RG, Walsh N, Sutherland AE, Propperova I, Delaney L, Morris SF et al. The diagnostic accuracy of in vivo confocal scanning laser microscopy compared to dermoscopy of benign and malignant melanocytic lesions: a prospective study. Dermatology. 2007;215(4):365–72. pmid:17912001
  21. 21. Pellacani G, Guitera P, Longo C, Avramidis M, Seidenari S, Menzies S. The impact of in vivo reflectance confocal microscopy for the diagnostic accuracy of melanoma and equivocal melanocytic lesions. J Invest Dermatol. 2007:127(12):2759–2765. pmid:17657243
  22. 22. Pellacani G, Cesinaro AM, Seidenari S. Reflectance-mode confocal microscopy of pigmented skin lesions-improvement in melanoma diagnostic specificity. J Am Acad Dermatol 2005:53(6):979–985. pmid:16310058
  23. 23. Edwards SJ, Osei-Assibey G, Patalay R, Wakefield V, Karner C. Diagnostic accuracy of reflectance confocal microscopy using VivaScope for detecting and monitoring skin lesions: a systematic review. Clin Exp Dermatol. 2017 Feb 20. [Epub ahead of print].
  24. 24. Xiong YQ, Ma SJ, Mo Y, Huo ST, Wen YQ, Chen Q. Comparison of dermoscopy and reflectance confocal microscopy for the diagnosis of malignant skin tumours: a meta-analysis. J Cancer Res Clin Oncol. 2017 Mar 13. [Epub ahead of print].
  25. 25. Piccolo D, Smolle J, Argenziano G, Wolf IH, Braun R, Cerroni Let al. Teledermoscopy—results of a multicentre study on 43 pigmented skin lesions. J Telemed Telecare. 2000;6(3):132–7. pmid:10912329
  26. 26. Arzberger E, Curiel-Lewandrowski C, Blum A, Chubisov D, Oakley A, Rademaker M et al. Teledermoscopy in High-risk Melanoma Patients: A Comparative Study of Face-to-face and Teledermatology Visits. Acta Derm Venereol. 2016 Jan 18.
  27. 27. Pellacani G, Witkowski A, Cesinaro AM, Losi A, Colombo GL, Campagna A et al. Cost-benefit of reflectance confocal microscopy in the diagnostic performance of melanoma. J Eur Acad Dermatol Venereol. 2016 Mar;30(3):413–9. pmid:26446299
  28. 28. Giambrone D, Alamgir M, Masud A, Bronsnick T, Rao B. The diagnostic accuracy of in vivo confocal microscopy in clinical practice. J Am Acad Dermatol. 2015 Aug;73(2):317–9. pmid:26183976
  29. 29. Witkowski A, Łudzik J, Soyer HP. Telediagnosis with Confocal Microscopy: A Reality or a Dream? Dermatol Clin. 2016 Oct;34(4):505–512. pmid:27692456
  30. 30. Mooi WJ. "Lentiginous melanoma": full-fledged melanoma or melanoma precursor? Adv Anat Pathol. 2014 May;21(3):181–7. pmid:24713988
  31. 31. King R. Lentiginous melanoma. Arch Pathol Lab Med. 2011 Mar;135(3):337–41. pmid:21366457
  32. 32. Pellacani G, Longo C, Ferrara G, Cesinaro AM, Bassoli S, Guitera P et al. Spitz nevi: In vivo confocal microscopic features, dermatoscopic aspects, histopathologic correlates, and diagnostic significance. J Am Acad Dermatol 2009; 60(2):236–247. pmid:19091443
  33. 33. Guida S, Pellacani G, Cesinaro AM, Moscarella E, Argenziano G, Farnetani F et al. Spitz naevi and melanomas with similar dermoscopic pattern: can confocal microscopy differentiate? Br J Dermatol. 2016;174(3):610–616. pmid:26554394
  34. 34. Pellacani G, Farnetani F, Gonzalez S, Longo C, Cesinaro AM, Casari A et al. In vivo confocal microscopy for detection and grading of dysplastic nevi: a pilot study. J Am Acad Dermatol. 2012 Mar;66(3):e109–21. pmid:21742408
  35. 35. Gill M, Longo C, Farnetani F, Cesinaro AM, González S, Pellacani G. Non-invasive in vivo dermatopathology: identification of reflectance confocal microscopic correlates to specific histological features seen in melanocytic neoplasms. J Eur Acad Dermatol Venereol. 2014 Aug;28(8):1069–78. pmid:24147614
  36. 36. Witkowski AM, Łudzik J, DeCarvalho N, Ciardo S, Longo C, DiNardo A et al. Non-invasive diagnosis of pink basal cell carcinoma: how much can we rely on dermoscopy and reflectance confocal microscopy? Skin Res Technol. 2016 May;22(2):230–7. pmid:26338448
  37. 37. Kadouch DJ, Schram ME, Leeflang MM, Limpens J, Spuls PI, de Rie MA. In vivo confocal microscopy of basal cell carcinoma: a systematic review of diagnostic accuracy. J Eur Acad Dermatol Venereol. 2015 Oct;29(10):1890–7. pmid:26290493
  38. 38. Peppelman M, Nguyen KP, Hoogedoorn L, van Erp PE, Gerritsen MJ. Reflectance confocal microscopy: non-invasive distinction between actinic keratosis and squamous cell carcinoma. J Eur Acad Dermatol Venereol. 2015 Jul;29(7):1302–9. pmid:25357235
  39. 39. Monheit G, Cognetta AB, Ferris L, Rabinovitz H, Gross K, Martini M et al. The performance of MelaFind: a prospective multicenter study. Arch Dermatol. 2011 Feb;147(2):188–94. pmid:20956633
  40. 40. Song E, Grant-Kels JM, Swede H, D'Antonio JL, Lachance A, Dadras SSet al. Paired comparison of the sensitivity and specificity of multispectral digital skin lesion analysis and reflectance confocal microscopy in the detection of melanoma in vivo: A bcross-sectional study. J Am Acad Dermatol. 2016 Dec;75(6):1187–1192.e2. pmid:27693007
  41. 41. Lui H, Zhao J, McLean D, Zeng H. Real-time Raman spectroscopy for in vivo skin cancer diagnosis. Cancer Res. 2012 May 15;72(10):2491–500. pmid:22434431
  42. 42. Zhao J, Lui H, Kalia S, Zeng H. Real-time Raman spectroscopy for automatic in vivo skin cancer detection: an independent validation. Anal Bioanal Chem. 2015 Nov;407(27):8373–9. pmid:26231688
  43. 43. Dimitrow E, Ziemer M, Koehler MJ, Norgauer J, Konig K, Elsner P et al. Sensitivity and specificity of multiphoton laser tomography for in vivo and ex vivo diagnosis of malignant melanoma. J Invest Dermatol. 2009 Jul;129(7):1752–8. pmid:19177136
  44. 44. Ulrich M, von Braunmuehl T, Kurzen H, Dirschka T, Kellner C, Sattler E et al. The sensitivity and specificity of optical coherence tomography for the assisted diagnosis of nonpigmented basal cell carcinoma: an observational study. Br J Dermatol. 2015 Aug;173(2):428–35. pmid:25904111
  45. 45. Olsen J, Themstrup L, De Carvalho N, Mogensen M, Pellacani G, Jemec GB. Diagnostic accuracy of optical coherence tomography in actinic keratosis and basal cell carcinoma. Photodiagnosis Photodyn Ther. 2016 Dec;16:44–49. pmid:27519350
  46. 46. Maher NG, Blumetti TP, Gomes EE, Cheng HM, Satgunaseelan L, Lo S et al. Melanoma diagnosis may be a pitfall for optical coherence tomography assessment of equivocal amelanotic or hypomelanotic skin lesions. Br J Dermatol. 2016 Nov 18. [Epub ahead of print]. pmid:27861726
  47. 47. Gambichler T, Schmid-Wendtner MH, Plura I, Kampilafkos P, Stücker M, Berking C et al. A multicenter pilot study investigating high-definition optical coherence tomography in the differentiation of cutaneous melanoma and melanocytic naevi. J Eur Acad Dermatol Venereol. 2015 Mar;29(3):537–41.
  48. 48. Seidenari S, Arginelli F, Dunsby C, French PM, König K, Magnoni C et al. Multiphoton laser tomography and fluorescence lifetime imaging of melanoma: morphologic features and quantitative data for sensitive and specific non-invasive diagnostics. PLoS One. 2013 Jul 26;8(7):e70682. pmid:23923016
  49. 49. Dimitrow E, Ziemer M, Koehler MJ, Norgauer J, König K, Elsner Pet al. Sensitivity and specificity of multiphoton laser tomography for in vivo and ex vivo diagnosis of malignant melanoma. J Invest Dermatol. 2009 Jul;129(7):1752–8. pmid:19177136
  50. 50. Manfredini M, Arginelli F, Dunsby C, French P, Talbot C, Konig K et al. High-resolution imaging of basal cell carcinoma: A comparison between multiphoton microscopy with fluorescence lifetime imaging and reflectance confocal microscopy. Skin Res Technol 2013;19(1).