MEETING REPORT

Transpl Int, 16 October 2023
https://doi.org/10.3389/ti.2023.11783

Banff Digital Pathology Working Group: Image Bank, Artificial Intelligence Algorithm, and Challenge Trial Developments

www.frontiersin.orgAlton B. Farris1*, www.frontiersin.orgMariam P. Alexander2, www.frontiersin.orgUlysses G. J. Balis3, www.frontiersin.orgLaura Barisoni4, www.frontiersin.orgPeter Boor5,6, www.frontiersin.orgRoman D. Bülow5, www.frontiersin.orgLynn D. Cornell2, www.frontiersin.orgAnthony J. Demetris7, www.frontiersin.orgEvan Farkash3, www.frontiersin.orgMeyke Hermsen8, www.frontiersin.orgJulien Hogan1,9, www.frontiersin.orgRenate Kain10, www.frontiersin.orgJesper Kers11,12, www.frontiersin.orgJun Kong13,14, www.frontiersin.orgRichard M. Levenson15, www.frontiersin.orgAlexandre Loupy16, www.frontiersin.orgMaarten Naesens17, www.frontiersin.orgPinaki Sarder18, www.frontiersin.orgJohn E. Tomaszewski19, www.frontiersin.orgJeroen van der Laak8,20, www.frontiersin.orgDominique van Midden8, www.frontiersin.orgYukako Yagi21 and www.frontiersin.orgKim Solez22
  • 1Department of Pathology and Laboratory Medicine, Emory University, Atlanta, GE, United States
  • 2Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, United States
  • 3Department of Pathology, University of Michigan, Ann Arbor, MI, United States
  • 4Department of Pathology and Medicine, Duke University, Durham, NC, United States
  • 5Institute of Pathology, Rheinisch-Westfälische Technische Hochschule (RWTH) Aachen University Clinic, Aachen, Germany
  • 6Department of Nephrology and Immunology, RWTH Aachen University Clinic, Aachen, Germany
  • 7Department of Pathology, University of Pittsburgh, Pittsburgh, PA, United States
  • 8Department of Pathology, Radboud University Medical Center, Nijmegen, Netherlands
  • 9Nephrology Service, Robert Debré Hospital, University of Paris, Paris, France
  • 10Department of Pathology, Medical University of Vienna, Vienna, Austria
  • 11Department of Pathology, Amsterdam University Medical Centers, Amsterdam, Netherlands
  • 12Department of Pathology, Leiden University Medical Center, Leiden, Netherlands
  • 13Georgia State University, Atlanta, GA, United States
  • 14Emory University, Atlanta, GA, United States
  • 15Department of Pathology, University of California Davis Health System, Sacramento, CA, United States
  • 16Institut National de la Santé et de la Recherche Médicale, UMR 970, Paris Translational Research Centre for Organ Transplantation, and Kidney Transplant Department, Hôpital Necker, Assistance Publique-Hôpitaux de Paris, University of Paris, Paris, France
  • 17Department of Microbiology, Immunology and Transplantation, KU Leuven, Leuven, Belgium
  • 18Division of Nephrology, Hypertension, and Renal Transplantation, Department of Medicine, Intelligent Critical Care Center, College of Medicine, University of Florida at Gainesville, Gainesville, FL, United States
  • 19Department of Pathology, The State University of New York at Buffalo, Buffalo, NY, United States
  • 20Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
  • 21Memorial Sloan Kettering Cancer Center, New York, NY, United States
  • 22Department of Pathology, University of Alberta, Edmonton, AB, Canada

The Banff Digital Pathology Working Group (DPWG) was established with the goal to establish a digital pathology repository; develop, validate, and share models for image analysis; and foster collaborations using regular videoconferencing. During the calls, a variety of artificial intelligence (AI)-based support systems for transplantation pathology were presented. Potential collaborations in a competition/trial on AI applied to kidney transplant specimens, including the DIAGGRAFT challenge (staining of biopsies at multiple institutions, pathologists’ visual assessment, and development and validation of new and pre-existing Banff scoring algorithms), were also discussed. To determine the next steps, a survey was conducted, primarily focusing on the feasibility of establishing a digital pathology repository and identifying potential hosts. Sixteen of the 35 respondents (46%) had access to a server hosting a digital pathology repository, with 2 respondents that could serve as a potential host at no cost to the DPWG. The 16 digital pathology repositories collected specimens from various organs, with the largest constituent being kidney (n = 12,870 specimens). A DPWG pilot digital pathology repository was established, and there are plans for a competition/trial with the DIAGGRAFT project. Utilizing existing resources and previously established models, the Banff DPWG is establishing new resources for the Banff community.

Introduction

The Banff Digital Pathology Working Group (DPWG) was formed in 2019, followed by a publication describing the DPWG’s main goals and the current state of transplant digital pathology [1]. Since then, the DPWG meets regularly in video conferences (nearly every 2 weeks) to discuss new digital pathology initiatives, innovative investigations, and digital pathology’s current status and future (2), particularly computer vision applied to transplantation, considering the fact that digital pathology has enabled the development of “computational pathology” as a new science [24]. “Computational Pathology” is a novel approach to precision medicine incorporating multiple data sources using artificial intelligence (AI) to generate actionable knowledge to improve disease diagnosis, prognostication, and prediction [5].

The development of new digital pathology-based tools, computer vision algorithms, and machine learning (ML) models for the study of kidney diseases has stimulated the pathology and nephrology community to build large digital pathology repositories to allow for the integration of data from clinical, molecular, pathology, and other domains. While this effort has been in place for over a decade for native kidney diseases [5], the use of digital pathology repositories, computer vision, and computational pathology in transplant pathology remains largely unexplored.

As also detailed in the last Banff Meeting Report [6] and the DPWGs’ first paper [1], the DPWG’s goals are detailed in Tables 1, 2; Figure 1. Notably, future plans can be summarized in three aims:

• Aim 1: Image banks and/or digital pathology repositories for benchmarking algorithms so that research groups can test their AI and other algorithms similar to what is being done in the computer science community overall, with ImageNet supervised natural image classification being a main example.1

• Aim 2: Algorithms will developed for the transplant community. One future goal potentially includes the release of “official” Banff algorithms that could be used by the Banff community and beyond. As mentioned in the previous Banff DPWG working group paper, these could include targeted, handcrafted algorithms (e.g., for parameters such as fibrosis, inflammation, steatosis, etc.) [1]; or these could include thoroughly validated AI/ML algorithms. Furthermore, data pipelines for the integration of “–omic” data could be provided so that centers could have mechanisms for mining data within their center as well as sharing with other centers.

• Aim 3: Competitions or trials will be conducted so that groups can compare their algorithms on standard transplant pathology image sets.

TABLE 1
www.frontiersin.org

TABLE 1. Banff digital pathology working group (DPWG) issues and plans. The Banff DPWG issues and future plans are depicted as laid forth in the original DPWG paper (1).

TABLE 2
www.frontiersin.org

TABLE 2. Banff digital pathology working group (DPWG) issues and plans. The Banff DPWG issues and future plans are depicted as further refined through DPWG discussions (1).

FIGURE 1
www.frontiersin.org

FIGURE 1. The Banff Digital Pathology Working Group (DPWG) main aims are shown. The primary aims of the Banff DPWG include 1) image bank/collection establishment, to possibly include other data in digital pathology repositories (digital pathology repositories); 2) algorithm sharing platform initialization; and 3) competition/trial organization. Multiple solutions for each of these aims may be possible. After competition/trial conduction among the Banff community and other collaborators, the algorithm performance will be characterized in a process that will affect the future performance and sharing of algorithms; and thus, the competition(s)/trial(s) will provide “feedback” to algorithm sharing. Ultimately, effective, precision patient care could be provided with Banff algorithm scores. (The “Banff Conference” and “Aim 3” image were produced by Kim Solez using DALL-E 2.).

This current DPWG paper serves as an update on the DPWG’s progress with selected examples and is not a comprehensive review, and we apologize for related studies that are not cited. The DPWG’s survey research on the current state of digital transplant pathology will be covered, and additional details regarding each of the three aims above will be discussed.

Image Bank Survey

A survey was conducted from 27 April, 2020, to 23 July, 2020, primarily to determine image bank possibilities for the DPWG. Questions were sent via SurveyMonkey (Palo Alto, California, United States) to both the NEPHROL and NEPHNPPT Discussion Groups (with 701 members and 456 members, respectively) moderated by Kim Solez aimed primarily toward renal pathologists and clinicians interested in renal pathology. The NEPHROL group includes mostly nephrologists and pathologists, and the NEPHNPPT group is a subset of the Renal Pathology Society (RPS) membership.

The Banff DPWG Image Bank Survey had 35 respondents from 13 countries, 19 from the US, 4 Canada, 2 Netherlands, and one each from 10 countries (Supplementary Material). Most (24 or 69% of respondents) specified pathology as their specialty. Of these, 16 (46%) specified that they had a server to manage whole slide images (WSIs) from multiple institutions, and these used various server software and image formats and had a range of storage and bandwidths. In this regard, it is recognized that setting up servers and workflows is quite a complex endeavor; and our survey reflected these complexities [710].

Of 13 answering a question regarding the ability of their server to de-identify slide information (including the slide label) automatically, 9 (69%) responded yes; 2 (15%) no; and 2 (15%) not sure. Of 12 answering a question regarding their server’s ability to allow customized and commercial algorithms installation, 8 (67%) answered yes; 2 (17%) no; 1 (8%) only customized algorithms; and 1 (8%) not sure. Of 10 answering a question regarding their server’s ability to allow the correction/standardization of staining variability and other variables in images from multiple laboratories, 9 (90%) answered yes.

Survey questions regarding the possibility of image bank hosting were asked; and of nine responding, 7 (78%) had an associated cost; and only 2 (22%) had no associated cost. The two responding there would be no cost were contacted; and it became clear that one of these would not be able to host the image bank due to logistical issues. Thus, based on the survey, only one respondent at Georgia State University could host an initial image bank pilot. Later discussions in the community revealed another image bank could possibly be hosted at RWTH Aachen University in the future, particularly regarding specimens in Europe subject to European Union General Data Protection Regulation (EU GDPR).

Survey questions also covered existing image bank material available among respondents. Of 28 respondents responding to the question of whether they had an existing transplant WSI repository, 16 (57%) said they had such a repository. When asked for the number of their specimens, the combined specimens included 12,870 kidney, 670 heart, 55 pancreas/islet, 50 lung, 30 liver, 20 intestine, and 2 vascularized composite allograft. Thus, the survey showed that the community already has a substantial specimen number; however, the number of specimens obtained is likely an underestimate.

It is likely that this survey could be repeated in the future with an increased response rate, since interests in AI, ML, and deep learning (DL) are likely increasing [11]. Furthermore, the survey was conducted during the COVID-19 pandemic, which could have hindered response rates. In the future, such a survey could likely find additional servers and material for collaboration.

Aim 1: Image Bank and Digital Pathology Repository Pilot

Our Banff DPWG conducts discussions, planning, testing, and implementations of appropriate vehicles for pathology AI method dissemination, deployment, and comparison readily accessible by end users. An image bank or digital pathology repository is a goal that the Banff DPWG would like to achieve, similar to the “Big Picture” European digital pathology project,2 the Nephrotic Syndrome Study Network (NEPTUNE),3 and Kidney Precision Medicine Project (KPMP4). In contrast to desktop applications, web-based platforms are preferred by many since they do not require any user-involved installation process [12]. Although some web-based tools have been developed, they are either commercial software with license purchase requirement [12] or limited for new algorithm integration (e.g., Omero [13]).

The one respondent available to host a pilot for the DPWG is the Digital Pathology Laboratory (DPLab5), a publicly available web platform allowing researchers to visualize, annotate, analyze, and share 2D and 3D pathology images via web-enabled devices. This platform allows users to upload their own WSIs, annotate regions of interest, invoke AI analysis methods, visualize analysis outputs, and download outputs for follow-up statistical comparisons. Due to its web-based framework, DPLab enables WSI annotation and analysis data sharing. Since AI method training and execution relies on a reliable and powerful computational infrastructure, DPLab allows running these AI methods without local computational resources. All requested analysis jobs from the front end are executed through a backend computational environment, addressing a frequent WSI analysis computational obstacle. Currently, DPLab is equipped with numerous WSI analysis algorithms, ranging from color deconvolution, cell detection, nuclei segmentation, histology component quantification, to serial WSI image registration (with some demonstrated in Figure 2). Because DPLab is designed as an open environment, AI methods from the research community can be contributed for method enrichment, validation, and comparison. In the future, additional components are planned for DPLab, such as a quality control component (e.g., similar to those seen in the open-source tool HistoQC [14]). As this software becomes more mature, we envision it and others like it can become useful tools for digital pathology community [12].

FIGURE 2
www.frontiersin.org

FIGURE 2. Digital Pathology Laboratory: a publicly available web platform for multi-dimensional pathology image analytics example image manipulations are shown, including the following: (A) A representative WSI visualized from DPLab at multiple image resolutions; (B) Cell detection result in a user-annotated rectangle region; (C) Liver fibrosis quantification with a region annotated by a free-hand annotation tool; (D) Detailed 3D liver tissue sub-volume visualization after serial WSI registration and collagen quantifications.

Complete digital pathology implementation will require digitization of all workflow steps. For example, in renal pathology, this will require light, immunofluorescence, and electron microscopy digitization. Regarding this, immunofluorescence staining is an integral part of kidney transplant biopsy evaluation, both for C4d staining for detection of antibody-mediated rejection and for immunoglobulins and other complement components for recurrent and de novo glomerulonephritis detection. Factors to consider include the ability to support automated scanning with minimal operator input, available immunofluorescence filters, scanning speed, automated tissue detection, image quality, tissue focusing ability, scanning magnification, degree of image bleaching (fading), and price. Major challenges with currently available immunofluorescence slide scanners include inability of scanners to focus on tissue, inability to reflect negative/dim staining, and excessive human technologist time for scanning setup (Dr. Lynn Cornell in DPWG communications).

Digital pathology repositories can include a variety of “omic” data types in the future. Digital pathology “pathomic” data can be included with other “omic” data including genomic, transcriptomic, proteomic, and metabolomic data. “Pathomics” refers to the morphological examination of tissue on the macroscopic, microscopic, and ultrastructural level. “Pathomics” was used at least as far back as a 2007 editorial by Robert Colvin (11, 12) commenting on a study investigating microarray analysis of rejection that later become available in the molecular microscope diagnostic (MMDx) system (13). Using this terminology, the “pathome” can refer to the entirety of morphological histology features, particularly when examined using enhanced ancillary techniques; and enhanced techniques to examine the “pathome” can be termed “Next-Generation histoMorphometry (NGM).” Of note, standard “omic” technologies are increasingly being applied in a “spatial” manner (e.g., spatial transcriptomics and spatial proteomics) [15]. Digital pathology repositories will likely be crucial for the integration of “pathomic” with other “omic” data.

Aim 2: AI/Deep Learning Algorithms

To effectively develop deep learning (DL)-based support systems for diagnosis and research, including in transplant pathology, three main prerequisites are needed (e.g., when thinking of setting up transplant digital pathology central resources), including: 1) hardware and software infrastructure, 2) interdisciplinary expert teams, and 3) diverse and clinically annotated datasets [16].

(1) The hardware and software infrastructure are becoming more available and affordable, and many pathology labs now have at least partial digital infrastructure. Based on a particular study’s extent and the computational demands of newer DL architectures, however, the introduction of robust digital pathology resources within a single institution can be challenging. Digital pathology and WSIs produce the largest imaging data in clinical medicine. When setting up large digital pathology repositories, sufficient storage capacity is required, which can easily be in the petabyte (PB) range. Such storage must be secure, both in terms of security of access and sufficient backup. Modern DL systems are increasingly computationally expensive to train due to the model size, with many trainable parameters and large datasets. Thus, central high-performance computing (HPC) centers or cloud providers might be needed for model development. Frequently, such HPC centers (or cloud providers) are not used to handling sensitive medical data and privacy concerns (e.g., HIPPA and GDPR); and the legal aspects can be complicated. Also, such centralized algorithmic training requires secure data transfer between institutions. This may also be challenging for security and compliance. Cloud providers and download possibilities can tackle some of these issues. Another potential solution for this could be the use of federated learning approaches, which are becoming more popular not only in computational pathology. These approaches train (parts of the) models on locally stored data (i.e., without the need to move the data from the hospitals) [1719]. Such federated approaches require scaling up local computing power, which, in our recent experience, is not available everywhere, and sometimes not even considered in some larger repositories. This is not completely surprising, since digital pathology possibilities are still new and emerging. Digital pathology infrastructure maintenance costs (e.g., security updates and other services) need to be kept in mind and can present a challenge when aiming for a long-term digital pathology repository. Thereby, solutions for long-term infrastructure financing are required, and might be a challenge.

(2) AI/DL development and infrastructure maintenance requires experts from information technology (IT), computer science, medicine, research, and other areas [20]. Such an interdisciplinary team is required 1) to ensure a relevant use case and the datasets are defined for meaningful application scenarios in a realistic workflow, 2) a suitable model architecture can be modified to fit the use case, 3) software best practices are followed during training, and 4) to ensure model safety. Ultimately, models should be thoroughly audited before clinical testing, uncovering potential risks and developing mitigation strategies [21]. User studies should test whether systems will be useful in later day-to-day work. The workforce needs of industry vs. academia may be in competition. Generating an environment that motivates IT and AI experts to join academia will be imperative to building up domain-specific expert teams. Also, such teams should have a minimal “critical” size of the particular specialty (e.g., Having only a single AI or IT expert makes the team heavily dependent on a single person, while it does not provide a suitable environment for discussion and exchange for the expert.). It is our experience that large and strongly interdisciplinary teams, directly embedded within the specific application domain might be most efficient in new approach development testing. This direction also helps educate “hybrid” experts (e.g., pathologists with expertise in AI development and AI developers knowledgeable of real-world pathology workflows). Such “hybrid” experts can be augmented by automated systems such as those that help codify the complexity of the Banff classification system [22].

(3) Finally, and currently one of the major challenges in this field, is the availability of relevant, sufficiently large datasets. Sample size is determined by the ML system’s efficiency and the problem complexity. Datasets should be multicentric and reflect the population(s) in which the system will ultimately be used. In addition, it is important to invest time uncovering existing dataset biases before fitting a model to the data and reducing biases as much as possible [23, 24]. To uncover such biases, datasets must be deeply phenotyped, and in the case of pathology, enriched at least with clinical and pathological data. It is essential to validate any DL models using independent cohort(s), which were not used for DL training. While tremendous thought has previously been given to the collection of training datasets [2], only recently have recommendations for the collection of test datasets been issued for the case of computational pathology [20]. Test datasets must be independent from the development datasets. The ML community has long recognized the need for diverse multi-center datasets to reliably assess the generalizability of DL systems. This is now also well established in computational pathology and should be a common standard [2].

One example of how the integration of all prerequisites and joined international cooperation can lead to promising DL algorithm development was previously shown in the DEEPGRAFT study, which involves transplant biopsy weakly-supervised slide-level diagnosis classification using DL [25]. This is currently the largest multicentric dataset of renal transplant biopsies assembled and analyzed centrally, with more than 5,000 WSIs, including consecutive biopsies from a center not included in training, representing a “real-world” scenario and enabling validation and assessment of the model’s generalizability.

Other novel algorithms for efficiently analyzing very large renal tissue biopsy digital WSIs have been integrated into ML pipelines for nephropathology. The developed tools employ a human-AI-loop (HAIL) approach [26] via integrating human with AI for efficiently detecting and segmenting multi-compartmental structures (e.g., glomeruli, tubules, interstitium, and arteries and arterioles). The tool’s performance is shown in computational histologic classification of diabetic nephropathy [27], as well as computational detection and segmentation of interstitial fibrosis and tubular atrophy [28]. The tool has been extended to computationally detect and count podocytes from WSIs, and also subsequent feature extraction for various inference studies [29]. HAIL’s utility has been further shown via integrating the tool with the VIPR (Validated Identification of Pre-Qualified Regions) algorithm [30]. HAIL operates at segmenting large renal structural levels, and VIPR operates at deriving renal micro-compartments using pixel level vector features. In tandem, these tools are being used to conduct unsupervised classification of tubules in the KPMP. Features quantified from HAIL-derived image structures are currently being used for fusing with tissue molecular signatures, such as those derived by CODEX and spatial transcriptomics, to discover new molecularly distinct structural motifs with implications in chronic kidney disease and acute kidney injury. It is anticipated that the tools developed herein will contribute to renal transplant biopsy assessment to automate Banff scoring for chronicity assessment as well as automatically predict graft outcome from pixel level image features.

While retrospective studies have inherent value in showing system applicability or useability, prospective evidence of the clinical benefit of DL systems must be generated through well-designed clinical trials. Promising studies include those examining the classification of rejection versus other diseases [25] and antibody-mediated rejection under Banff criteria [31] in the kidney; and in cardiac endomyocardial biopsies, allograft rejection has been distinguished from benign mimics (Quilty B lesions) using AI [32]. However, clinical trials implementing DL systems are currently largely missing in the field of computational pathology, but in some scenarios might also be hard to provide.

Aim 3: Competition/Trial and Current Image Analysis Trial Work

As mentioned previously, our last aim deals with competitions or trials will be conducted so that groups can compare their algorithms on standard transplant pathology image sets. In this regard, the Banff DPWG has an ongoing collaboration that has been discussed in DPWG meetings entitled “DIAGGRAFT: leveraging artificial intelligence technology for accurate quantitative histological diagnostic assessment of transplant renal biopsies.” The Dutch Kidney Foundation recently awarded a Success Accelerator Grant for the DIAGGRAFT project. DIAGGRAFT was started in January 2022 by Dominique van Midden, Meyke Hermsen, Jeroen van der Laak, et al, and will be executed in close collaboration with the DPWG. This project builds upon former research by Hermsen et al. [33, 34] that developed AI (more specifically: DL) for automated assessment of histopathologic features in digitized kidney tissue sections. DIAGGRAFT aims to take developed AI a step further, extending these techniques and preparing them for large-scale research- and even diagnostic use. The DIAGGRAFT consortium will organize a so-called “grand challenge”: an international competition, similar to challenges previously organized (e.g., PANDA [35] for prostate cancer,6 CAMELYON [3638] for breast cancer and lymph node metastasis, and other Kaggle efforts7, 8). In the DIAGGRAFT challenge, a large, annotated, multi-center data set will be established and provided to participants with the goal to collectively build AI for inflammatory cell detection in periodic-acid Schiff-stained slides. The best inflammatory cell detection model(s) from the DIAGGRAFT challenge will be combined with existing structure segmentation AI to quantify Banff classes. In the last part of DIAGGRAFT, AI will be validated on a large patient cohort, originating from multiple international medical centers and scored by an expert renal pathologist panel. DIAGGRAFT aims to develop powerful DL tools for objective and reproducible Banff scoring, validated in a multicenter setting against graft function and survival. The resulting DL models will be made available to the Banff community for subsequent validation studies. DIAGGRAFT is visually displayed in Figure 3.

FIGURE 3
www.frontiersin.org

FIGURE 3. The DIAGGRAFT Challenge Work Plan is shown. Abbreviations: FFPE, formalin-fixed paraffin-embedded; PAS, periodic acid Schiff; WSI, Whole Slide Images; IHC, immunohistochemistry.

Conclusion

The Banff DPWG plans to continue the efforts of fostering the establishment of image banks and digital pathology repositories, of stimulating algorithm development, and supporting the validation of these algorithms. The DPWG’s efforts will be disseminated through a variety of venues (e.g., during the annual meeting of the American Society of Transplantation), to stimulate engagement of the entire transplant community. Funding sources are being explored to financially support efforts of the DPWG. Digital pathology techniques allow computational pathology, which provides automated histopathology analyses with more throughput scalability, reproducibility, and precision [5, 15, 3942]. Indeed, these new technologies will essentially allow numerous novel manipulations, such as the translation/augmentation of one stain to another [43, 44]. It is possible that AI/ML will serve as a “gold standard” in some sense, although we foresee AI/ML augmenting pathologists rather than replacing them as the “gold standard.” Algorithms and other advances for the Banff community may eventually arise from these efforts, with the ultimate goal of providing more effective, precision patient care.

Author Contributions

AF and KS devised the survey and manuscript structure. AF drafted the manuscript along with contributions from LB, PB, RB, LC, MH, JeK, JuK, MN, PS, JvL, and DvM. All authors contributed to the article and approved the submitted version.

Funding

PB is supported by the German Research Foundation (DFG, Project IDs 322900939 & 445703531), European Research Council (ERC Consolidator Grant No 101001791), the Federal Ministries of Education and Research [BMBF, STOP-FSGS-01GM2202C; Economic Affairs and Climate Action (EMPAIA, No. 01MK2002A), and Health (Deep Liver, No. ZMVI1-2520DAT111), and the Innovation Fund of the Federal Joint Committee (Transplant.KI, No. 01VSF21048)]. JuK is supported in part by a grant from National Institutes of Health U01CA242936. JvL and RK have received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No. 945358. This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation program and EFPIA. The Dutch Kidney Foundation supported JeK (DEEPGRAFT, 17OKG23), MH and DvM (DIAGGRAFT, 21OK+012) JeK was additionally funded by the Human(e) AI research priority area of the University of Amsterdam. MN is supported by the Research Foundation Flanders (FWO), with projects S003422N and G087620N, and by a KU Leuven C3 internal grant (C32/17/049). MN is senior clinical investigator of the FWO (supported by grant 1844019N).

Conflict of Interest

JvL has been a member of the advisory boards of Philips, Netherlands and ContextVision, Sweden, and received research funding from Philips, Netherlands, ContextVision, Sweden, and Sectra, Sweden in the last 5 years. He is chief scientific officer (CSO) and shareholder of Aiosyn BV, Netherlands. JeK is a consultant for Aiosyn BV and Novartis AG Switzerland and received speaker fees from Chiesi Pharmaceuticals, Netherlands.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We acknowledge the Banff Foundation for Allograft Pathology for making the regular Banff meetings possible.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontierspartnerships.org/articles/10.3389/ti.2023.11783/full#supplementary-material

Abbreviations

2D, 3D, 2-dimensional, 3-dimensional; AI, Artificial intelligence; DL, deep learning; DPWG, digital pathology working group; EU GDPR, European Union General Data Protection Regulation; HPC, High-performance computing; IFTA, interstitial fibrosis and tubular atrophy; IT, information technology; ML, machine learning; NLP, natural language processing; WSI, whole slide image/imaging.

Footnotes

1https://www.image-net.org/

2https://bigpicture.eu/

3https://www.neptune-study.org/

4https://www.kpmp.org

5https://dplab.gsu.edu/

6https://www.kaggle.com/c/prostate-cancer-grade-assessment/overview

7https://www.kaggle.com/c/hubmap-kidney-segmentation, https://cns.iu.edu/docs/publications/2021-Godwin-FTUs.pdf

8https://hubmapconsortium.github.io/ccf/pages/kaggle2.html

References

1. Farris, AB, Moghe, I, Wu, S, Hogan, J, Cornell, LD, Alexander, MP, et al. Banff Digital Pathology Working Group: Going Digital in Transplant Pathology. Am J Transpl (2020) 20:2392–9. doi:10.1111/ajt.15850

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Abels, E, Pantanowitz, L, Aeffner, F, Zarella, MD, van der Laak, J, Bui, MM, et al. Computational Pathology Definitions, Best Practices, and Recommendations for Regulatory Guidance: A White Paper From the Digital Pathology Association. J Pathol (2019) 249:286–94. doi:10.1002/path.5331

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Louis, DN, Feldman, M, Carter, AB, Dighe, AS, Pfeifer, JD, Bry, L, et al. Computational Pathology: A Path Ahead. Arch Pathol Lab Med (2015) 140:41–50. doi:10.5858/arpa.2015-0093-SA

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Fuchs, TJ, and Buhmann, JM. Computational Pathology: Challenges and Promises for Tissue Analysis. Comput Med Imaging Graph (2011) 35:515–30. doi:10.1016/j.compmedimag.2011.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Barisoni, L, Lafata, KJ, Hewitt, SM, Madabhushi, A, and Balis, UGJ. Digital Pathology and Computational Image Analysis in Nephropathology. Nat Rev Nephrol (2020) 16:669–85. doi:10.1038/s41581-020-0321-6

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Loupy, A, Haas, M, Roufosse, C, Naesens, M, Adam, B, Afrouzian, M, et al. The Banff 2019 Kidney Meeting Report (I): Updates on and Clarification of Criteria for T Cell- and Antibody-Mediated Rejection. Am J Transpl (2020) 20:2318–31. doi:10.1111/ajt.15898

CrossRef Full Text | Google Scholar

7. Cheng, JY, Abel, JT, Balis, UGJ, McClintock, DS, and Pantanowitz, L. Challenges in the Development, Deployment, and Regulation of Artificial Intelligence in Anatomic Pathology. Am J Pathol (2021) 191:1684–92. doi:10.1016/j.ajpath.2020.10.018

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Zarella, MD, McClintock, DS, Batra, H, Gullapalli, RR, Valante, M, Tan, VO, et al. Artificial Intelligence and Digital Pathology: Clinical Promise and Deployment Considerations. J Med Imaging (Bellingham) (2023) 10:051802. doi:10.1117/1.JMI.10.5.051802

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Kim, I, Kang, K, Song, Y, and Kim, TJ. Application of Artificial Intelligence in Pathology: Trends and Challenges. Diagnostics (Basel) (2022) 12:2794. doi:10.3390/diagnostics12112794

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Ahmad, Z, Rahim, S, Zubair, M, and Abdul-Ghafar, J. Artificial Intelligence (AI) in Medicine, Current Applications and Future Role With Special Emphasis on its Potential and Promise in Pathology: Present and Future Impact, Obstacles Including Costs and Acceptance Among Pathologists, Practical and Philosophical Considerations. A Comprehensive Review. Diagn Pathol (2021) 16:24. doi:10.1186/s13000-021-01085-4

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Farris, AB, Vizcarra, J, Amgad, M, Cooper, LAD, Gutman, D, and Hogan, J. Artificial Intelligence and Algorithmic Computational Pathology: An Introduction With Renal Allograft Examples. Histopathology (2021) 78:791–804. doi:10.1111/his.14304

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Shen, A, Wang, F, Paul, S, Bhuvanapalli, D, Alayof, J, Farris, AB, et al. An Integrative Web-Based Software Tool for Multi-Dimensional Pathology Whole-Slide Image Analytics. Phys Med Biol (2022) 67:224001. doi:10.1088/1361-6560/ac8fde

CrossRef Full Text | Google Scholar

13. Allan, C, Burel, JM, Moore, J, Blackburn, C, Linkert, M, Loynton, S, et al. OMERO: Flexible, Model-Driven Data Management for Experimental Biology. Nat Methods (2012) 9:245–53. doi:10.1038/nmeth.1896

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Chen, Y, Zee, J, Smith, A, Jayapandian, C, Hodgin, J, Howell, D, et al. Assessment of a Computerized Quantitative Quality Control Tool for Whole Slide Images of Kidney Biopsies. J Pathol (2021) 253:268–78. doi:10.1002/path.5590

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Bulow, RD, Holscher, DL, Costa, IG, and Boor, P. Extending the Landscape of Omics Technologies by Pathomics. NPJ Syst Biol Appl (2023) 9:38. doi:10.1038/s41540-023-00301-9

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Boor, P. Artificial Intelligence in Nephropathology. Nat Rev Nephrol (2020) 16:4–6. doi:10.1038/s41581-019-0220-x

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Kaissis, G, Ziller, A, Passerat-Palmbach, J, Ryffel, T, Usynin, D, Trask, A, et al. End-to-End Privacy Preserving Deep Learning on Multi-Institutional Medical Imaging. Nat Machine Intelligence (2021) 3:473–84. doi:10.1038/s42256-021-00337-8

CrossRef Full Text | Google Scholar

18. Saldanha, OL, Quirke, P, West, NP, James, JA, Loughrey, MB, Grabsch, HI, et al. Swarm Learning for Decentralized Artificial Intelligence in Cancer Histopathology. Nat Med (2022) 28:1232–9. doi:10.1038/s41591-022-01768-5

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Warnat-Herresthal, S, Schultze, H, Shastry, KL, Manamohan, S, Mukherjee, S, Garg, V, et al. Swarm Learning for Decentralized and Confidential Clinical Machine Learning. Nature (2021) 594:265–70. doi:10.1038/s41586-021-03583-3

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Homeyer, A, Geissler, C, Schwen, LO, Zakrzewski, F, Evans, T, Strohmenger, K, et al. Recommendations on Compiling Test Datasets for Evaluating Artificial Intelligence Solutions in Pathology. Mod Pathol (2022) 35:1759–69. doi:10.1038/s41379-022-01147-y

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Liu, X, Glocker, B, McCradden, MM, Ghassemi, M, Denniston, AK, and Oakden-Rayner, L. The Medical Algorithmic Audit. Lancet Digit Health (2022) 4:e384–e397. doi:10.1016/S2589-7500(22)00003-6

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Yoo, D, Goutaudier, V, Divard, G, Gueguen, J, Astor, BC, Aubert, O, et al. An Automated Histological Classification System for Precision Diagnostics of Kidney Allografts. Nat Med (2023) 29:1211–20. doi:10.1038/s41591-023-02323-6

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Schmitt, M, Maron, RC, Hekler, A, Stenzinger, A, Hauschild, A, Weichenthal, M, et al. Hidden Variables in Deep Learning Digital Pathology and Their Potential to Cause Batch Effects: Prediction Model Study. J Med Internet Res (2021) 23:e23436. doi:10.2196/23436

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Oakden-Rayner, L, Dunnmon, J, Carneiro, G, and Re, C. Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging. Proc ACM Conf Health Inference Learn (2020) 2020:151–9. doi:10.1145/3368555.3384468

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Kers, J, Bulow, RD, Klinkhammer, BM, Breimer, GE, Fontana, F, Abiola, AA, et al. Deep Learning-Based Classification of Kidney Transplant Pathology: A Retrospective, Multicentre, Proof-of-Concept Study. Lancet Digit Health (2022) 4:e18–e26. doi:10.1016/S2589-7500(21)00211-9

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Lutnick, B, Ginley, B, Govind, D, McGarry, SD, LaViolette, PS, Yacoub, R, et al. An Integrated Iterative Annotation Technique for Easing Neural Network Training in Medical Image Analysis. Nat Mach Intell (2019) 1:112–9. doi:10.1038/s42256-019-0018-3

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ginley, B, Lutnick, B, Jen, KY, Fogo, AB, Jain, S, Rosenberg, A, et al. Computational Segmentation and Classification of Diabetic Glomerulosclerosis. J Am Soc Nephrol (2019) 30:1953–67. doi:10.1681/ASN.2018121259

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Ginley, B, Jen, KY, Han, SS, Rodrigues, L, Jain, S, Fogo, AB, et al. Automated Computational Detection of Interstitial Fibrosis, Tubular Atrophy, and Glomerulosclerosis. J Am Soc Nephrol (2021) 32:837–50. doi:10.1681/ASN.2020050652

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Govind, D, Becker, JU, Miecznikowski, J, Rosenberg, AZ, Dang, J, Tharaux, PL, et al. PodoSighter: A Cloud-Based Tool for Label-Free Podocyte Detection in Kidney Whole-Slide Images. J Am Soc Nephrol (2021) 32:2795–813. doi:10.1681/ASN.2021050630

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Hipp, JD, Cheng, JY, Toner, M, Tompkins, RG, and Balis, UJ. Spatially Invariant Vector Quantization: A Pattern Matching Algorithm for Multiple Classes of Image Subject Matter Including Pathology. J Pathol Inform (2011) 2:13. doi:10.4103/2153-3539.77175

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Becker, JU, Mayerich, D, Padmanabhan, M, Barratt, J, Ernst, A, Boor, P, et al. Artificial Intelligence and Machine Learning in Nephropathology. Kidney Int (2020) 98:65–75. doi:10.1016/j.kint.2020.02.027

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Lipkova, J, Chen, TY, Lu, MY, Chen, RJ, Shady, M, Williams, M, et al. Deep Learning-Enabled Assessment of Cardiac Allograft Rejection From Endomyocardial Biopsies. Nat Med (2022) 28:575–82. doi:10.1038/s41591-022-01709-2

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Hermsen, M, de Bel, T, den Boer, M, Steenbergen, EJ, Kers, J, Florquin, S, et al. Deep Learning-Based Histopathologic Assessment of Kidney Tissue. J Am Soc Nephrol (2019) 30:1968–79. doi:10.1681/ASN.2019020144

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Hermsen, M, Ciompi, F, Adefidipe, A, Denic, A, Dendooven, A, Smith, BH, et al. Convolutional Neural Networks for the Evaluation of Chronic and Inflammatory Lesions in Kidney Transplant Biopsies. Am J Pathol (2022) 192:1418–32. doi:10.1016/j.ajpath.2022.06.009

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Bulten, W, Pinckaers, H, van Boven, H, Vink, R, de Bel, T, van Ginneken, B, et al. Automated Deep-Learning System for Gleason Grading of Prostate Cancer Using Biopsies: A Diagnostic Study. Lancet Oncol (2020) 21:233–41. doi:10.1016/S1470-2045(19)30739-9

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Ehteshami Bejnordi, B, Veta, M, Johannes van Diest, P, van Ginneken, B, Karssemeijer, N, Litjens, G, et al. Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer. JAMA (2017) 318:2199–210. doi:10.1001/jama.2017.14585

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Litjens, G, Bandi, P, Ehteshami Bejnordi, B, Geessink, O, Balkenhol, M, Bult, P, et al. 1399 H&E-Stained Sentinel Lymph Node Sections of Breast Cancer Patients: The CAMELYON Dataset. Gigascience (2018) 7:giy065. doi:10.1093/gigascience/giy065

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Bandi, P, Geessink, O, Manson, Q, Van Dijk, M, Balkenhol, M, Hermsen, M, et al. From Detection of Individual Metastases to Classification of Lymph Node Status at the Patient Level: The CAMELYON17 Challenge. IEEE Trans Med Imaging (2019) 38:550–60. doi:10.1109/TMI.2018.2867350

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Marsh, JN, Matlock, MK, Kudose, S, Liu, TC, Stappenbeck, TS, Gaut, JP, et al. Deep Learning Global Glomerulosclerosis in Transplant Kidney Frozen Sections. IEEE Trans Med Imaging (2018) 37:2718–28. doi:10.1109/TMI.2018.2851150

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Marsh, JN, Liu, TC, Wilson, PC, Swamidass, SJ, and Gaut, JP. Development and Validation of a Deep Learning Model to Quantify Glomerulosclerosis in Kidney Biopsy Specimens. JAMA Netw Open (2021) 4:e2030939. doi:10.1001/jamanetworkopen.2020.30939

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Bullow, RD, Marsh, JN, Swamidass, SJ, Gaut, JP, and Boor, P. The Potential of Artificial Intelligence-Based Applications in Kidney Pathology. Curr Opin Nephrol Hypertens (2022) 31:251–7. doi:10.1097/MNH.0000000000000784

CrossRef Full Text | Google Scholar

42. Nakagawa, K, Moukheiber, L, Celi, LA, Patel, M, Mahmood, F, Gondim, D, et al. AI in Pathology: What Could Possibly Go Wrong? Semin Diagn Pathol (2023) 40:100–8. doi:10.1053/j.semdp.2023.02.006

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Bouteldja, N, Holscher, DL, Klinkhammer, BM, Buelow, RD, Lotz, J, Weiss, N, et al. Stain-Independent Deep Learning-Based Analysis of Digital Kidney Histopathology. Am J Pathol (2023) 193:73–83. doi:10.1016/j.ajpath.2022.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

44. de Haan, K, Zhang, Y, Zuckerman, JE, Liu, T, Sisk, AE, Diaz, MFP, et al. Deep Learning-Based Transformation of H&E Stained Tissues Into Special Stains. Nat Commun (2021) 12:4884. doi:10.1038/s41467-021-25221-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Banff, digital pathology, artificial intelligence, machine learning, image analysis

Citation: Farris AB, Alexander MP, Balis UGJ, Barisoni L, Boor P, Bülow RD, Cornell LD, Demetris AJ, Farkash E, Hermsen M, Hogan J, Kain R, Kers J, Kong J, Levenson RM, Loupy A, Naesens M, Sarder P, Tomaszewski JE, van der Laak J, van Midden D, Yagi Y and Solez K (2023) Banff Digital Pathology Working Group: Image Bank, Artificial Intelligence Algorithm, and Challenge Trial Developments. Transpl Int 36:11783. doi: 10.3389/ti.2023.11783

Received: 08 July 2023; Accepted: 22 September 2023;
Published: 16 October 2023.

Copyright © 2023 Farris, Alexander, Balis, Barisoni, Boor, Bülow, Cornell, Demetris, Farkash, Hermsen, Hogan, Kain, Kers, Kong, Levenson, Loupy, Naesens, Sarder, Tomaszewski, van der Laak, van Midden, Yagi and Solez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Alton B. Farris, abfarri@emory.edu