Diabetic Retinopathy Grading by a Source-Free Transfer Learning Approach

doi:10.1016/j.bspc.2021.103423

Biomedical Signal Processing and Control

Volume 73, March 2022, 103423

https://doi.org/10.1016/j.bspc.2021.103423 Get rights and content

Abstract

Diabetic retinopathy (DR) gives rise to blindness in young adults around the world. By early detection, patients with DR can be properly treated in time, and the deterioration of DR can be prevented. Thus, early and accurate DR screening is critical for disease prognosis. However, traditional manual detection work is intensive and easy to cause misdiagnosis for large amounts of patients. In recent years, deep learning methods have achieved remarkable improvements in medical image analysis, making DR detection more reliable and efficient. Nevertheless, existing supervised learning and transfer learning methods require a great deal of labeled data, which is always not available in DR screening due to the challenges of medical annotating and privacy issues. To solve this problem, we design a Source-Free Transfer Learning (SFTL) method for referable DR detection, which utilizes unannotated retinal images and only employs source model throughout the training process. In this paper, we propose two major modules, namely, the target generation module and the collaborative consistency module. For the target generation module, it can produce target-style retinal images, trained by the inputting target data and source model. For the collaborative consistency module, the classification model is further optimized by the generated target-style images, which guides the generator to produce images with more accurate expression. Furthermore, a target reconstruction loss is attached on the generator to enhance the performance, and a feature consistency loss is introduced to make the target model not drift far away from the source model. To evaluate the effectiveness of the SFTL model, we have carried out extensive experiments on APTOS 2019 dataset with a source model from EyePACS dataset, and obtained an accuracy of 91.2%, a sensitivity of 0.951 and a specificity of 0.858, demonstrating that our proposed SFTL model is more competitive than other state-of-the-art supervised learning methods.

Introduction

Diabetic retinopathy (DR) plays a major inducement of blindness in people aged 20–64 [1]. It is crucial for ophthalmologists to screen DR and track the developments in the early stage of disease, assisting patients against vision loss. Popular screening technology employs non-mydriatic colorful retinal cameras to collect fundus photos (Fig. 1), which expresses the essential characteristics to judge the DR grade. The most common lesion signs are often red or bright colors, indicating different lesion stages. The red dots existing in retinal images are mainly microaneurysms (MAs), focal dilatations of retinal capillaries. Besides, dot hemorrhage lesions (HEs) are found wherever capillary walls are weak inside the retina, slightly larger than MAs. Bright lesions or intra-retinal lipid exudates (EXs) result from the breakdown of the retinal blood barrier. Excluded fluid rich in lipids and proteins leave the parenchyma, leading to retinal edema and exudation. Moreover, progressive DR also causes macular edema, neovascularization (NV) and retinal detachment in later stages. In this situation, early detection of diabetic retinopathy guarantees patients have effective therapeutic effects. According to experts’ declaration, if the primary stage of DR can be detected, almost 90 percent of DR patients’ vision can be saved [2]. Ophthalmologists can manually complete DR screening based on the patient’s fundus images.

However, the symptom of early DR is tiny and the detection work in clinical practice is quite a lot. Meanwhile, the difference between two consecutive stages of symptoms is difficult to distinguish (Fig. 1). For instance, MAs are microscopic blood-filled bulges in the artery walls, which are the earliest signs of DR and are challenging to be noticed by ophthalmologists. HEs can be seen in moderate non-proliferative DR in addition to MAs, which are ‘blot’ shaped lesions slightly larger than MAs. This makes HEs can not be discriminated from MAs. Therefore, manual detection for many patients is inefficient and fatigable for ophthalmologists, which may produce misdiagnosis in long-time working. In recent years, many automatic DR detection methods are proposed with the development of artificial intelligence, which almost depends on deep learning technology. Therefore, the DR screening system developed through the artificial intelligence is much more authentic, reliable, faster, efficient, and easier than previous manual systems.

The current automated screening methods, especially using convolutional neural networks (CNNs), play a significant role in improving DR detection performance. For example, Abramoff et al. [3] proposed a deep-learning enhanced algorithm integrated with an IDx-DR device for the DR detection, which proves the effectiveness of the deep learning method compared with algorithms that only adopt machine learning. Shanthi el at. [4] used a modified AlexNet architecture to detect the DR stages and achieved better classification performance. Nevertheless, these methods are under a supervised framework, which requires a great deal of annotated data, causing several obstacles for clinical practice. Because marking sufficient retinal images is more time-consuming and economic-expensive than familiar images, it is very necessary to relax the limitation of the annotations. Here, transfer learning reveals its effectiveness for solving these problems due to its excellent capability of distilling knowledge from another dataset. Thus, it has received extensive research attention in recent years. For instance, Li et al. [5] firstly used a fine-tuned fully-trained CNN model to learn representations for inputting retinal images; Secondly, a robust support vector machine deployed these representations and suggested CNNs based domain adaptation methods have the capability to perform satisfactory grading accuracy on small datasets. Besides, Yang et al. [6] presented a residual-CycleGAN to distill the camera brand difference, which improves the transfer learning performance. These works support the potential of transfer learning and have reached a satisfactory performance in DR detection.

The remarkable performance of transfer learning methods in DR detection is reached by fully utilizing knowledge from adequate annotated data from the source domain [7], [5], [6], and these existing transfer learning models usually suppose that the labeled source retinal data is accessible during the training process. However, the huge number of annotated data may be not always available in the field of medical image analysis because of the following cases:

1.
Medical image annotation is a time-consuming and labor-intensive manual task. Besides, professional annotators need the friendly environment to keep excellent and accurate performance.
2.
Due to privacy and security issues of medical data, the source images are usually unavailable to access in transfer learning, whereas the sufficiently trained source model can be utilized in the training grading model for target data.

Therefore, it is necessary to develop an automated DR detection method without any annotated retinal images to accelerate the performance only with the help of a source model.

Motivated by the above observations, this study proposes a novel Source-Free Transfer Learning (SFTL) model for referable DR screening, which sufficiently exploits the learned source knowledge from the prediction model and the provided target images (Fig. 2). The proposed method mainly consists of a target generation module and a collaborative consistency module. Specifically, conventional data-based transfer learning methods [5], [6] aim to classify retinal images in the target domain with source images $X_{s}$ , labels $Y_{s}$ and unannotated target images $X_{t}$ . In contrast, our proposed SFTL model is to distilling knowledge from the pre-trained classification model $C$ into the target data only giving $X_{t}$ . That is to say, learned model $C$ pre-trained by source data is accessible, but medical institutions do not provide source data $\{X_{s}, Y_{s}\}$ .

To achieve this goal, we first introduce a generator and a discriminator for the target generation module to distinguish whether the input image is real or fake by an adversarial loss. Secondly, a target reconstruction loss is attached to the target generator to improve the generative performance. Besides, a semantic similarity constraint is imposed to collaborate with the classifier throughout the training process. In the collaborative consistency module, a model consistency loss acted on feature level and parameter level is designed to constrain the classifier from being too far away from the source model, making the training process staler and improving the prediction result. Moreover, the objective for the prediction model is integrated with a clustering-based regularization to make the features more compact in distribution space.

An extensive study on public retinal image datasets shows that our SFTL method can effectively solve retinal image classification by source-data-free transfer learning. The contributions of this paper can be summarized as below:

•
We develop a Source-Free Transfer Learning (SFTL) model to transfer valuable information from a source pre-trained classification model, and further exploit predicted knowledge from unlabeled target domain using merely target retinal images, where none of the existing DR detection approaches is feasible.
•
To enhance adversarial learning and improve the generative performance, we propose a target generation module attached a reconstruction loss, and a collaborative consistency module to avoid the features learned from the target prediction model drift far away from those by source pre-trained model, achieved by a feature consistency loss.
•
We conduct transfer learning experiments between EyePACS and APTOS 2019 datasets, proving that our SFTL method can effectively improve the referable DR screening performance in the absence of source data.

Section snippets

Automated diabetic retinopathy grading

A series of research attempts have been devoted to improving the efficiency and accuracy of DR screening [8], [9], [10], [11], [12], [13], [14], [15]. We review the literature of DR screening by consolidating them into a bar graph based on the number of published papers in a year, as shown in Fig. 3. By using the technology of machine learning, deep learning, and transfer learning, remarkable progress has been made in automated DR screening.

Initially, researchers are inspired by machine

The proposed method

In this study, a novel transfer learning architecture is proposed to automatically grade retinal fundus image in the absence of source data, named the Source-Free Transfer Learning (SFTL) model. The overall architecture of SFTL model is shown in Fig. 4. In detail, our framework consists of two crucial modules of target generation and collaborative consistency, jointly trained from a pre-trained source classification model $C$ and massive unlabeled target images $X_{t}$ .

In the target generation module,

Experiments

In this section, sufficient experiments are performed on two datasets (EyePACS [37], and APTOS 2019 [38]) to prove that our proposed SFTL model can effectively execute the domain shift and increase the diabetic retinopathy diagnosis performance of the target prediction model. In our experiments, the EyePACS dataset and APTOS 2019 dataset are utilized as source and target domain, respectively.

We perform three steps of experiments: 1) Train a ResNet50 [19] model using source images as the source

Conclusion

This work exploits a novel transfer learning method without the help of source data for referable diabetic retinopathy diagnoses. Since many fundus images are hard to annotate and are not always accessible, our proposed SFTL method is more practical to train an effective DR diagnosing model. Both classification and generator modules have been reinforced reciprocally via collaborative optimization by incorporating generated target-style images into the transfer learning stage. Additionally, a

CRediT authorship contribution statement

Chenrui Zhang: Conceptualization, Methodology, Investigation, Writing – original draft, Formal analysis, Supervision. Tao Lei: Data curation, Validation, Writing – original draft, Resources. Ping Chen: Visualization, Investigation, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This research was funded by the National Natural Science Foundation of China (61801437, 61871351, 61971381); Natural Science Foundation of Shanxi Province (201801D221206, 201801D221207); Scientific and Technological Innovation Programs of Higher Education Institutions in Shanxi (2020L0683); The National Natural Science Foundation of China under Grant 61461025, Grant 61871259, Grant 61811530325 (IECnNSFCn170396, Royal Society, U.K.), and Grant 61861024; The Key research and development plan of

References (50)

T. Shanthi et al.
Modified alexnet architecture for classification of diabetic retinopathy images
Comput. Electr. Eng.
(2019)
M.U. Akram et al.
Identification and classification of microaneurysms for early detection of diabetic retinopathy
Pattern Recogn.
(2013)
H. Pratt et al.
Convolutional neural networks for diabetic retinopathy
Proc. Comput. Sci.
(2016)
M. Yurt et al.
Mustgan: Multi-stream generative adversarial networks for mr image synthesis
Med. Image Anal.
(2021)
J. Tan et al.
Lgan: Lung segmentation in ct scans using generative adversarial network
Comput. Med. Imaging Graph.
(2021)
Y. Xia et al.
Super-resolution of cardiac mr cine imaging using conditional gans and unsupervised transfer learning
Med. Image Anal.
(2021)
A.P. Bradley
The use of the area under the roc curve in the evaluation of machine learning algorithms
Pattern Recogn.
(1997)
M.M. Engelgau et al.
The evolving diabetes burden in the united states
Ann. Internal Med.
(2004)
P. Vashist et al.
Role of early screening for diabetic retinopathy in patients with diabetes mellitus: an overview
Indian J. Commun. Med.
(2011)
M.D. Abràmoff et al.
Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning
Invest. Ophthalmol. Visual Sci.
(2016)

X. Li, T. Pang, B. Xiong, W. Liu, P. Liang, T. Wang, Convolutional neural networks based transfer learning for diabetic...

D. Yang et al.

Residual-cyclegan based camera adaptation for robust diabetic retinopathy screening

J. Hoffman, E. Tzeng, T. Darrell, K. Saenko, Simultaneous deep transfer across domains and tasks, in: Domain Adaptation...

R. Acharya et al.

Application of higher order spectra for the identification of diabetes retinopathy stages

J. Med. Syst.

(2008)

S. Roychowdhury et al.

Dream: diabetic retinopathy analysis using machine learning

IEEE J. Biomed. Health Inf.

(2013)

S.H. Khan et al.

Classification of diabetic retinopathy images based on customised cnn architecture

X. Li et al.

Canet: Cross-disease attention network for joint diabetic retinopathy and diabetic macular edema grading

IEEE Trans. Med. Imag.

(2019)

M.T. Hagos, S. Kant, Transfer learning based detection of diabetic retinopathy from small dataset, arXiv preprint...

S. Gupta et al.

Classification of lesions in retinal fundus images for diabetic retinopathy using transfer learning

A. Krizhevsky et al.

Imagenet classification with deep convolutional neural networks

Commun. ACM

(2017)

K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, arXiv preprint...

F.N. Iandola, S. Han, M.W. Moskewicz, K. Ashraf, W.J. Dally, K. Keutzer, Squeezenet: Alexnet-level accuracy with 50x...

K. He et al.

Deep residual learning for image recognition

K. Wei et al.

Cancer classification with data augmentation based on generative adversarial networks

Front. Comput. Sci.

(2022)

N.A. Kande et al.

Siamesegan: a generative model for denoising of spectral domain optical coherence tomography images

IEEE Trans. Med. Imaging

(2020)

Cited by (32)

Source-free active domain adaptation for diabetic retinopathy grading based on ultra-wide-field fundus images
2024, Computers in Biology and Medicine
Domain adaptation (DA) is commonly employed in diabetic retinopathy (DR) grading using unannotated fundus images, allowing knowledge transfer from labeled color fundus images. Existing DAs often struggle with domain disparities, hindering DR grading performance compared to clinical diagnosis. A source-free active domain adaptation method (SFADA), which generates features of color fundus images by noise, selects valuable ultra-wide-field (UWF) fundus images through local representation matching, and adapts models using DR lesion prototypes, is proposed to upgrade DR diagnostic accuracy. Importantly, SFADA enhances data security and patient privacy by excluding source domain data. It reduces image resolution and boosts model training speed by modeling DR grade relationships directly. Experiments show SFADA significantly improves DR grading performance, increasing accuracy by 20.90% and quadratic weighted kappa by 18.63% over baseline, reaching 85.36% and 92.38%, respectively. This suggests SFADA’s promise for real clinical applications.
A refined ResNet18 architecture with Swish activation function for Diabetic Retinopathy classification
2024, Biomedical Signal Processing and Control
Automatic detection of Diabetic Retinopathy (DR) is critically important, as it is the primary reason of irreversible loss of vision in the economically active populations in the developed countries. Early detection of the onset of Diabetic Retinopathy can greatly benefit clinical treatment; although several different feature extraction methods have been proposed, the task of retinal image classification remains tedious even for trained clinicians. This paper emphasizes on Diabetic Retinopathy detection as well as the analysis of the different stages of DR, performed on fundus images using Deep Learning algorithms. Fundus images of the patient were provided as input to the developed model evaluated using the real-time dataset of the hospital. The proposed ResNet-18 architecture with swish function has achieved an accuracy of 93.51%, sensitivity of 93.42%, precision of 93.77% and F1-score of 93.59%. The paper concludes with a comparative study of Simple CNN, VGGNet-16, MobileNet-V2 and ResNet architectures and other state-of-art approaches, which highlights ResNet-18 with Swish as the most effective deep learning classifier model for DR detection.
Joint DR-DME grading classification using optimal feature selection-based deep graph correlation network
2023, Applied Soft Computing
Diabetic macular edema (DME) and diabetic retinopathy (DR) are the leading causes of human blindness, and accurate grading of individual DME, individual DR, and the joint DR-DME is very important for the diagnosis of human eye diseases. However, the conventional methods failed to separate the features such as disease-specific, disease-dependent, and joint DR-DME, which resulted in poor grading accuracy. In addition, optimal feature selection is also vital in DR-DME grading classification for improving the performance of joint DR-DME grading classification. Therefore, this work focuses on the implementation of an advanced deep graph correlation learning model based on a joint DR-DME network (JDD-Net) for disease detection and classification from color fundus images. Initially, convolutional block attention module (CBAM) and joint disease attention (JDA) modules are combined to extract the DR-specific, DME-specific, and joint DR-DME disease-dependent features. Here, interdependent DR and DME features are separated by a CBAM-based channel-spatial split attention mechanism. In addition, an iterative random forest network (IRF-Net) is used to select the optimal features by adopting fast machine learning properties. Finally, a deep graph correlation network (DGCN) is used to classify the different diseases using a pre-trained model. The simulations conducted on the Indian Diabetic Retinopathy Image Dataset (IDRiD) disclose that the proposed JDD-Net results in improved individual DR, individual DME, and joint DR-DME performance as compared to state-of-the-art approaches with DR, DME, and joint DR-DME accuracy of 99.53%, 99.1%, and 99.01%, respectively.
Two-stage framework for diabetic retinopathy diagnosis and disease stage screening with ensemble learning
2023, Expert Systems with Applications
Diabetic retinopathy (DR), a consequence of diabetes, is among the most common causes of vision loss. Due to a lack of symptoms in the early stages, achieving a firm diagnosis is time-consuming and difficult, even for experts. Hence, a reliable machine learning (ML) model is essential in facilitating an early and timely diagnosis. Although several deep learning (DL) models have recently been developed for DR diagnosis, no real-world interpretation is available for the features considered for classification. Therefore, we propose a fully automatic, end-to-end, conventional ML framework to perform DR diagnosis and precise disease-stage screening. To the best of our knowledge, the proposed framework represents the first implementation of a two-stage framework for DR stage screening that is accurate, reliable, simple, and fast and in which the relevant features responsible for accurate diagnosis are identified in the first stage, and a reliable ML classification technique is identified from the second stage. Ensemble learning helps to achieve a robust ML model, even with a diverse and imbalanced database, which is essential in proving model applicability for real-world computer-aided diagnosis (CAD) applications. Indeed, our model is able to identify DR degrees: normal, mild, moderate, severe, and proliferative. The presented DR disease-stage identification model achieves classification accuracy, F1-score, and precision of 0.99 (95% confidence interval, 0.98–1) using a bagging ensemble learning technique, a specificity and sensitivity of 0.99 (95% confidence interval, 0.97–1), and an Area Under the ROC Curve (AUC) of 1.00 (95% confidence interval, 0.99–1).
Attention-Driven Cascaded Network for Diabetic Retinopathy Grading from Fundus Images
2023, Biomedical Signal Processing and Control
Citation Excerpt :
Nowadays, early diagnosing retinopathy is considered as an effective way to stop sight degradation for blindness prevention [1–8].
Recently, academic and clinical communities have paid increasing attention to design computer-aided diagnosis methods for automatic and accurate diabetic retinopathy (DR) grading from fundus images. However, most existing methods either possess limited capability in extracting lesion-aware information or require manual lesion annotations, resulting in ordinary grading performance or additional undesirable labor. To address these issues, this paper proposes an end-to-end Attention-Driven Cascaded Network (ADCNet) for DR grading. Specifically, we first propose a hybrid attention module at the shallow layer by incorporating a multi-branch spatial attention and a loss-based attention to extract rich lesion-aware information without any manual lesion annotations. Then, we orderly cascade the lesion-aware information from shallow to high layers through an attention-driven aggregation strategy to obtain and integrate plentiful DR-related features. Finally, the grading score is generated by fusing DR-related features of all layers. Experimental results on two publicly available datasets demonstrate that the proposed ADCNet is competent for accurate DR grading, and outperforms the state-of-the-art methods on seven widely used evaluation criteria.
HD<sup>2</sup>A-Net: A novel dual gated attention network using comprehensive hybrid dilated convolutions for medical image segmentation
2023, Computers in Biology and Medicine
Citation Excerpt :
However, the training of these early CNNs usually needed large amounts of training data. In view of this problem, many scholars adopted the transfer learning [4] in their training process [5–7]. In addition, the optimization of CNNs, including the training process [8] and the network structure [9], was also studied.
The convolutional neural networks (CNNs) have been widely proposed in the medical image analysis tasks, especially in the image segmentations. In recent years, the encoder–decoder structures, such as the U-Net, were rendered. However, the multi-scale information transmission and effective modeling for long-range feature dependencies in these structures were not sufficiently considered. To improve the performance of the existing methods, we propose a novel hybrid dual dilated attention network (HD²A-Net) to conduct the lesion region segmentations. In the proposed network, we innovatively present the comprehensive hybrid dilated convolution (CHDC) module, which facilitates the transmission of the multi-scale information. Based on the CHDC module and the attention mechanisms, we design a novel dual dilated gated attention (DDGA) block to enhance the saliency of related regions from the multi-scale aspect. Besides, a dilated dense (DD) block is designed to expand the receptive fields. The ablation studies were performed to verify our proposed blocks. Besides, the interpretability of the HD²A-Net was analyzed through the visualization of the attention weight maps from the key blocks. Compared to the state-of-the-art methods including CA-Net, DeepLabV3+, and Attention U-Net, the HD²A-Net outperforms significantly, with the metrics of Dice, Average Symmetric Surface Distance (ASSD), and mean Intersection-over-Union (mIoU) reaching 93.16%, 93.63%, and 94.72%, 0.36 pix, 0.69 pix, and 0.52 pix, and 88.03%, 88.67%, and 90.33% on three publicly available medical image datasets: MAEDE-MAFTOUNI (COVID-19 CT), ISIC-2018 (Melanoma Dermoscopy), and Kvasir-SEG (Gastrointestinal Disease Polyp), respectively.

View all citing articles on Scopus

View full text