Fine-grained leukocyte classification with deep residual learning for microscopic images

doi:10.1016/j.cmpb.2018.05.024

Computer Methods and Programs in Biomedicine

Volume 162, August 2018, Pages 243-252

https://doi.org/10.1016/j.cmpb.2018.05.024 Get rights and content

Highlights

•
According to the knowledge of white blood test domain, a leukocyte classifier (Cell3Net) based on deep residual learning theory is constructed.
•
Multiple training strategies and properly selected and combined together to make the classifier get better generalization performance.
•
A benchmark dataset containing about 100,000 leukocytes belonging to 40 categories is built for the purpose of training and testing.

Abstract

Background and objective: Leukocyte classification and cytometry have wide applications in medical domain, previous researches usually exploit machine learning techniques to classify leukocytes automatically. However, constrained by the past development of machine learning techniques, for example, extracting distinctive features from raw microscopic images are difficult, the widely used SVM classifier only has relative few parameters to tune, these methods cannot efficiently handle fine-grained classification cases when the white blood cells have up to 40 categories.

Methods: Based on deep learning theory, a systematic study is conducted on finer leukocyte classification in this paper. A deep residual neural network based leukocyte classifier is constructed at first, which can imitate the domain expert’s cell recognition process, and extract salient features robustly and automatically. Then the deep neural network classifier’s topology is adjusted according to the prior knowledge of white blood cell test. After that the microscopic image dataset with almost one hundred thousand labeled leukocytes belonging to 40 categories is built, and combined training strategies are adopted to make the designed classifier has good generalization ability.

Results: The proposed deep residual neural network based classifier was tested on microscopic image dataset with 40 leukocyte categories. It achieves top-1 accuracy of 77.80%, top-5 accuracy of 98.75% during the training procedure. The average accuracy on the test set is nearly 76.84%.

Conclusions: This paper presents a fine-grained leukocyte classification method for microscopic images, based on deep residual learning theory and medical domain knowledge. Experimental results validate the feasibility and effectiveness of our approach. Extended experiments support that the fine-grained leukocyte classifier could be used in real medical applications, assist doctors in diagnosing diseases, reduce human power significantly.

Introduction

The ratio of leukocytes (white blood cells) in the blood is usually an indicator of diseases. The leukocyte differential count is used in the process of diagnosing diseases by counting the number and ratio of leukocytes in the blood. White blood cell (WBC) test plays a crucial role in detecting and treating diseases such as leukemia, anemia, etc.

Traditionally, white blood cell classification and counting is primarily achieved through a time consuming and troublesome manual process, during which errors often come up. Therefore, an automated and intelligent cell classification approach is of significance. The samples could be processed continuously or in batches in this way. Due to the intrinsic sophistication of white blood cell classification problem, for example, the morphology and features of leukocytes may vary significantly depending on cell types, no rigid rules which are robust and general enough could be applied in white blood cell category recognition. In previous works, researchers usually tend to conquer the WBC test problem by exploiting machine learning techniques. Bikhet et al. [1] propose a white blood cell shape analysis and classification approach based on the morphological characteristics of their outer contour and nuclei. Denoised gray images are used as input for the classification system, where ten features including the area of the cell, the area of the nucleus, cell circularity etc. are selected and extracted from the original images. 71 cell samples belonging to 5 categories are tested in the experiment. Though this approach considers mainly the contour information on the white blood cells, category distinguishing ability of this approach is not good enough. Sinha et al. [2] develop an automatic system for differential blood count using color images of blood smears. Shape, color and texture features are respectively extracted and tested in this system. Meanwhile various classifiers have been explored on the features sets. In the experiments, the best accuracies are obtained based on approaches using neural networks and support vector machine (SVM). However, though the chosen features are relatively simple, the cells happen to be touching can’t be distinguished by the developed system. Hiremath et al. [3] propose an automatic approach for the differential counting of white blood cell. The geometric features such as area, major axis length, minor axis length, perimeter, circularity, ratio of areas of nucleus and cytoplasm are used for identifying the different types of white blood cells. Namely three leukocyte cell types including lymphocyte, monocyte and neutrophil are classified in this approach. Though only low level geometric information is calculated when comparing different types of white blood cells, necessary pre-processing methods are needed. Moreover, empirically chosen thresholds are applied for classification in this approach, which maybe not robust enough. Khashman et al. [4], [5] investigate three different topologies of conventional neural networks for blood cell type identification. Firstly the global pattern averaging descriptor is used to extract the feature vectors from the cell images; then three neural networks which have different input layers with different numbers of neurons are trained. In the experiments, both of the three neural networks could identify red blood cells, white blood cells and platelets. But this approach doesn’t take the background factors into account, so the recognition effect is not very good in practice. Sabino et al. [6] propose a texture approach to leukocyte recognition. Five texture features including inertia, energy, entropy, correlation and homogeneity are calculated based on gray level co-occurrence matrices (GLCM) to differentiate five types of normal leukocytes and chronic lymphocytic leukemia. The performance of texture features are illustrated through experiments on 730 leukocyte images, but the accuracy seems to be a little low. Huang et al. [7] propose a computer assisted approach for leukocyte nucleus segmentation and recognition in blood smear images. Firstly the leukocyte nucleus is segmented based on Otsu’s method. Then the co-occurrence matrix is used as a textural measure of segmented images. Finally the genetic algorithm based k-means is used to classify five kinds of leukocyte. However, constrained by the past development of machine learning techniques, for example, extracting distinctive features from raw microscopic images are difficult, the widely used SVM classifier only has relative few parameters to tune [8], these approaches are mainly aiming at three classification or five classification of leukocytes. Actually leukocytes could be classified into 40 categories according to their medical characteristics. 40 classification for leukocytes will be more challenging and valuable for clinical diagnosis of diseases.

Deep learning makes machine learning proceed a big step toward its original goal – implementing the true artificial intelligence. The deep learning approach has acquired significant breakthrough for many problems in various domains since 2006, triggered a research boom in AI and machine learning communities. Its about learning multiple levels of representation and abstraction that help to make sense of data such as sound, image and video [9], [10], [11], [12]. Currently many deep learning approaches have been developed to overcome recognition and classification tasks in multiple fields. Most notably, Krizhevsky et al. [13] propose a classical deep learning architecture called AlexNet, and show significant improvements upon previous approaches on the image classification task. Witness on the success of AlexNet, several works are proposed to improve its performance. Among them, five representative works are ZFNet [14], VGGNet [15], ResNet [16], GoogLeNet [17] and DenseNet [18]. In addition, Long et al. [19] train a fully convolutional network (FCN) for object semantic segmentation. The input images are directly predicted to dense label maps. However, these networks are mainly aiming at recognizing objects in natural scenes, can’t be applied into micro situation directly.

In this paper, we propose a finer white blood cell classification approach with deep residual learning. Comparing to the traditional shallow machine learning algorithms, our approach addresses the problem through a simple end-to-end learning manner, avoiding error-prone processes such as denoising, white balance, feature extraction, feature selection etc. Hence the classification and cytometry accuracy could be improved.

Section snippets

Methods

Leukocyte classification is a high intelligent activity. The man who classify the cells manually should have rich knowledge and experience on this domain, and need to go through some complex thinking processes to accomplish this work.

Fig. 1 illustrates the overall workflow of how to design a deep network classifier. The pipeline of designing the classifier contains six steps:

1)
Acquire enough sample data from local hospitals, and build white blood cell data sets for training and performance test.
2)

Results

The proposed approach has been implemented by Python and C++, the IDE is PyCharm, and using the Caffe framework [24]. The experiments here are conducted on a sing GPU (Nvidia Tesla K80). The operating system is Ubuntu 16.04. After running for 3611.51 minutes, with 225 epochs, 80,000 iterations, the optimization completes. The deep residual neural network classifier for leukocytes achieves top-1 accuracy of 77.80%, top-5 accuracy of 98.75%. Fig. 9 shows the accuracy curve during the training

Quantitative evaluations

The accuracies of recognizing each leukocyte category on the test set are summarized in Table 2. The third column is the number of images containing in each category. The fourth column is the number of images which can be correctly recognized by the classifier. The fifth column shows the accuracy of recognizing leukocytes for each category. The accuracies of 4 categories (i.e. Basophilic rod-shaped nucleus, polychromatic erythroblast, metarubricyte, mature lymphocyte) are greater than 80%, 6

Conclusions

In conclusion, this paper systematically studies the problem of finely recognizing white blood cell category for microscopic images. Its novelty and contributions lie the following aspects:

Neural network architecture. According to the knowledge of white blood test domain, a leukocyte classifier based on deep residual learning theory is constructed. It can extract salient feature and generate feature representation layer-by-layer. It is notable that our classifier’s performance degrades if a

Conflict of interest

The authors do not have financial and personal relationships with other people or organizations that could inappropriately influence (bias) their work.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (Nos. 61502129, 61602140, 61602139), the Zhejiang Provincial Natural Science Foundation of China (No. LQ16F020004), the Open Project Program of State Key Lab of CAD&CG, Zhejiang University (Nos. A1803, A1817), the Science and Technology Program of Zhejiang Province (No. 2017C33049), China Postdoctoral Science Foundation (No. 2017M620470) and Co-Innovation Center for Information Supply & Assurance Technology,

References (28)

D.M.U. Sabino et al.
A texture approach to leukocyte recognition
Real-Time Imag.
(2004)
D.-C. Huang et al.
A computer assisted method for leukocyte nucleus segmentation and recognition in blood smear images
J. Syst. Software
(2012)
Y. Peng et al.
Discriminative extreme learning machine with supervised sparsity preserving for image classification
Neurocomputing
(2017)
S.F. Bikhet et al.
Segmentation and classification of white blood cells
Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference
(2000)
N. Sinha et al.
Automation of differential blood count
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
(2003)
P.S. Hiremath et al.
Automated identification and classification of white blood cells (leukocytes) in digital microscopic images
Int. J. Comput. Appl
(2010)
A. Khashman
Investigation of Different Neural Models for Blood Cell Type Identification
(2012)
A. Khashman
Blood cell identification using emotional neural networks
J. Inf. Sci. Eng.
(2009)
Y. Bengio
Learning deep architectures for AI
Found. Trends® Mach. Learn.
(2009)
Y. Li et al.
Joint embeddings of shapes and images via CNN image purification.
ACM Trans. Graph.
(2015)

F. Qin et al.

Research on intelligent fault diagnosis of mechanical equipment based on sparse deep neural networks.

J. Vibroeng.

(2017)

F. Qin et al.

A deep learning approach to the classification of 3D CAD models

J. Zhejiang Univ. Science C

(2014)

A. Krizhevsky et al.

ImageNet classification with deep convolutional neural networks

Advances in Neural Information Processing systems

(2012)

M.D. Zeiler et al.

Visualizing and understanding convolutional networks

European Conference on Computer Vision

(2014)

Cited by (113)

From microscope to micropixels: A rapid review of artificial intelligence for the peripheral blood film
2024, Blood Reviews
Artificial intelligence (AI) and its application in classification of blood cells in the peripheral blood film is an evolving field in haematology. We performed a rapid review of the literature on AI and peripheral blood films, evaluating the condition studied, image datasets, machine learning models, training set size, testing set size and accuracy. A total of 283 studies were identified, encompassing 6 broad domains: malaria (n = 95), leukemia (n = 81), leukocytes (n = 72), mixed (n = 25), erythrocytes (n = 15) or Myelodysplastic syndrome (MDS) (n = 1). These publications have demonstrated high self-reported mean accuracy rates across various studies (95.5% for malaria, 96.0% for leukemia, 94.4% for leukocytes, 95.2% for mixed studies and 91.2% for erythrocytes), with an overall mean accuracy of 95.1%. Despite the high accuracy, the challenges toward real world translational usage of these AI trained models include the need for well-validated multicentre data, data standardisation, and studies on less common cell types and non-malarial blood-borne parasites.
Emotion assessment and application in human–computer interaction interface based on backpropagation neural network and artificial bee colony algorithm
2023, Expert Systems with Applications
Emotion assessment is a challenging task in the human–computer interaction interface. Previous studies have examined the relationship between emotion and color, but they fail to accurately analyze emotional semantics due to the numerous elements in human–computer interaction interfaces. As a result, a combination model of a backpropagation neural network (BPNN) and an artificial bee colony algorithm (ABC) was presented in this paper to predict the emotion semantics of the human–computer interaction interface. The mechanism of generating the weights and thresholds for each layer of BPNN was converted to the search for an optimal honey source. Meanwhile, according to experiment results and evaluation of elements in human–computer interaction interfaces, this paper has assessed the relationships amongst the eight key elements (ratio of graphics to text, color difference, color distribution, color harmony, theme style, white space ratio, frame style, number of colors) and emotion word pairs (moderation-fancy, calm-pleasure, confusing-clear, cold-kind, coarse-elegant). Furthermore, an emotion application database was established to determine how the amalgamation of critical elements affects the users’ feelings about the human–computer interaction interface to help designers build a user-centric interface. Finally, the database can be applied to relieve mental health problems by meeting the psychological expectations of users as mental healthcare intervention during the COVID-19 pandemic. Also, it can help designers to design a pleasurable visual interaction interface for a particular element to convey health-related information and protective measures.
Fine-grained imbalanced leukocyte classification with global-local attention transformer
2023, Journal of King Saud University - Computer and Information Sciences
Leukemia is a fatal disease that requires the counting of White Blood Cells (WBCs) in bone marrow for diagnosis. However, bone marrow blood contains many types of leukocytes, some of which have subtle differences. To address this issue, we propose the WBC-GLAformer model, which comprises three parts: Low-level Feature Extractor (LFE), Global–Local Attention based Encoder (GLAE), and Discrimination Part Select (DPS). The LFE uses a convolutional neural network (CNN) to tokenize patches from the extracted low-level features. The GLAE combines the ability of the CNN to extract local features with the ability of the transformer to extract global features, thereby enriching the features of leukocyte images. The DPS improves the accuracy of leukocyte classification by selecting the discriminative regions. Our method achieves state-of-the-art results in the bone marrow leukocyte fine-grained classification dataset. Experimental results demonstrate that the model has good generalization on different datasets and is more robust to the optimizer. And visualization results show that the model can effectively focus on the discriminative parts of different cells. Code is available at https://github.com/ywj1/WBC-GLAformer
Early bread mold detection through microscopic images using convolutional neural network: Early bread mold detection
2023, Current Research in Food Science
Mold on bread in the early stages of growth is difficult to discern with the naked eye. Visual inspection and expiration dates are imprecise approaches that consumers rely on to detect bread spoilage. Existing methods for detecting microbial contamination, such as inspection through a microscope and hyperspectral imaging, are unsuitable for consumer use. This paper proposes a novel early bread mold detection method through microscopic images taken using clip-on lenses. These low-cost lenses are used together with a smartphone to capture images of bread at 50× magnification. The microscopic images are automatically classified using state-of-the-art convolutional neural networks (CNNs) with transfer learning. We extensively compared image preprocessing methods, CNN models, and data augmentation methods to determine the best configuration in terms of classification accuracy. The top models achieved near-perfect $F_{1}$ scores of 0.9948 for white sandwich bread and 0.9972 for whole wheat bread.
Leukocytes Image Classification Using Optimized Convolutional Neural Networks
2022, Expert Systems with Applications
Hematologic diseases and blood disorders can be studied through the microscopic or chemical examination of blood smear images. Many researchers work on identifying, counting, and classifying different types of blood cells as a theoretical and practical problem that is crucial for disease diagnosis and treatment planning. There are various approaches to classify blood cells such as thresholding, morphological operators, segmentation, edge-based techniques, region-based techniques, and hybrid approaches. Each of these techniques has several limitations in effectively classifying different types of cells; however, methods based on deep learning (DL) have remarkably contributed to the progress of blood cell classification by combining feature extraction, feature selection, and classification into one interconnected step. This study develops a hybrid approach of DL and optimization for accurate and efficient classification of four types of leukocytes: neutrophils, eosinophils, lymphocytes, and monocytes. Model hyperparameters are optimized using grid search (GS) and random search (RS), in which a convolutional neural network (CNN) is used to classify leukocytes. CNNs work through pattern recognition to detect significant features that help distinguish different classes. The blood cell count and detection (BCCD) dataset provides basic information, but the data is insufficient and highly unbalanced for CNNs to accurately classify the images, so the data is augmented to improve model performance. This segmentation-free optimized CNN achieved a classification accuracy of 97% for the validation set, which includes 2,487 cell images, and 99% for the training set, which includes 9,966 cell images. The model reached a sensitivity and specificity of 94% and 98%, respectively. RS accelerates the process of hyperparameter optimization while achieving the same accuracy as GS. The results are compared with the results accomplished by recent CNN models on the BCCD database using seven performance measures and demonstrate the superior performance and competence of the proposed method. This research study develops a fast and accurate approach for leukocyte classification and can be beneficial for other image classification tasks and help clinicians in diagnosing blood diseases.
Automated Detection of B Cell and T Cell Acute Lymphoblastic Leukaemia Using Deep Learning
2022, IRBM
Leukaemia is diagnosed conventionally by observing the peripheral blood and bone marrow smear using a microscope and with the help of advanced laboratory tests. Image processing-based methods, which are simple, fast, and cheap, can be used to detect and classify leukemic cells by processing and analysing images of microscopic smear. The proposed study aims to classify Acute Lymphoblastic Leukaemia (ALL) by Deep Learning (DL) based techniques.
The study used Deep Convolutional Neural Networks (DNNs) to classify ALL according to WHO classification scheme without using any image segmentation and feature extraction that involves intense computations. Images from an online image bank of American Society of Haematology (ASH) were used for the classification.
A classification accuracy of 94.12% is achieved by the study in isolating the B-cell and T-cell ALL images using a pretrained CNN AlexNet as well as LeukNet, a custom-made deep learning network designed by the proposed work. The study also compared the classification performances using three different training algorithms.
The paper detailed the use of DNNs to classify ALL, without using any image segmentation and feature extraction techniques. Classification of ALL into subtypes according to the WHO classification scheme using image processing techniques is not available in literature to the best of the knowledge of the authors. The present study considered the classification of ALL only, and detection of other types of leukemic images can be attempted in future research.

View all citing articles on Scopus

View full text

Fine-grained leukocyte classification with deep residual learning for microscopic images

Highlights

Abstract

Introduction

Section snippets

Methods

Results

Quantitative evaluations

Conclusions

Conflict of interest

Acknowledgments

Real-Time Imag.

J. Syst. Software

Neurocomputing

Segmentation and classification of white blood cells

Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference

Automation of differential blood count

TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region

Automated identification and classification of white blood cells (leukocytes) in digital microscopic images

Int. J. Comput. Appl

Investigation of Different Neural Models for Blood Cell Type Identification

Blood cell identification using emotional neural networks

J. Inf. Sci. Eng.

Learning deep architectures for AI

Found. Trends® Mach. Learn.

Joint embeddings of shapes and images via CNN image purification.

ACM Trans. Graph.