Introduction

In December 2019, a group of patients with atypical pneumonia, cause unknown, was associated to the consumption of bat meat bought at an exotic animal meat market in Wuhan, Hubei, China. The disease has quickly spread to other corners of the world and, on March 11, 2020, the World Health Organization declared the COVID-19 global pandemic outbreak, which is still ongoing (Zhu et al. 2020; Yang et al. 2020). By using impartial sequencing of samples from patients, it was possible to identify a new type of beta coronavirus. This novel coronavirus, named 2019-nCoV, was compared to Severe Acute Respiratory Syndrome (SARS) and Middle East Respiratory Syndrome (MERS), presented a lower mortality rate and a higher transmission capacity (Zhu et al. 2020; Yang et al. 2020).

Coronaviruses are RNA viruses with a lipid envelope able to cause respiratory, enteric, liver, and neurological diseases in domestic animals and human beings (Zhu et al. 2020; Weiss and Leibowitz 2011). The existence of wild viruses in nature is well known by researchers. They are highly prevalent, broadly distributed, genetically diverse, and frequently genetically recombined. There is a higher probability of natural hosts disseminating these viruses to human beings as the man-animal interface activity is increasing.

Both SARS and MERS were associated with a zoonotic origin and transmitted by civets and camels, respectively. The appearance of diseases such as SARS and MERS, supposedly caused by the new coronavirus, tends to become more frequent events when no barriers are imposed between human society and wild nature (Zhu et al. 2020; Cui et al. 2019).

As there is no specific drug or vaccine to treat the new coronavirus, early detection is crucial so that the patient is isolated from the healthy population as soon as possible (Ai et al. 2020). Thus, the research for methods of early detection has become essential to fight this pandemic outbreak.

Currently, the gold standard of the COVID-19 diagnosis is made by viral nucleic acid detection using reverse transcription-polymerase chain reaction (RT-PCR) in real-time, although its effective accuracy is 30 to 50% (Ai et al. 2020; Zhang et al. 2020; Ozturk et al. 2020).

An underlying problem with this method is how unavailable it is in several regions and countries impacted; this generates logistic and political issues for providing enough test kits for the increasing number of patients suspected of having the disease (Zhang et al. 2020). Moreover, the delay in processing and getting results and the significant number of false negatives urged researchers from all over the world to try to find a solution for this problem in various areas of knowledge (Ozturk et al. 2020).

Medical image processing is one of the areas which has been contributing to promising studies. Research in this area is being done to clinically aid to diagnose the disease for patients who develop lung atypical pneumonia, using chest X-ray images or computed tomography (CT) scans (Ozturk et al. 2020). As several patients with COVID-19 develop lung infection, CT scans are useful to detect lung impairment, as well as to classify its progression (Zhang et al. 2020; Dai et al. 2020).

Radiological images from COVID-19 patients may present similarities with those from patients with bacterial or viral pneumonia, specifically the ones caused by SARS and MERS. Thus, the ability to accurately differentiate diseases by analyzing medical images has become a vital challenge, and overcoming it means helping healthcare professionals with detecting the disease early and isolating affected patients as soon as possible (Ozturk et al. 2020; Chung et al. 2020).

Medical image processing area has several studies aimed at developing machine learning methods able to help to diagnose COVID-19, either using CT scan images or chest X-ray ones (Ozturk et al. 2020). Problems using CT scans rather than chest X-rays include low availability of equipment, radiologists, and physicians; higher cost; and longer time to obtain images (Zhang et al. 2020).

A machine learning technique used in research is called deep learning. It allows computer models with several layers of processing to learn how to represent data in various levels of abstraction (Zhang et al. 2020; Lecun et al. 2015; Martin et al. 2020). This technique allows the design of applications which can perform recognition, such as speech recognition, visual recognition, and object detection.

Medical images of patients with COVID-19 presents common features that might show a pattern. Deep learning is an effective technique used by researchers to help healthcare professionals to analyze vast volumes of data generated by chest X-ray images, for instance (Zhang et al. 2020; Martin et al. 2020).

When it comes to applying artificial intelligence and machine learning to the medical field, convolutional neural networks (CNNs) stand out. CNNs are deep artificial neural networks which can be used to classify images, group them by similarity, and run object recognition. These networks are inspired in the human visual cortex processing and used for medical images where irregularities in tissue morphology may be used to classify tumors. CNNs can detect patterns, which are hard to be found by human specialists, for instance, initial stages of disease in tissue samples (Balas et al. 2019).

This paper proposes a new Convolutional Neural Network model called CNN-COVID, to classify images of COVID-19 patients and differentiate them from those who do not have COVID-19, with a focus in analysis, classification, and high accuracy. The CNN-COVID model and its related work are presented in the following sections.

Related work

In the study from Zhang et al. (2020), a model to detect anomalies was developed by using deep learning. The goal is to do a quick and trustworthy triage of patients with COVID-19. One hundred chest X-ray images from 70 patients with COVID-19 were sourced from a GitHub repository to assess the performance of the model, and 1431 chest X-ray images from 1008 patients with other types of pneumonia were sourced from ChestX-ray14, a public data pool. The Zhang model in (Zhang et al. 2020) is composed of three components, namely, a backbone network, a classification head, and an anomaly detection head. Thus, the first component extracts the high-level features from a chest X-ray which will be the input data for the classification head, and then, the input data for the anomaly detection head. So, with this paper, it resulted 90.00% sensitivity, 87.84% specificity (when parameter T from the study was 0.25), or 96.00% sensitivity and 70.65% specificity (when parameter T was 0.15). Nonetheless, this model presented some limitations, such as missing 4% of COVID-19 cases and having an approximately 30% false positives.

In a similar study, Sethy and Behera (2020) suggested a model to detect COVID-19 from chest X-rays using deep learning. Support vector machine (SVM) classified images of patients suffering from this disease differentiating from images of patients who suffer from other diseases. A subset of 25 COVID-19 images were sourced from the GitHub repository, and a subset of 25 pneumonia images was sourced from the Kaggle repository. The ResNet50 deep neural network model with SVM classification has proven to be the best approach to detect COVID-19, with 95.38% accuracy, 97.2% sensitivity, and 93.4% specificity.

In Wang and Wong (2020), a CNN was adapted to detect COVID-19 cases. It was called COVID-Net, and it used chest X-rays images from open source code and available to the general audience. Thus, COVIDX was created, a database sourcing samples from five different databases (Wang and Wong 2020): 100 samples of chest X-rays of healthy patients, 100 samples of pneumonia patients, and 100 images of COVID-19 patients. COVID-NET presented the following results: 93.3% accuracy, 91.0% sensitivity, and 99.9% specificity when detecting COVID-19 X-ray images out of healthy cases and severe acute respiratory syndrome cases.

The study from Abbas et al. (2020) uses the transfer learning technique, an effective mechanism to provide a promising solution by transferring knowledge from generic tasks for object recognition to domain-specific tasks. Therefore, DeTraC, a deep CNN was adapted with functions to decompose, transfer, and compose samples to classify chest X-ray images as COVID-19. They sourced 80 chest X-ray images (4020 × 4892 pixels) of healthy patients from the Japanese Society of Radiological Technology (JSRT), 105 images (4248 × 3480 pixels) of COVID-19 patients, and 11 images (4248 × 3480 pixels) of SARS patients from GitHub. DeTraC presented the following results: 95.12% accuracy, 97.91% sensitivity, and 91.87% specificity when detecting COVID-19 X-ray images out of healthy cases and severe acute respiratory syndrome cases.

Methods

CNN-COVID was trained and tested using images from two different databases (public domain) containing chest X-ray who tested positive for COVID-19. The 434 images from Dataset I was obtained at the beginning of the pandemic, and the 4030 images from Dataset II, in October 2020. Due to the limited availability of public images at the beginning of the pandemic, Dataset I was expanded using data augmentation techniques. For Dataset II, a data augmentation was not necessary, and the images were randomly selected for each phase of training and testing. Figure 1 represents the proposed methodology for image classification.

Fig. 1
figure 1

Diagram of the proposed methodology for image classification (COVID and NON-COVID). Created by the author

Dataset I

Two databases compose Dataset I. The first one has 217 chest X-ray images from the COVID-19 image data collection (Cohen et al. 2020). In this database, the images are from 141 patients who tested positive for COVID-19 was labeled as COVID. From the second database, ChestXray14 (Wang et al. 2019), 1126 images were used. All these images correspond to chest X-ray images labeled for the presence of 14 common chest radiographic observations that, in this paper, were labeled as NON-COVID.

A subset of 166 images, out of the 217 images from the COVID-19 set were selected randomly (Cohen et al. 2020). Out of these 166 selected images, around 75% were for the training phase and 25% were for the testing phase. From the NON-COVID set, 1000 images were selected randomly, 80% were for the training phase and 20% were for the testing phase.

For the validation phase, 126 images were selected from the COVID-19 set (out of the 217 total images) and 126 images from the NON-COVID set (out of the 1126 total images), as shown in Table 1.

Table 1 Number of images in Dataset I used for CNN-COVID

Data augmentation

As the state of pandemic was declared on March 2020, one of the challenges of this study was to find suitable COVID-19 images to work with. So, we had a limited number of X-rays images from COVID-19 patients for training the deep learning models. To overcome that issue, we used the ImageDataGenerator, that provided us with new images to be generated for the training phase. The new images were generated from digital processing, using geometric transformations of the original images.

These geometric transformations, such as be translation, rotation, patch extraction, and reflection, do not make changes to the image object properties, making “data augmentation” possible. The positive side of this technique is that it increases the ability to generalize CNN-COVID when trained with an augmented dataset (Aggarwal et al. 2018; Chollet 2016). Thus, overfitting, which is when the network is no longer able to generalize when presented with new data, can be reduced.

The following common methods were used for dataset augmentation:

  • range rotation

  • width shift range

  • height shift range

  • zoom range

  • horizontal flip

  • vertical flip

After these changes, it was possible to balance the Dataset I from both COVID and NON-COVID classification on the testing and training sets. This database augmentation happens during run time, when chest X-ray images are presented as an input for CNN-COVID.

Dataset II

Dataset II is composed of images from the Valencian Region Medical ImageBank (BIMCV) repository (Vayá et al. 2020) containing chest X-ray and computed tomography (CT) imaging of COVID-19+ patients. From this repository, two databases were created: COVID and NON-COVID. The COVID has 2025 chest X-ray images from patients who tested positive for COVID-19, and NON-COVID database, 2025 chest X-ray images from patients who tested negative for the disease.

Among the 2025 images, chosen from the COVID and NON-COVID databases, approximately 75% of them made up the training phase, 15% the testing phase, and 15% the validation phase, as shown in Table 2.

Table 2 Number of images in Dataset II used for CNN-COVID

Created by the author

CNN-COVID creation

A CNN is composed of two stages: feature extraction stage and classification stage. In the CNN, the pooling and convolution layers act as a stage of feature extraction. In contrast, the classification stage is made of one or more fully connected layers followed by a sigmoid function layer (Wani et al. 2020), which are presented below. For our proposed CNN-COVID method, we used Python programming language (using Keras library) (Chollet, 2016) to create and train CNN-COVID. The work was developed using a I7-8750H Intel processor, 2.21 GHz CPU, 16.0 GB RAM, and a GeForce GTX 1060 graphic card with Max-Q Design.

Convolution layer

In the proposed CNN-COVID, a new convolution operation is established for the convolutional layer, in which a kernel is used to map the activations from one layer into the next. The convolution operation places the kernel in each possible position in the image (or hidden layer) so that the kernel overlaps the entire image and executes a dot product between the kernel parameters and its corresponding receptive field—to which a kernel is applied—in the image. The convolution operation is executed in every region of the image (or hidden layer) to define the next layer (in which activations keep their spatial relations in the previous layer) (Ponti and Da Costa, 2018; Aggarwal et al. 2018; Lecun et al. 2015).

There may be several kernels in the convolutional layer. Every kernel has a feature, such as an edge or a corner. During the forward pass, every kernel is slid to the image width and height (or hidden layer), thus, generating the feature map (Ponti and Da Costa, 2018; Aggarwal et al. 2018; Lecun et al. 2015; Balas et al. 2019

Adaptive moment estimation (ADAM)

CNN-COVID uses Adaptive Moment Estimation (ADAM), an adaptive optimization technique that saves an exponentially decaying average of previously squared gradients vt. Besides, ADAM also computes the average of the second moments of the gradients mt (Wani et al. 2020; Kingma and Ba 2014).

Average and non-centered variance values mt are presented in Eqs. 1 and 2, respectively:

$$ {m}_t={\beta}_1\ {m}_{t-1}+\left(1-{\beta}_1\right) gt $$
(1)
$$ {v}_t={\beta}_2{v}_{t-1}+\left(1-{\beta}_2\right)g{t}^2 $$
(2)

ADAM updates exponential moving averages of the gradient and the squared gradient where the hyperparameters β1, β2 ∈ [0, 1] control the decay rates of these moving averages (Eqs. 3 and 4):

$$ {\hat{m}}_t=\frac{m_t}{1-{\beta}_1^t} $$
(3)
$$ {\hat{v}}_t=\frac{v_t}{1-{\beta}_2^t} $$
(4)

The final formula for the update is presented in Eq. (5):

$$ {w}_{t+1}={w}_t-\frac{\alpha .{\hat{m}}_t}{\sqrt{{\hat{v}}_t}+\epsilon } $$
(5)

Where α is the learning rate and ϵ is a constant added to the denominator for quick conversion methods in order to avoid the division by 0 (Wani et al. 2020; Kingma and Ba 2014).

Dropout technique

CNN-COVID uses the Dropout technique, the most popular technique to reduce overfitting. Dropout refers to dropping out neurons in a neural network during training. Dropping out a neuron means temporarily disconnecting it, as well as all its internal and external connections, from the network.

Dropped-out neurons neither contribute to the forward pass nor add to the backward pass. By using the dropout technique, the network is forced to learn the most robust features as the network architecture changes with every input (Wani et al. 2020; Balas et al. 2019).

Activation functions

An activation function feeds the output of every convolutional layer. The activation function layer consists of an activation function which uses the resource map produced by the convolutional layer and generates the activation map as the output. The activation function is used to change a neuron activation level in an output signal. Thus, it performs a mathematical operation and generates the neuron activation level at a specific interval, for instance, 0 to 1 or −1 to 1 (Wani et al. 2020). The functions used were the following:

  • Sigmoid/Logistic activation function: The sigmoid function \( \sigma (x)=\frac{1}{1+{e}^{-x}} \) is a curve shaped like an S (Ponti and Da Costa, 2018).

  • The activation function f(x) =  max (0, x) is called Rectified Linear Unit–ReLU (Ponti and Da Costa, 2018) and generates a non-linear activation map.

Pooling layer

In the CNN-COVID pooling layer, or downsampling layer, is used to reduce the receptive field spatial size, thus, reducing the number of network parameters. The pooling layer selects each convolutional layer feature map and creates a reduced sample. Max-pooling was the technique used for this work. It generates the maximum value in the receptive field. The receptive field is 2 × 2; therefore, max-pooling will issue the maximum of the four input values (Wani et al. 2020).

Fully connected layer

After the convolution and pooling processes, the next step is to decide based on the features detected. This is done by adding one or more fully connected layers to the end. In the fully connected layer, each neuron from the previous layer is connected to each neuron from the following one. All values contribute to predict how strong a value correlates to a given class (Wani et al. 2020). The fully connected layers may be layered on top of one another to learn even more sophisticated feature combinations. The output of the last fully connected layer is fed by an activation function which generates the class scores. The sigmoid activation function is the one used for CNN-COVID. It produces class scores, and the class with the highest score is treated as the correct one (Wani et al. 2020).

CNN-COVID structure development

Convolutional Neural Networks (CNNs) were proposed to assess image data. The name comes from the convolution operator, a simple way of doing complex operations using the convolution kernel (Ravi et al. 2017).

Many variations of CNN were proposed, such as AlexNet (Krizhevsky et al. 2012), Clarifai (Zeiler and Fergus 2014), and GoogleNet (Szegedy et al. 2015). CNN-COVID structure is also a variation of CNN with the following architecture: an input layer, a convolutional layer, a dense layer, and an output layer, as per Fig. 2.

Fig. 2
figure 2

Deep neural network classification scheme. Created by the author

CNN-COVID detailed architecture is illustrated in Table 3. Our proposed network consists of conventional layers, including input, convolution layer, max-pooling layer, and fully-connected layers.

Table 3 CNN-COVID architecture. The network contains the input (I), the convolution (C), the max-pooling (M) layers, and the fully connected network (F). Created by the author

Besides, a rectified linear unit (ReLU) activation function is used after each convolution layer (1st, 3rd, 5th, and 7th) and dense layers (9th, 10th, 11th, and 12th). To reduce the possibility of overfitting, a dropout rate of 20% was implemented to the first four fully connected layers (9th, 10th, 11th, and 12th).

CNN-COVID training

In the training phase, weights are initialized randomly. The network was trained as per the ADAM model (Wani et al. 2020). Standard parameters β1 = 0.9 and β2 = 0.999 were used (Kingma and Ba, 2014), as well as the initial learning rate α = 0.001 reduced by a factor of 10.

The ADAM training model (Wani et al. 2020) has a better performance compared with other adaptive techniques; it has a quick conversion rate; thus, it reduces the chances of error and increases accuracy. It also overcomes problems faced by other optimization techniques, such as decaying learning rate, high variance in updates, and slow convergence (Wani et al. 2020).

CNN-COVID input parameters

Several options were tested when choosing the input parameter values and the CNN-COVID batch size, considering the performance capacity of the available hardware. For input sizes 200 × 200 and 220 × 220, the batch size was 20. For larger input sizes, the batch size was 10. The input parameter tests were run in 500 epochs. The test with better accuracy results was the one with input size 300 × 300 and batch size 10, as per Fig. 3.

Fig. 3
figure 3

Choice of CNN-COVID Input Size (200 × 200 300 × 300). Created by the author

CNN-COVID training, testing, and validation

For Dataset I and II, the training and testing phases were performed as follows:

  • Regarding Dataset I, the network was tested 10 times varying the parameters used by the ImageDataGenerator class to generate new samples on the image databases for both COVID-19 and NON-COVID classes. For the validation phase, the network with the best accuracy in the test phase (10 tests) was considered.

  • Furthermore, for Dataset II, the network was also tested 10 times, and the training and testing database images were randomly selected. For the validation phase, the network with the best accuracy in the test phase (during 10 tests) was considered.

Performance metrics evaluation

The following metrics were used to validate the CNN-COVID system:

  • Accuracy (ACC): accurate classification rate as per the total number of elements.

  • Precision (P): ratio of true positive classifieds.

  • Recall/Sensitivity (SE): true positive rate.

  • Specificity (SP): true negative rate.

  • F1-Score: relationship between precision and recall/sensitivity.

They are commonly used to assess the performance of classification algorithms (Ruuska et al. 2018; Skansi 2018; Khatami et al. 2017). There is a standard way to show the number of true positives (TP), false positives (FP), true negatives (TN), and false negatives (FN) to be more visual. This method is called the confusion matrix. For a classification of two classes, the confusion matrix is presented in Table 4.

Table 4 Example of a confusion matrix. Adapted from (Skansi 2018)

The confusion matrix allows us to determine the following metrics (Narin et al. 2020; Ruuska et al. 2018; Skansi 2018; Khatami et al. 2017):

$$ \mathrm{Accuracy}:\mathrm{ACC}=\frac{\mathrm{TN}+\mathrm{TP}}{\mathrm{TN}+\mathrm{TP}+\mathrm{FN}+\mathrm{FP}} $$
(6)
$$ \mathrm{Precision}:\mathrm{P}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FP}} $$
(7)
$$ \mathrm{Recall}/\mathrm{Sensitivity}:\mathrm{SE}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}} $$
(8)
$$ \mathrm{Specificity}:\mathrm{SP}=\frac{\mathrm{TN}}{\mathrm{TN}+\mathrm{FP}} $$
(9)
$$ \mathrm{F}1\ \mathrm{Score}:\mathrm{F}1-\mathrm{Score}=\frac{2\times \left(\mathrm{Precision}\times \mathrm{Recall}\right)}{\left(\mathrm{Precision}+\mathrm{Recall}\right)} $$
(10)

Performance metrics help assess the CNN-COVID network and must be interpreted as follows: accuracy is the proportion of correct classification, considering the samples classified; precision, in turn, shows the odds of a positive anomaly be confirmed, recall is the ability of the model to identify all positive examples, and F-score is the harmonic average between precision and recall, best results being closer to 1 and worst, close to 0.

It is also possible to get the Receiver Operating Characteristic Curve (ROC curve). ROC analysis is often called the ROC accuracy ratio, a common technique for judging the accuracy of default probability models (Shirazi et al. 2018).

Results

A total of 10 trainings and tests were done with CNN-COVID. A total of 2000 epochs were applied to each training and test in both Dataset I and II. The results are shown in Fig. 4. The average accuracy of Dataset I and II was 0.9787 and 0.9839, respectively, and the overall average between the two datasets was 0.9813.

Fig. 4
figure 4

Dataset I and II: CNN-COVID results of each test and overall average (Avg) of all 10 tests. Source: Created by the author

To perform the validation phase, we take the weights of the best accuracy from the 10 tests obtained in CNN-COVID. So, the confusion matrix for Dataset I was generated with 126 COVID-19 images and 126 NON-COVID images which resulted in 252 images for validating the model. Otherwise, the confusion matrix for Dataset II was generated with 303 COVID-19 images and 303 NON-COVID images which resulted in 606 images for validating the model. Figure 5 shows the confusion matrix generated for Dataset I (Fig. 5a) and Dataset II (Fig. 5b), respectively.

Fig. 5
figure 5

CNN-COVID confusion matrix: a Dataset I; b Dataset II. Source: Created by the author

Using the TP, TN, FP, and FN parameters, the accuracy, sensitivity, and specificity were calculated for both Dataset I and Dataset II, as registered in Table 5.

Table 5 Metrics results. Source: Created by the author

According to the results presented in Table 5, the ROC curve was calculated for both datasets as (1-SP) and SE, as x and y, respectively. As per the ROC assessment, the area (AUC) is shown in Fig. 6.

Fig. 6
figure 6

CNN-COVID ROC curve: a Dataset I/AUC = 0.972; b Dataset II/AUC = 0.988. Source: Created by the author

From the results of Table 5, we calculated the average of the ACC, SE, and SP metrics for the two datasets, which was compared with the results of the state-of-the-art methods. Our method (CNN-COVID) presents better results for SE (98.83%) and ACC (98.03%) metrics, in comparison with other studies as shown in bold text in Table 6. In addition, the methodology applied at CNN-COVID is broader than that of the studies analyzed, as it uses the average of 10 tests, that is, the average accuracy of 10 tests made with CNN-COVID was considered.

Table 6 Comparison of CNN-COVID with state-of-the-art methods. Created by the author

Discussion

For the chest X-ray images that have various sizes over 1000 × 1000 pixels, a study was needed to decide which image sizes should be remodeled to have the best input size for CNN-COVID. The initial input size was 200 × 200, and then, it was increased by 20 until it reached 300 × 300. CNN-COVID processed every input size in 500 epochs. This study showed that the higher the input size, the more accurate the network was. As per Fig. 2, when the input size was 300 × 300, the accuracy was 0.9697.

The algorithm for gradient-based optimization of stochastic objective functions chosen was the Adaptive Moment Estimation (ADAM). It is an adaptive optimization technique which leverages both AdaGrad (the ability to deal with sparse gradients) and RMSProp (the ability to deal with non-stationary objectives) (Wani et al. 2020). Besides, the method is straightforward to implement and requires little memory.

For Dataset I, which presented a reduced number of COVID-19 images, augmenting the database was needed, to increase the CNN-COVID generalization. The database augmentation was done during run time. In the training phase, 126 COVID-19 images generated 252,000 new images, and 800 NON-COVID images generated 1,600,000 new images bringing it all to a total of 1,852,000 images. In the test phase, 40 COVID-19 images generated 80,000 new images, and 200 NON-COVID images generated 400,000 new images bringing it all to a total of 480,000 images. In the validation phase, the data generator was not used.

In this work, we trained and tested CNN-COVID 10 times, in order to obtain a more reliable value for ACC. The average accuracy of the 10 tests was 0.9787 for Dataset I and 0.9839 for Dataset II (according to Fig. 4). As for the validation, the accuracy obtained was 0.9722 for Dataset I and 0.9884 for Dataset II (according to Table 5). With this result, it was found that the accuracy of the validation of Dataset I was close to the average accuracy of the tests with a percentage error less than 1%.

In Dataset II, the validation accuracy was higher than the average of the tests, showing that the average accuracy is actually more reliable. Thus, these results indicate that the investment of time, human, financial, and computational resources in the creation and improvement of techniques based on machine learning is a promising approach to assist professionals in the prognosis of the new coronavirus through chest X-ray images.

Conclusion and future work

This paper proposed a deep neural network, called CNN-COVID, for prognosis of the COVID-19 virus. Two different datasets were used: Dataset I is formed by a COVID-19 image data repository (Cohen et al. 2020), as well as the ChestXray14 data set (Wang et al. 2019), and Dataset II, on the other hand, is formed by the BIMCV COVID-19 + repository (Vayá et al. 2020). After completing the 10 tests, the average accuracy of Dataset I and Dataset II was 0.9787 and 0.9839, respectively. The weights of the best test results were applied in the validation, obtaining values of accuracy 0.9722 for Dataset I and 0.9884 for Dataset II.

The results proved that the CNN-COVID model is a promising tool to help physicians classify chest images with pneumonia, considering atypical pneumonia caused by COVID-19 and pneumonia due to other causes. We hope this technology enhances the provision of healthcare services, contributing to the disease prognosis through straightforward exams, such as chest X-ray, and broadening the access to information through tools that help with using images for diagnosing.

For future work, we plan to improve the CNN-COVID accuracy as new COVID-19 data is collected.