Recurrent convolutional neural network based multimodal disease risk prediction

doi:10.1016/j.future.2018.09.031

Future Generation Computer Systems

Volume 92, March 2019, Pages 76-83

https://doi.org/10.1016/j.future.2018.09.031 Get rights and content

Highlights

•
We propose a new MD-RCNN for disease risk prediction.
•
We propose a feature fusion scheme based on DBN.
•
We experiment with the medical big data of a Chinese grade-A hospital of second class.

Abstract

With the rapid growth of biomedical and healthcare data, machine learning methods are used in more and more work to predict disease risk. However, most works use single-mode data to predict disease risk and only few works use multimodal data to predict disease risk. Thus, a new multimodal data-based recurrent convolutional neural network (MD-RCNN) for disease risk prediction is proposed. This model not only can use patient’s structured data and text data, but also can extract structured and unstructured features in fine-grained. Furthermore, in order to obtain the highly non-linear relationships between structured data and unstructured data, we use deep belief network (DBN)to fuse the features. Finally, we experiment with the medical big data of a Chinese two grade hospital during 2013–2015. Experimental results show that the accuracy of MD-RCNN algorithm can reaches 96% and outperforms several state-of-the-art methods.

Introduction

According to McKinsey Report [1], half of Americans suffer from chronic diseases. Furthermore, America spends 27 thousand billion dollars on chronic diseases every year. That accounts for 18% of the annual GDP of America [2]. This situation is also common in other countries. For example, in china, the rate of death rises continuously, accounting for 86.6% of total number of dead people [3]. Therefore, it is crucial to predict the risk of chronic diseases [4].

With the development of deep learning [5] and wearable computing [6], [7], more and more researches predict disease risks from the perspective of big data analysis [8], [9]. Especially, electronic medical record (EHR) [10] becomes more and more convenient. The statistical information [11], detection result and medical history of patients are all recorded in EHR [12]. With some advanced information fusion mechanisms based on distributed systems like [13], data collected from different types of medical sensors and wearables can be used to enhance the information accuracy [14]. Therefore, the potential opportunity centering on data is offered for us to research medical treatment cases and decrease expenditure. For the risk prediction of chronic diseases, many researchers improve the accuracy of risk classification by automatically extracting features from a large amount of structured data [15], rather than the previously selected characteristics [16], [17]. Such as, Yang et al. [18] introduced a multi-label learning algorithm based on convolutional neural network (CNN). Under this algorithm, features can be extracted automatically from EHR for diagnosing diseases.

Furthermore, some research tried to use unstructured text data to predict the risk of chronic diseases [19], [20]. Weng et al. [21] used machine learning technology based on natural language processingto extract features for classification of clinical diagnosis. Jonnagaddala et al. [22] used unstructured electronic medical treatment record to operate text mining for predicting the risk of coronary artery disease. Framingham risk scoring was used to evaluate and calculate the risk of coronary heart disease in ten years, and imputation strategy was used to compensate the lost information. Khalifa et al. [23] suggested using clinical record to evaluate cardiovascular danger factors. However, most existing disease risk assessment schemes only considered structured data or unstructured data [24]. Few works considered multimodal disease risk assessment. For example, Chen et al. [25] propose multimodal disease risk prediction algorithm based on CNN (CNN-MDRP). The result indicated that the combination of structured data and unstructured data could improve the accuracy of model comparing with the only use of text data.

However, existing methods still has the following two problems: (i) As for extracting the features of medical text data, the existing most schemes do not consider context information of text. Therefore, the fine-grained feature of text cannot be extracted [26]. (ii) As for fusing the features of multimodal data, most existing fusion schemes do not consider the highly non-linear relationships among different modes of data [27]. Therefore, the degree of fusion is low.

In order to solve these challenges, we process the structured data and unstructured data of cerebral infarction patients to predict the patients’ disease risk. In details, we propose MD-RCNN for disease risk prediction. We first use RCNN to process unstructured text data to get the textual features of patients’ disease risk. At the same time, referring to the data of patients’ physical examination, the structured features of patients can be got, such as patients’ stature, weight, age, gender, disease and blood type. Then, the deep belief network (DBN) is used to combine structured features and unstructured features deeply. Finally, patients’ disease risk can be predicted. The main contributions of the paper are summarized as follows.

•
We propose a MD-RCNN algorithm based on the structured and unstructured data. The RCNN is used to effectively extract fine-grained features of unstructured textual data.
•
In order to deal with the highly nonlinear relationship between multimodal data and better fuse the features of structured data and unstructured text data, we propose a feature fusion scheme based on DBN.
•
We experiment with the medical big data of a Chinese grade-A hospital of second class. Experimental results show that the proposed MD-RCNN algorithm is better than other prediction algorithms. The accuracy of MD-RCNN can reach 96%.

The rest of the paper is organized as follow. The framework of disease risk prediction is given in Section 2. The MD-RCNN for disease risk prediction model is describe in Section 3. Our result and discussions are given in Section 4. Finally, Section 5 concludes this paper.

Section snippets

Disease risk prediction framework

In this section, we give the framework of disease risk prediction, as shown in Fig. 1. The framework contains the following two aspects: presentation of patients’ unstructured textual data and MD-RCNN algorithm. In details, (i) for the structured data which include patients’ physical examination data and assay data, relevant data is extracted from data set according to the discussion with doctors. Then supplement missing data and make correlation analysis to look for the relation among data.

Multimodal data-based RCNN for disease risk prediction

In this section, we propose the MD-RCNN algorithm for disease risk prediction. Specially, we first introduce RCNN algorithm for processing unstructured medical textual data. Then, we give the detailed procedures of MD-RCNN algorithm.

Experimental results

In this section, we give experimental results. We run the MD-RCNN in data center containing 84core CPU and 336G RAM. We give the experiment analysis from datasets, evaluation methods and experimental results.

Conclusions

In this paper, we predict risk of disease aiming at cerebral infarction and propose MD-RCNN model using collected patients’ structured data and unstructured data. This algorithm can not only use RCNN to extract the features of unstructured textual data in fine-grained but also use DBN to fuse structured features and unstructured features. The non-linear relationship between the two kinds of data can be well reflected. Furthermore, we experiment with the healthcare big data of a Chinese grade-A

Acknowledgment

The authors extend their appreciation to the Deanship of Scientific Research at King Saud University, Riyadh, Saudi Arabia for funding this work through the research group project no. RGP-229.

Yixue Hao received the B.E. degree in Henan University, China, and his Ph.D. degree in computer science from Huazhong University of Science and Technology (HUST), China, 2017. He is currently working as a post-doctoral scholar in School of Computer Science and Technology at Huazhong University of Science and Technology. His research includes 5G network, internet of things, edge computing and healthcare.

References (36)

ZhangY. et al.
PEA: Parallel electrocardiogram-based authentication for smart healthcare systems
J. Netw. Comput. Appl.
(2018)
LinK. et al.
Emotion-aware system design for the battlefield environment
Inf. Fusion
(2019)
ChenM. et al.
Edge cognitive computing based smart healthcare system
Future Gener. Comput. Syst.
(2018)
ZhangY. et al.
SOVCAN: Safety-oriented vehicular controller area network
IEEE Commun.
(2017)
JonnagaddalaJ. et al.
Coronary artery disease risk assessment from unstructured electronic health records using text mining
J. Biomed. Inform.
(2015)
QianY. et al.
EARS: Emotion-aware recommender system based on hybrid information fusion
Inf. Fusion
(2019)
GrovesP. et al.
The ‘Big Data’revolution in Healthcare: Accelerating Value and Innovation
(2016)
ChenM. et al.
Deep features learning for medical image analysis with convolutional autoencoder neural network
IEEE Trans. Big Data
(2017)
HeK. et al.
DeyPoS: Deduplicatable dynamic proof of storage for multi-user environments
IEEE Trans. Comput.
(2016)
XiaM. et al.
A two-stage approach for remaining useful life prediction of bearings using deep neural networks
IEEE Trans. Ind. Inf.
(2018)

ChenM. et al.

Wearable 2.0: Enable human-cloud integration in next generation healthcare system

IEEE Commun.

(2017)

WanJ. et al.

Cloud-enabled wireless body area networks for pervasive healthcare

IEEE Netw.

(2013)

JiangH. et al.

Smart home based on wifi sensing: A survey

IEEE Access

(2018)

JensenP.B. et al.

Mining electronic health records: towards better research applications and clinical care

Nature Rev. Genet.

(2012)

ZhouY. et al.

Statistical study of view preferences for online videos with cross-platform information

IEEE Trans. Multimed.

(2018)

TianD. et al.

An adaptive fusion strategy for distributed information estimation over cooperative multi-agent networks

IEEE Trans. Inform. Theory

(2017)

ChenM. et al.

Task offloading for mobile edge computing in software defined ultra-dense network

IEEE J. Sel. Areas Commun.

(2018)

ChenJ. et al.

Dominating set and network coding-based routing in wireless mesh networks

IEEE Trans. Parallel Distrib. Syst.

(2015)

Cited by (58)

Multimodal fine-tuning of clinical language models for predicting COVID-19 outcomes
2023, Artificial Intelligence in Medicine
Clinical prediction models tend only to incorporate structured healthcare data, ignoring information recorded in other data modalities, including free-text clinical notes. Here, we demonstrate how multimodal models that effectively leverage both structured and unstructured data can be developed for predicting COVID-19 outcomes. The models are trained end-to-end using a technique we refer to as multimodal fine-tuning, whereby a pre-trained language model is updated based on both structured and unstructured data. The multimodal models are trained and evaluated using a multicenter cohort of COVID-19 patients encompassing all encounters at the emergency department of six hospitals. Experimental results show that multimodal models, leveraging the notion of multimodal fine-tuning and trained to predict (i) 30-day mortality, (ii) safe discharge and (iii) readmission, outperform unimodal models trained using only structured or unstructured healthcare data on all three outcomes. Sensitivity analyses are performed to better understand how well the multimodal models perform on different patient groups, while an ablation study is conducted to investigate the impact of different types of clinical notes on model performance. We argue that multimodal models that make effective use of routinely collected healthcare data to predict COVID-19 outcomes may facilitate patient management and contribute to the effective use of limited healthcare resources.
Risk factor refinement and ensemble deep learning methods on prediction of heart failure using real healthcare records
2023, Information Sciences
The prediction of heart failure (HF) is crucial in preventing disease progression by implementing lifestyle changes and pharmacological interventions before the onset of heart diseases. While there have been numerous attempts to predict HF, many have failed to consider the coexisting risk factors and their complex relationships with one another. In this research paper, we present an early warning and prediction method for HF using deep learning approaches. Our proposed method involves a risk factor selection method to identify significant risk factors that contain relevant and valuable information for HF prediction. Additionally, we present an anomaly detection method to eliminate abnormal data that may be caused by mood changes or environmental factors. Finally, we propose an ensemble deep learning model for HF prediction based on scalable conjugate-gradient concept and back propagation learning algorithm that aims to predict and provide early warning of HF in massive medical data. We evaluate our proposed method based on our real research project, HeartCarer, and achieve an accuracy of 98.5%, which surpasses other state-of-the-art methods and our prior work (90%).
Heart disease prediction using hybrid optimization enabled deep learning network with spark architecture
2023, Biomedical Signal Processing and Control
Analyzing massive amounts of data that contain many sorts of data is known as big data analytics. Additionally, the bulk of applications in the actual world need a significant amount of information. Machine learning techniques are used to automatically identify the types and severity of cardiac disease due to the rapid increase of biomedical and healthcare information. However, the ML approach has a number of drawbacks, and because of the complexity of the material, it did not always produce the best results. As a result, an improved deep learning strategy offers a superior fix for this problem. In this study, a brand-new method for diagnosing heart disease—the Grey Wolf Horse Herd optimization-based Shepard Convolutional Neural Network is developed. Here, the master node and slave node-based Spark architecture is used to carry out the heart disease detection process. Preprocessing and feature fusion are carried out in the slave node, whilst heart disease detection is done in the master node. Z-score normalization and missing value imputation are used in this case for pre-processing. The feature fusion is then carried out utilizing Hellinger distance and Deep Q Network (DQN). Furthermore, the ShCNN, which was trained using the created GWHHO algorithm, is used to identify heart disease. The Grey Wolf Optimizer (GWO) and Horse Herd Optimization (HHO) algorithms are also incorporated into the newly developed GWHHO algorithm. Additionally, employing the VA Long Beach dataset, the experimental of the developed model yields improved results in terms of testing accuracy, sensitivity, and specificity of 0.9325, 0.9472, and 0.9142.
An efficient IoT-Artificial intelligence-based disease prediction using lightweight CNN in healthcare system
2023, Measurement: Sensors
In recent years, chronic disease (heart or kidney disease) is a leading cause of death worldwide. Chronic disease prediction is a very complicated task. Doctors who are experienced and familiar with this disease can only be able to predict chronic disease. In a healthcare system, the Internet of Things (IoT) is the key technology. In this paper, an EO optimized Lightweight Automatic modulation classification Network named the EO-LWAMCNet model is proposed to accurately predict a patient's chronic health condition (kidney or heart disease). A sensor implanted in the patient's body can able to collect every data and it uses a gateway to transmit the data toward the cloud. Based on the achieved sensor data, the EO-LWAMCNet model initiates the classification process to predict chronic disease. The model undergoes a testing and training stage. The disease is predicted using CKD and HD datasets. Here, the preprocessed data is used for classification in the training stage. Once the training process is completed, the Cloud server's (CS) sensor data is tested and categorized into abnormal (heart or kidney disease) and normal. The awareness message is sent to the doctor to treat a patient in case of an abnormal result. The performance of the model is evaluated using accuracy, MCC, F1-score, and miss rate. This model can accurately predict the presence or absence of heart or kidney disease with an accuracy of 93.5% using the CKD dataset and an accuracy of 94% using the HD dataset. Also, the miss rate of the model is less in classification.
Big Data in Forecasting Research: A Literature Review
2022, Big Data Research
Citation Excerpt :
According to Fig. 5, the big data in forecasting research can be generally categorized into three major types by source: UGC data (generated by the users on social media or other web platforms), including online textual data, online photo data, etc.; device data (monitored by devices), including meteorological data, smart meter data, traffic flow data, etc.; log data (recording activities or operations), including bio-medical data, web search data, stock exchange data, online marketing data, etc. Different types of big data have provided different new, rich knowledge to prediction: for example, users' emotions, opinions and attention toward prediction target-related events or issues from UGC data [16,57,58]; the sensor-level, real-time dynamics of weather environment (e.g., real-time temperature and wind speed) [25], electricity consumption [39] and traffic behaviors [59] from device data; medical insights (e.g., regarding pathological features and gene expressions) [60,61], public attention and preferences (in prediction-related activities) [62], market dynamics [63] and marketing behaviors and preferences [64] from log data. This informative knowledge facilitated various challenging forecasting tasks: in the domain of society, the forecasting hotspots were the dynamics of human behaviors [13], market factors [19], social events [65] and transportation [66]; in the nature, big data primarily served the prediction for weather factors [23], environmental factors [25], engineering issues [28] and material properties [29]; in the biology, popular research fields were biomedicine [67], biotechnology [35] and animal and plant science [37].
With the boom in Internet techniques and computer science, a variety of big data have been introduced into forecasting research, bringing new knowledge and improving prediction models. This paper is the first attempt to conduct a literature review on full-scale big data in forecasting research. By source, big data in forecasting research fell into user-generated content data (from the users on social media in texts, photos, etc.), device-monitored data (by meteorological monitors, smart meters, GPS, etc.) and activity log data (for web searching/visiting, online/offline marketing, clinical treatments, laboratory experiments, etc.). Different data types, bearing distinctive information and characteristics, dominated different forecasting tasks, required different analysis technologies and improved different forecasting models. This survey provides an overall review of big data-based forecasting research, details what (regarding data types and sources), where (forecasting hotspots) and how (analysis and forecasting methods used) big data improved prediction, and offers insights into future prospects.
Cardiac murmur grading and risk analysis of cardiac diseases based on adaptable heterogeneous-modality multi-task learning
2024, Health Information Science and Systems

View all citing articles on Scopus

Mohd Usama is a Ph.D. candidate in Embedded and Pervasive Computing Lab of Huazhong University of Science and Technology (HUST), China. His research includes Pervasive Computing, The Internet of things, Edge Computing, healthcare. etc.

Jun Yang received Bachelor and Master degree in Software Engineering from HUST, China in 2008 and 2011, respectively. Currently, he does Postdoc research at Embedded and Pervasive Computing (EPIC) Lab in School of Computer Science and Technology, HUST. His research interests include cognitive computing, software intelligence, Internet of Things, cloud computing and big data analytics, etc.

M. Shamim Hossain is a Professor at the Department of Software Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia. Prof. Shamim is also an Adjunct Professor, School of Electrical Engineering and Computer Science (EECS), University of Ottawa, Canada. Prof. Shamim received his Ph.D. in Electrical and Computer Engineering from the University of Ottawa, Canada. His research interests include Cloud networking, smart environment (smart city, smart health), social media, IoT, edge computing and multimedia for healthcare, deep learning approach for multimedia processing, and multimedia big data. He has authored and coauthored approximately 190 publications including refereed IEEE/ACM/Springer/Elsevier journals, conference papers, books, and book chapters. Recently, his publication has been recognized as the ESI Highly Cited Paper. He has served as a member of the organizing and technical committees of several international conferences and workshops. He has served as co-chair, general chair, workshop chair, publication chair, and TPC for over 12 IEEE and ACM conferences and workshops. Currently, he is the co-chair of the 1st IEEE ICME workshop on Multimedia Services and Tools for smart-health (MUST-SH 2018). He is a recipient of a number of awards, including, the Best Conference Paper Award, the 2016 ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) Nicolas D. Georganas Best Paper Award, and the Research in Excellence Award from the College of Computer and Information Sciences (CCIS), King Saud University (3 times in a row). He is on the editorial board of IEEE Network, IEEE Multimedia, IEEE Access, Journal of Network and Computer Applications (Elsevier), Computers and Electrical Engineering (Elsevier), Human-centric Computing and Information Sciences (Springer), Games for Health Journal, and International Journal of Multimedia Tools and Applications (Springer). Currently, he serves as a lead guest editor of Future Generation Computer Systems (Elsevier), IEEE Network Magazine, and IEEE Access. Previously, he served as a guest editor of IEEE Communication Magazine, IEEE Transactions on Information Technology in Biomedicine (currently JBHI), IEEE Transactions on Cloud Computing, International Journal of Multimedia Tools and Applications (Springer), Cluster Computing (Springer), Future Generation Computer Systems (Elsevier), Computers and Electrical Engineering (Elsevier), Sensors (MDPI), and International Journal of Distributed Sensor Networks. Prof. Shamim is a Senior Member of IEEE, a Senior member of ACM and ACM SIGMM.

Ahmed Ghoneim [M’10] received his M.Sc. degree in software modeling from the University of Menoufia, Egypt, and the Ph.D. degree from the University of Magdeburg, Germany in the area of software engineering, in 1999 and 2007, respectively. He is currently an assistant professor in the department of Software Engineering, College of Computer and Information Sciences (CCIS), King Saud University. His research activities address software evolution; service oriented engineering, software development methodologies, quality of services, net-centric computing, and human computer interaction (HCI). He is a member of the IEEE.

View full text

Recurrent convolutional neural network based multimodal disease risk prediction

Highlights

Abstract

Introduction

Section snippets

Disease risk prediction framework

Multimodal data-based RCNN for disease risk prediction

Experimental results

Conclusions

Acknowledgment

J. Netw. Comput. Appl.

Inf. Fusion

Future Gener. Comput. Syst.

IEEE Commun.

J. Biomed. Inform.

Inf. Fusion

The ‘Big Data’revolution in Healthcare: Accelerating Value and Innovation

Deep features learning for medical image analysis with convolutional autoencoder neural network

IEEE Trans. Big Data

DeyPoS: Deduplicatable dynamic proof of storage for multi-user environments

IEEE Trans. Comput.

A two-stage approach for remaining useful life prediction of bearings using deep neural networks

IEEE Trans. Ind. Inf.

Wearable 2.0: Enable human-cloud integration in next generation healthcare system

IEEE Commun.

Cloud-enabled wireless body area networks for pervasive healthcare

IEEE Netw.

Smart home based on wifi sensing: A survey

IEEE Access

Mining electronic health records: towards better research applications and clinical care

Nature Rev. Genet.

Statistical study of view preferences for online videos with cross-platform information

IEEE Trans. Multimed.

An adaptive fusion strategy for distributed information estimation over cooperative multi-agent networks

IEEE Trans. Inform. Theory

Task offloading for mobile edge computing in software defined ultra-dense network

IEEE J. Sel. Areas Commun.

Dominating set and network coding-based routing in wireless mesh networks

IEEE Trans. Parallel Distrib. Syst.