Enhancing self-management in type 1 diabetes with wearables and deep learning

Zhu, Taiyu; Uduku, Chukwuma; Li, Kezhi; Herrero, Pau; Oliver, Nick; Georgiou, Pantelis

doi:10.1038/s41746-022-00626-5

Download PDF

Article
Open access
Published: 27 June 2022

Enhancing self-management in type 1 diabetes with wearables and deep learning

npj Digital Medicine volume 5, Article number: 78 (2022) Cite this article

8185 Accesses
24 Citations
15 Altmetric
Metrics details

Subjects

Abstract

People living with type 1 diabetes (T1D) require lifelong self-management to maintain glucose levels in a safe range. Failure to do so can lead to adverse glycemic events with short and long-term complications. Continuous glucose monitoring (CGM) is widely used in T1D self-management for real-time glucose measurements, while smartphone apps are adopted as basic electronic diaries, data visualization tools, and simple decision support tools for insulin dosing. Applying a mixed effects logistic regression analysis to the outcomes of a six-week longitudinal study in 12 T1D adults using CGM and a clinically validated wearable sensor wristband (NCT ID: NCT03643692), we identified several significant associations between physiological measurements and hypo- and hyperglycemic events measured an hour later. We proceeded to develop a new smartphone-based platform, ARISES (Adaptive, Real-time, and Intelligent System to Enhance Self-care), with an embedded deep learning algorithm utilizing multi-modal data from CGM, daily entries of meal and bolus insulin, and the sensor wristband to predict glucose levels and hypo- and hyperglycemia. For a 60-minute prediction horizon, the proposed algorithm achieved the average root mean square error (RMSE) of 35.28 ± 5.77 mg/dL with the Matthews correlation coefficients for detecting hypoglycemia and hyperglycemia of 0.56 ± 0.07 and 0.70 ± 0.05, respectively. The use of wristband data significantly reduced the RMSE by 2.25 mg/dL (p < 0.01). The well-trained model is implemented on the ARISES app to provide real-time decision support. These results indicate that the ARISES has great potential to mitigate the risk of severe complications and enhance self-management for people with T1D.

Engineering digital biomarkers of interstitial glucose from noninvasive smartwatches

Article Open access 02 June 2021

Digital health application integrating wearable data and behavioral patterns improves metabolic health

Article Open access 25 November 2023

Predicting changes in glycemic control among adults with prediabetes from activity patterns collected by wearable devices

Article Open access 21 December 2021

Introduction

Diabetes is a group of chronic metabolic disorders that affect almost half a billion people worldwide¹, and around 10% of them have type 1 diabetes (T1D)². Due to an absolute deficiency of endogenous insulin caused by pancreatic β-cell loss, the management of T1D relies on exogenous insulin delivery and adherence to a group of self-care behaviors, such as estimating dietary carbohydrate and exercise, and titrating insulin therapy. The primary objective of T1D self-management is to prevent immediate adverse glycemic events, including hypoglycemia and hyperglycemia, and minimize the risk of long-term diabetes complications. Severe hypoglycemia may cause seizures, brain damage, and intellectual impairment³, while hyperglycemia is a risk factor for cardiovascular diseases, neuropathy, nephropathy and retinopathy ⁴.

The development of continuous glucose monitoring (CGM) has led to therapeutic benefits in diabetes management^5,6. The usage of real-time CGM systems has been demonstrated to reduce the number of severe hypoglycemic events for T1D subjects with multiple daily injection (MDI)⁷. As a wearable device that automatically measures glucose levels with a fixed frequency (e.g. five minutes), CGM can be combined with an insulin pump as sensor-augmented therapy or an artificial pancreas for closed-loop glycemic control^8,9. Smartphone apps to log daily events^10,11 and calculate bolus insulin are increasingly being adopted to successfully reduce the daily burden associated with T1D self-management. Other wearables, such as wristbands, have been used in recent literature to estimate physical activity for T1D subjects^12,13. Nonetheless, the clinical efficacy of apps and sensor wristbands remains unproven¹⁴, and there is a lack of an integrated platform that synchronizes the real-time physiological measurements of sensor wristbands and other wearable devices to improve decision support^14,15.

Despite CGM enabling correction of glucose concentrations outside of the target range ([70, 180] mg/dL), self-management can be challenging for people with T1D due to the variable pharmacokinetics and pharmacodynamics of insulin¹⁶ and the multiple endogenous and exogenous influences on glucose. Combined with CGM systems, a predictive low-glucose suspend feature, commonly found in continuous subcutaneous insulin infusion (CSII) systems, has been shown to significantly reduce exposure to hypoglycemia¹⁷. Accurate glucose prediction is, therefore, a useful tool to enable proactive interventions and timely medication administration to enhance T1D self-management. However, the performance of physiological and rule-based prediction models are still limited by the influence of various external factors and high inter and intra-subject variability on glucose dynamics¹⁸.

The widespread use of wearable devices and smartphone apps yields a substantial amount of granular data and has boosted machine learning-based algorithms in the literature¹⁹. Previous work has explored several classic machine learning approaches for the prediction of glucose levels or glycemic events^{20,21,22,23,24} using prediction horizons between 15 and 60 min. In a recent study, non-invasive wearable measurements combined with food logs were employed as digital biomarkers to estimate interstitial glucose using a machine learning method²⁵. As indicated by a recent review²⁶, deep learning technologies have attracted increasing attention in the field of diabetes, such as diabetic retinopathy^27,28, neuropathy²⁹, and glycemic control^30,31. Empowered by various deep neural networks, deep learning has also achieved the state of the art in glucose prediction^{26,32,33,34,35,36,37,38} and has been applied to detect hypoglycemia using non-invasive vital signs, e.g., electrocardiograms (ECG)³⁹.

In this work, we introduce ARISES (Adaptive, Real-time, and Intelligent System to Enhance Self-care), a smartphone-based platform, to facilitate decision support and enhance self-management for people with T1D. It is based on an innovative mobile app with an embedded deep learning model for real-time glucose prediction and hypo- and hyperglycaemia warnings, which integrates data from CGM (Dexcom G6, Dexcom Inc., San Diego, CA, US) and a clinically validated physiological data acquisition sensor (Empatica E4 wristband, Boston, MA, US). In particular, we develop the prediction algorithm with an architecture of the recurrent neural network (RNN), leveraging a number of recent advances in deep learning, including attention mechanisms⁴⁰, evidential regression⁴¹, and model-agnostic meta-learning (MAML)⁴². Fig. 1 shows the overall system architecture. The app interacts with the wearable devices via Bluetooth connections and has a new graphical user interface (Supplementary Fig. 1), which is specifically designed according to the feedback of T1D users, aiming to reduce cognitive burden and facilitate the visualization of information. The app allows users to record various daily activities, including meal composition, insulin injection, exercise, and health conditions, view the glucose trajectories, historical daily logs and predictions with the metaphor underlying the bifocal display⁴³, and receive warnings of potential adverse glycemic events.

**Fig. 1: System architecture and clinical scheme of ARISES.**

Results

Participant characteristics

Table 1 presents the demographic and clinical characteristics of the 12 T1D participants in the phase I prospective study. We collected a median (IQR) of 1113.5 (1059.0–1184.0) and 832.5 (733.0–953.0) hours of glucose data and sensor wristband data, respectively, and received a total of 5767 daily entries with a median (IQR) of 396 (237–732.3) interactions (Supplementary Table 1 and 2), including carbohydrates, protein, fat, insulin bolus, exercise, alcohol, stress, and illness, where the carbohydrate entries account for the largest portion.

Table 1 Demographic characteristics and clinical characteristics of the 12 T1D participants in the phase I clinical study.

Full size table

Independent predictors using non-invasive physiological data

The association of the non-invasive physiological measurements with adverse glycemic events over a 60-minute prediction horizon by mixed effects logistic regression is shown in Fig. 2. Hypoglycemia is negatively associated with a larger range of inter-beat intervals (IBIs) (odds ratio (OR): 0.72, 95% confidence interval (CI): 0.57–0.91; p < 0.01), while higher mean IBIs and mean skin temperature increases the OR of hypoglycemia (OR: 1.23, 95% CI: 1.17–1.30; p < 0.01; and OR: 1.18, 95% CI: 1.07–1.29; p < 0.01, respectively). Similarly, we observe that, besides the IBIs and skin temperature, variables derived from electrodermal activity (EDA) and acceleration are also significant predictive factors for hyperglycemia prediction. Considering all the physiological signals are significantly associated with adverse glycemic events, we, therefore, combined these non-invasive measurements with CGM and daily entries to extract a total of 20 real-time features (Supplementary Table 3), which were used in feature selection for the deep learning-based prediction model.

**Fig. 2: Forest plots of mixed effects logistic regression showing the association between non-invasive physiological measurements and adverse glycemic events.**

Glucose level prediction

Table 2 presents the performance of the personalized ARISES model with 15, 30, 45, 60-minute prediction horizons. The proposed model outperformed all the considered baseline methods in terms of root mean square error (RMSE), glucose-specific RMSE (gRMSE), mean absolute error (MAE), mean absolute percent error (MAPE), and the time lag. The results of the baseline methods are presented in Supplementary Table 2. The ARISES obtained significant improvement in RMSE, gRMSE, MAE, and MAPE, when compared with the best performance of the baseline methods (convolutional recurrent neural networks (CRNNs)³⁴; p < 0.01). When only one day of data was used for fine-tuning, the MAML approach obtained the average RMSE of 39.37 ± 7.14 for the 60-minute prediction horizon, which is much smaller than the RMSE obtained by a baseline method of transfer learning³⁶ (RMSE: 43.07 ± 8.41; p < 0.05).

Table 2 Results of glucose level prediction (Mean ± SD) evaluated on 12 clinical T1D subjects.

Full size table

In addition, Fig. 3 shows the results of ablation analysis, where we removed certain components from the model and evaluated their impact on the prediction performance. In particular, the use of MAML and wristband input respectively reduced the average RMSE by 1.41 and 2.25 mg/dL (p < 0.05) for the 60-minute prediction horizon.

**Fig. 3: Ablation analysis on the prediction performance of glucose levels.**

Hypoglycemia and hyperglycemia prediction

Tables 3 and 4 respectively show the results of hypoglycemia and hyperglycemia prediction using the lower and upper bounds derived from evidential deep learning. We observe that the proposed ARISES model achieved the accuracy of 88.58% with the sensitivity of 70.30% and the accuracy of 87.20% with the sensitivity of 86.62% for hypoglycemia and hyperglycemia prediction over the 60-minute prediction horizon, respectively. For the considered baseline methods (Supplementary Tables 5 and 6), we used the predicted glucose levels, i.e., single trajectories, to detect adverse glycemic events. Among these, the autoregressive moving average (ARMA)⁴⁴ and the physiologically-based kinetic model (PKM)⁴⁵ achieved the best baseline results for hypoglycemia and hyperglycemia prediction, respectively, which are reported in Tables 3 and 4 for comparison. It is worth noting that, compared with the ARMA, the ARISES model significantly increased sensitivity by 13.35% and reduced the mean deviation (MD) by 13.29 mg/dL for hypoglycemia prediction. Compared with the PKM for hyperglycemia prediction, the ARISES model significantly increased specificity and precision by 5.38% and 5.43%, respectively, while reducing the MD by 13.80 mg/dL. As shown in Fig. 4, we observe that the use of lower bounds and wristband input data enhanced the average Matthews correlation coefficient (MCC) scores by 0.34 (p < 0.01) for hypoglycemia prediction with the 60-minute prediction horizon.

Table 3 Results of hypoglycemia prediction (Mean ± SD) evaluated on 12 clinical T1D subject.

Full size table

Table 4 Results of hyperglycemia prediction (Mean ± SD) evaluated on 12 clinical T1D subjects.

Full size table

**Fig. 4: Ablation analysis on the prediction of adverse glycemic events, evaluated on the 12 T1D subjects.**

Discussion

This study proposes a deep learning algorithm embedded in a smartphone-based platform to predict glucose levels and hypo- and hyperglycemia, with the input of CGM, daily entries, and real-time measurements from the physiological sensor wristband. Notably, the integration of the wristband data has improved the results of both glucose level prediction and hypo- and hyperglycemia detection.

Figure 5 depicts the predicted trajectories and CGM measurements of a participant over a two-day period. We present the 7-day trajectories of four selected participants in Supplementary Fig. 2 We observe that the daily activities, including meal intake and insulin bolus delivery, have a significant impact on the glucose levels. The glycemic homeostasis is affected by these external factors and internal changes in the T1D subject. Thus, the accuracy degrades as the prediction horizon increases in Tables 2, 3 and 4.

**Fig. 5: Two-day period CGM and prediction trajectories of a T1D adult over a 30-minute prediction horizon.**

As a safety-critical system, the reliability of predictions is essential, especially when glucose levels are approaching the threshold of hypoglycemia. In clinical settings, the occurrence of hypoglycemia is more dangerous than that of hyperglycemia, which may lead to life-threatening complications⁴⁶. To this end, we used an evidential deep learning approach⁴¹ to train the models and map model uncertainty. Most previous studies used mean square error as the loss function and computed a single prediction value to indicate whether there will be risk of hypo- and hyperglycemia^{20,21,26,33,35}. However, hypo- and hyperglycemic events may fail to be detected when the confidence of a prediction is low. In the experiments, we noted that hypoglycemic events with short duration were likely to be missed when single trajectory values are used in detection (Fig. 5). Therefore, we use lower and upper bounds derived to determine adverse glycemic events and assist decision support in T1D self-management with the ARISES app (Supplementary Fig. 1). Displaying these informative bounds on the app is a preferable feature according to the requirements of the phase I participants. As highlighted by the eclipses in Fig. 5, the use of lower bounds successfully identified two hypoglycemic events that are likely to be missed using single prediction values.

The proposed ARISES model has achieved superior performance and outperformed six considered baseline methods (Supplementary Tables 2, 5 and 6). It is observed that the machine learning and deep learning baseline models obtained better RMSE performance for glucose level prediction, but smaller MCC scores for hypo- and hyperglycemia prediction, when compared with the physiological and statistical baseline methods. One possible explanation is that the machine learning and deep learning baseline models were optimized in a supervised learning process with the targets of actual CGM measurements, but the prediction of adverse glycemic events was not considered. In this regard, the introduced lower and upper bounds in the ARISES model enabled a good balance between glucose level prediction and hypo- and hyperglycemia detection.

We compared the MCC scores using these bounds against the results of single curve prediction in Fig. 4, where the classification based on the bounds exhibited better performance. We noticed that hypoglycemia is a minority class in the dataset, which accounts for 2.91 ± 1.93% of total glucose measurements (Table 1). In general, the classifier is less sensitive to detecting a minority class. Nevertheless, in this work, the sensitivity can be further enhanced by reducing the thresholds of lower bounds at a cost of potential alarm fatigue. This trade-off can be decided by clinicians on an individual case basis.

We used the MAML approach to train population models and personalized models, which outperformed the transfer learning approach with a small amount of available data. This fast adaption feature of the MAML approach can mitigate the cold-start issues when we provide the software to new T1D users with limited personal data. It is a common scenario in actual clinical settings since data collection is expensive and time-consuming. Moreover, the MAML also improved the final average RMSE results in the ablation analysis (Fig. 3).

The chronological partition of training, validation and testing set in this work was carefully selected. Random cross-validation can be found in previous work, which trained and validated machine learning models on the same dataset^24,39. However, during the experiments, we noticed that there were temporal dependencies between the data points from nearby locations, especially in adjacent ones. The features were derived with the small resolution of CGM, so the difference between consecutive time steps is sometimes negligible. In this regard, the use of random or stratified splitting methods would introduce underlying temporal correlation into training and testing sets, which could result in serious overestimation of model accuracy⁴⁷.

The ARISES app (Supplementary Fig. 1) is based on the iOS operating system and integrates with Dexcom CGM (G5 or G6) and Empatica E4 wristband. The source code of the app is not publicly available. We analyzed the performance of the app on an iPhone XS Max over 50 runs. The whole app has an initial storage size of 39.9 MB and consumed an average of 50.5 MB and 39.3 MB memory while running in foreground and background, respectively. The trained deep learning models were converted to mobile compatible format via TensorFlow Lite, which has a storage size of 1.2 MB. When the app received a new CGM measurement, it took 5.7 ms and 1.8 MB memory to compute real-time glucose prediction through model inference on the edge, which require one-hour historical data of CGM measurements, sensor wristband measurements, and daily entries (if any). Model fine-tuning is performed by Amazon S3 buckets and SageMaker in the Amazon Web Services (AWS) cloud (Fig. 1) and requires at least one-week historical data.

Our results suggest that measurements obtained from wearable physiological wristband data sensors could be integrated alongside CGM data to improve the prediction of glucose levels and adverse glycemic events. Interestingly, the IBI measured by the sensor wristband is the only predictor that has significant effect on both hypoglycemia and hyperglycemia prediction (Fig. 2), which was also selected as the input of the deep learning model with the best validation performance. It indicates IBI or other heart rate variability (HRV) could be useful biomarkers in T1D decision support, which accords with the findings of previous studies^48,49,50. However, the sensors in Empatica E4 are quite sensitive to motion artifacts, so it is difficult to obtain accurate measurements with too many hand movements⁵¹. In future work, an algorithm to detect exercise and reduce measurement error for the wristband will be developed. Meanwhile, data extracted from manually recorded daily events have the potential to be used for the analysis of the drivers and patterns of the changes in plasma or interstitial glucose concentrations. During feature pre-processing, we calculated insulin on board and the carbohydrate on board with fixed duration (i.e., time of decay) and constant absorption rate of carbohydrates, respectively. Nonlinear insulin on board and carbohydrate on board based on physiological models with personalized parameters., such as the variable appearance rate of glucose and plasma insulin estimation⁵², will be considered in the future, aiming to improve quality of input features and further enhance prediction accuracy. We collected the dietary data from the T1D participants under free-living conditions, so the dietary reporting is variable in quality but reflects the real-world use of carbohydrate counting and self-management. Although we manually checked the carbohydrate amount for each meal record to confirmed that there are no unrealistic values, such as negative or larger than 500 g, it would be interesting to investigate how the accuracy of dietary reporting affects prediction performance, which could be done by analyzing the results obtained from datasets collected in inpatient trials with standardized meals. It is noted that the percentages of time spent below range (Table 1) are small, and there is a modest carbohydrate intake of 160 (102–220) grams per day (Supplementary Table 2). Although these values are not unusual for people living with T1D, especially for those who use CGM to visualize post-prandial glucose peaks, it is a potential limitation in the development of the algorithm to predict hypoglycemic events. Future work will include validating the proposed system on a T1D cohort with greater variance in carbohydrate intake and glycemic variability. Currently, there are no publicly available T1D datasets containing all the data fields that we need in the ARISES model, but it is important to further test the generalization of the proposed algorithm using an independent validation dataset with a larger cohort size. In this case, we also recommend to analyze covariates in the T1D population, such as age and insulin delivery mode. In addition, there is a lack of system testing of the whole ARISES in real-world settings. It might be challenging to simultaneously administer the multiple wearable devices, smartphone app, and cloud services with reliable wireless connectivity. A deep learning model with only CGM input and daily entries needs to be implemented as a sub-optimal solution when the wristband data is not available, e.g., when the wristband is taken off for battery charging.

Methods

Phase I prospective study

This was a six-week longitudinal prospective study (NCT ID: NCT03643692) using a clinically validated real-time physiological data acquisition sensor (Empatica E4) and CGM (Dexcom G6) to identify correlations between measurable physiological parameters and glycemia. Under free-living conditions, twelve adults (18 years old and older) with a median age (IQR) of 40 years (30–50) were equally stratified by gender and mode of insulin delivery (MDI and CSII). Participants were recruited from the Imperial College Healthcare NHS trust T1D outpatient clinics, registered research databases, and interested participants who contact us. Throughout the duration of the study, participants wore the Empatica E4 and Dexcom G6 devices with alarm thresholds of glucose levels set at <4 mmol/L and >11 mmol/L. Participants were asked to log daily events such as, insulin doses in units, meal macronutrient composition in grams, alcohol intake in units, stress, illness, and exercise in the mySugr smartphone app, which are used to develop the input features of glucose prediction models. The study was conducted under a trial protocol (18/LO/1096) approved by London - Fulham Research Ethics Committee, and each participant was informed and signed consent.

Analysis of sensor wristband data

Different from most of the previous studies using CGM and daily manual logs^{20,21,22,23,24,26}, an objective of this work is to better understand the effect of the non-invasive physiological data on the prediction of glycemic events. Using the package lme4 in R, a mixed effects logistic regression was applied calculate the logarithm of ORs to interpret the relationship between physiological measurements and the binary outcome of adverse glycemic events (i.e., hypoglycemia (<70 mg/dL) or hyperglycemia (> 180 mg/dL) in Fig. 2⁵³. The measured physiological variables applied to the regression analysis include the mean values, standard deviation, range, and maximum and minimum differential values of EDA, IBI, acceleration, and skin temperature signals.

Multi-modal feature engineering

As a clinically validated, commercially available, and non-invasive device, the Empatica E4 wristband uses a photoplethysmography sensor to measure blood volume pulse (BVP), two electrodes to obtain EDA, a pair of accelerometers and a gyroscope to detect the level of physical activity, and a peripheral temperature sensor to monitor skin temperature. In previous clinical studies, BVP and ECG signals are the primary sources to identify IBIs for HRV analysis⁵⁴. In particular, we applied a band-pass Butterworth filter to remove noise in BVP signals and employed a slope sum function⁵⁵ to detect the local maxima. Then we used a sliding window with decision rules⁵⁵ to search peaks, as the systoles in cardiac cycles. The IBIs were computed by the difference of consecutive peaks.

We extracted short-term HRV features with a 5-minute window to indicate early HRV changes⁴⁸ in temporal and frequency domains⁵⁶. To obtain skin conductance levels (SCLs) and skin conductance responses (SCRs), we continuously decomposed EDA data into tonic and phasic components via a high-pass filter⁵⁷. There are two open-source software tools involved in EDA and BVP processing^58,59. Together with physical activity levels and skin temperature, the outcomes of these features in the past five minutes were averaged and aligned with the time steps of CGM readings. HRV is an established indicator that reflects cardiac autonomic activities, while EDA is related to the status of the nervous system. These biomarkers have been used in previous studies to predict and detect hypoglycemia for T1D^48,50,60.

The daily entries were converted to insulin on board and carbohydrate on board via physiological models. We assumed insulin bolus lasts for four hours in the human body with different slopes, as a common setting used by many commercialized pumps^24,61. Similarly, the carbohydrate was assumed to be absorbed at a rate of 2.5 g/min 15 min after the time of meal ingestion²⁴.

Feature pre-processing and selection

We obtained a total of 20 features from the pre-processed multi-modal data (Supplementary Table 3). There are some inevitable errors in the sensor data, e.g., compression artifacts, signal loss, and sensor calibration. To this end, we performed feature selection in the following steps. First, we analyzed the missing fraction of CGM and wristband measurements to identify the quality of features. The median value of the missing percentages of CGM and wristband data are 3.02% and 23.05%, respectively, which are reasonable since the wristband needs to be charged for around 4–5 hours every day. We linearly interpolated the gaps that occurred in the middle of input sequences and extrapolated the gaps at the tail to guarantee that future information is not involved in current predictions. Then, min-max normalization was adopted to scale the selected features to [0, 1]. Finally, we performed collinearity analysis, considering correlated bias is prone to degrade the stability and interpretability of machine learning models⁶². We noted that features derived from the same measurement exhibited strong a correlation with each other. Hence, each time we retained one feature in IBI or EDA feature group (Supplementary Table 3) and selected the best combination according to the error scores that summed up RMSE results for the four prediction horizons in model validation.

Model training, validation, and testing

Considering the personalized models are provided to the T1D subjects at a midterm clinical visit (Fig. 1), we divided the data of each subject into a training set and a testing set that include the first 50% data and the remaining 50% data, respectively. The last 20% data of each training set were used as a hold-out personalized validation set. To simulate a clinical scheme with two phases (Fig. 1), we employed a population set containing the training sets of 11 subjects and a personalized set with the data of the remaining subject, assuming it is a new subject (e.g., a participant in phase II). Data of each subject in the population set were used to optimize the population model. The population models were validated with leave-one-subject-out cross validation. Then we used the training data of the personalized set to fine-tune the population model to develop a personalized model, and used the testing data of the personalized set for evaluation. The Hyperband algorithm⁶³ of Keras tuner was used to select the best hyperparameters of the deep learning models (Supplementary Table 7). Besides, we used early stopping to mitigate overfitting and improve generalization.

Developing population and personalized models

With the population and personalized data sets, we applied a well-established MAML framework to develop population models⁴². Each subject is regarded as a learning task in the inner loop of MAML. Then, the learned parameters for each task guided the population model to achieve meta-optimization via stochastic gradient descent in the outer loop. The first-order approximation was performed to accelerate the training process⁶⁴. In the personalization phase, we fine-tuned the meta-model with a small learning rate⁶⁵.

We performed an experiment to compare the MAML population model with the pre-trained model by classic transfer learning³⁶. For each subject, the data collected on the fist day of the trial, i.e., the first 288 data samples (5-minute CGM resolution) in a personalized training set, were used to fine-tune both models. Then, we evaluated the performance of the fine-tuned models using the testing data of the personalized set.

Attention-based RNN architecture

The recurrent structure is well-suited to learn short and long-term temporal dependencies in sequence processing. Thus, RNN-based models are emerging in the literature of diabetes management and have been shown to exhibit superior performance in glucose prediction^26,33,36,66. However, the vanilla RNNs face the challenge of gradient exploding and vanishing, which largely limits the learning performance on long-term temporal dependencies. Fortunately, long short-term memory (LSTM)⁶⁷ and gated recurrent units (GRUs)⁶⁸ were proposed to solve this problem. The GRU uses reset and update gate functions with less parameters than LSTM (Fig. 6)³⁶.

**Fig. 6: Architecture of the deep learning model.**

After pre-processing the features, we developed an attention-based RNN with GRUs for glucose prediction and hypo- and hyperglycemia detection. The multivariate input data for the RNN model were selected according to validation performance, which include CGM, carbohydrate amount, insulin bolus, time index, IBIs, and SCRs.

At each time step t with a CGM measurement G_t, the target of the algorithm is to predict a glucose level at t + w, where w is calculated as the prediction horizon divided by the CGM resolution. Here, we define the normalization function as m and the prediction targets as y_t = m(G_t+w − G_t), using glucose change to minimize the bias^35,36. The model input consists of a multidimensional vector ${{{{\bf{X}}}}}_{t}={[{{{{\bf{x}}}}}_{t},{{{{\bf{x}}}}}_{t-1},\ldots ,{{{{\bf{x}}}}}_{t+1-l}]}^{{\mathsf{T}}}\in {{\mathbb{R}}}^{l\times c}$, where l is the length of the input sequence; c is the feature dimension.

Leveraging the previous and future information in a sequence, we modeled a bidirectional structure to simultaneously compute two states in forward direction (${\overrightarrow{{\mathop{{\bf{h}}}}}}_{t}$) and backward direction ($\overleftarrow{{\bf{h}}}_t$) and merged them into an output vector ${{{\mathbf{h}}}}_t^b = \left[ {\overleftarrow {{{{\mathbf{h}}}}_t} ;\,\overrightarrow {{{{\mathbf{h}}}}_t} } \right]$^32,69. Fig. 6 correspondingly illustrates the unfolded block diagrams. The output of the bidirectional RNN is sent to another GRU-based RNN layer to obtain high-level hidden representations, of which the cell output is denoted as h_t.

We employed the attention mechanism, as one of the latest advances in deep learning, to extract temporal dependencies regardless of distance. Introducing attention in deep neural networks has shown success in a variety of tasks, especially in natural language processing⁷⁰. Instead of only using the output of the final state, the attention mechanism assigns attention weights to the hidden state h_t on each time step and then combines them to compute the final representation vector. In particular, we modified the general form of Luong’s multiplicative attention⁴⁰ and implemented the many-to-one attention weight a_t at time step t as:

$${a}_{t}=\frac{\exp ({{{{\bf{h}}}}}_{T}^{\top }{{{{\bf{W}}}}}_{a}{{{{\bf{h}}}}}_{t})}{\mathop{\sum}\nolimits_{t^{\prime} }\exp ({{{{\bf{h}}}}}_{T}^{\top }{{{{\bf{W}}}}}_{a}{{{{\bf{h}}}}}_{t^{\prime} })}$$

(1)

where ${{{{\bf{h}}}}}_{T}^{\top }$ is the final cell state. The attention output vector v is computed with hidden weights W_v and $\tanh$ activation, which is defined as follows

$${{{\bf{v}}}}=\tanh ({{{{\bf{W}}}}}_{v}[\mathop{\sum}\limits_{t}{a}_{t}{{{{\bf{h}}}}}_{t};{{{{\bf{h}}}}}_{T}]),$$

(2)

Lower and upper bounds of predictions

To determine the reliability and confidence of the predictions, model uncertainty is estimated by a higher-order evidential distribution⁴¹. Assuming the predictions are drawn from a Gaussian distribution with unknown mean μ and variance σ², i.e., $\mu \sim {{{\mathcal{N}}}}(\gamma ,{\sigma }^{2}/\lambda )$, σ² ~ Γ⁻¹(α, β), we fed the attention output vector into an evidential layer to map the normal-inverse-gamma distribution as (μ, σ²) ~ N − Γ⁻¹(γ, λ, α, β). In this case, the final model output comprises four parameters (γ_t, λ_t, α_t, β_t), which are computed by a dense layer with four neurons, where ${\hat{y}}_{t}={\gamma }_{t}$. The output of the evidential layer (evid) and epistemic uncertainty (i.e., model uncertainty) u_t are defined as

$${\hat{y}}_{t},{\lambda }_{t},{\alpha }_{t},{\beta }_{t}={{{\rm{evid}}}}({{{\bf{v}}}}),\quad {u}_{t}=\sqrt{\frac{{\beta }_{t}}{{\lambda }_{t}({\alpha }_{t}-1)}}.$$

(3)

Thus, the corresponding glucose prediction ${\hat{G}}_{t+w}$, lower bound ${B}_{t+w}^{l}$, and upper bound ${B}_{t+w}^{u}$ can be denoted as

$$\begin{array}{ll}{\hat{G}}_{t+w}={G}_{t}+{m}^{-1}({\hat{y}}_{t}),\\ {B}_{t+w}^{l}={\hat{G}}_{t+w}-{k}^{l}{m}^{-1}({u}_{t}),\\ {B}_{t+w}^{u}={\hat{G}}_{t+w}+{k}^{u}{m}^{-1}({u}_{t}),\end{array}$$

(4)

where m⁻¹ is the inverse function of the normalization; k^l and k^u are the thresholds of the uncertainty. Clinicians are allowed to adjust the thresholds to obtain specific clinical efficacy. For instance, increasing the value of k can enhance the sensitivity of the classifier to avoid missing the warnings of adverse glycemic events. During the model training, the negative log-likelihood loss function to optimize the parameters with maximum likelihood estimation can be solved by a Student-t distribution according to Bayesian probability theory⁴¹.

Performance evaluation

The glucose predictions were estimated by the mean values of the evidential distribution of the model output. The regression performance was evaluated by the RMSE, gRMSE, MAE, MAPE, and the time lag. In particular, gRMSE penalizes the prediction errors that could lead to harmful events, such as overestimation in hypoglycemia and underestimation in hyperglycemia, to demonstrate clinical impact ⁷¹, which is defined as follows:

$$\begin{array}{ll}{{{\rm{gRMSE}}}}\qquad\qquad=\sqrt{\frac{1}{N}\mathop{\sum }\limits_{t=1}^{N}P({G}_{t+w,}{\hat{G}}_{t+w}){({G}_{t+w}-{\hat{G}}_{t+w})}^{2}},\\ P\left.({G}_{t+w},{\hat{G}}_{t+w})\right)=1+{\alpha }_{L}{\bar{\sigma }}_{{G}_{t+w}\le {T}_{L},{\beta }_{L}}({G}_{t+w}){\sigma }_{{\hat{G}}_{t+w}\ge {G}_{t+w},{\gamma }_{L}}({\hat{G}}_{t+w},{G}_{t+w})\\ \qquad\qquad\qquad\qquad+\,{\alpha }_{H}{\sigma }_{{G}_{t+w}\ge {T}_{H},{\beta }_{H}}({G}_{t+w}){\bar{\sigma }}_{{\hat{G}}_{t+w}\le {G}_{t+w},{\gamma }_{H}}({\hat{G}}_{t+w},{G}_{t+w}),\end{array}$$

(5)

where $P({G}_{t+w,}{\hat{G}}_{t+w})\ge 1$, and values of α_L, β_L, γ_L, T_L, α_H, β_H, γ_H, T_H equal to 1.5, 30, 10, 85, 1, 100, 20, 155, respectively. The time lag is derived by the cross-correlation of predicted glucose levels and actual CGM measurements^34,35, which denotes the time-shift between two time series. A smaller time lag indicates a faster response of the prediction method to the changes in CGM trends and thus better prediction performance.

The thresholds of lower and upper bounds were selected in model validation according to MCC scores, which are respectively used to detect hypoglycemia and hyperglycemia. In particular, a hypoglycemic or hyperglycemic event is defined as three consecutive CGM measurements (i.e., at least 15 min) below 70 mg/dL or above 180 mg/dL, as recommended by previous studies⁷². A true positive means that an adverse glycemic event is correctly identified, while a false negative indicates a missed prediction. We evaluated the classification performance of hypo- and hyperglycemia prediction using a set of standard metrics, including accuracy, sensitivity, specificity, precision, and MCC^22,23,24. Good MCC scores can be obtained only if the classifier performs well in all confusion matrix categories, which is a more reliable and informative score than accuracy and the F1 score in binary classification⁷³. In addition, we introduced MD scores calculated by the MAE for the glucose sequences in missed predicted hypoglycemic or hyperglycemic events.

We used the results for the 60-minute prediction horizon as the primary outcomes, since predicting glucose over such a long prediction horizon is challenging. The converted TensorFlow Lite models were evaluated to simulate on-device inference in the ARISES app. To compare the proposed model with existing approaches, we employed a set of classic machine learning and deep learning baseline methods (Supplementary Tables 2, 5 and 6), including support vector regression (SVR) with the RBF kernel²¹, artificial neural networks (ANNs) with three fully-connected layers²⁰, bidirectional long short-term memory (Bi-LSTM)³², and CRNNs³⁴. Besides, we also used a statistical model, the ARMA with exogenous inputs⁴⁴, and a physiological model, the PKM, which is based on the composite minimal model of plasma glucose and insulin kinetics with personalized insulin sensitivity, time to maximum glucose rate of appearance, and time to maximum insulin absorption^45,74. The PKM has been validated on both the in silico data from the UVA/Padova T1D simulator⁷⁵ and real data from clinical trials⁷⁶ in terms of glucose prediction⁴⁵. The input features of baseline models were identical to those of the proposed model, except that the PKM only used the information of CGM measurements, carbohydrate intake and insulin bolus. To calculate the statistical significance with respect to the considered baseline results, we performed paired t tests after evaluating the normality by Shapiro–Wilk tests.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The dataset used in this study is not publicly available due to the proprietary nature of the data and privacy concerns. Interested researchers should contact the corresponding authors to inquire about the access.

Code availability

The free and open-source programming languages R 3.6.3 and Python 3.8.5 with the deep learning framework TensorFlow 2.2.0 were used in this study. The source code of the deep learning model and smartphone app is available upon reasonable request from the corresponding authors.

References

Saeedi, P. et al. Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045: Results from the International Diabetes Federation Diabetes Atlas. Diab. Res. Clin. Practice 157, 107843 (2019).
Article Google Scholar
Katsarou, A. et al. Type 1 diabetes mellitus. Nat. Rev. Disease Primers 3, 1–17 (2017).
Google Scholar
Yale, J.-F., Paty, B. & Senior, P. A. Hypoglycemia. Can J Diabetes 42, S104–S108 (2018).
Article PubMed Google Scholar
Gregg, E. W., Sattar, N. & Ali, M. K. The changing face of diabetes complications. Lancet Diabetes Endocrinol. 4, 537–547 (2016).
Article PubMed Google Scholar
Rodbard, D. Continuous glucose monitoring: a review of successes, challenges, and opportunities. Diabetes Technol. Therapeutics 18, S2–3 (2016).
Article CAS Google Scholar
Juvenile Diabetes Research Foundation Continuous Glucose Monitoring Study Group. Continuous glucose monitoring and intensive treatment of type 1 diabetes. N. Engl. J. Med. 359, 1464–1476 (2008) .
Heinemann, L. et al. Real-time continuous glucose monitoring in adults with type 1 diabetes and impaired hypoglycaemia awareness or severe hypoglycaemia treated with multiple daily insulin injections (HypoDE): a multicentre, randomised controlled trial. Lancet 391, 1367–1377 (2018).
Article CAS PubMed Google Scholar
Herrero, P., Georgiou, P., Oliver, N., Johnston, D. G. & Toumazou, C. A bio-inspired glucose controller based on pancreatic β-cell physiology. J. Diabetes Sci. Technol. 6, 606–616 (2012).
Article PubMed PubMed Central Google Scholar
Oliver, N., Reddy, M., Marriott, C., Walker, T. & Heinemann, L. Open source automated insulin delivery: addressing the challenge. npj Digital Med. 2, 1–5 (2019).
Article Google Scholar
Kirwan, M., Vandelanotte, C., Fenning, A. & Duncan, M. J. Diabetes self-management smartphone application for adults with type 1 diabetes: randomized controlled trial. J. Med. Internet Res. 15, e235 (2013).
Article PubMed PubMed Central Google Scholar
Ryan, E. A. et al. Improved A1C levels in type 1 diabetes with smartphone app use. Can. J. Diabetes 41, 33–40 (2017).
Article PubMed Google Scholar
Sevil, M. et al. Determining physical activity characteristics from wristband data for use in automated insulin delivery systems. IEEE Sensors J. 20, 12859–12870 (2020).
Article CAS Google Scholar
Ozaslan, B., Patek, S. D. & Breton, M. D. Impact of daily physical activity as measured by commonly available wearables on mealtime glucose control in type 1 diabetes. Diabetes Technol. Ther. 22, 742–748 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wu, Y. et al. Mobile app-based interventions to support diabetes self-management: a systematic review of randomized controlled trials to identify functions associated with glycemic efficacy. JMIR mHealth uHealth 5, e35 (2017).
Article PubMed PubMed Central Google Scholar
Lithgow, K., Edwards, A. & Rabi, D. Smartphone app use for diabetes management: evaluating patient perspectives. JMIR Diabetes 2, e2 (2017).
Article PubMed PubMed Central Google Scholar
Mathieu, C., Gillard, P. & Benhalima, K. Insulin analogues in type 1 diabetes mellitus: getting better all the time. Nat. Rev. Endocrinol. 13, 385 (2017).
Article CAS PubMed Google Scholar
Battelino, T., Nimri, R., Dovc, K., Phillip, M. & Bratina, N. Prevention of hypoglycemia with predictive low glucose insulin suspension in children with type 1 diabetes: a randomized controlled trial. Diabetes Care 40, 764–770 (2017).
Article CAS PubMed Google Scholar
Herrero, P. et al. Enhancing automatic closed-loop glucose control in type 1 diabetes with an adaptive meal bolus calculator–in silico evaluation under intra-day variability. Comput. Methods Progr. Biomed. 146, 125–131 (2017).
Article Google Scholar
Woldaregay, A. Z. et al. Data-driven blood glucose pattern classification and anomalies detection: machine-learning applications in type 1 diabetes. J. Medical Internet Res. 21, e11030 (2019).
Article Google Scholar
Pérez-Gandía, C. et al. Artificial neural network algorithm for online glucose prediction from continuous glucose monitoring. Diabetes Technol. Ther. 12, 81–8 (2010).
Article PubMed Google Scholar
Georga, E. I. et al. Multivariate prediction of subcutaneous glucose concentration in type 1 diabetes patients based on support vector regression. IEEE J. Biomed. Health Informatics 17, 71–81 (2013).
Article CAS Google Scholar
Gadaleta, M., Facchinetti, A., Grisan, E. & Rossi, M. Prediction of adverse glycemic events from continuous glucose monitoring signal. IEEE J. Biomed. Health Informatics 23, 650–659 (2019).
Article Google Scholar
Vehí, J., Contreras, I., Oviedo, S., Biagi, L. & Bertachi, A. Prediction and prevention of hypoglycaemic events in type-1 diabetic patients using machine learning. Health Informatics J. 26, 703–718 (2020).
Article PubMed Google Scholar
Dave, D. et al. Feature-based machine learning model for real-time hypoglycemia prediction. J. Diabetes Sci. Technol. (2020).
Bent, B. et al. Engineering digital biomarkers of interstitial glucose from noninvasive smartwatches. npj Digital Med. 4, 1–11 (2021).
Article Google Scholar
Zhu, T., Li, K., Herrero, P. & Georgiou, P. Deep learning for diabetes: A systematic review. IEEE J. Biomed. Health Informatics 25, 2744–2757 (2021).
Article Google Scholar
Fogel, A. L. & Kvedar, J. C. Artificial intelligence powers digital medicine. npj Digital Med. 1, 1–4 (2018).
Article Google Scholar
Arcadu, F. et al. Deep learning algorithm predicts diabetic retinopathy progression in individual patients. npj Digital Med. 2, 1–9 (2019).
Article Google Scholar
Williams, B. M. et al. An artificial intelligence-based deep learning algorithm for the diagnosis of diabetic neuropathy using corneal confocal microscopy: a development and validation study. Diabetologia 63, 419–430 (2020).
Article PubMed Google Scholar
Zhu, T., Li, K., Herrero, P. & Georgiou, P. Basal glucose control in type 1 diabetes using deep reinforcement learning: An in silico validation. IEEE J. Biomed. Health Informatics 25, 1223–1232 (2021).
Article Google Scholar
Zhu, T., Li, K., Kuang, L., Herrero, P. & Georgiou, P. An insulin bolus advisor for type 1 diabetes using deep reinforcement learning. Sensors 20, 5058 (2020).
Article CAS PubMed Central Google Scholar
Sun, Q., Jankovic, M. V., Bally, L. & Mougiakakou, S. G. Predicting blood glucose with an LSTM and Bi-LSTM based deep neural network (2018). 2018 14th Symposium on Neural Networks and Applications (NEUREL).
Zhu, T., Li, K., Herrero, P., Chen, J. & Georgiou, P. A deep learning algorithm for personalized blood glucose prediction (2018). The 3rd International Workshop on Knowledge Discovery in Healthcare Data, IJCAI-ECAI 2018.
Li, K., Daniels, J., Liu, C., Herrero, P. & Georgiou, P. Convolutional recurrent neural networks for glucose prediction. IEEE J. Biomed. Health Informatics 24, 603–613 (2020).
Article Google Scholar
Li, K., Liu, C., Zhu, T., Herrero, P. & Georgiou, P. GluNet: A deep learning framework for accurate glucose forecasting. IEEE J. Biomed. Health Informatics 24, 414–423 (2020).
Article Google Scholar
Zhu, T., Li, K., Herrero, P., Chen, J. & Georgiou, P. Dilated recurrent neural networks for glucose forecasting in type 1 diabetes. J. Healthcare Informatics Res. 1–17 (2020) .
Deng, Y. et al. Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients. npj Digital Med. 4, 1–13 (2021).
Article CAS Google Scholar
Zhu, T. et al. IoMT-enabled real-time blood glucose prediction with deep learning and edge computing. IEEE Internet of Things Journal https://doi.org/10.1109/JIOT.2022.3143375 (2022).
Porumb, M., Stranges, S., Pescapè, A. & Pecchia, L. Precision medicine and artificial intelligence: a pilot study on deep learning for hypoglycemic events detection based on ECG. Scientific Rep. 10, 1–16 (2020).
CAS Google Scholar
Luong, M. T., Pham, H. & Manning, C. D. Effective approaches to attention-based neural machine translation (2015). Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing.
Amini, A., Schwarting, W., Soleimany, A. & Rus, D. Deep evidential regression (2020) . Advances in Neural Information Processing Systems .
Finn, C., Abbeel, P. & Levine, S. Model-agnostic meta-learning for fast adaptation of deep networks (2017). Proceedings of the 34th International Conference on Machine Learning.
Spence, R. & Apperley, M. Data base navigation: an office environment for the professional. Behavi. Inform. Technol. 1, 43–54 (1982).
Article Google Scholar
Turksoy, K. et al. Hypoglycemia early alarm systems based on multivariable models. Ind. Eng. Chem. Res. 52, 12329–12336 (2013).
Article CAS Google Scholar
Liu, C. et al. Long-term glucose forecasting using a physiological model and deconvolution of the continuous glucose monitoring signal. Sensors 19, 4338 (2019).
Article CAS PubMed Central Google Scholar
Preissig, C. M. & Rigby, M. R. A disparity between physician attitudes and practice regarding hyperglycemia in pediatric intensive care units in the united states: a survey on actual practice habits. Critical Care 14, 1–8 (2010).
Article Google Scholar
Roberts, D. R. et al. Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40, 913–929 (2017).
Article Google Scholar
Bekkink, M. O., Koeneman, M., de Galan, B. E. & Bredie, S. J. Early detection of hypoglycemia in type 1 diabetes using heart rate variability measured by a wearable device. Diabetes Care 42, 689–692 (2019).
Article Google Scholar
Rothberg, L. J., Lees, T., Clifton-Bligh, R. & Lal, S. Association between heart rate variability measures and blood glucose levels: implications for noninvasive glucose monitoring for diabetes. Diabetes Technol. Ther. 18, 366–376 (2016).
Article CAS PubMed Google Scholar
Cichosz, S. L., Frystyk, J., Hejlesen, O. K., Tarnow, L. & Fleischer, J. A novel algorithm for prediction and detection of hypoglycemia based on continuous glucose monitoring and heart rate variability in patients with type 1 diabetes. J. Diabetes Sci. Technol. 8, 731–737 (2014).
Article CAS PubMed PubMed Central Google Scholar
Schuurmans, A. A. et al. Validity of the empatica e4 wristband to measure heart rate variability (HRV) parameters: a comparison to electrocardiography (ECG). J. Medical Systems 44, 1–11 (2020).
Article Google Scholar
Hovorka, R. et al. Nonlinear model predictive control of glucose concentration in subjects with type 1 diabetes. Physiol. Meas. 25, 905 (2004).
Article PubMed Google Scholar
Larsen, K., Petersen, J. H., Budtz-Jørgensen, E. & Endahl, L. Interpreting parameters in the logistic regression model with random effects. Biometrics 56, 909–914 (2000).
Article CAS PubMed Google Scholar
Peper, E., Harvey, R., Lin, I.-M., Tylova, H. & Moss, D. Is there more to blood volume pulse than heart rate variability, respiratory sinus arrhythmia, and cardiorespiratory synchrony? Biofeedback 35 (2007) .
Zong, W., Heldt, T., Moody, G. & Mark, R. An open-source algorithm to detect onset of arterial blood pressure pulses (2003). Computers in Cardiology, 2003.
Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology. Heart rate variability: standards of measurement, physiological interpretation and clinical use. Circulation 93, 1043–1065 (1996).
Benedek, M. & Kaernbach, C. A continuous measure of phasic electrodermal activity. J. Neurosci. Methods 190, 80–91 (2010).
Article PubMed PubMed Central Google Scholar
Carreiras, C. et al. BioSPPy: Biosignal processing in Python https://github.com/PIA-Group/BioSPPy/ (2015).
Makowski, D. et al. Neurokit2: A Python toolbox for neurophysiological signal processing https://github.com/neuropsychology/NeuroKit (2020).
Marling, C., Xia, L., Bunescu, R. & Schwartz, F. Machine learning experiments with noninvasive sensors for hypoglycemia detection (2016). Proceedings of IJCAI Workshop on Knowledge Discovery in Healthcare Data. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
Zisser, H. et al. Bolus calculator: a review of four “smart” insulin pumps. Diabetes Technol. Ther. 10, 441–444 (2008).
Article CAS PubMed Google Scholar
Toloşi, L. & Lengauer, T. Classification with correlated features: unreliability of feature ranking and solutions. Bioinformatics 27, 1986–1994 (2011).
Article PubMed CAS Google Scholar
Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A. & Talwalkar, A. Hyperband: a novel bandit-based approach to hyperparameter optimization. J. Machine Learning Res. 18, 6765–6816 (2017).
Google Scholar
Nichol, A., Achiam, J. & Schulman, J. On first-order meta-learning algorithms Preprint at https://arxiv.org/abs/1803.02999 (2018).
Raghu, A., Raghu, M., Bengio, S. & Vinyals, O. Rapid learning or feature reuse? towards understanding the effectiveness of MAML (2019). International Conference on Learning Representations.
Zhu, T., Yao, X., Li, K., Herrero, P. & Georgiou, P. Blood glucose prediction for type 1 diabetes using generative adversarial networks (2020). The 5th International Workshop on Knowledge Discovery in Healthcare Data, ECAI 2020.
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Chung, J., Gulcehre, C., Cho, K. & Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling (2014). NIPS 2014 Workshop on Deep Learning, December 2014.
Schuster, M. & Paliwal, K. K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45, 2673–2681 (1997).
Article Google Scholar
Bahdanau, D., Cho, K. & Bengio, Y. Neural machine translation by jointly learning to align and translate (2015). 3rd International Conference on Learning Representations, ICLR.
Del Favero, S., Facchinetti, A. & Cobelli, C. A glucose-specific metric to assess predictors and identify models. IEEE Trans. Biomed. Eng. 59, 1281–1290 (2012).
Article PubMed Google Scholar
Danne, T. et al. International consensus on use of continuous glucose monitoring. Diabetes Care 40, 1631–1640 (2017).
Article PubMed PubMed Central Google Scholar
Chicco, D. & Jurman, G. The advantages of the matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21, 6 (2020).
Article PubMed PubMed Central Google Scholar
Herrero, P. et al. Robust fault detection system for insulin pump therapy using continuous glucose monitoring. J. Diabetes Sci. Technol. 6, 1131–1141 (2012).
Article PubMed PubMed Central Google Scholar
Dalla Man, C. et al. The UVA/PADOVA type 1 diabetes simulator: new features. J. Diabetes Sci. Technol. 8, 26–34 (2014).
Article Google Scholar
Liu, C. et al. A modular safety system for an insulin dose recommender: a feasibility study. J. Diabetes Sci. Technol. 14, 87–96 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

The work has been supported by EPSRC EP/P00993X/1 and the President’s PhD Scholarship at Imperial College London. We would like to thank Prof. Robert Spence for his contribution in designing the interface of the ARISES app.

Author information

Authors and Affiliations

Centre for Bio-Inspired Technology, Department of Electrical and Electronic Engineering, Imperial College London, London, UK
Taiyu Zhu, Kezhi Li, Pau Herrero & Pantelis Georgiou
Division of Diabetes, Endocrinology and Metabolism, Faculty of Medicine, Imperial College London, London, UK
Chukwuma Uduku & Nick Oliver
Institute of Health Informatics, University College London, London, UK
Kezhi Li

Authors

Taiyu Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Chukwuma Uduku
View author publications
You can also search for this author in PubMed Google Scholar
Kezhi Li
View author publications
You can also search for this author in PubMed Google Scholar
Pau Herrero
View author publications
You can also search for this author in PubMed Google Scholar
Nick Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Pantelis Georgiou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.Z. and K.L. conceptualized the methods and contributed to model development. T.Z. contributed to manuscript preparation. C.U. and N.O. contributed to the acquisition, analysis, and interpretation of data. K.L., P.H., N.O., and P.G. contributed to the critical revision of the manuscript for important intellectual content. P.H., N.O., and P.G. contributed to the study design, project administration, and funding acquisition. All authors edited the manuscript.

Corresponding authors

Correspondence to Taiyu Zhu or Kezhi Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Supplementary Information

Clinical Trial Protocol

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhu, T., Uduku, C., Li, K. et al. Enhancing self-management in type 1 diabetes with wearables and deep learning. npj Digit. Med. 5, 78 (2022). https://doi.org/10.1038/s41746-022-00626-5

Download citation

Received: 18 August 2021
Accepted: 01 June 2022
Published: 27 June 2022
DOI: https://doi.org/10.1038/s41746-022-00626-5

This article is cited by

Machine learning in precision diabetes care and cardiovascular risk prediction
- Evangelos K. Oikonomou
- Rohan Khera
Cardiovascular Diabetology (2023)
Insulin detection in diabetes mellitus: challenges and new prospects
- Eva Vargas
- Ponnusamy Nandhakumar
- Joseph Wang
Nature Reviews Endocrinology (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Participant characteristics

Independent predictors using non-invasive physiological data

Glucose level prediction

Hypoglycemia and hyperglycemia prediction

Discussion

Methods

Phase I prospective study

Analysis of sensor wristband data

Multi-modal feature engineering

Feature pre-processing and selection

Model training, validation, and testing

Developing population and personalized models

Attention-based RNN architecture

Lower and upper bounds of predictions

Performance evaluation

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links