Acute kidney injury detection using refined and physiological-feature augmented urine output

Alkhairy, Sahar; Celi, Leo A.; Feng, Mengling; Zimolzak, Andrew J.

doi:10.1038/s41598-021-97735-0

Download PDF

Article
Open access
Published: 01 October 2021

Acute kidney injury detection using refined and physiological-feature augmented urine output

Sahar Alkhairy¹,
Leo A. Celi^1,2,
Mengling Feng^1,3^na1 &
…
Andrew J. Zimolzak^4,5^na1

Scientific Reports volume 11, Article number: 19561 (2021) Cite this article

2088 Accesses
2 Citations
3 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 09 November 2021

This article has been updated

Abstract

Acute kidney injury (AKI) is common in the intensive care unit, where it is associated with increased mortality. AKI is often defined using creatinine and urine output criteria. The creatinine-based definition is more reliable but less expedient, whereas the urine output based definition is rapid but less reliable. Our goal is to examine the urine output criterion and augment it with physiological features for better agreement with creatinine-based definitions of AKI. The objectives are threefold: (1) to characterize the baseline agreement of urine output and creatinine definitions of AKI; (2) to refine the urine output criteria to identify the thresholds that best agree with the creatinine-based definition; and (3) to build generalized estimating equation (GEE) and generalized linear mixed-effects (GLME) models with static and time-varying features to improve the accuracy of a near-real-time marker for AKI. We performed a retrospective observational study using data from two independent critical care databases, MIMIC-III and eICU, for critically ill patients who developed AKI in intensive care units. We found that the conventional urine output criterion (6 hr, 0.5 ml/kg/h) has specificity and sensitivity of 0.49 and 0.54 for MIMIC-III database; and specificity and sensitivity of 0.38 and 0.56 for eICU. Secondly, urine output thresholds of 12 hours and 0.6 ml/kg/h have specificity and sensitivity of 0.58 and 0.48 for MIMIC-III; and urine output thresholds of 10 hours and 0.6 ml/kg/h have specificity and sensitivity of 0.49 and 0.48 for eICU. Thirdly, the GEE model of four hours duration augmented with static and time-varying features can achieve a specificity and sensitivity of 0.66 and 0.61 for MIMIC-III; and specificity and sensitivity of 0.66 and 0.64 for eICU. The GLME model of four hours duration augmented with static and time-varying features can achieve a specificity and sensitivity of 0.71 and 0.55 for MIMIC-III; and specificity and sensitivity of 0.66 and 0.60 for eICU. The GEE model has greater performance than the GLME model, however, the GLME model is more reflective of the variables as fixed effects or random effects. The significant improvement in performance, relative to current definitions, when augmenting with patient features, suggest the need of incorporating these features when detecting disease onset and modeling at window-level rather than patient-level.

Development of a prediction score for in-hospital mortality in COVID-19 patients with acute kidney injury: a machine learning approach

Article Open access 24 December 2021

Machine-learning model for predicting oliguria in critically ill patients

Article Open access 11 January 2024

Machine learning algorithm to predict mortality in critically ill patients with sepsis-associated acute kidney injury

Article Open access 30 March 2023

Introduction

Acute kidney injury (AKI) is a sudden decrease in kidney function, resulting in fluid dysregulation, electrolyte abnormalities, and/or retention of waste products¹. Approximately seven percent of patients in hospitals, and over half of patients in intensive care units (ICUs) are thought to develop AKI during hospital stay². Multiple studies have shown a very strong association between AKI and consequent septic shock³ and mortality in adults^4,5,6,7,8 and in children⁹. Early intervention is known to lower the severity of AKI¹⁰ making rapid prognostication an important goal¹¹.

The detection and treatment of AKI, however, can be challenging as the ailment may result from one or more renal insults (pre-renal, post-renal, and/or intrinsic). Existing definitions of AKI (RIFLE, AKIN, and KDIGO) have similar predictive abilities of AKI patients, and have had associated biomarkers of renal injury studied^12,13.

The RIFLE criteria¹⁴ stratify AKI risk into five groups: risk, injury, failure, loss, and end stage renal disease. These criteria were validated in studies of tens of thousands of patients^15,16,17, and in systematic reviews¹⁸, all of which correlated the criteria with mortality and/or other adverse outcomes.

The acute kidney injury network (AKIN) criteria¹⁹ are a modification of RIFLE that have been validated in several studies^20,21,22, including one study of over 300,000 patients, thereby making them more popular for research studies²³. The more recent KDIGO criteria are similar to AKIN in the urine output aspect with more elaborate creatinine aspect²⁴.

While the details of the criteria may differ, they are united by their use of creatinine (CR) and urine output (UO) to independently define AKI^25,26. Furthermore, their lowest level criteria for AKI have a common requirement of a maximum urine output of 0.5 ml/kg/h for at least 6 h and creatinine level of greater than 1.5 \(\times \) the baseline.

The independence of the urine output and creatinine definitions, however, often leads to conflicting conclusions. The urine output definition has the advantage of being more readily available (as creatinine is often measured only once a day)^19,27, but it is also less strongly associated with ICU outcomes than the creatinine definition. This is because the relationship between AKI and urine output depends on the type of renal injury (pre-renal, post-renal, or intrinsic). For example, pre-renal issues are associated with oliguria, post-renal issues often result in anuria, and intrinsic renal issues have varying effects on urine output (sometimes even increasing it), depending on the region injured and the extent of injury.

The relationship between urine output and AKI have been studied in detail²⁸. Urine output as a marker of AKI is probably confounded by multiple factors²⁹. That is, fluctuations in urine output can be confounded by variables unrelated to AKI. Overall, low urine output may indicate AKI in some patients but not others, and certain clinical variables should be considered before urine output is used to make the diagnosis. Unlike urine output, multiple investigators have indicated a strong preference for the creatinine definition of AKI^2,11 and have found it to have an overall low false positive rate³⁰. However, research has also shown that utilizing both creatinine and UO significantly increase the detection power of AKI as compared to only using creatinine^31,32.

Because the urine- and creatinine-based definition “limits timely and accurate AKI diagnosis”, a variety of additional biomarkers for AKI have been investigated³³. The goal is a marker of AKI that is more specific and sensitive than existing criteria, and which ideally becomes detectable before a rise in creatinine. One biomarker clinically available in several countries is neutrophil gelatinase-associated lipocalin (NGAL), and another test, known as “Nephrocheck,” is formed by the combination of two markers of cell cycle arrest^33,34. Such biomarkers are not measured in all patients, and it is not yet clear when or in what populations they should be measured, as they may add to healthcare costs²⁹. Unfortunately, existing biomarkers have shown mixed prognostic ability³⁵.

We hypothesize that urine output can indicate AKI before a rise in creatinine, and that improved sensitivity and specificity can be achieved if the time courses of other easily measured physiologic variables are taken into account. This combination could be considered a “digital biomarker,” rather than a chemical one such as NGAL.

Our goals are: (1) to characterize the agreement between the urine output and creatinine definitions for AKI, (2) to determine what time and volume thresholds of the urine definition best agree with the creatinine definition, and (3) build generalized estimating equation (GEE)³⁶ and generalized linear mixed-effects (GLME) models³⁷ with static and time-varying features to improve agreement with the creatinine-based definition, without sacrificing expediency.

We perform this study on two independent large retrospective clinical archives. We do not intend to formulate a new, unitary definition of AKI that will supplant the measurement of creatinine. Rather, our aim is to determine a urine output-based detector that is more aligned with the creatinine criteria for AKI.

Methods

Data set and feature extraction

Data for this study were extracted from two independent intensive care databases with clinical and physiological data, MIMIC-III³⁸ and eICU³⁹. Multiparameter Intelligent Monitoring in Intensive Care III (MIMIC-III) database includes data from over 38,590 Beth Israel Deaconess Medical Center adult ICU patients. The database covers patients who were admitted between 2008 and 2014 to the adult ICUs at Beth Israel Deaconess Medical Center, a tertiary care university academic medical center located in Boston, Massachusetts. It includes physiologic information from bedside monitors and hospital information systems. The data in MIMIC-III were de-identified, and the use of the database for research was approved by the Institutional Review Boards of the Massachusetts Institute of Technology and Beth Israel Deaconess Medical Center. eICU Collaborative Research Database (eICU), includes patient data from a telehealth system developed by Philips Healthcare. The database includes de-identified clinical and physiological data for more than 139,360 patients admitted to one of 335 units at 208 hospitals between 2014 and 2015.

For each patient sample, we extracted static features including age, gender, first measured weight, height, lean body mass (LBM, derived from the weight, height, and gender), and binary indicators for diabetes, heart disease, cancer, and prior use of diuretics. We also extracted time varying features such as serum creatinine, and hourly measures of urine output, vasopressor use, fluid intake, and mean arterial pressure (MAP) from the first 48 h of ICU stay. These features have been shown to be indicators of AKI^{24,25,40,41,42}. Drugs that were considered vasopressors are: dobutamine, dopamine, epinephrine, isuprel, levophed, vasopressin, milrinone, neosynephrine, norepinephrine, and phenylephrine. We computed fluid balance by subtracting fluid output from input and normalized it by the patient’s first measured weight. Inclusion of features such as diuretics would account for increase in urine output that can be factored out in determining if a patient has AKI.

Pre-processing and inclusion/exclusion criteria

Patients with less than four hours urine output measurements were excluded. Of those with more than four hourly measures, we excluded any patients with a normalized urine output less than or equal 0.5 ml/kg/h during the first 6 h of admission given that they will require data collected prior to ICU admission which the current databases do not capture. As urine output measures occurred at irregular intervals, we estimated the urine output at the end of the sixth hour, when the measure was not recorded, using interpolation between the two nearest measures. Lastly, we excluded the first urine measurement that inconsistently includes urine output in the Emergency Department, in the operating room or the hospital ward prior to ICU admission.

We excluded part of the database from analyses because we are concerned only with patients with sufficient data who developed AKI during their ICU stay. The data went through two stages of filtering as illustrated in Fig. 1 . The two cohorts resulting from the two stages are Analyses cohort and subsequently the GEE/GLME cohort.

Table 1 Study population characteristics.

Full size table

The Analyses cohort is used in characterizing the baseline symmetry between the urine output and creatinine criteria of AKI, and in evaluating the performance of various combinations of time and volume thresholds. It included only patients who had normal kidney function at ICU admission. Therefore, we excluded patients if they had undergone dialyses prior to ICU admission, or if they had a first creatinine measure greater than 1.2 mg/dl, or had an average urine output less than 0.5 ml/kg/h for the first 6 h. Additionally, we excluded patients that had missing data, and ones with too few observations to reliably extract information from (e.g. had less than four measurements of urine output data).

The GEE/GLME cohort is used in identifying a urine output based model that is augmented with other static and dynamic features to predict AKI onset. This cohort is a subset of the Analyses cohort but additionally excluded any patient with missing values for the static and dynamic features used in the model. These features are: age, gender, use of diuretics, use of vasopressors, average MAP, and fluid intake.

Baseline symmetry and time/volume refinement

All three AKI standards (RIFLE, AKIN, and KDIGO) have similar criteria for their lowest levels of AKI classification. Stage 1 of KDIGO and AKIN and the risk stage of RIFLE require urine output that characterizes AKI by time and volume thresholds of 6 h and 0.5 ml/kg/h and a creatinine level of greater than 1.5 \(\times \) the baseline. The creatinine-based criteria for classifying patients as having AKI (\(AKI_{Cr}\)) is based on the creatinine measurements within the first 48 h of ICU admission where we define AKI as either (1) an increase in creatinine greater than or equal 0.3 mg/dl from hospital stay minimum, or (2) a 50% or more increase from hospital stay minimum¹⁶. The urine output based criterion (\(AKI_{UO}\)) classifies patients as having AKI if any time window of a given length threshold has an average weight-normalized urine output less than the volume threshold. We investigated the baseline symmetry between the creatinine and urine criteria of AKI. In particular, we determined its classification performance as indicated by sensitivity and specificity of time and volume thresholds of 6 h and 0.5 ml/kg/h with the creatinine-based definition of AKI as reference. We also refined the choice of time and volume threshold combinations that allowed for the greatest overlap between \(AKI_{UO}\) and acute kidney injury based on creatinine (\(AKI_{Cr}\)). The time thresholds we investigated ranged from 2 to 12 h in increments of 2 while the volume thresholds ranged from 0 to 1 ml/kg/h in increments of 0.1. For each combination of thresholds, we calculated specificity, sensitivity, J-point distance, and net reclassification index (NRI). J-point is the point on the ROC curve that has the least Cartesian distance to 100% sensitivity and specificity.

Multivariable modeling

Urine output is time-varying, with future values correlated to past values. This makes standard generalized linear modeling approaches invalid. To address this, we employed a generalized estimating equation (GEE), which estimates the parameters of a generalized linear model without any assumptions about the covariance structure of the data, allowing us to use multiple correlated urine observations for model parameter estimation.

The following features were included in the GEE model to predict AKI onset according to the creatinine criteria: age, having diabetes, having heart disease, having cancer, prior diuretic use, prior vasopressor use, first creatinine measure, lean body mass (LBM), time-averaged mean arterial pressure, and fluid balance. All these variables are considered as fixed effects in the GEE model. In comparison, for the GLME model, we consider age, prior vasopressor use, first creatinine measure, LBM, time-averaged mean arterial pressure, and fluid balance to be fixed effects; and a patient having diabetes, heart disease, cancer, and been given diuretic prior as random effects. This better representation could potentially lead to greater agreement with creatinine-based definition. The GLME model integrates out the random effects, but is limited to categorical variables. The extended GLMM model⁴³ is able to model continuous random effects using Monte Carlo simulation and expectation maximization, which makes it computationally infeasible for the size of the database we are using.

We computed fluid balance within a certain time window by subtracting the total urine output within the window from the adjusted fluid intake and normalizing it by the patient’s first measured weight. The adjusted fluid intake is the sum of fluid intake up to and including during the time window minus the total urine output up to the start of the time window.

As in our refinement analyses, we explored various time window lengths and observed their impact on model performance in prediction of AKI onset with reference to creatinine based AKI criteria. Specifically, we explored time thresholds ranging from 2 to 12 h in increments of 2.

We generated the GEE model using GEEQBOX toolkit³⁶ and the GLME model using Matlab’s GLME function using a randomly selected training set comprising of two-thirds of the GEE/GLME cohort, and tested the performance of our fitted models by predicting \(AKI_{Cr}\) on the unseen test set (one-third of GEE/GLME cohort). We plotted the receiver operating characteristic (ROC) curve for each of the six models (one model for each time window), and examined the model coefficients, odds ratios, 95% confidence intervals, and p-values for each model.

For each model, we calculated the area under the ROC curve (AUC), J-point specificity and sensitivity, J-point distance, and net reclassification index (NRI). For computing the NRI for the various models, we binarized the prediction of AKI for the validation set using the probability threshold of the J-point.

Model variables

In order to obtain the features, we extracted the average UO per window per time threshold the same way we computed \(AKI_{UO}\).

For the other time-varying features (1) MAP (2) fluid balance (3) use of vasopressors, we used the normalized start time of each window. For the MAP, we obtained the median value one to three hours prior. For the fluid balance, we obtained the difference between fluid input and output and normalized it by weight. For the vasopressors, we checked to see if any vasopressor was used prior to the start time of the window.

To obtain \(AKI_{Cr}\) for each window, we labeled each creatinine measurement with 0 or 1 (0: no AKI, 1: has AKI) based on the \(AKI_{Cr}\) definition. We also, removed any UO window that overlap with serum creatinine measurements (because it is difficult to know which measurement it would belong to) and any window after the last measurement. We labeled each window based on the next nearest creatinine measurement.

Net reclassification index

In order to measure the improvement in performance of the various refinements in time and volume thresholds and GEE/GLME models with respect to the standard urine output threshold of 0.5 ml/kg/h for a duration of at least 6 h, we computed their net reclassification improvement (NRI)^44,45. NRI is the difference between the probability of correct reclassification and the probability of incorrect reclassification. It is also the difference between the sum of the sensitivity and specificity of the new model and the sum of the sensitivity and specificity of the old model.

Use of experimental animals, and human participants

This is a retrospective study using openly available datasets and does not deal with human participants or groups. Therefore, need for consent is not applicable. Only computational methods were used and no clinical or experimental methods were carried out. All methods were carried out in accordance with relevant guidelines and regulations.

Results

Characteristics of patients and population sizes for the Primary cohort, Analyses cohort, and cohort of best performing GEE/GLME model for the MIMIC-III and eICU databases are shown in Table 1. We note that the GEE/GLME cohort differs from the Primary cohort in all characteristics in both databases with the exception of cancer indicator, use of diuretics, height, and age in MIMIC-III; and age in eICU . This is to be expected as we only include patients with specific characteristics from the general and heterogeneous patient population.

We also note a significant difference in the number of patients that have heart disease and that have cancer between the MIMIC and eICU databases—heart disease (MIMIC: 68%, eICU: 11.4%), cancer (MIMIC: 17%, eICU: 2.3%). The diagnoses included in the heart disease and cancer categories for MIMIC and eICU include similar diverse set of diagnoses. Johnson et al.³⁸ had similar statistics for the percentage of patients with heart disease (71.4%) and Pollard et al.³⁹ mentioned that 11.15% and 4.7% of the patients in the eICU had heart disease and cancer respectively, similar to our findings. Supported by existing work, the differences in the percentages of patients with diseases between the MIMIC and eICU datasets suggest that the two sets of patients are significantly different.

Table 2 GEE and GLME multivariable models’ estimated parameters.

Full size table

Table 3 Performance metrics across various models.

Full size table

Additionally, there was a noticeable drop in the percentage of patients that meet the creatinine-based definition of AKI in the eICU database between the Primary and Analyses cohorts (56.4–34.4%). The reason behind this drop is due to there being a large intersection between the patients with abnormal kidney function at ICU admission and the ones who meet the definition of developing creatinine-based AKI. When filtering out the ones with prior abnormal kidney function from the Primary cohort a significant portion of the patients that had further increase in creatinine during their ICU stay were also excluded resulting in the sharp decrease.

The congruence between creatinine-based definition of AKI and mortality has a sensitivity of 0.61 and specificity of 0.48 for MIMIC-III; and sensitivity of 0.47 and specificity of 0.67 for eICU. The baseline symmetry between the standard AKI (\(AKI_{UO}\)) definition of urine output less than 0.5 ml/kg/h for 6 h and the reference AKI (\(AKI_{Cr}\)) definition based on creatinine levels has a sensitivity of 0.54 and specificity of 0.49, with a distance of 0.68 from 100% sensitivity and specificity for the MIMIC-III database; and a sensitivity of 0.56 and specificity of 0.38, with a distance of 0.76 from 100% sensitivity and specificity for the eICU database.

The results of refining AKI urine output and time thresholds are depicted in Fig. 2 and supplementary Table S1. For each of the two databases MIMIC-III and eICU, there are volume and time threshold combinations for the urine-based AKI definition that have better congruence with the creatinine-based AKI definition than the standard volume and time thresholds of 0.5 ml/kg/h and 6 h.

For the MIMIC-III database, ranking based on J-point distance results in the optimal time and volume thresholds of \(AKI_{UO}\) as UO less than 0.6 ml/kg/h for 12 h. This combination has a sensitivity of 0.48, specificity of 0.58, J-point distance of 0.67, and NRI of 0.027. Ranking the threshold combinations based on NRI values, results in the same optimal time and volume thresholds of \(AKI_{UO}\). For the eICU database, ranking based on J-point distance results in the optimal time and volume thresholds of \(AKI_{UO}\) as UO less than 0.6 ml/kg/h for 10 h. This combination has a sensitivity of 0.48, specificity of 0.49, distance of 0.73 from 100% sensitivity and specificity, and NRI of 0.026. Ranking the threshold combinations based on NRI values, results in the optimal time and volume thresholds of \(AKI_{UO}\) as UO less than 1 ml/kg/h for 2 h. This combination has a sensitivity of 0.92, specificity of 0.074, distance of 0.93 from 100% sensitivity and specificity, and NRI of 0.046.

The mortality percentage of patients meeting the volume and duration thresholds of urine-based definition of AKI decreases as the normalized urine output threshold increases and increases as the time duration threshold increases as shown in Fig. 3.

The area under the ROC curve (AUC) for the GEE/GLME multivariable models augmented physiological features for two partitions are plotted in Fig. 4.

Performance trend across partitions is generally consistent. Ranking of each of GEE and GLME models according to AUC values results in a best performing model with a time window of 4 h for both MIMIC-III and eICU.

The GEE model with a time window of 6 h—the same duration of data as the standard criteria– has a sensitivity of 0.65, a specificity of 0.62, J-point distance of 0.517, and NRI of 0.21 for MIMIC-III; and sensitivity of 0.65, a specificity of 0.64, J-point distance of 0.50, and NRI of 0.34 for eICU. The GLME model with a time window of 6 h has a sensitivity of 0.57, a specificity of 0.65, J-point distance of 0.56 and NRI of 0.19 for MIMIC-III; and sensitivity of 0.61, a specificity of 0.66, J-point distance of 0.52, and NRI of 0.31 for eICU.

The best performing GEE model has a sensitivity of 0.61, specificity of 0.66, J-point distance of 0.512, and NRI of 0.256 for MIMIC-III; and a sensitivity of 0.64, specificity of 0.66, J-point distance of 0.50, and NRI of 0.35 for eICU. The best performing GLME model has a sensitivity of 0.55, specificity of 0.71, J-point distance of 0.54, and NRI of 0.25, for MIMIC-III; and a sensitivity of 0.60, specificity of 0.66, J-point distance of 0.52, and NRI of 0.31, for eICU.

GEE model has better performance than the GLME model for MIMIC and eICU databases. However, we include the GLME model as it is more reflective of fixed and random effects, integrating out random effects.

For the best performing model according to AUC (4 h of data), the odds ratio, and 95% confidence intervals for significant features are tabulated in Table 2.

First creatinine measurement, LBM, prior vasopressor use, and fluid balance were found to exhibit a statistically significant association with \(AKI_{Cr}\) in both MIMIC-III and eICU. Additionally, heart disease was a significant indicator in MIMIC-III in the GEE model, while diuretics use and MAP were significant features in eICU in the GEE model. Specifically, increased first creatinine measurement, positive fluid balance, and decreased LBM showed a positive association with AKI. In MIMIC-III, heart disease and vasopressor use showed negative association with AKI. In eICU, use of diuretics and vasopressors use showed positive association, whereas mean arterial pressure showed negative association.

Summary of performance across the various non-parametric and parametric models is tabulated in Table 3. For both databases, MIMIC-III and eICU, J-point distance is reduced for non-parametric model over the standard urine-based AKI definition. Additionally, the distance is substantially reduced for the parametric GEE and GLME models over the non-parametric model.

We also tested the MIMIC-trained model on eICU and vice versa using both GEE and GLME models. The significantly lower performance compared to models trained and tested on the same database leads to the conclusion that there are significant differences between the patients cohorts not captured in the databases. These differences may partially arise from distinctions in qualitative procedures and quantitative variables not part of the database.

Discussion

Over the past 3 decades, the incidence of AKI has increased over 20-fold, making it an important problem in critical care medicine. The purpose of this paper was to investigate the complex factors mediating the relationship between urine output and creatinine in AKI, and to develop a time varying multivariable model that identifies factors mediating the relationship based on augmentation of urine output with physiological features.

For the diagnosis of AKI, serum creatinine remains the AKI reference in practice. Creatinine, however, reflects kidney function and not kidney damage. This is problematic because functional changes tend to occur only after the kidney has suffered significant damage¹⁰. Recent studies have shown the potential of other biomarkers to be better predictors of AKI^33,46 that are not readily measured. Indeed, it has been reported that kidney damage may begin up to 48 hours before it is detected by changes in creatinine. This fact was the motivation for the development of urine output criteria of AKI in the first place⁴⁶.

In the realm of urine output criteria, the congruence between urine output and creatinine-based AKI is greater in MIMIC-III than in eICU. This may be a result of a much larger portion of patients that meet the creatinine-based AKI definition in MIMIC-III (54% in MIMIC-III vs 34 % in eICU). Additionally, the performance of the optimal time and volume threshold combinations both according to J-point distance and according to NRI had only a slightly better agreement with creatinine-based AKI definition than the standard urine-output based definition. We argue that the additional 4 or 6 hours of data required for this modified threshold does not merit the small improvement in classification performance.

In actuality, the relationship between urine output and creatinine is likely confounded by multiple factors. Fluctuations in urine output are also likely to be driven independently by variables completely unrelated to AKI. Overall, low urine output may translate into AKI in some patients but not in others, and potentially confounding clinical factors should be considered before urine output is used to make a diagnosis. Although it is known to be less accurate, there are known advantages to using the urine output criteria.

Ultimately AKI is a highly heterogeneous disease²⁹ and it may be naïve to assume that a single feature (be it urine or creatinine) will correctly predict the same ailment for all patients. As suggested by De Corte, one future path forward may be to condition the definition of AKI on the population in question¹⁰. Our work presented here is a step towards incorporating this heterogeneity through physiological features.

We saw a significant improvement in the predictive performance of feature-augmented time varying GEE and GLME models with a window of 6 h (time duration of standard urine output) compared to the standard urine output based AKI definition in terms of sensitivity, specificity, and J-point distance in both databases.

Additionally, the prediction performance of all the feature-augmented time varying models consistently outperformed the prediction performance of the original urine output based definition of AKI or any refinement of its time and volume thresholds according to any of the metrics used (sensitivity, specificity, J-point distance, and NRI). Importantly, there is no trade off between any of these metrics such as an increase in specificity at the expense of sensitivity. This suggests that having a time varying model augmented with static and dynamic features is necessary for significantly improved prediction of AKI.

Furthermore, our results provide insight into features other than urine output that might improve the prediction performance of AKI. In both MIMIC and eICU, first creatinine measurement, fluid balance, and LBM were significantly associated with creatinine-based AKI. First, a higher baseline creatinine was associated with future rise in creatinine. This is a noteworthy finding as we specifically excluded patients with “abnormal” baseline creatinine—thus even a “high normal” baseline creatinine is associated with AKI. Second, positive fluid balance was associated with future rise in creatinine. It is worth noting here that we did not directly investigate the type of fluid received by the patients, which has been reported as a potential driver of AKI by others in the literature⁴⁷. Third, a greater LBM decreased the probability of developing creatinine-based AKI. This finding is substantiated by the work of Liu et al.⁴⁸ where they found that underweight patients had a greater chance of developing AKI in ICU as adequate nutritional intake is thought to reduce ICU length of stay and improve chances of recovery^49,50.

In MIMIC-III, use of vasopressors and heart disease are associated with decreased risk of AKI. In MIMIC-III, more than 60% of the patients have heart disease. Of those patients, 39% were given vasopressors; Only 20% of the patients without heart disease were given vasopressors. Vasopressors stabilizes the abnormally low blood pressure and blood perfusion caused by heart disease and restores end-organ perfusion leading to better outcomes.

In eICU, use of diuretics was associated with increased chance of developing AKI. This may be due to forced diuresis leading to volume overload⁵¹. Also, decreased MAP had a positive association with future rise in creatinine. Low average MAP within a time window was associated with a future rise in creatinine, as expected from decreased renal perfusion. Additionally, use of vasopressors was associated with the development of AKI as also previously noted⁵². Reduction of blood flow to tissues for patients with increased fluid overload can cause harm⁵³.

It is interesting to note that direction of association of a given feature depends on the underlying population. Vasopressor use was negatively associated with AKI in MIMIC whereas it was positively associated in eICU, as MIMIC has significantly more patients with heart disease diabetes, and cancer than eICU. This emphasises the importance of taking into account the patient population characteristics when making treatment decisions.

Even prior to the development of the RIFLE criteria¹⁴ and the AKIN modification¹⁹, experts remarked “none of the definitions (of AKI) used to date take into account the modifying effects of age, gender, and race on creatinine generation”⁵⁴. Even the most recent clinical practice guidelines state that the urine output criteria are not well validated, require further investigation, and that the effects of fluid balance and other factors should be considered. One recent study of 2171 patients performs such an adjustment based on fluid balance⁵⁵, but our work here considers fluid balance in addition to multiple other factors suggested by prior investigators.

Our findings that UO alone is not a powerful indicator of AKI but UO along with other features such as blood pressure and use of vasopressors can be a sensitive indicator are supported by Prowle et al.⁵⁶ and Macedo et al.⁵⁷, although it should be noted that their conclusions are based on very limited data. Prowle et al included 239 patients in their study of which 23 further developed AKI, and Macedo et al included only 75 patients of which 21 developed AKI.

Both studies sought to determine if changes in UO could be a sensitive marker of AKI using creatinine-based definition as the gold standard. However, both studies used urine output and not fluid balance to detect AKI, which is necessary as increase in fluid intake while maintaining the same UO does raise concerns about kidney function. Additionally, they used summary statistics such as mean, median and interquartile range (IQR) for continuous variables and percentages and CI for categorical variables rather than utilizing higher resolution of variables. Our modeling at a window-level rather than a patient level allows use of the appropriate corresponding values. It allows accounting for the time difference between events such as use of vasopressors and change in fluid balance as the impact of drugs lessens over time.

The last decade’s research on the topic of AKI has focused primarily on the discovery of more reliable biomarkers for laboratory diagnosis of AKI. Several biomarkers can give an indication before serum creatinine rises, but unfortunately they may perform no better than standard criteria in unselected populations, and have not been linked to improved outcomes^29,46. Additionally, the biomarkers are not readily measured, making impossible to perform large retrospective studies on it.

With the advent of digital health records, we have the opportunity to re-calibrate consensus definitions and clinical guidelines traditionally based on expert opinion, and/or data from relatively small sample populations. This allows us to test the robustness of physiologic concepts developed based on animal experiments or studies on healthy human volunteers in the setting of critical illness. When AKIN first created a definition of AKI, large databases that relate creatinine to hourly urine output, like the Multiparameter Intelligent Monitoring in Intensive Care III database (MIMIC-III) and Collaborative Research Database (eICU), were not as readily available. Using two independent large retrospective clinical archives with significantly different patient populations we have re-examined the agreement between the two components of this definition.

While our results are robust, this improved detection cannot replace the measurement of creatinine for the definition of AKI. In the future, other definitions, and even guidelines, based on expert opinion and existing data should be revisited in this manner, based on new repositories of patient data linked with clinical outcomes, and we believe that our work presented here can serve as a prototype for this approach.

Conclusion

In this paper, we refined the urine-based definition of AKI by optimizing urine volume and duration criteria, and also introduced a time varying detection model that incorporated physiological features that confound the relationship between hourly urine output measurements and creatinine. This was conducted using two independent data sets with different patient populations. In both data sets we consistently showed that a model which monitors repeated urine output measures in addition to other covariates (such as average MAP) has enhanced associations with future rise in creatinine, as compared to applying a fixed criterion of 0.5 ml/kg/hour of urine for 6 hours or any of its refinements. Thus, urine output and other patient characteristics could be continuously monitored in real time by a bedside algorithm. Once the multivariable definition of AKI is met in a given patient, critical steps (such as interventions to treat AKI, or adjusting the dose of medications cleared by the kidneys) could be undertaken.

Data availability

The openly available datasets supporting the conclusions of this article are from https://mimic.physionet.org/ and https://eicu-crd.mit.edu/.

Change history

09 November 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41598-021-01415-y

References

Thadhani, R., Pascual, M. & Bonventre, J. Failure. N. Engl. J. Med. 334, 1448–1460 (1996).
Article CAS PubMed Google Scholar
Rahman, M., Shad, F. & Smith, M. C. Acute kidney injury: A guide to diagnosis and management. Am. Fam. Phys. 86, 631–639 (2012).
Google Scholar
Leedahl, D. D. et al. Derivation of urine output thresholds that identify a very high risk of AKI in patients with septic shock. Clin. J. Am. Soc. Nephrol. 9, 1168–1174 (2014).
Article PubMed PubMed Central Google Scholar
Celi, L. A. G. et al. A clinical database-driven approach to decision support: Predicting mortality among patients with acute kidney injury. J. Healthc. Eng. 2, 13 (2011).
Article Google Scholar
Córdova-Sánchez, B. M., Herrera-Gómez, Á. & \({\tilde{{\rm N}}}\)amendys-Silva, S. A. Acute kidney injury classified by serum creatinine and urine output in critically ill cancer patients. BioMed Res. Int. 2016 (2016).
Ralib, A. M., Pickering, J. W., Shaw, G. M. & Endre, Z. H. The urine output definition of acute kidney injury is too liberal. Critical Care 17, R112 (2013).
Article Google Scholar
Han, S. S. et al. Additional role of urine output criterion in defining acute kidney injury. Nephrol. Dial. Transplant. 27, 161–165 (2012).
Article PubMed Google Scholar
Jin, K. et al. Intensive monitoring of urine output is associated with increased detection of acute kidney injury and improved outcomes. Chest 152, 972–979 (2017).
Article PubMed Google Scholar
Kaddourah, A. et al. Oliguria and acute kidney injury in critically ill children: Implications for diagnosis and outcomes. Pediatr. Crit. Care Med. 20, 332–339 (2019).
Article PubMed Google Scholar
De Corte, W., De Laet, I. & Hoste, E. Shifting paradigms in acute kidney injury. In Annual Update in Intensive Care and Emergency Medicine 2014. 541–552 (Springer, 2014).
Legrand, M. & Payen, D. Understanding urine output in critically ill patients. Ann. Intensive Care 1, 13 (2011).
Article PubMed PubMed Central Google Scholar
Pickering, J. W. & Endre, Z. H. Linking injury to outcome in acute kidney injury: A matter of sensitivity. PLoS One 8, e62691 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Rodrigues, F. B. et al. Incidence and mortality of acute kidney injury after myocardial infarction: A comparison between Kdigo and Rifle criteria. PloS One 8, e69998 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Bellomo, R. et al. Acute renal failure-definition, outcome measures, animal models, fluid therapy and information technology needs: The second international consensus conference of the acute dialysis quality initiative (adqi) group. Critical Care 8, R204 (2004).
Article PubMed PubMed Central Google Scholar
Uchino, S., Bellomo, R., Goldsmith, D., Bates, S. & Ronco, C. An assessment of the rifle criteria for acute renal failure in hospitalized patients. Crit. Care Med. 34, 1913–1917 (2006).
Article PubMed Google Scholar
Ostermann, M. & Chang, R. W. Acute kidney injury in the intensive care unit according to rifle. Crit. Care Med. 35, 1837–1843 (2007).
Article PubMed Google Scholar
Bagshaw, S. M., George, C., Dinu, I. & Bellomo, R. A multi-centre evaluation of the rifle criteria for early acute kidney injury in critically ill patients. Nephrol. Dial. Transplant. 23, 1203–1210 (2008).
Article PubMed Google Scholar
Ricci, Z., Cruz, D. & Ronco, C. The rifle criteria and mortality in acute kidney injury: A systematic review. Kidney Int. 73, 538–546 (2008).
Article CAS PubMed Google Scholar
Mehta, R. L. et al. Acute kidney injury network: Report of an initiative to improve outcomes in acute kidney injury. Crit. Care 11, R31 (2007).
Article PubMed PubMed Central Google Scholar
Barrantes, F., Tian, J., Vazquez, R., Amoateng-Adjepong, Y. & Manthous, C. A. Acute kidney injury criteria predict outcomes of critically ill patients. Crit. Care Med. 36, 1397–1403 (2008).
Article PubMed Google Scholar
Joannidis, M. et al. Acute kidney injury in critically ill patients classified by akin versus rifle using the saps 3 database. Intensive Care Med. 35, 1692–1702 (2009).
Article PubMed Google Scholar
Mandelbaum, T. et al. Outcome of critically ill patients with acute kidney injury using the akin criteria. Crit. Care Med. 39, 2659 (2011).
Article PubMed PubMed Central Google Scholar
Thakar, C. V., Christianson, A., Freyberg, R., Almenoff, P. & Render, M. L. Incidence and outcomes of acute kidney injury in intensive care units: A veterans administration study. Crit. Care Med. 37, 2552–2558 (2009).
Article PubMed Google Scholar
Goren, O. & Matot, I. Perioperative acute kidney injury. BJA Br. J. Anaesth. 115, ii3–ii14 (2015).
Article PubMed Google Scholar
Kellum, J. A. et al. Classifying AKI by urine output versus serum creatinine level. J. Am. Soc. Nephrol. 26, 2231–2238 (2015).
Article CAS PubMed PubMed Central Google Scholar
Xu, Y., Liu, X., Sun, X. & Wang, Y. The impact of serum uric acid on the natural history of glomerular filtration rate: A retrospective study in the general population. PeerJ 4, e1859 (2016).
Article PubMed PubMed Central Google Scholar
Solomon, A. W. et al. Urine output on an intensive care unit: Case-control study. BMJ 341, c6761 (2010).
Article PubMed PubMed Central Google Scholar
Prowle, J. & Bellomo, R. Urine output and the diagnosis of acute kidney injury. in Annual Update in Intensive Care and Emergency Medicine 2012. 628–640 (Springer, 2012).
Ostermann, M. & Joannidis, M. Biomarkers for AKI improve clinical practice: No (2015).
Lin, J. et al. False-positive rate of AKI using consensus creatinine-based criteria. Clin. J. Am. Soc. Nephrol. 10, 1723–1731 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wlodzimirow, K. A. et al. A comparison of rifle with and without urine output criteria for acute kidney injury in critically ill patients. Crit. Care 16, R200 (2012).
Article PubMed PubMed Central Google Scholar
Koeze, J. et al. Incidence, timing and outcome of AKI in critically ill patients varies with the definition used and the addition of urine output criteria. BMC Nephrol. 18, 70 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pickkers, P. et al. The intensive care medicine agenda on acute kidney injury. Intensive Care Med. 43, 1198–1209 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ronco, C. Acute kidney injury: From clinical to molecular diagnosis. Crit. Care 20, 1–4 (2016).
Article Google Scholar
Zhou, J. et al. A comparison of rifle, akin, kdigo, and cys-c criteria for the definition of acute kidney injury in critically ill patients. Int. Urol. Nephrol. 48, 125–132 (2016).
Article CAS PubMed Google Scholar
Ratcliffe, S. J. et al. Geeqbox: A matlab toolbox for generalized estimating equations and quasi-least squares. J. Stat. Softw. 25, 1–14 (2008).
Article Google Scholar
Rabe-Hesketh, S. & Skrondal, A. Generalized linear mixed-effects models. Longitud. Data Anal. 79 (2008).
Johnson, A. E. et al. Mimic-iii, a freely accessible critical care database. Sci. Data 3, 1–9 (2016).
Article Google Scholar
Pollard, T. J. et al. The EICU collaborative research database, a freely available multi-center database for critical care research. Sci. Data 5, 180178 (2018).
Article PubMed PubMed Central Google Scholar
An, M., Ni, Y., Li, X. & Gao, Y. Effects of arginine vasopressin on the urine proteome in rats. PeerJ 5, e3350 (2017).
Article PubMed PubMed Central Google Scholar
Quan, S. et al. Prognostic implications of adding urine output to serum creatinine measurements for staging of acute kidney injury after major surgery: a cohort study. Nephrol. Dial. Transplant. 31, 2049–2056 (2016).
Article CAS PubMed Google Scholar
Priyanka, P. et al. The impact of acute kidney injury by serum creatinine or urine output criteria on major adverse kidney events in cardiac surgery patients. J. Thoracic Cardiovasc. Surg. (2020).
McCulloch, C. E. & Neuhaus, J. M. Generalized Linear Mixed Models (Statistics Reference Online, Wiley StatsRef, NY, 2014).
Google Scholar
Mayaud, L. et al. Dynamic data during hypotensive episode improves mortality predictions among patients with sepsis and hypotension. Crit. Care Med. 41, 954 (2013).
Article PubMed PubMed Central Google Scholar
Pencina, M. J., D’Agostino, R. B. Sr., D’Agostino, R. B. Jr. & Vasan, R. S. Evaluating the added predictive ability of a new marker: From area under the roc curve to reclassification and beyond. Stat. Med. 27, 157–172 (2008).
Pickering, J. W. & Endre, Z. H. Acute kidney injury urinary biomarker time-courses. PLoS One 9, e101288 (2014).
Article ADS PubMed PubMed Central Google Scholar
Citerio, G. et al. Year in review in intensive care medicine 2014: I. Cardiac dysfunction and cardiac arrest, ultrasound, neurocritical care, icu-acquired weakness, nutrition, acute kidney injury, and miscellaneous. Intensive Care Med. 41, 179–191 (2015).
Liu, A. Y. L., Wang, J., Nikam, M., Lai, B. C. & Yeoh, L. Y. Low, rather than high, body mass index is a risk factor for acute kidney injury in multiethnic Asian patients: A retrospective observational study. Int. J. Nephrol. 2018 (2018).
Di Iorio, B., Torraca, S., Gustaferro, P., Fazeli, G. & Heidland, A. High-frequency external muscle stimulation in acute kidney injury (AKI): Potential shortening of its clinical course. Clin. Nephrol. 79, S37–S45 (2013).
PubMed Google Scholar
Xavier, S., Goes, C., Bufarah, M., Balbi, A. & Ponce, D. Handgrip strength and weight predict long-term mortality in acute kidney injury patients. Clin. Nutrit. ESPEN 17, 86–91 (2017).
Article CAS Google Scholar
Ejaz, A. A. & Mohandas, R. Are diuretics harmful in the management of acute kidney injury?. Curr. Opin. Nephrol. Hypertens. 23, 155–160 (2014).
Article CAS PubMed Google Scholar
Kim, C. S. et al. Incidence, predictive factors, and clinical outcomes of acute kidney injury after gastric surgery for gastric cancer. PLoS One 8, e82289 (2013).
Article ADS PubMed PubMed Central Google Scholar
Okusa, M. D. & Davenport, A. Reading between the (guide) lines–the kdigo practice guideline on acute kidney injury in the individual patient. Kidney Int. 85, 39–48 (2014).
Article PubMed Google Scholar
Mehta, R. L. & Chertow, G. M. Acute renal failure definitions and classification: Time for change?. J. Am. Soc. Nephrol. 14, 2178–2187 (2003).
Article PubMed Google Scholar
Moore, E. et al. The impact of fluid balance on the detection, classification and outcome of acute kidney injury after cardiac surgery. J. Cardiothorac. Vasc. Anesth. 29, 1229–1235 (2015).
Article PubMed Google Scholar
Prowle, J. R. et al. Oliguria as predictive biomarker of acute kidney injury in critically ill patients. Crit. Care 15, R172 (2011).
Article PubMed PubMed Central Google Scholar
Macedo, E., Malhotra, R., Claure-Del Granado, R., Fedullo, P. & Mehta, R. L. Defining urine output criterion for acute kidney injury in critically ill patients. Nephrol. Dial. Transplant. 26, 509–515 (2011).
Article PubMed Google Scholar

Download references

Acknowledgements

The contents of this publication do not represent the views of the U.S. Department of Veterans Affairs or the United States Government.

Funding

This study was supported by the National Institute of Biomedical Imaging and Bioengineering grant R01 EB001659. This study was also partially supported by the National University of Singapore Start-up Grant R-608-000-172-133; and in part by the Center for Innovations in Quality, Effectiveness and Safety (CIN 13-413), Michael E. DeBakey VA Medical Center, Houston, TX; and the Gordon and Betty Moore Foundation.

Author information

These authors jointly supervised this work: Mengling Feng and Andrew J. Zimolzak.

Authors and Affiliations

Massachusetts Institute of Technology, Cambridge, MA, USA
Sahar Alkhairy, Leo A. Celi & Mengling Feng
Beth Israel Deaconess Medical Center, Boston, MA, USA
Leo A. Celi
Saw Swee Hock School of Public Health, National University Health System, National University of Singapore, Singapore, Singapore
Mengling Feng
Baylor College of Medicine, Houston, TX, USA
Andrew J. Zimolzak
Michael E. DeBakey VA Medical Center, Houston, TX, USA
Andrew J. Zimolzak

Authors

Sahar Alkhairy
View author publications
You can also search for this author in PubMed Google Scholar
Leo A. Celi
View author publications
You can also search for this author in PubMed Google Scholar
Mengling Feng
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Zimolzak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.A., A.J.Z., L.C., and M.F. conceived and designed the analysis, L.C. and A.J.Z. provided clinical insights, S.A. performed data extraction and processing, S.A. performed the analyses, S.A. and A.J.Z. Wrote the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Sahar Alkhairy.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The original version of this Article contained an error in the Funding section. “This study was supported by the National Institute of Biomedical Imaging and Bioengineering grant R01 EB001659. This study was also partially supported by the National University of Singapore Start-up Grant R-608-000-172-133” now reads: “This study was supported by the National Institute of Biomedical Imaging and Bioengineering grant R01 EB001659. This study was also partially supported by the National University of Singapore Start-up Grant R-608-000-172-133; and in part by the Center for Innovations in Quality, Effectiveness and Safety (CIN 13-413), Michael E. DeBakey VA Medical Center, Houston, TX; and the Gordon and Betty Moore Foundation.”

Supplementary Information

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alkhairy, S., Celi, L.A., Feng, M. et al. Acute kidney injury detection using refined and physiological-feature augmented urine output. Sci Rep 11, 19561 (2021). https://doi.org/10.1038/s41598-021-97735-0

Download citation

Received: 17 February 2021
Accepted: 18 June 2021
Published: 01 October 2021
DOI: https://doi.org/10.1038/s41598-021-97735-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Development of a prediction score for in-hospital mortality in COVID-19 patients with acute kidney injury: a machine learning approach

Machine-learning model for predicting oliguria in critically ill patients

Machine learning algorithm to predict mortality in critically ill patients with sepsis-associated acute kidney injury

Introduction

Methods

Data set and feature extraction

Pre-processing and inclusion/exclusion criteria

Baseline symmetry and time/volume refinement

Multivariable modeling

Model variables

Net reclassification index

Use of experimental animals, and human participants

Results

Discussion

Conclusion

Data availability

Change history

09 November 2021

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Table S1.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links