Main

Skin color is considered to be an essential aspect in the physical examination of the newborn (1). However, its evaluation is neither accurate nor reproducible, being subject to individual interpretation and several variables other than pathology. Color is also a complex concept to describe and communicate. Besides jaundice (2), subjective skin color changes are known to be associated with several physiopathologic conditions of the newborn, including hypoxia (3), infection (4), anemia (5), and stress response (6).

In common practice it has long been believed that skin color in the neonate can indicate the degree of severity of illness, although this hypothesis has not been rigorously tested. Nowadays, instruments are available that are easy to use, portable, of relatively low cost, and able to accurately measure color. In the present study we have analyzed the hypothesis that objective skin color evaluation in the first day of admission can be used to predict illness severity in high-risk newborn infants.

METHODS

Subjects.

The study was conducted in a level III NICU and was approved by the Institutional Review Board, with informed consent of the parents. One hundred seven randomly selected (random algorithm; Minitab release 9.2 statistical software package; Minitab, Inc., State College, PA, U.S.A.), newborn white infants (47 males, 60 females) admitted to the NICU were enrolled. Exclusion criteria were known skin disease; infants born to mothers with chronic severe anemia or intrapartum hemorrhage; gestational diabetes or other endocrine disorders; illicit drug abuse; maternal intrapartum fever; or fetal hemolysis. Cases of discharge or death at <24 h after admission to the NICU were also excluded. To evaluate the possible risk of infection related to the use of an instrument to be shared among different neonates, the rate of nosocomial infections was recorded and compared with that of a similar period in the same NICU. Nosocomial infections were defined according to the National Nosocomial Infections Surveillance System (7). Biologic data, demographic information, and clinical outcomes, independently obtained by two neonatologists who were unaware of the results of the color recordings, were prospectively recorded.

Evaluation of illness severity.

The newborns were categorized as either high or low severity of illness, defined by death at age <28 d or severe neonatal morbidity, as determined by the presence of at least one of the following: sepsis or pneumonia; bronchopulmonary dysplasia; intraventricular hemorrhage ≥3; periventricular leukomalacia ≥2; retinopathy of prematurity ≥3; and necrotizing enterocolitis. Sepsis and pneumonia were defined as described (8). Bronchopulmonary dysplasia was defined by the need for supplemental oxygen after the first 28 d of life, in association with respiratory signs and typical radiographic findings (9). Necrotizing enterocolitis (10) and retinopathy of prematurity (11) (grades 1–5) were determined according to internationally recognized criteria. Infants were assessed for intraventricular hemorrhage (grades 1–4) and periventricular leukomalacia (grades 1–3), which were classified according to standard systems (12).

In addition to the clinical assessment, illness severity at 24 h was evaluated using the SNAP, which records deviations from the normal range of 34 routinely available vital signs and laboratory test results (13). Therapeutic intensity was also calculated as described (14).

Quantitative color measurements.

Skin color was measured with a Chroma Meter CR-300 (Minolta, Osaka, Japan), a portable tristimulus colorimeter with a standard D65illuminant (correlated color temperature of 6504 K) and 2° angle supplementary standard observer, after calibration against standard white before each series of measurements (mean percentage of deviation from calibration standard, L*, ±1.23%; a*, ±0.29%; and b*, ±0.29% points). Colorimetry measures colors with quantitative physical methods and can define them within well-established numerical values (15). Here they are expressed using the standard CIE L*a*b* system (16). L*, a*, and b* values (nondimensional units) refer to the three axes of the system: a lightness axis (white-black; L*); and two axes representing both hue and chroma, one red-green (a*) and the other blue-yellow (b*). This system provides an unambiguous description of color and has the advantage that color differences between samples can be determined using simple computer programs (15). To minimize bias caused by intra- or interobserver variability, readings were performed independently by two experienced dermatologists, unaware of the clinical findings, each of whom made two readings at 10 body sites on one arbitrarily chosen side of the body (forehead, cheek, forearm, palm, upper chest, abdomen, back, buttock, leg, and sole), at three different times: time 0 (admission to the NICU) and +12 and +24 h. Readings were not taken either during the first 10 min after birth or during apnea or seizures, to exclude the effects of immediate early postnatal adaptation on the infants' skin color (3). Vernix caseosa was carefully removed. In cases in which phototherapy was being used, an interval of at least 30 min was used before performing the color measurements. Color readings were not taken in skin areas with common benign ephemeral cutaneous lesions and were taken (as well as skin temperature at each site) while the infants were either awake (not crying) or asleep. Environmental light, temperature, and relative humidity conditions were maintained constant inside the NICU and open radiant warmer beds or thermoregulated incubators.

The colorimeter probe was disinfected with a chlorhexidine 1:10,000 solution each time. To account for possible influences of inheritable characteristics on the infant skin color, the CIE L*a*b* values were obtained for the parents' constitutional skin (buttock).

Data analysis.

Data are expressed as mean ± SD for continuous normally distributed data and medians with interquartile range (25th and 75th percentiles) for nonnormal distributions. The t test, Mann-Whitney U test, and χ2test or Fisher's exact test were used to compare continuous normally distributed data, nonparametric continuous data, and categorical data, respectively. Correlation analysis between the CIE L*a*b* values and SNAP scores was performed with the use of Spearman's rho (rs), as variables were not normally distributed. Predictive accuracy for identifying newborns with severe neonatal morbidity or adverse outcome for different cutoff values of L*a*b* at different anatomic sites, was calculated using ROC curve analysis (17). The predictive accuracy of color readings was compared with that of SNAP. Model calibration was evaluated using χ2to compare the observed values (according to the classification of the infants into the two illness severity groups) with the expected values (according to the CIE L*a*b* values). The effects of potential confounders on the relationship between colorimetric variables from different body sites and the study end point (illness severity) were examined using multivariable logistic regression, and factors with p values of <0.25 at univariate analysis were included in the models (18). Statistica software (StatSoft, Tulsa, OK, U.S.A.), SPSS release 6.1 statistical package (SPSS Inc., Chicago, IL, U.S.A.), and MedCalc software (MedCalc Software, Mariakerke, Belgium) were used. A two-sided p value of <0.05 was considered to be statistically significant, and the Bonferroni corrected significance levels were used for multiple t tests.

RESULTS

Study population characteristics.

The demographic and clinical characteristics of the infants examined are shown in Table 1. Median postnatal age on admission to the NICU was 0.10 h (interquartile range, 0.06–0.25 h). SNAP scores ranged from 0 to 42. During the examination period, there was no significant increase in the rate of nosocomial infections, which could have been attributed to the introduction of an additional instrument (3.6 versus 4.7 infections/1000 patient-d, p = 1.0).

Table 1 Demographic and clinical characteristics of the neonatal study group

According to the predefined criteria, 50 neonates (46.7%; 21 males, 29 females) were allocated to the high severity group and 57 (26 males, 31 females) to the low severity group (Table 2). Male to female ratio, gestational age, and birth weight were not different between the two severity groups. There was a significant difference between the arterial blood pH, arterial Po2, and the Po2 to fraction inspired oxygen ratio values between the groups, which were lower in the high severity group, and SNAP scores, which were higher (p < 0.0001). No significant differences between concentrations of Hb and total bilirubin peak values were observed (p ≥ 0.11). In the high severity group there were a greater number of infants requiring endotracheal intubation, assisted ventilation, oxygen supplementation, exogenous surfactant administration, use of vasopressor agents, and prolonged stay in the NICU (data not shown, p ≤ 0.01). No significant differences in the CIE L*a*b* values for the parents' constitutional skin were observed between the two groups (data not shown, p ≥ 0.68).

Table 2 Comparison of relevant clinical characteristics of two illness severity categories of newborns

Colorimetry and illness severity.

The total number of readings per observer was 3210 (107 infants × 3 × 10 sites), and median time needed for each was 9.2 s (interquartile range, 8.8–9.4 s;n = 50). Intra- and interobserver coefficients of variations were between 3.5% (L*, 2.0%; a*, 3.5%; b*, 1.0%) and 8% (L*, 3.5%; a*, 8.0%; b*, 2.5%), respectively. The median skin temperature of the examined sites was 35°C (interquartile range, 34–35.5°C).

In Table 3, the L*a*b* values for each measurement site are shown (only the values for time 0 are included, as those for the successive time measurements were not significantly different;p ≥ 0.9). Median L* values ranged from 53.58 (palm) to 60.48 (cheek), a* ranged from 13.67 (abdomen) to 20.09 (sole), and b* ranged from 10.71 (sole) to 18.96 (cheek). CIE b* values were significantly different between the two severity groups (p < 0.0001) (Table 4). This applied to all the sites. However, for the L* and a* values, significant differences were observed at fewer sites. Moreover, a significant inverse correlation between the b* values at each site and SNAP was found:rs values ranging from −0.37 (sole) to −0.71 (back) (all p values <0.0001;Table 5). To a lesser degree, the a* component is also inversely correlated with the SNAP score, rs ranging from −0.04 (palm) to −0.43 (buttock), whereas the L* values were unrelated to SNAP, with the exception of the chest and leg recording sites. There was no significant correlation between the L*a*b* values and skin temperature, gestational age, birth weight, mode of delivery, Apgar score, sex, and skin colorimetry values of the parents (rs [absolute value] ≤ 0.08;p ≥ 0.70).

Table 3 Skin colorimetry results, expressed as CIE L*a*b* values, in the entire population of infants examined
Table 4 Comparison of skin colorimetric analyses of two illness severity categories of newborns at time 0
Table 5 Spearman rank correlations between skin colorimetry at time 0 and SNAP values

The results of the analysis of predictive accuracy of a low b* value are shown in Table 6: all average area-under-the-curve values were significantly different from 0.5, ranging from 0.917 (sole) to 0.994 (back). The most sensitive individual site was found to be the forearm (96.3%), whereas the back showed the highest specificity (96.9%). The results of an ROC curve comparison analysis indicated that values from the abdomen, back, chest, and leg were not significantly different from SNAP (p > 0.25). Overall, the abdomen b* value showed the highest predictive accuracy (abdomen, positive predictive value 95.98%, negative predictive value 93.7%; back, positive predictive value 92.57%, negative predictive value 98.24%), with adequate model calibration (χ2= 0.02, df = 1, p = 0.89). After controlling for potential confounders, a low b* value for the abdomen was found to be significantly associated with illness severity (adjusted odds ratio, 14.7; 95% confidence interval, 6.4 to 33.8).

Table 6 Predictive accuracy of multiple anatomic site CIE b* and SNAP values i identifying high severity newborn infants: results of ROC curve analyses

DISCUSSION

A relationship between skin color and illness severity in the newborn is commonly acknowledged but until now has not been validated. The results of the present study indicate that skin color during the first 24 h from admission reflects illness severity in high-risk newborns and that colorimetry can accurately predict initial clinical severity and adverse outcome.

Colorimetry can provide an unambiguous description of color hue, and the technique is not affected by the factors typically associated with subjective interpretation, such as environmental light source, background contrast, and observer-related differences. To date, medical applications of colorimetry are limited to the assessment of skin microcirculation (19) and pigmentary changes (20). With the sole exception of transcutaneous bilirubinometry (2), little information is currently available regarding quantitative color data in the newborn (2124).

Using the CIE L*a*b* system, colorimetry describes color using three values: L*, a*, and b*. Our study indicates that the b* value has the highest predictive efficiency, whereas L* and a* appear to be of limited predictive relevance. In particular, a low b* and, to a lesser extent, low a* readings (but not L*) have been shown to be strongly associated with high 24-h SNAP scores. The observed color changes in the examined newborns were independent of parental constitutional skin color. They were present at each of the anatomic sites tested and at each serial time measurement, which means that these color measurements could be determined using a single site. Although the values at particular anatomic sites can be dependent on each other, of all the sites examined, the values for the abdomen had the greatest predictive efficiency with adequate calibration. Values recorded at time 0 have been shown to be fully informative for the evaluation of the clinical status of the newborn, whereas measurements at successive times did not add further significant information.

Several neonatal severity scoring systems have been developed (13, 2530) for predicting mortality, morbidity, or long-term sequelae and comparing NICU performance (29). The Apgar score (3) is the only commonly used severity scoring method that includes color, but its use is restricted to the immediate postnatal adaptation period and evaluation is subjective and gives limited information. As an independent illness severity predictor, skin colorimetry offers important positive aspects, including 1) independence of other outcome predictions, notably gestational age and birth weight;2) correlation with clinical outcome and a standard severity score (SNAP);3) absence of invasiveness;4) possible application at time 0 and immediacy of results; and 5) possible usefulness in different clinical contexts. The importance of noninvasive procedures in neonatology is obvious, as blood sampling for diagnostic tests is a major cause of anemia in premature infants (31) and there is increasing awareness concerning possible long-term effects of pain in neonates in intensive care (32). Moreover, colorimetry does not interfere with the usual neonatologic procedures. Our findings also indicate that skin colorimetry is safe and suitable for use in the NICU.

Possible limitations to the present study may be mainly related to the population size examined, but in our statistical scenario differences were highly significant. Besides, the population examined were all white infants, but preliminary observations of high-risk newborns in a multiracial setting indicate that a significant inverse association between b* value and clinical severity persists, as the difference between the skin colorimetry values of those newborns and the currently examined study group appears to reside in the L* value. In addition, a falsely elevated predictive performance of skin color readings at admission may arise when mortality and morbidity are combined; further research is needed to assess the value of skin colorimetry for identifying the risk of each independent morbidity.

The physicochemical mechanisms underlying the low b* value in severely ill neonates remain unclear. Peripheral desaturation of Hb and poor perfusion may be partly responsible for these changes, because Hb saturation and concentration modulate the apparent skin color response mainly in the blue region of the visible spectrum, a wavelength range that significantly contributes to the b* component of the CIE L*a*b* color space (33).

In conclusion, skin color remains a rather undervalued measurement for diagnostic, prognostic, and therapeutic purposes for the newborn infant. Nevertheless, our findings suggest that skin colorimetry is an objective, noninvasive, quick and easy-to-operate, reproducible, low-cost, and accurate means of predicting clinical severity in the high-risk newborn.