Introduction

There is wide practice variation in the application of transfusion guidelines and the indications used to transfuse packed red blood cell (pRBC) in the Neonatal Intensive Care Unit (NICU). Despite this variation, the vast majority of extremely low gestational age neonates (ELGANs, <28 weeks’ gestation) receive at least 1 pRBC transfusion, with the median number ranging from 3 to 8 during their hospitalization, the majority of which occur in the first month after birth.1,2

A better understanding of the potential risks and benefits associated with pRBC transfusions is needed to help guide appropriate transfusion practices in the NICU. Potential short-term risks include infection, inflammation, transfusion-related lung injury, and transfusion-related necrotizing enterocolitis.3,4,5,6,7,8 A potential long-term risk associated with pRBC transfusion includes neurodevelopmental (ND) modulation. In a previous study evaluating erythropoiesis-stimulating agents (ESAs) that utilized a restrictive pRBC transfusion strategy, cognitive scores at 18–22 months were inversely correlated with transfusion volume.9 However, the long-term effect of anemia vs. pRBC transfusion on ND is unclear, as reports are few and conflicting.10,11,12 The recently published “Effects of Transfusion Thresholds on Neurocognitive Outcome of Extremely Low Birth-Weight (ELBW) Infants (ETTNO) Study13,14 (NCT01393496) showed no difference in outcomes between liberal and restrictive transfusion approaches (N-1013). The ongoing Transfusion of Prematures (TOP) Trial (NCT01702805) will shed further light on this question. However, neither the ETTNO or TOP study addresses the interaction of transfusion with the use of ESAs, and the ETTNO study did not compare outcomes of those infants who received transfusions with those who were transfusion-free.

Using data from the Preterm Erythropoietin Neuroprotection Trial (PENUT; NCT01378273), we now investigate the relationship between the number and volume of transfusions given to ELGANs and their ND outcome at 22–24 months of age. In addition, after adjusting for covariates, we evaluate whether infants who remain transfusion-free have better cognitive outcomes than ELGANS who receive ≥1 pRBC transfusions.

Methods

Study design and population

The PENUT Trial was a multi-center, randomized, placebo-controlled, double masked study examining the use of erythropoietin (Epo) for neuroprotection in infants born between 24-0/7 and 27-6/7 weeks of gestation. A total of 941 infants at 19 sites across the United States were enrolled between December 2013 and September 2016. Parental consent, enrollment, and initial treatment with study drug occurred within 24 h of birth. Subjects received Epo 1000 U/kg/dose or placebo intravenously every 48 h for 6 doses, followed by Epo 400 U/kg/dose by subcutaneous injection or sham (no actual needle stick) 3 times a week until the subjects completed 32 weeks postmenstrual age (PMA). An investigational New Drug (IND) application was approved for the PENUT Trial by the Food and Drug Administration (IND#12656). The PENUT Trial was registered in ClinicalTrials.gov (NCT01378273) and was approved by each site’s Institutional Review Board. Details of eligibility, enrollment, and randomization have been published elsewhere.15,16 PENUT was powered for the primary outcome of death or ND impairment (NDI) at 2 years. This paper focuses on randomized infants who survived and underwent 2-year ND assessment in compliance with protocol.

Guidelines for iron supplementation were included in the study protocol. When feedings were initiated, a standard iron-containing formula was used if breast milk was unavailable. When subjects reached an enteral intake of 60 mL/kg/day and were at least 7 days of age, 3 mg/kg/day enteral iron was recommended. Iron dosing was increased to 6 mg/kg/day when infants reached an enteral intake of 100 to 120 mL/kg/day. To assess infant iron status, serum ferritin or zinc protoporphyrin-to-heme ratio17 was to be measured at 14 and 42 days, and iron dosing increased or decreased accordingly. Infants who were not able to tolerate enteral feedings were given maintenance iron parenterally (3 mg/kg/week of iron dextran or iron sucrose, dose adjusted based on iron indices).

As there are no consensus recommendations for pRBC transfusions in critically ill neonates, each site followed their own pRBC transfusion protocol and procedures. We documented the formal transfusion guidelines for sites when available. Transfusion records were considered complete it they included both volume and donor information. Information from infants with no transfusion was also included in the analysis.

In addition to clinical characteristics and demographic information, pRBC transfusion number, volume, and donor exposures were recorded for each subject. Infants without complete 2-year developmental follow-up assessment were excluded from these analyses.

Outcomes

The primary ND outcome of interest was the Bayley Scales of Infant Development Third Edition (BSID-III) evaluated at 22–26 months, which has 3 composite components: cognitive, motor, and language scores. Secondary outcomes included severe NDI, defined as the presence of any one of the following: severe cerebral palsy (CP), (defined as Gross Motor Function Classification System (GMFCS) >2), BSID-III cognitive score <70, or BSID-III motor score <70; and moderate-to-severe NDI, defined as the presence of any one of the following: moderate-to-severe CP (GMFCS ≥2), BSID-III cognitive score <85, or BSID-III motor score <85.15

Statistical analysis

Our post hoc statistical analyses included randomized infants who received at least the first dose of study treatment, survived, and underwent 2-year ND assessment. All analyses accounted for potential sibship correlation by using Generalized Estimating Equations (GEE)18 with robust standard errors.

The primary goal was to evaluate the associations between mean BSID-III component scores and three transfusion variables: the total number of pRBC transfusions, the cumulative transfusion volume, and the number of unique donor exposures. These associations were examined among all participants using GEE models adjusted for treatment group, gestational age at birth (grouped as 24/25-week or 26/27 weeks), recruitment site, and covariates that potentially impact receipt of transfusions. The potential confounding covariates included birth weight, Apgar score at 5 min, suspected or confirmed chorioamnionitis, and the use of ventilator for respiratory support at enrollment. The associations between BSID-III scores and transfusion exposures were further assessed within the 2 treatment arms using GEE models adjusted for gestational age group, recruitment site, and the covariates considered as potential confounders.

In secondary analyses, we used similar GEE models to evaluate associations between BSID-III scores and transfusion status (i.e., transfusions yes/no) for all participants and within each treatment arm. We also evaluated associations between the secondary outcomes of interest and transfusion exposures or transfusion status among all participants using GEE loglinear models, adjusting for treatment group, and gestational age group. Associations were again stratified based on treatment group. We did not adjust for recruitment site and other confounding covariates at baseline in these analyses due to low count of events relative to a large number of sites and the number of covariates.

In sensitivity analyses, we assessed the primary associations between BSID-III component scores and pRBC exposures for subgroups defined by sex or gestational age (24, 25, 26, and 27 weeks).

All statistical tests were two sided and statistical significance was set at level 0.05. All analyses were post hoc and not adjusted for multiple testing. Statistical analyses were performed using the R statistical software package (version 3.5.1, Vienna, Austria).19

Results

The 628 infants (315 placebo and 313 Epo treated) who survived and had 2-year ND outcomes collected per protocol and who were included in these analyses comprised 67% of the total 941 subjects enrolled in the PENUT Trial. There were 2711 complete pRBC transfusion records available across all neonates. Screening, inclusion, and available analysis data are summarized as a CONSORT diagram in Fig. 1. Maternal demographics and neonatal descriptors are shown in Table 1. Of the 628 infants, 29 (9%) in the placebo group and 87 (28%) in the Epo group remained transfusion-free (p < 0.01). The 315 infants in the placebo group received a mean of 5.4 (SD = 4.8) transfusions and 83.7 (SD = 87.6) mL cumulative pRBC volume from 2.4 (SD = 2.2) unique donors. In comparison, the 313 infants in the Epo group received a mean of 3.2 (SD = 3.7) transfusions and 46.6 (SD = 61.8) mL cumulative pRBC volume from 1.6 (SD = 1.9) unique donors. Infants with missing data on either outcomes or any of the adjusted potential confounding covariates (birth weight, Apgar score at 5 min, suspected or confirmed chorioamnionitis, and the use of ventilator for respiratory support at enrollment), as described in Table 1 footnotes, were excluded from statistical analyses.

Fig. 1: CONSORT diagram.
figure 1

This figure shows the infants screened, enrolled, randomized to Epo or placebo, and transfusion records analyzed.

Table 1 Maternal demographics and neonatal data of the 628 included infants.

There was substantial variability in transfusion practice by site. Of the 19 participating sites, 14 (74%) used formal transfusion guidelines. An additional two sites had general guidelines that included suggested volumes for transfusions and standard storage solutions for pRBCs. For critically ill infants requiring mechanical ventilation, hematocrit triggers for transfusion ranged from 28 to 45%. For stable infants, hematocrit triggers ranged from 21 to 25%.

Results of the primary analysis for the associations between transfusion exposures (number of pRBC transfusions, cumulative volume, and donor exposure) and BSID-III scores (cognitive, motor, and language components) are given in Fig. 2. Accounting for potential sibship correlations, treatment effect, site, gestational age, and other potential confounding variables, each additional transfusion received was associated with: a decrease in mean cognitive score of 0.96 (95% confidence interval (CI) of [−1.34, −0.57], p < 0.01); a decrease in mean motor score by 1.51 (95% CI of [−1.91, −1.12], p < 0.01); and a decrease in mean language score by 1.10 (95% CI of [−1.54, −0.66], p < 0.01). Statistically significant and negative associations were observed for analysis of cumulative volume as well as donor exposure. When stratified by treatment arm, the associations remained statistically significant and negative for all analyses among the placebo infants. For those randomized to Epo, the estimated effect sizes were all negative; however, only the associations between motor score and transfusion exposures were statistically significant.

Fig. 2: Primary analysis of BSID-III component scores and pRBC transfusion exposures.
figure 2

The overall associations between mean BSID-III component scores (cognitive, motor, and language) and pRBC transfusion exposures (number of transfusions, cumulative volume of transfusion, and donor exposure) were examined using GEE models clustering on same-birth siblings, adjusted for fixed effects of treatment group, gestational age group, recruitment site, and other potential confounders at baseline. These associations were also evaluated for each treatment group using similar GEE models. The forest plot shows the estimated effect sizes of transfusion exposures on mean BSID-III scores and corresponding 95% CIs.

As a secondary analysis, we examined the associations between BSID-III scores and transfusion status, with the results shown in Fig. 3. Among all infants and in either treatment arm, transfusion-free infants consistently had higher mean scores across all BSID-III components. While the results were statistically significant when considering all subjects, only the motor component for Epo-treated infants was significant when treatment subgroups were examined separately.

Fig. 3: BSID-III component scores and any pRBC transfusion.
figure 3

The distributions of BSID-III a cognitive scores, b motor scores, and c language scores were summarized for those who were transfusion-free and who had ≥1 pRBC transfusion, overall and by treatment group. The overall associations among all participants were examined using GEE models clustering on same-birth siblings, adjusted for fixed effects of treatment group, gestational age group, recruitment site, and other potential confounding variables at baseline. These associations were also evaluated for each treatment group using similar GEE models.

We also evaluated the relationships between transfusion exposures and other ND outcomes, which included severe NDI and its components (severe CP, BSID-III cognitive <70, BSID-III motor <70) and moderate-to-severe NDI and its components (moderate-to-severe CP, BSID-III cognitive <85, BSID-III motor <85). The results are presented in Fig. 4. The relative risks (RRs) of an adverse ND outcome were higher among those receiving more transfusions, more cumulative volume, or transfusions from more unique donors. Adjusted for potential confounding covariates, these associations were all statistically significant for all subjects and those in the placebo group. The results were also significant for those in the Epo group, except for moderate-to-severe CP and BSID-III cognitive <85.

Fig. 4: Secondary ND outcomes and pRBC transfusion exposures.
figure 4

The overall associations between secondary ND outcomes and pRBC transfusion exposures were examined using GEE loglinear models clustering on same-birth siblings, adjusted for fixed effects of treatment group, gestational age group, and other potential confounding covariates at baseline. These associations were also evaluated for each treatment group using similar GEE models. The forest plot shows the estimated effect sizes of transfusion exposures on the RR of ND outcomes and corresponding 95% CIs. RRs >1.0 indicate that exposure to pRBC transfusions is associated with higher risk of worse ND outcomes.

Table 2 shows the results for the analysis of adverse ND outcomes and transfusion status. The RRs of an adverse ND outcome were estimated to be higher among those who had any transfusion; however, most of these results did not reach statistical significance. The results of sensitivity analyses using subgroups defined by gestational age or sex are given in Supplemental Table S1 (online). Variability in the associations between BSID-III components and transfusion exposures was seen across different subgroups.

Table 2 Secondary analysis of ND outcomes and any pRBC transfusion.

Conclusions and discussions

In this post hoc analysis of 628 ELGAN infants from the PENUT Trial who underwent ND assessment at 22–26 months PMA, we found significantly negative associations between BSID-III score and pRBC transfusions, with BSID-III scores progressively decreasing as a function of number of transfusions, cumulative volume of transfusions, and number of donor exposures. These findings were consistent for cognitive, motor, and language components of the BSID III. Similarly, when subjects were dichotomized into those who received no pRBC transfusions or those with ≥1 transfusions, lower BSID-III scores were seen in those who had been exposed to pRBC transfusions. Further, the odds of severe or moderate and severe NDI were higher in those with increased transfusion exposure (increased number of transfusions, increased cumulative volume of transfusions, or increased donor exposure) when the cohort was examined as a whole. The association between ND assessment and transfusions was less robust in the Epo-treated arm, with several analyses in the Epo-treated arm not reaching statistical significance. Infants who received transfusions despite being on Epo might have been sicker and therefore unable to respond as robustly to the erythropoietic effect of Epo.

We previously reported cognitive results of preterm survivors enrolled in a randomized controlled trial of ESAs.9 Cognitive scores at 18–22 months PMA were inversely correlated with transfusion volume; among infants receiving at least one transfusion, cognitive scores were significantly higher in the ESA-treated group (n = 80). Although not statistically significant, transfusion number and volume trended toward an inverse correlation to cognitive scores at 3.5–4 years. Scores were higher in the ESA than in the placebo group. Our current, larger study corroborates and expands on these findings.

There remains great longstanding uncertainty regarding appropriate management of anemia in neonates. The potential harm of extreme anemia must be balanced with the potential benefit or harm of transfusions. Severe anemia has been associated with apnea,20 poor growth,21 compromised cardiac function,22 increased mortality,23 and potentially, poor ND.9 What constitutes “severe anemia” for the neonatal population, however, is not well defined.24 In contrast, in the adult literature, the recommended transfusion trigger has fallen to a hematocrit of 21% (hemoglobin of 7) as the adverse effects of transfusion have become more apparent.25,26 Neonatal transfusions have been associated with many adverse acute effects, including inflammation,17 extension of intraventricular hemorrhage (IVH),27 bronchopulmonary dysplasia,28 retinopathy of prematurity,29 necrotizing enterocolitis,29,30 and death.31 In addition to the short-term effects, the long-term ND effects are also of concern. The randomized controlled trials of liberal vs. stringent transfusion guidelines showed no clear benefit or harm to either approach.14,20,32 In a post hoc analysis, the Bell study (N = 100) showed an increase in severe brain injury (grade-IV IVH or periventricular leukomalacia) in the restrictive group (6 vs. 0). However, up to 2 transfusions were allowed prior to enrollment, the median day of enrollment was 3 days of age, and cranial ultrasounds were not done prior to enrollment, so it is not clear whether the findings reflect the transfusion approach.32 Follow-up of 44 of the 100 infants enrolled showed cerebral white matter to be substantially reduced in both preterm groups at 12 years of age, but more so for the liberal transfusion group.12,33 The Premature Infants in Need of Transfusion study (N = 451) did not show a difference in acute brain injury, and follow-up of these subjects at 18–20 months corrected age showed no statistical difference in ND outcomes.10 The ETTNO trial14 (N = 1013) showed no difference in ND outcome measured by BSID-II at 24 months corrected age; however, they did not compare children who received transfusions with those who remained transfusion-free, nor did they evaluate the effect of increasing numbers of transfusions. None of these studies evaluated ESAs in their treatment arms.

The mechanisms by which pRBC transfusions affect development are unclear. Possible explanations include the promotion of the inflammatory cascade,4,17,34 suppression of endogenous Epo, pro-oxidant injury, and ferroptosis.35 In this study, the relationship between adverse ND outcomes and transfusion exposure was less robust in the Epo-treated subjects. This may be due to a wider confidence interval in the Epo group, as these subjects received fewer transfusions, which leads to an even smaller number of infants with adverse outcomes and who had transfusions in this group. Alternatively, as has been shown in preclinical studies,36 Epo may interrupt or modulate the potential injurious effects of pRBC transfusion.

The primary limitation of this study is that given its post hoc observational nature, it is difficult to ascertain whether increased transfusions are causative of adverse outcomes or rather are merely markers of clinical illness that dictate worse outcomes. We attempted to account for this by adjusting for markers of illness severity, including gestational age, birth weight, 5-min Apgar, chorioamnionitis, and mechanical ventilator use. Additional limitations include the lack of standardized transfusion guidelines (though variation in practice permitted the necessary variability needed to compare the effects of different transfusion exposures) and the absence of data on phlebotomy losses for each infant. Despite these limitations, the PENUT Trial is a well-designed and executed randomized trial with a large cohort of well-characterized participants, which permits exploratory or post hoc analyses such as ours. Our study emphasizes the urgent need to shed further light on this clinically important topic of transfusions and ND, given the frequency at which ELGANs are transfused. This important gap in knowledge will be partially informed by results from the ETTNO and TOP trials, exclusive of the effects of ESAs. The Darbepoetin Trial to Improve Red Cell Mass and Neuroprotection in Preterm Infants (NCT 03169881), will inform this issue, as it does involve an ESA.

We conclude that transfusions in preterm infants may contribute to poor ND outcomes measured at 2 years of age. Strategies to minimize the need for pRBC transfusions, such as delayed cord clamping, judicious phlebotomy practices, appropriate iron supplementation, adoption of transfusion guidelines,37 and use of ESAs, should be encouraged in order to improve long-term ND in critically ill, ELGANs. These children should be followed for a longer term to determine whether the effects of transfusions observed at 2 years of age persist into school age and beyond.