Development and Validation of a Prediction Rule for Growth Hormone Deficiency Without Need for Pharmacological Stimulation Tests in Children With Risk Factors

Clément, Florencia; Grinspon, Romina P.; Yankelevich, Daniel; Martín Benítez, Sabrina; De La Ossa Salgado, María Carolina; Ropelato, María Gabriela; Ballerini, María Gabriela; Keselman, Ana C.; Braslavsky, Débora; Pennisi, Patricia; Bergadá, Ignacio; Finkielstain, Gabriela P.; Rey, Rodolfo A.

doi:10.3389/fendo.2020.624684

ORIGINAL RESEARCH article

Front. Endocrinol., 03 February 2021

Sec. Pediatric Endocrinology

Volume 11 - 2020 | https://doi.org/10.3389/fendo.2020.624684

Development and Validation of a Prediction Rule for Growth Hormone Deficiency Without Need for Pharmacological Stimulation Tests in Children With Risk Factors

Florencia Clément¹

Romina P. Grinspon^1†

Daniel Yankelevich^2,3

Sabrina Martín Benítez⁴

María Carolina De La Ossa Salgado¹

María Gabriela Ropelato¹

María Gabriela Ballerini¹

Ana C. Keselman^1†

Débora Braslavsky^1†

Patricia Pennisi^1†

Ignacio Bergadá^1†

Gabriela P. Finkielstain^1,5

Rodolfo A. Rey^1*†

¹Centro de Investigaciones Endocrinológicas “Dr César Bergadá” (CEDIE), CONICET-FEI-División de Endocrinología, Hospital de Niños Ricardo Gutiérrez, Buenos Aires, Argentina
²Practia S.A., Buenos Aires, Argentina
³Fundación para el Desarrollo Argentino (FUNDAR), Buenos Aires, Argentina
⁴Hospital Dr Humberto J. Notti, Mendoza, Argentina
⁵Takeda Pharma, Buenos Aires, Argentina

Introduction: Practice guidelines cannot recommend establishing a diagnosis of growth hormone deficiency (GHD) without performing growth hormone stimulation tests (GHST) in children with risk factors, due to the lack of sufficient evidence.

Objective: Our goal was to generate an evidence-based prediction rule to diagnose GHD in children with growth failure and clinically identifiable risk factors.

Methods: We studied a cohort of children with growth failure to build the prediction model, and a second, independent cohort to validate the prediction rule. To this end, we assessed the existence of: pituitary dysgenesis, midline abnormalities, (supra)sellar tumor/surgery, CNS infection, traumatic brain injury, cranial radiotherapy, chemotherapy, genetic GHD, pituitary hormone deficiencies, and neonatal hypoglycemia, cholestasis, or hypogenitalism. Selection of variables for model building was performed using artificial intelligence protocols. Specificity of the prediction rule was the main outcome measure in the validation set.

Results: In the first cohort (n=770), the resulting prediction rule stated that a patient would have GHD if (s)he had: pituitary dysgenesis, or two or more anterior pituitary deficiencies, or one anterior pituitary deficiency plus: neonatal hypoglycemia or hypogenitalism, or diabetes insipidus, or midline abnormalities, or (supra)sellar tumor/surgery, or cranial radiotherapy ≥18 Gy. In the validation cohort (n=161), the specificity of the prediction rule was 99.2% (95% CI: 95.6–100%).

Conclusions: This clinical rule predicts the existence of GHD with high specificity in children with growth disorders and clinically identifiable risk factors, thus providing compelling evidence to recommend that GHD can be safely diagnosed without recurring to GHST in neonates and children with growth failure and specific comorbidities.

Introduction

Growth is a good indicator of a child’s health, and growth failure prompts the pediatrician to search for nutrition disorders, subclinical chronic diseases, or hormone deficiencies. Growth hormone deficiency (GHD) is characterized by the insufficient production of growth hormone (GH), which leads to deficient insulin-like growth factor 1 (IGF1) synthesis and secretion leading to growth failure in children. An accurate diagnosis is crucial for timely initiation of treatment in order to optimize child growth and adult height and to avoid co-morbidities resulting in impaired quality of life (1).

Auxologic evaluation lies at the basis of the diagnosis of GHD in children (2–4), and the Growth Hormone Research Society clearly defined in its consensus guidelines released in year 2000 the clinical criteria that should prompt immediate investigation of GHD in childhood and adolescence (5). Many national endocrine societies have set up procedures to diagnose GHD, and the health authorities of several countries have established national or regional boards that review and monitor GH prescriptions (2, 6). Multiple GH stimulation tests (GHSTs) have been designed to evaluate GH sufficiency in children in an attempt to reach the most accurate diagnosis of GHD (7, 8). Although they have limitations, GHSTs are still used as the gold standard for the diagnosis of pediatric GHD in most countries (1, 8).

The guidelines of the US Pediatric Endocrine Society advocate for restricting GH testing (1). For instance, in newborns with hypoglycemia and/or neonatal cholestasis the diagnosis of GHD is an emergency, and GHSTs may be dangerous (9). In a child with growth failure, the presence of micropenis and cryptorchidism or of craniofacial midline abnormalities are other putative predictors of GH deficiency (1, 5). Acquired GHD may be suspected in patients with intracranial tumors, severe traumatic brain injury or cranial radiotherapy in whom a common co-morbidity is hypothalamic obesity, associated with blunted response during GHSTs (10). However, none of these studies provide predictive values that can guide medical decisions. Therefore, due to the lack of sufficient evidence, the guidelines cannot recommend establishing the diagnosis of GHD without GHSTs in patients with these conditions (1, 11).

Pharmacological GHSTs remain a standard practice in pediatric patients—usually due to requirements from health systems (6)—even in children with clearly identifiable potential risk factors for GHD (12–14) in whom the implementation of GHSTs might be considered redundant (1, 6, 15). The aim of the present study was to assess predictors of GHD in children with growth failure by analyzing a large cohort of pediatric patients in whom GHSTs had been performed in a tertiary referral center. Our primary objective was to develop and validate an accurate clinical prediction rule with high enough specificity to allow confirmation of the diagnosis of GHD in children without recurring to GHSTs. We, therefore, designed and validated a multivariable prediction model in accordance with the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement (16).

Patients and Methods

Study Design and Data Sources

We performed a study designed to develop and validate a prediction rule to diagnose GHD with the highest specificity rate in children with growth failure and clinically identifiable risk factors, who underwent GHSTs. We analyzed clinical and biochemical characteristics and brain imaging findings in all patients younger than 18 years of age who underwent GHSTs at the Division of Endocrinology of the Hospital de Niños Ricardo Gutiérrez, a tertiary pediatric public hospital in the city of Buenos Aires, Argentina, between August 1, 2004 and July 31, 2014. Additionally, we validated the predictive rule in a second, independent cohort including GHSTs performed between February 1, 2017 and January 31, 2019.

We included GHSTs performed in patients who:

1. Met the criteria required for performing a GHST according to the Summary Statement of the Growth Hormone Research Society (5), as follows:

a. - Severe short stature, defined as a height >3 SD below the mean, or

b. - Height >1.5 SD below the mid-parental height, or

c. - Height >2 SD below the mean and a height velocity over 1 year >1 SD below the mean for chronological age, or a decrease in height SD of >0.5 over 1 year in children over 2 years of age, or

d. - In the absence of short stature, a height velocity >2 SD below the mean over 1 year or >1.5 SD sustained over 2 years, or

e. - Signs indicative of an intracranial lesion, or

f. - Signs of multiple pituitary hormone deficiency, or

g. - Neonatal symptoms and signs of GHD.

2. Underwent two sequential GHSTs (arginine-clonidine), or only one test in patients weighing <10 kg.

We excluded GHSTs performed in patients:

1. With incomplete medical records.

2. With poorly controlled chronic disorders (e.g., hypothyroidism, or gastrointestinal, immunological, nephrogenic or hematological diseases).

3. In whom the GHST was performed for re-testing reasons.

Case Ascertainment, Control Selection and Predictors

GHD (case) was diagnosed when maximal stimulated GH concentration was <6.1 ng/ml (from 2004 to 2011) (17) or <4.7 ng/ml (from 2012 to 2019) (18) during sequential arginine-clonidine tests, or a single arginine test in children weighing <10 Kg, according to previously validated cut-off values (19). Controls, i.e., non-GHD patients meeting the criteria required for GHST, were all the patients with at least one GH peak ≥6.1 ng/ml (2004–2011) or ≥4.7 ng/ml (since 2012).

To develop the model, we tested 15 potentially predictive dichotomous variables, as defined in Table 1, and auxologic data and IGF1 and IGFBP3 serum levels, as continuous variables.

TABLE 1

Table 1 Definitions of predictors used in the model building.

Clinical Evaluation

Auxologic data and past medical and family history were collected from medical records. Height and weight were expressed as standard deviation scores (SDS) using Argentine standards (25). Growth velocity was assessed considering a period of at least 6 months. Pubertal stage was assessed at the time of GHSTs according to Marshall and Tanner (26, 27). Penile size was compared to standardized data of the Argentine population (28).

Hormone Measurements

Plasma GH was measured using a chemiluminescent immunometric assay (ICMA; Immulite 2000; Siemens Healthcare Diagnostics, Gwynedd, UK) at baseline, and 30, 45, 60 and 90 min after iv administration of arginine-HCl (0.5 g/kg), and 30, 60, 90, and 120 min following oral administration of clonidine (0.1 mg/m²) (29). Intra- and inter-assay coefficients of variation were <4%. GH standards were IS-80/505 from 2004 to 2011 (17) and rhGH IS 98/574 from 2012 to 2019 (18). Total IGF1 was measured by radioimmunoassay (30) and, since October 2009, by ICMA (Immulite 2000, Siemens) (31). Serum levels of T4, free T4, T3, TSH, cortisol, ACTH, and prolactin were determined by electrochemiluminescence (Elecsys Cobas e411; Roche, Indianapolis, IN, USA) (22, 32).

Imaging

Pre- and post-gadolinium enhanced T1- and T2-weighted images of magnetic resonance imaging (MRI) studies of the brain and hypothalamic-pituitary region were evaluated. When MRI was not performed, it was considered as “missing value.”

Prediction Model-Building Procedures and Statistical Analysis

We followed the TRIPOD guideline (16) for development, validation, and reporting of the proposed score. Most of the work was done using the KNIME software, version 3.7.0 (which is open source under GNU General Public License), and we also performed calculations using Microsoft^® Excel^® for Microsoft 365 MSO version 16.0.13001.20266 and GraphPad Prism^® version 8.4.3.

It is key to the analysis that we did not look for an unbiased model. The predictive criteria were intended to detect cases with GHD with the highest specificity. Type II errors (false negatives) were tolerated since, in real world practice, those cases would be subsequently diagnosed by the GHSTs. Conversely, type I errors (false positives) would have a very high impact since a patient would be diagnosed as having GHD without undergoing GHST and receive GH treatment. Therefore, the model should have the highest specificity (low rate of false positives) while keeping an acceptable sensitivity (rate of false negatives) to be clinically relevant. Since the predictors we considered were mostly binary, we could not construct ROC curves. Therefore, to develop and validate an accurate clinical prediction rule intended to diagnose GHD with the highest specificity in children without recurring to GHSTs, we used the following methodology (33, 34):

Step 1: Data Exploration

Data gathered from clinical experience and from existing criteria (1, 5) included the 15 dichotomous variables as potential predictors for diagnosing GHD without GHSTs defined in Table 1. We also included auxologic data (height and weight) and IGF1 and IGFBP3 serum levels, as continuous variables. Data exploration (cohort 2004–2014) consisted of: a) establishing linear dependencies between variables, using Pearson’s chi-squared test and product-moment coefficients for discrete and continuous variables respectively (a summary of all pairwise correlations is presented in Supplementary Figure 1); b) analyzing distributions of each continuous variable (distribution summary in Supplementary Figure 2), and c) establishing a-priori probabilities for GHD using a frequency table (Supplementary Table 1) for every variable (for instance, see a-priori for insipid diabetes in Supplementary Table 2, this analysis was done for each nominal variable). Data exploration and metrics used followed those previously described by Gelman (35). Finally, a model was automatically built using an information gain algorithm. We chose to construct a decision tree using the algorithm described by Quinlan (36). Decisions trees have the advantage that they have a graphical representation, and they give relevant information when inspected. The decision tree was subsequently used in the step 2 to select the predictors to be used in the final model.

Step 2: Feature Selection

Finally, we calculated conditional probabilities (from frequency tables) for the cases where two variables could give similar information. All variables selected by statistical means were checked from the point of view of clinical criteria.

Feature selection was done by combining statistical analyses with clinical criteria. First, from a quantitative point of view, we analyzed the decision tree built during step 1 and selected the variables that maximized entropy reduction. We also built a random forest to derive an analysis of feature relevance in order to validate the robustness of the set of conditions selected, as described (37). The algorithm used in KNIME to establish variable importance using random forests was the Tree Ensemble Learner (Supplementary Table 3), as previously established (38). Finally, all variables selected by statistical means were checked from the point of view of clinical criteria.

Step 3: Model Building

Model building was done in an iterative way: a model built using a machine learning algorithm was discussed from the point of view of clinical criteria. The model was then refined to build a new version. The process was iterated until the prediction rule was satisfactory from both points of view: the quantitative analysis and the clinical criteria.

We started by building a decision tree using the machine learning algorithm mentioned in step 2, fed only with the selected predictors. The result was discussed from the point of view of clinical criteria. The model was then refined to build a new version, by adjusting the parameters of the algorithm. The algorithm used in KNIME to build the decision trees was the Decision Tree Learner node. Some characteristics of this algorithm are as follows: numeric splits are always binary, dividing the domain in two partitions at a given split point. Nominal splits can be either binary or they can have as many outcomes as nominal values. The quality measure used for split calculation was the gain ration, no pos pruning method was used during the execution and the minimum number of records per node was set to 2. No root column was forced. We “pruned” the branch that had more false positives from the refined model, which led to lower sensitivity and higher specificity, which was the original goal. The final tree is shown in Supplementary Figure 3.

Step 4: Validation

In order to validate the prediction model, a dataset from a different cohort of patients (2017–2019) was used. This second cohort was independent from the first one, and the data set had not been used in the derivation of the model nor the analysis.

Validation included three axes: a) Safety, interpreted as no false positives. We aimed to keep type I error near 0 in the validation cohort; b) Usefulness, translated to sensitivity of the model, set at >0.2, meaning that the prediction rule would provide a diagnosis in at least 20% of all patients with suspected GHD, and c) significance: to establish significance, the null hypothesis H0 was that the proposed criterion did not imply GHD (as tested by GHSTs), and hence it was independent. We did not assume any further conditions nor particular distribution of the data. Statistical significance was not analyzed in the original dataset (2004–2014) since these data were used to derive the diagnostic procedure and to perform exploratory analysis of the features. For the validation data (2017–2019), we set an α-value of 0.00001 in order to build a very conservative model.

Results

A total of 1,006 GHSTs were eligible for evaluation: 770 out of the 834 in the 2004–2014 cohort used to build the prediction model, and 161 of the 172 in the 2017–2019 cohort used to validate the model, could be analyzed (Figure 1).

FIGURE 1

Figure 1 Flowchart of the selection process in the two cohorts of this study.

Clinical characteristics of the 2004–2014 cohort are summarized in Table 2. It includes GHSTs performed in 491 boys and 279 girls (1.8:1), 79% were prepubertal, with a median age of 7.74 years and median height SDS of −2.51. Of the 770 GHSTs analyzed, 150 (19.5%) yielded results defined as GHD. The auxologic features were similar in patients without GHD (controls) and patients with GHD (cases), and within the latter, clinical features were similar in those predicted by the model and those not predicted. Both groups included congenital and acquired conditions (Supplementary Table 4). An MRI, used to define “pituitary dysgenesis” (Table 1) was available in 218 of the 770 patients (120 of 150 with GHD and 98 of 620 without GHD),

TABLE 2

Table 2 Clinical characteristics of the 2004−2014 cohort, used for model building, classified according to growth hormone stimulation test (GHST) results.

In this cohort of 770 patients, after tuning the classification model trained using the 15 potential dichotomic predictors, we selected 9 variables of interest, with an odds ratio (OR) >5 and a p-value <0.0001 (Table 3); “cranial radiotherapy,” with an OR=4.4, was also selected given its clinical relevance. The continuous variables height, weight, and serum levels of IGF1 and IGFBP3 were not informative enough to be considered in the model (Table 2). We used a gain ratio, entropy reducing algorithm to automatically build a decision tree from the 10 selected variables, without reduce error pruning or limits on minimum number of records. Anterior pituitary hormone (TSH, ACTH, prolactin) deficiencies were categorized as 0 (no deficiency), 1 (any one deficiency), or ≥1 (multiple pituitary hormone deficiency) (Supplementary Figure 3). This model reached a 99.2% specificity (Table 4, “decision tree” column). Its sensitivity (49.3%) was above the required threshold, and overall accuracy was 89.5%, with a resulting F-measure = 0.646. Cross validation, with a test set of 20% of the cases randomly selected using stratified sampling, showed similar results, which remained consistent after repeating the random selection and changing the parameters (for instance, Gini index instead of gain ratio, or by using pruning). Alternative models, such as Naïve Bayes, also gave similar results.

TABLE 3

Table 3 Categorical distribution and odds ratios for growth hormone deficiency of predictors used for model building (cohort 2004–2014).

TABLE 4

Table 4 Performance of the prediction rule for diagnosing growth hormone deficiency (GHD).

The conditions “pituitary dysgenesis” and “≥1 anterior pituitary hormone (TSH, ACTH or prolactin) deficiency” were selected by the entropy reduction algorithms as the first variables to analyze. Interestingly, this selection was consistent with relevant clinical criteria. We also built a random forest (37) to derive an analysis of feature relevance, that reinforced the robustness of the set of conditions selected (Supplementary Table 3). Finally, we also calculated conditional probabilities for the cases where two variables could give similar information. The resulting prediction rule stated that a patient, who met the criteria required for performing GHSTs according to the Summary Statement of the Growth Hormone Research Society (5), was a case (GHD associated to risk factors) if (s)he met the following conditions:

1. Pituitary dysgenesis on MRI, or

2. Two or more anterior pituitary hormone (TSH, ACTH or prolactin) deficiencies, or

3. At least one anterior pituitary hormone (TSH, ACTH or prolactin) deficiency plus one of the following:

a. Neonatal symptoms of pituitary deficiency (hypoglycemia or hypogenitalism)

b. Central diabetes insipidus

c. Clinical or radiological craniofacial midline abnormalities

d. Suprasellar or sellar tumor/surgery

e. Cranial radiotherapy ≥18 Gy

The proposed criteria were very conservative for specificity (Table 4, “prediction rule” column), and missed 89 cases of GHD of the 2004−2014 cohort, which could then be diagnosed using GHSTs. Therefore, the proposed predictive rule diagnosed 61 GHD cases, which represents 40.7% of all GHD patients.

We validated the predictive rule in an independent cohort of patients (2017−2019). Clinical characteristics of this second cohort are summarized in Table 5. It includes GHSTs of 111 boys and 50 girls (2.2:1), 87% prepubertal, with a median age of 7.67 years and median height SDS of −2.57. This validation group was similar to the 2004−2014 cohort in terms of age, gender, or proportion of pathological tests compared, as well as in IGF1 serum levels. GHD was diagnosed in 36 patients (22.4%). An MRI was available in 46 of the 162 patients (27 of 36 with GHD and 19 of 126 without GHD), In this validation cohort (2017−2019), the prediction rule showed a specificity of 99.2%, and its positive predictive value was 95.2% (Table 4, “validation” column). The false-positive case according to our rule was a 10-year-old boy with height at −3.98 DS, IGF1 level 78 ng/ml (reference for age 60−370), who showed one peak GH level of 6.71 ng/ml at GHST, very close to the cutoff value, and no other remarkable feature. Given the severe growth deficiency, a therapeutic trial with GH treatment resulted in 1.06 DS gain in height after 1 year. GHD could not be ruled out in this patient despite the GHST result. The positive likelihood ratio of the prediction rule was 69.4, and the number needed to test with the rule was 1.19.

TABLE 5

Table 5 Clinical characteristics of the 2017–2019 cohort, used to validate the predictive rule, classified according to Growth Hormone Stimulation Test (GHST) results.

Finally, we tested significance following the methodology presented. The a priori probability of a GHD case in the second cohort was 36/161 (22.4%). We can safely assume that individual patients are independent cases, so we have a set of Bernoulli Trials under a binomial distribution. Thus, the p-value given by the cumulative function was <0.000001.

Discussion

In this study, we identified clinically relevant risk factors for GHD in children, which were applied to build a robust clinical prediction rule to diagnose GHD, which could avoid resorting to GHSTs, in pediatric patients with growth failure and comorbidities. Our conclusion is based on the scientific evidence provided by the use of strict diagnostic criteria and clearly defined and accurately measured exposure variables in the analysis of a large cohort of GHSTs performed in a tertiary pediatric hospital.

The Endocrine Society clinical practice guideline on “Hypothalamic–Pituitary and Growth Disorders in Survivors of Childhood Cancer” advises using the same provocative testing to diagnose growth hormone deficiency in childhood cancer survivors as are used for diagnosing growth hormone deficiency in the non-cancer population as an ungraded good practice statement (11). On the other hand, the “Guidelines for the treatment of children with GHD, idiopathic short stature, and primary insulin-like growth factor 1 deficiency” of the Pediatric Endocrine Society (PES) suggest that in patients with auxological criteria, hypothalamic-pituitary defects and deficiency of at least one additional pituitary hormone, GHD diagnosis could potentially be established without performing GHSTs (1). However, due to the insufficient level of evidence according to the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) consensus (39), these recommendations could only be considered as conditional. Our study provides evidence to increase the strength of these recommendations.

We built a model on the knowledge generated by experts over the last 20 years (1, 5, 39)(and references therein), and applied a rigorous mathematical and machine-learning approach. Feature selection was based on the combination of statistical analyses with clinical criteria, used to refine the model in an iterative way until the prediction rule was satisfactory from both the statistical analysis and the clinical criteria, in a cohort of 770 GHSTs performed in our center between 2004 and 2014. Since the prediction rule was intended to diagnose GHD without the need for a GHST, and therefore type I errors would result in a false diagnosis of GHD leading to GH treatment in real world practice, we set goals of high specificity and positive predictive value for our rule. Of all the potential risk factors considered during model building, we identified the presence of pituitary dysgenesis on MRI or the existence of two or more anterior pituitary hormone deficiencies (TSH, ACTH, or prolactin) as specific enough to diagnose GHD without resorting to GHSTs in children meeting the criteria required for GHST by the Summary Statement of the Growth Hormone Research Society (5). Alternatively, if only one (TSH, ACTH, or prolactin) deficiency was present, the coexistence of central diabetes insipidus, neonatal symptoms of pituitary deficiency (hypoglycemia or hypogenitalism), sellar or suprasellar surgery or tumor (excluding microadenomas), clinical or radiological craniofacial midline abnormalities, or cranial radiotherapy ≥18 Gy, also led to a safe diagnosis. Gonadotropin deficiency was not considered in the analysis because its ascertainment may prove challenging in prepubertal patients.

To test the clinical applicability of the prediction rule, we validated our results using an independent cohort of 161 GHSTs performed between 2017 and 2019. Auspiciously, specificity was 99.2% in this second cohort, supporting the safety of our prediction rule. As expected, sensitivity was relatively low, reaching 55.6% in the validation cohort, indicating that almost half of the patients with GHD would only be identified after referring to GHSTs. Nonetheless, the sensitivity of the prediction rule applied to children meeting the criteria required for GHST by the Summary Statement of the Growth Hormone Research Society (5) would reduce in approximately half of the cases with GHD the need to perform a relatively invasive endocrine test, which underscores the clinical relevance of our results.

An unexpected, clinically relevant result is that only 20% of the children undergoing provocative tests, due to a suspicion of GHD, proved to be GH deficient (150 of 770 in the first cohort and 36 of 161 in the validation cohort). This is particularly significant given that only patients meeting the rigorous criteria defined by the Growth Hormone Research Society for prescribing GHSTs to children (5) were included in our study. This may be explained by the stringent criteria used to ascertain GHD in our center. Indeed, the diagnosis of GHD was based on peak GH levels <6.1 ng/ml between 2004 and 2011 (17), or <4.7 ng/ml between 2012 and 2019 (18), according to previously validated cutoff values (19).

Key strengths of this study are the high number of patients included in the construction of the predictive model as well as in the independent validation sample. It should also be stressed that strict criteria were used to define cases and controls: as mentioned above, the diagnosis of GHD was based on stringent cut-off levels for peak GH in GHSTs. A meticulous analysis of inclusion and exclusion criteria was performed to avoid inclusion bias. The population sample is representative of patients seeking advice from pediatric endocrinologists at referral centers for the assessment of short stature, which renders our results widely applicable.

Our study also has some limitations related to its design. Frequently, observational studies are prone to missing information in their datasets. To minimize memory bias, we limited the assessment of potential predictors to risk factors that were routinely reported in the clinical charts or were available from electronic records of endocrine laboratory or imaging study in our hospital. However, we cannot exclude that risk factors have been missed and, therefore, not included in the prediction rule we generated. Particularly, very few patients had a confirmed genetic diagnosis. This may explain why the condition “familial or sporadic GHD of genetic etiology” was not prioritized by our model. Also, MRI studies were not available in all patients; nonetheless, false positive prediction of GHD due to pituitary dysgenesis did not occur in any of the 98 controls (no GHD, Table 3) who underwent MRI.

In summary, this study developed an algorithm that led to the construction of a predictive rule for decision-making in the diagnosis of GHD in children typically seeking advice from pediatricians for growth failure, on the basis of a reduced number of clinically relevant and easily identifiable risk factors. The application of this rule avoids the need for GHSTs in a significant proportion of the patients in whom testing to assess GHD is presently indicated, and it is especially important for a subgroup of labile or vulnerable patients, such as infants, very low weight patients and children with oncological conditions or other comorbidities.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics Statement

The studies involving human participants were reviewed and approved by Comité de Ética en Investigación, Hospital de Niños Ricardo Gutiérrez, Buenos Aires, Argentina. Written informed consent from the participants’ legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author Contributions

FC, RG, DY, IB, GF, and RR conceived the study and designed the analysis plan. FC, DY, and RR did the statistical analyses. FC, RG, DY, GF, and RR wrote the manuscript. SMB, MS, MR, MB, AK, DB, PP, IB, and GF contributed to obtain and interpret the data. All authors contributed to the article and approved the submitted version.

Funding

This work was partially supported by funding from: CONICET (Consejo Nacional de Investigaciones Científicas y Técnicas) grant PIP-11220130100687, and FONCYT (Fondo para la Investigación Científica y Tecnológica) grant PICT-2014-2490, Argentina.

Conflict of Interest

DY is a founding partner of Practia S.A. GF is employed by Takeda Pharmaceutical. RR and IB report lecture honoraria from Novo Nordisk and non-financial support from Biosidus, Merck, Novo Nordisk, Pfizer, and Sandoz. RG reports lecture honoraria from Novo Nordisk and Raffo, and non-financial support from Merck, Novo Nordisk, Pfizer, and Sandoz.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2020.624684/full#supplementary-material

References

1. Grimberg A, DiVall SA, Polychronakos C, Allen DB, Cohen LE, Quintos JB, et al. Guidelines for Growth Hormone and Insulin-Like Growth Factor-I Treatment in Children and Adolescents: Growth Hormone Deficiency, Idiopathic Short Stature, and Primary Insulin-Like Growth Factor-I Deficiency. Horm Res Paediatr (2016) 86:361–97. doi: 10.1159/000452150

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Werther GA. Growth hormone measurements versus auxology in treatment decisions: the Australian experience. J Pediatr (1996) 128:S47–51. doi: 10.1016/s0022-3476(96)70011-2

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Hintz RL. The role of auxologic and growth factor measurements in the diagnosis of growth hormone deficiency. Pediatrics (1998) 102:524–6.

PubMed Abstract | Google Scholar

4. Rogol AD, Blethen SL, Sy JP, Veldhuis JD. Do growth hormone (GH) serial sampling, insulin-like growth factor-I (IGF-I) or auxological measurements have an advantage over GH stimulation testing in predicting the linear growth response to GH therapy? Clin Endocrinol (Oxf) (2003) 58:229–37. doi: 10.1046/j.1365-2265.2003.01701.x

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Society GHR. Consensus guidelines for the diagnosis and treatment of growth hormone (GH) deficiency in childhood and adolescence: summary statement of the GH Research Society. GH Research Society. J Clin Endocrinol Metab (2000) 85:3990–3. doi: 10.1210/jcem.85.11.6984

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Binder G, Reinehr T, Ibanez L, Thiele S, Linglart A, Woelfle J, et al. GHD Diagnostics in Europe and the US: An Audit of National Guidelines and Practice. Horm Res Paediatr (2019) 92:150–6. doi: 10.1159/000503783

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Sizonenko PC, Clayton PE, Cohen P, Hintz RL, Tanaka T, Laron Z. Diagnosis and management of growth hormone deficiency in childhood and adolescence. Part 1: diagnosis of growth hormone deficiency. Growth Horm IGF Res (2001) 11:137–65. doi: 10.1054/ghir.2001.0203

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Badaru A, Wilson DM. Alternatives to growth hormone stimulation testing in children. Trends Endocrinol Metab (2004) 15:252–8. doi: 10.1016/j.tem.2004.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Binder G, Weidenkeller M, Blumenstock G, Langkamp M, Weber K, Franz AR. Rational approach to the diagnosis of severe growth hormone deficiency in the newborn. J Clin Endocrinol Metab (2010) 95:2219–26. doi: 10.1210/jc.2009-2692

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Loche S, Guzzetti C, Pilia S, Ibba A, Civolani P, Porcu M, et al. Effect of body mass index on the growth hormone response to clonidine stimulation testing in children with short stature. Clin Endocrinol (Oxf) (2011) 74:726–31. doi: 10.1111/j.1365-2265.2011.03988.x

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Sklar CA, Antal Z, Chemaitilly W, Cohen LE, Follin C, Meacham LR, et al. Hypothalamic-Pituitary and Growth Disorders in Survivors of Childhood Cancer: An Endocrine Society Clinical Practice Guideline. J Clin Endocrinol Metab (2018) 103:2761–84. doi: 10.1210/jc.2018-01175

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Finkelstein JW, Rusovici DE, Green E, Foreman S, Kulin HE, D’Arcangelo MR, et al. Children with organic growth hormone deficiency have elevated cortisol responses to stimuli. J Clin Endocrinol Metab (2001) 86:2854–6. doi: 10.1210/jcem.86.6.7594

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Cerbone M, Dattani MT. Progression from isolated growth hormone deficiency to combined pituitary hormone deficiency. Growth Horm IGF Res (2017) 37:19–25. doi: 10.1016/j.ghir.2017.10.005

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Louvel M, Marcu M, Trivin C, Souberbielle JC, Brauner R. Diagnosis of growth hormone (GH) deficiency: comparison of pituitary stalk interruption syndrome and transient GH deficiency. BMC Pediatr (2009) 9:29. doi: 10.1186/1471-2431-9-29

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Pampanini V, Pedicelli S, Gubinelli J, Scire G, Cappa M, Boscherini B, et al. Brain Magnetic Resonance Imaging as First-Line Investigation for Growth Hormone Deficiency Diagnosis in Early Childhood. Horm Res Paediatr (2015) 84:323–30. doi: 10.1159/000439590

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med (2015) 162:55–63. doi: 10.7326/M14-0697

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Chaler E, Belgorosky A, Maceiras M, Mendioroz M, Rivarola MA. Between-assay differences in serum growth hormone (GH) measurements: importance in the diagnosis of GH deficiency in childhood. Clin Chem (2001) 47:1735–8. doi: 10.1093/clinchem/47.9.1735

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Clemmons DR. Consensus statement on the standardization and evaluation of growth hormone and insulin-like growth factor assays. Clin Chem (2011) 57:555–9. doi: 10.1373/clinchem.2010.150631

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Chaler EA, Ballerini G, Lazzati JM, Maceiras M, Frusti M, Bergadá I, et al. Cut-off values of serum growth hormone (GH) in pharmacological stimulation tests (PhT) evaluated in short-statured children using a chemiluminescent immunometric assay (ICMA) calibrated with the International Recombinant Human GH Standard 98/574. Clin Chem Lab Med (2013) 51:e95–7. doi: 10.1515/cclm-2012-0505

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Stocchetti N, Carbonara M, Citerio G, Ercole A, Skrifvars MB, Smielewski P, et al. Severe traumatic brain injury: targeted management in the intensive care unit. Lancet Neurol (2017) 16:452–64. doi: 10.1016/s1474-4422(17)30118-7

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Chemaitilly W, Li Z, Huang S, Ness KK, Clark KL, Green DM, et al. Anterior hypopituitarism in adult survivors of childhood cancers treated with cranial radiotherapy: a report from the St Jude Lifetime Cohort study. J Clin Oncol (2015) 33:492–500. doi: 10.1200/JCO.2014.56.7933

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Braslavsky D, Grinspon RP, Ballerini MG, Bedecarrás P, Loreti N, Bastida G, et al. Hypogonadotropic Hypogonadism in Infants with Congenital Hypopituitarism: A Challenge to Diagnose at an Early Stage. Horm Res Paediatr (2015) 84:289–97. doi: 10.1159/000439051

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Thornton PS, Stanley CA, De Leon DD, Harris D, Haymond MW, Hussain K, et al. Recommendations from the Pediatric Endocrine Society for Evaluation and Management of Persistent Hypoglycemia in Neonates, Infants, and Children. J Pediatr (2015) 167:238–45. doi: 10.1016/j.jpeds.2015.03.057

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Suchy FJ. Neonatal cholestasis. Pediatr Rev (2004) 25:388–96. doi: 10.1542/pir.25-11-388

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Lejarraga H, del Pino M, Fano V, Caino S, Cole TJ. [Growth references for weight and height for Argentinian girls and boys from birth to maturity: incorporation of data from the World Health Organisation from birth to 2 years and calculation of new percentiles and LMS values]. Arch Argent Pediatr (2009) 107:126–33. doi: 10.1590/S0325-00752009000200006

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Marshall WA, Tanner JM. Variations in the pattern of pubertal changes in boys. Arch Dis Child (1970) 45:13–23. doi: 10.1136/adc.45.239.13

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Marshall WA, Tanner JM. Variations in pattern of pubertal changes in girls. Arch Dis Child (1969) 44:291–303. doi: 10.1136/adc.44.235.291

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Anigstein CR. Longitud y diámetro del pene en niños de 0 a 14 años de edad. Arch Argent Pediatr (2005) 103:401–5.

Google Scholar

29. Martinez AS, Domené HM, Ropelato MG, Jasper HG, Pennisi PA, Escobar ME, et al. Estrogen priming effect on growth hormone (GH) provocative test: a useful tool for the diagnosis of GH deficiency. J Clin Endocrinol Metab (2000) 85:4168–72. doi: 10.1210/jcem.85.11.6928

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Domené HM, Bengolea SV, Martínez AS, Ropelato MG, Pennisi P, Scaglia P, et al. Deficiency of the circulating insulin-like growth factor system associated with inactivation of the acid-labile subunit gene. N Engl J Med (2004) 350:570–7. doi: 10.1056/NEJMoa013100

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Ballerini MG, Domené HM, Scaglia P, Martinez A, Keselman A, Jasper HG, et al. Association of serum components of the GH-IGFs-IGFBPs system with GHR-exon 3 polymorphism in normal and idiopathic short stature children. Growth Horm IGF Res (2013) 23:229–36. doi: 10.1016/j.ghir.2013.08.003

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Grinspon RP, Bedecarrás P, Ballerini MG, Iñíguez G, Rocha A, Mantovani Rodrigues Resende EA, et al. Early onset of primary hypogonadism revealed by serum anti-Müllerian hormone determination during infancy and childhood in trisomy 21. Int J Androl (2011) 34:e487–98. doi: 10.1111/j.1365-2605.2011.01210.x

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Toll DB, Janssen KJ, Vergouwe Y, Moons KG. Validation, updating and impact of clinical prediction rules: a review. J Clin Epidemiol (2008) 61:1085–94. doi: 10.1016/j.jclinepi.2008.04.008

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Reilly BM, Evans AT. Translating clinical research into clinical practice: impact of using prediction rules to make decisions. Ann Intern Med (2006) 144:201–9. doi: 10.7326/0003-4819-144-3-200602070-00009

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Gelman A. A Bayesian Formulation of Exploratory Data Analysis and Goodness-of-fit Testing. Int Stat Rev (2003) 71:369–82. doi: 10.1111/j.1751-5823.2003.tb00203.x

CrossRef Full Text | Google Scholar

36. Quinlan JR. C4.5: Programs for machine learning. San Mateo, CA, USA: Morgan Kaufmann Publishers, Inc (1993).

Google Scholar

37. Breiman L. Random forests. Mach Learn (2001) 45:5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

38. Strobl C, Boulesteix AL, Zeileis A, Hothorn T. Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinf (2007) 8:25. doi: 10.1186/1471-2105-8-25

CrossRef Full Text | Google Scholar

39. Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, et al. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ (2008) 336:924–6. doi: 10.1136/bmj.39489.470347.AD

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: multiple pituitary hormone deficiencies, pituitary dysgenesis, short stature, growth failure, midline abnormalities

Citation: Clément F, Grinspon RP, Yankelevich D, Martín Benítez S, De La Ossa Salgado MC, Ropelato MG, Ballerini MG, Keselman AC, Braslavsky D, Pennisi P, Bergadá I, Finkielstain GP and Rey RA (2021) Development and Validation of a Prediction Rule for Growth Hormone Deficiency Without Need for Pharmacological Stimulation Tests in Children With Risk Factors. Front. Endocrinol. 11:624684. doi: 10.3389/fendo.2020.624684

Received: 31 October 2020; Accepted: 14 December 2020;
Published: 03 February 2021.

Edited by:

Sandro Loche, Ospedale Microcitemico, Italy

Reviewed by:

Chiara Guzzetti, Ospedale Microcitemico, Italy
Roberto Salvatori, Johns Hopkins University, United States

Copyright © 2021 Clément, Grinspon, Yankelevich, Martín Benítez, De La Ossa Salgado, Ropelato, Ballerini, Keselman, Braslavsky, Pennisi, Bergadá, Finkielstain and Rey. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rodolfo A. Rey, rodolforey@cedie.org.ar

^†ORCID: Romina P. Grinspon, orcid.org/0000-0002-6291-1518
Patricia Pennisi, orcid.org/0000-0001-5179-7353
Rodolfo A. Rey, orcid.org/0000-0002-1100-3843
Ignacio Bergadá, orcid.org/0000-0001-6546-1949
Ana Keselman, orcid.org/0000-0002-5612-0445
Debora Braslavsky, orcid.org/0000-0002-4501-7610

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.