Toward Novel Tools for Autism Identification: Fusing Computational and Clinical Expertise

Corona, Laura L.; Wagner, Liliana; Wade, Joshua; Weitlauf, Amy S.; Hine, Jeffrey; Nicholson, Amy; Stone, Caitlin; Vehorn, Alison; Warren, Zachary

doi:10.1007/s10803-020-04857-x

Toward Novel Tools for Autism Identification: Fusing Computational and Clinical Expertise

Original Paper
Published: 08 January 2021

Volume 51, pages 4003–4012, (2021)
Cite this article

Download PDF

Journal of Autism and Developmental Disorders Aims and scope Submit manuscript

Toward Novel Tools for Autism Identification: Fusing Computational and Clinical Expertise

Download PDF

Laura L. Corona ORCID: orcid.org/0000-0001-8166-253X^1,2,
Liliana Wagner^1,2,
Joshua Wade³,
Amy S. Weitlauf^1,2,
Jeffrey Hine^1,2,
Amy Nicholson^1,2,4,
Caitlin Stone^1,2,
Alison Vehorn¹ &
…
Zachary Warren^1,2,4,5

2441 Accesses
17 Citations
10 Altmetric
1 Mention
Explore all metrics

Abstract

Barriers to identifying autism spectrum disorder (ASD) in young children in a timely manner have led to calls for novel screening and assessment strategies. Combining computational methods with clinical expertise presents an opportunity for identifying patterns within large clinical datasets that can inform new assessment paradigms. The present study describes an analytic approach used to identify key features predictive of ASD in young children, drawn from large amounts of data from comprehensive diagnostic evaluations. A team of expert clinicians used these predictive features to design a set of assessment activities allowing for observation of these core behaviors. The resulting brief assessment underlies several novel approaches to the identification of ASD that are the focus of ongoing research.

Validation of Autism Diagnosis and Clinical Data in the SPARK Cohort

Article 30 July 2021

Testing the accuracy of an observation-based classifier for rapid detection of autism risk

Article Open access 12 August 2014

Applications of Supervised Machine Learning in Autism Spectrum Disorder Research: a Review

Article Open access 19 February 2019

Although many caregivers of children with autism spectrum disorder (ASD) report developmental concerns within the first two years of a child’s life (Sacrey et al. 2018; Zuckerman et al. 2015), only 44% of children receive a diagnosis before 36 months of age, and the median age of first ASD diagnosis in the United States is after 4 years (Maenner et al. 2020). Unfortunately, these delays create additional stress for families (Oswald et al. 2017) and may prevent children from accessing early intervention services, which are important predictors of child outcomes (Fuller and Kaiser 2019; Landa 2018). Multiple barriers contribute to diagnostic delays, including extensive wait lists (Gordon-Lipkin et al. 2016) and limited access to qualified providers (Bishop-Fitzpatrick and Kind 2017). These barriers are exacerbated by socioeconomic, geographic, and linguistic disparities (Antezana et al. 2017; Durkin et al. 2017; Khowaja et al. 2015). Together, these challenges highlight the need for novel approaches to the early identification of ASD that meet families’ needs and connect children with essential services.

Established best practices in the diagnosis of ASD include a clinical interview with primary caregivers, comprehensive assessment of a child’s cognitive or developmental functioning, and observation of a child’s play and social interactions using standardized, semi-structured assessments (Huerta and Lord 2012). Though valuable, such evaluations often involve multi-hour testing sessions and/or multiple appointments that can be taxing for children and families, particularly for those with geographic or transportation barriers. Further, there is evidence that some young children can be identified as having ASD based on a briefer evaluation (Juárez et al. 2018; Swanson et al. 2014). A tiered model that streamlines risk classification and early intervention access for those children with clear phenotypic profiles of ASD may, by reducing need for comprehensive testing, simultaneously reduce wait times for those children whose complex presentations warrant additional evaluation (Zwaigenbaum and Warren 2020). At present, however, most phenotypic presentations are funneled into the same model of care, regardless of provider capacity or family preference.

Several models have been developed to increase access to evaluation and early intervention services within community settings. These include providing ASD diagnostic training and consultation to pediatric providers (Hine et al. 2020; Keehn et al. 2020; Mazurek et al. 2019), embedding psychologists within pediatric clinics (Hine et al. 2018), and leveraging partnerships with early intervention providers (Juárez et al. 2018; Stainbrook et al. 2019; Yingling 2019). Such models have demonstrated positive outcomes including reduced wait times for families, reduced travel burden, and family satisfaction with alternate diagnostic processes. However, even these models are limited by reliance on multiple providers, utilization of existing tools for ASD assessment which incur training and materials costs, and scheduling or personnel burdens which limit their transportability into other community systems of care.

An alternate approach to the development of novel tools for ASD assessment has been the application of advanced computational strategies, such as machine learning, that attempt to distill extensive assessment measures into smaller sets of questions and behavioral observations that could be used for more efficient assessment (Wall et al. 2012). To date, such approaches have demonstrated limited clinical impact and have been thoughtfully critiqued (Bone et al. 2015). In particular, identifying a limited set of behavioral codes with strong predictive validity does not account for the evaluation processes, clinical judgment, and expertise of well-trained providers that ultimately result in the assignment of these codes—and the further assignment of an ASD diagnosis or risk classification. While applying advanced computational strategies may distill large amounts of data into key observations, a machine learning strategy and analysis alone does not result in a meaningful methodology for abbreviated ASD assessment.

Recognizing both the limitations of current tools for ASD evaluation and the shortcomings of prior work utilizing machine learning, the goal of the present work was to develop a brief assessment tool for ASD symptoms in toddlers by fusing computational and clinical expertise that can then be adapted for use across formats, settings, and providers. While a machine learning approach in isolation is not a sufficient strategy for realizing a viable, stand-alone assessment tool, machine learning represents an opportunity to elucidate patterns in clinical assessments and clinical decision-making in a way that can inform the development of novel tools (Sarkar et al. 2018). Below, we describe a computational approach to the identification of key behaviors to inform diagnostic tool development based on available clinical registry datasets, followed by the translation of these predictive models to guidelines for clinical observations of child behavior. The resulting tool, a basic framework for symptom identification, has subsequently been adapted into assessment platforms for use in telemedicine-based evaluation (Corona et al. 2020), intelligent applications for screening (Adiani et al. 2019; R43 MH115528, R44 MH115528), and enhanced training protocols for medical residents (Hine et al. 2019).

Methods

Participants

The analyses below were completed using a clinical research database housed within a university medical center. This database includes phenotypic data for individuals with and without ASD at the time of diagnosis as evaluated by a team of over 20 research-reliable psychological providers (i.e., providers with certified expertise in standardized ADOS-2 administration) across autism-focused research studies and outpatient clinics. Within the targeted age range of this work, the database included scores from 737 toddlers (77% male, 23% female) between the ages of 14–33 months (M = 25.7, SD = 3.7) whose families had provided consent for inclusion at the time of a diagnostic evaluation. Included in this database were toddlers’ scores from diagnostic evaluations using the ADOS-2 (Lord et al. 2012; Luyster et al. 2009), the Mullen Scales of Early Learning (MSEL; Mullen 1995), and the Vineland Adaptive Behavior Scales, Second Edition-Interview Form (Vineland-II; Sparrow et al. 2005), as well as the clinician diagnostic impression from that visit (ASD, global developmental delay, and so on). Data span the years 2012–2016, with all participants administered the Toddler Module of the ADOS-2 (including two children ages 30–32 months who are retained here to reflect the true analyses underlying algorithmic creation). Within this sample, 70% of toddlers were classified as having ASD and 30% were classified as not meeting criteria for ASD (see Table 1). Of participants not diagnosed with ASD, approximately 62% received a diagnosis of global developmental delay, 30% received other unspecified diagnoses (such as language delay or behavioral disorders), and 8% received no diagnosis.

Table 1 Participant demographics and scores from diagnostic evaluations

Full size table

Data Analytic Procedure

Analytic Approach

Machine Learning (ML), a branch of Artificial Intelligence, offers a powerful means by which to infer patterns within datasets (Bishop 2006). Feature selection techniques in particular can be used to reveal components of a dataset that most effectively distinguish between classes of data. In the current work, ML techniques were used to carry out an exploratory analysis of behavioral assessment variables (i.e., individual items from each of the assessment instruments; henceforth features) with the aim of identifying the most discriminating classes of features (i.e., those features best differentiating ASD from non-ASD cases). A ML approach, as opposed to a statistical approach (e.g., exploratory factor analysis Furr 2017) was used (1) because our goal was not to identify specific discriminating codes of established instruments but instead to reveal broader discriminatory patterns in behavioral observations, and (2) because of the overtly ML-oriented nature of the dataset—specifically, the structured dataset was ideally suited for supervised classification methods given its binary labeling (ASD, non-ASD), rich feature set, and relatively large size.

Feature Space Exploration

Feature selection techniques, and feature engineering more broadly (Géron 2019), were used in the current work to identify a minimal subset of features that would yield clinically acceptable levels of model accuracy, sensitivity, and specificity. To achieve this goal, we applied both established and novel feature engineering methods with the aim of identifying the optimal feature set for use in model development. Table 2 summarizes and compares the feature engineering models applied to the data.

Table 2 Comparison of model performance on holdout set

Full size table

Three established methods—χ² goodness of fit, information gain, and Pearson correlation—were used to rank features according to their ability to reliably predict the class label (i.e., ASD or non-ASD) using the ML toolkits scikit-learn version 0.20.3 (Pedregosa et al. 2011) and WEKA version 3.8.4 (Hall et al. 2009).

A fourth method, developed by the data analytic team, evaluated the predictive utility of aggregations of features by comparing the central tendencies of groupwise clusters within the clinical dataset. That is, rather than employing the three methods described above, we extracted new features based on the distance, in feature space, between an individual and a group (see Fig. 1). Here, “groups” refer to the children with a confirmed ASD diagnosis and the children who did not meet criteria for diagnosis of ASD. We explored the space of all possible two- and three-component feature vectors, in which the vectors were comprised of unique combinations of the ADOS-2 feature set. For example, a two-component feature vector ASD_a2,b1 represents the central tendency, or centroid, composed of features a2 and b1 within the ASD sample. Similarly, a three-component feature vector NonASD_a2,b1,b5 represents the centroid composed of features a2, b1, and b5 within the non-ASD sample. Only two- and three-component feature vector spaces were explored because distance metrics become increasingly unreliable in higher-dimensions (Aggarwal et al. 2001). Moreover, the chosen spaces were amenable to brute force search, resulting in rapid computation and evaluation.

Model Selection

The set of 737 samples was randomly partitioned using a 70–30 split into a training set (N = 515) and a holdout set for validation (N = 222). A train-test split approach involves partitioning a dataset into two sets of different sizes—often 70–30 or 60–40 partitions—where the larger partition is typically used for model training and the smaller partition is held out exclusively for testing. This method is often used to account for the problem of overfitting in which a predictive model demonstrates excellent classification performance on a training dataset at the expense of generalizability to future or unseen data (Kuhn and Johnson 2013). In a train-test split approach, the performance of the model on the holdout set is used to determine the model’s overall reliability. In the current work, the training set was used to train and compare classifiers using the four feature engineering methods described above, while the holdout set was used to evaluate the final performance of the various models. Within the scope of the training set, cross-validation was used to gauge the preliminary performance of the classifier before final evaluation on the holdout set. Cross-validation involves iteratively dividing a dataset into k segments, training a model on k − 1 segments, and then evaluating the model on the kth segment (Bishop 2006). The accuracy of the model on each test segment is then averaged to yield a measure of performance that is expected to be robust to variations in the data. Consistent with typical practice, the value of k was set to 10 in the performed analyses (Géron 2019).

Results

Multiple predictive models were trained based on the four feature selection methods described above. All features identified through the feature selection methods were individual items from the ADOS-2. A decision tree classifier was selected for model training. Decision trees are widely used in practice and perform well on certain types of data, particularly data that fit neatly into cuboid regions when viewed from the perspective of plotting data points in multidimensional space (Bishop 2006). Table 2 compares the performance of seven models evaluated on the holdout set.

Model 1, which is included as a basis of comparison for each of the other models, is simply the ADOS-2 total score, representing a trivial classifier with only one feature. Model 2, another trivial classifier, includes only one feature, frequency of spontaneous vocalizations directed to others, which emerged as one of the highest ranked features across all of the ranking methods. It is included to demonstrate the predictive accuracy of a single feature; however, the specificity of this model was inadequate (see Table 2).

Models 3–5 include the 10 highest ranked features using the χ², Information Gain, and Pearson Correlation ranking methods, respectively. Table 3 shows the 10 highest-ranked features for each of these three feature selection methods. Features identified as among the most predictive of ASD diagnosis across Models 3–5 include ADOS-2 codes related to: frequency of child vocalizations to others, integration of eye contact with other behaviors in the context of social overtures, showing behaviors, and the overall number of social overtures to the examiner.

Table 3 The 10 highest-ranked features for each of three feature extraction methods

Full size table

Model 6 represents the highest performing aggregated feature from the previously described clustering method. The predictive features identified by this model focused on items related to a child’s use of eye contact, vocalizations directed to other people, and integration of eye contact with other forms of communication, such as sounds and gestures. Finally, Model 7 is an extension of Model 6 that includes four additional features based on a secondary feature selection analysis, conducted using an embedded feature ranking algorithm in WEKA. This method first assesses subsets of features and then selects features that are highly correlated with the class label while maintaining low correlation with one another (Hall 1999). The additional features identified in this final model included items focused on the intonation of a child’s vocalization, atypical sensory interests, stereotyped hand movements, and repetitive or stereotyped interests.

The assessment of model performance focused on five performance metrics, including accuracy, sensitivity, specificity, F-score, and unweighted average recall (UAR). Accuracy, F-score, and sensitivity (also known as “recall”) are commonly reported metrics in the ML literature, while specificity is often reported in the context of diagnostic testing. UAR has been suggested as a preferred performance metric for model assessment related to diagnostic testing, especially in the presence of unbalanced data (Bone et al. 2015). As such, UAR was selected as the preferred metric for comparing model performance. Based on this criterion, Model 7 was the highest performing non-trivial classifier with a UAR of 0.844. When applied to the test sample, Model 7 achieved a sensitivity of 0.90 and a specificity of 0.78.

Following feature selection and model comparison, a design team of six clinical experts in ASD reviewed the features identified, each of which represented a behavioral code from a standardized instrument included in the database, and then aligned each feature (and associated behavioral descriptor) with DSM-5 diagnostic criteria for ASD in young children to determine if symptoms would be captured across each core diagnostic area. Features identified by the final model (Model 7) included child behaviors related to: vocalizations directed to other people, intonation of vocalizations, overall use of eye contact, integration of eye contact with other social and communicative behaviors, and restricted or repetitive behaviors including sensory interests, repetitive motor behaviors, and repetitive play or interests.

Through a process of collaborative consensus that included independent generation of content, cross-team review for shared and discrepant material, and preliminary finalization of items deemed most clinically representative, the design team developed core behavioral descriptors reflecting those features most predictive of diagnosis in our sample (see Table 4). These descriptors were then reviewed by an internal group of 16 behavioral providers (licensed clinical psychologists, licensed senior psychological examiners, developmental-behavioral pediatricians, and postdoctoral fellows) with varying levels of ASD expertise who read the text and replied with suggested edits or clarifications in order to simplify the language for a broader audience. After the descriptors were finalized, the design team operationalized these behaviors using anchors within a Likert-style scale. For each item, a rating of 1 indicates that the ASD-related symptom is not present, a rating of 2 indicates that the symptom is present but at subclinical levels, and a rating of 3 indicates that the symptom is present and clearly consistent with ASD. Similar to the development of the behavioral descriptors, the Likert anchors were reviewed by a secondary team of non-experts to improve clarity regarding targets for behavioral observation.

Table 4 Translation of predictive features identified through computational approach to underlying constructs and behavioral ratings

Full size table

After finalizing behavioral descriptions and anchors for Likert ratings, the team developed an assessment process designed to elicit observations tied to these seven key behaviors. The process was designed to (1) be administered within 20 min of interaction, in order to maximize transportability into community practice settings, (2) employ inexpensive and widely available materials to facilitate use and access to the tool with low cost burden, and (3) provide understandable instructions for community providers. Activities include opportunities for free play and partnered play, as well as social presses to provide opportunities for making requests, sharing enjoyment, and directing attention (see Table 5), all of which give children opportunities to display the communication, social interaction, and independent play skills related to the core discriminatory behaviors identified through the ML procedures described above. Together, these brief administration activities and providers’ ratings of children’s behaviors during the activities make up a novel tool for identification of ASD symptoms.

Table 5 Administration activities

Full size table

Discussion

This work describes a computationally and clinically informed development process aimed at creating innovative measures for accurate, efficient identification of ASD in young children across a variety of clinical practice settings. This approach applied complex feature engineering to a rich phenotypic data set of toddlers with ASD and other complex developmental concerns in order to elucidate potentially valuable targets for clinical assessment and observation that could be folded into scoring systems explicitly designed for use in varied settings. The result of this translational process is a novel, brief assessment tool that has the potential to provide clinical information regarding the presence of ASD symptoms in young children.

The development process described above represents an extension of past work that has focused solely on the identification of key items within more thorough assessment measures (Wall et al. 2012). By combining a computational approach with clinical expertise, this process identified elements within a comprehensive assessment process that are predictive of ASD diagnosis and then translated these elements into key, underlying behavioral constructs. Our approach further moved beyond past work by proposing a set of assessment activities designed to elicit behaviors of clinical interest, as well as an observation and scoring system to organize and quantify clinical impressions.

It is important to recognize that the features computationally identified as most predictive of ASD diagnosis were distilled from behavioral codes assigned as part of a longer, standardized assessment and scoring procedure. These behavioral codes in isolation do not represent an appropriate, stand-alone estimate of risk for or presence of an ASD diagnosis (Bone et al. 2015). Instead, these codes represent key clinical features of concern—particularly, features related to a child’s challenges with aspects of social communication (e.g., use and integration of verbal and nonverbal communication) and the presence of restricted, repetitive behaviors—that are characteristic of ASD. A brief assessment approach designed around these key clinical features, such as that described here, holds promise for identifying clear symptoms of ASD, within a short time period, in a variety of clinical settings.

The assessment activities and rating procedures developed through this work present a preliminary model and a starting point for further tool development, refinement, and investigation. Although based upon the computational selection of features most predictive of ASD within our clinical database, the activities eventually chosen to elicit these features were derived from clinical expertise with the goal of creating activities that could be easily implemented in community care settings with minimal time or financial burdens. The completion of predictive studies is an essential next step to understanding the performance of novel tools developed through this method. Translating machine learning methods into clinically meaningful assessment practices and tools is a promising approach; however, it is also critical that novel assessment tools and procedures undergo rigorous evaluation. In our ongoing work, this tool underlies four different assessment instruments under investigation. In one model, the administration instructions and behavioral ratings have been built into an interactive app designed to guide non-expert pediatric providers through the use of the tool (Adiani et al. 2019). In a second model, the tool is used to guide parents through administration activities via telehealth, while a clinician provides coaching, observes the administration, and assigns behavioral ratings (Corona et al. 2020). In a third model, the tool is integrated into an approach to meet the identified needs of pediatric residents regarding ASD evaluation (Hine et al. 2019). Finally, in a fourth model, the tool was modified slightly for use within telehealth-to-home evaluations, in direct response to disruptions in care caused by the COVID-19 pandemic (TELE-ASD-PEDS; Wagner et al. 2020). Within all of these models, evaluation of the clinical utility, user acceptability, and psychometric properties of these assessment measures is ongoing.

By definition, the preliminary nature of this work yields several limitations and essential future directions. Analysis of data regarding the performance of this tool in accurately identifying children with concern for ASD is in progress, but ongoing. The clinical dataset upon which this tool is based does not provide information regarding child race and ethnicity, medical complexity, or family variables, all of which are factors important to understanding how and for whom this assessment tool works best. Additionally, because of the point-in-time nature of the clinical assessment process, information on diagnostic stability and accuracy for children within our preliminary dataset is not available, and it is unknown how this instrument would function across a range of phenotypic profiles. Future work should also explore a range of scoring methodologies; although a Likert-style scale was chosen for ease of clinical use and behavioral anchoring, other methods may provide a more fine-grained phenotypic profile across a range of risk levels. Finally, given that the tool has been adapted for applications across provider type and setting, ongoing work is necessary to examine potential differences in scoring cut-offs and tool functionality across these groups. Addressing these limitations in ongoing work by our group and others will be essential for creating models of care that accurately identify ASD-related concerns in young children while meeting families’ diverse needs (Zwaigenbaum and Warren 2020). In addition, continuing to apply machine learning approaches as our datasets grow can continue to help clinicians identify and interpret these patterns on a large scale.

In conclusion, the present work describes the development of one preliminary approach to identifying behaviors strongly indicative of ASD in young children in a brief amount of time, using readily available materials and play-based assessment activities. By bringing together computational approaches and clinical knowledge, this work has identified several key child behaviors predictive of ASD and developed procedures for eliciting and observing these behaviors. Ultimately, an ASD diagnosis comes not from a specific assessment tool, but from a provider trained to use assessment tools, observations, and other available clinical information to make clinical decisions (De Marchena and Miller 2017; Sheldrick et al. 2019). This work attempts to provide a streamlined way of helping providers make key behavioral observations and organize these observations to inform diagnostic decision-making. In doing so, this tool may contribute to our collective ability to serve patients and families in timely, informed ways.

References

Adiani, D., Schmidt, M., Wade, J., Swanson, A. R., Weitlauf, A., Warren, Z., & Sarkar, N. (2019, July). Usability enhancement and functional extension of a digital tool for rapid assessment of risk for autism spectrum disorders in toddlers based on pilot test and interview data. In International conference on human–computer interaction (pp. 13–22). Cham: Springer. https://doi.org/10.1007/978-3-030-23563-5_2.
Aggarwal, C. C., Hinneburg, A., & Keim, D. A. (2001, January). On the surprising behavior of distance metrics in high dimensional space. In International conference on database theory (pp. 420–434). Berlin: Springer.
Antezana, L., Scarpa, A., Valdespino, A., Albright, J., & Richey, J. A. (2017). Rural trends in diagnosis and services for autism spectrum disorder. Frontiers in Psychology, 8, 590. https://doi.org/10.3389/fpsyg.2017.00590.
Article PubMed PubMed Central Google Scholar
Bishop, C. M. (2006). Pattern recognition and machine learning. New York: Springer.
Google Scholar
Bishop-Fitzpatrick, L., & Kind, A. J. (2017). A scoping review of health disparities in autism spectrum disorder. Journal of Autism and Developmental Disorders, 47(11), 3380–3391. https://doi.org/10.1007/s10803-017-3251-9.
Article PubMed PubMed Central Google Scholar
Bone, D., Goodwin, M. S., Black, M. P., Lee, C. C., Audhkhasi, K., & Narayanan, S. (2015). Applying machine learning to facilitate autism diagnostics: Pitfalls and promises. Journal of Autism and Developmental Disorders, 45(5), 1121–1136.
Article Google Scholar
Corona, L. L., Weitlauf, A. S., Hine, J., Berman, A., Miceli, A., Nicholson, A., et al. (2020). Parent perceptions of caregiver-mediated telemedicine tools for assessing autism risk in toddlers. Journal of Autism and Developmental Disorders. https://doi.org/10.1007/s10803-020-04554-9.
Article PubMed PubMed Central Google Scholar
De Marchena, A., & Miller, J. (2017). “Frank” presentations as a novel research construct and element of diagnostic decision-making in autism spectrum disorder. Autism Research, 10(4), 653–662. https://doi.org/10.1002/aur.1706.
Article PubMed Google Scholar
Durkin, M. S., Maenner, M. J., Baio, J., Christensen, D., Daniels, J., Fitzgerald, R., et al. (2017). Autism spectrum disorder among US children (2002–2010): Socioeconomic, racial, and ethnic disparities. American Journal of Public Health, 107(11), 1818–1826. https://doi.org/10.2105/AJPH.2017.304032.
Article PubMed PubMed Central Google Scholar
Fuller, E. A., & Kaiser, A. P. (2019). The effects of early intervention on social communication outcomes for children with autism spectrum disorder: A meta-analysis. Journal of Autism and Developmental Disorders. https://doi.org/10.1007/s10803-019-03927-z.
Article Google Scholar
Furr, R. M. (2017). Psychometrics: an introduction. Thousand Oaks, CA: Sage Publications.
Google Scholar
Géron, A. (2019). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. Sebastopol, CA: O’Reilly Media.
Google Scholar
Gordon-Lipkin, E., Foster, J., & Peacock, G. (2016). Whittling down the wait time: Exploring models to minimize the delay from initial concern to diagnosis and treatment of autism spectrum disorder. Pediatric Clinics of North America, 63(5), 851–859. https://doi.org/10.1016/j.pcl.2016.06.007.
Article PubMed PubMed Central Google Scholar
Hall, M. A. (1999). Correlation-based feature selection for machine learning. Thesis, The University of Waikato.
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., & Witten, I. H. (2009). The WEKA data mining software: An update. ACM SIGKDD Explorations Newsletter, 11(1), 10–18.
Article Google Scholar
Hine, J. F., Allin, J., Allman, A., Black, M., Browning, B., Ramsey, B., et al. (2020). Increasing access to autism spectrum disorder diagnostic consultation in rural and underserved communities: Streamlined evaluation within primary care. Journal of Developmental and Behavioral Pediatrics, 41(1), 16–22. https://doi.org/10.1097/DBP.0000000000000727.
Article PubMed PubMed Central Google Scholar
Hine, J. F., Dubin, A., Foster, T., Goode, R., Swanson, A., & Warren, Z. E. (2019). Need for pediatric resident training in autism spectrum disorder: Preparation for primary care. Psychological Disorders and Research. https://doi.org/10.31487/j.pdr.2019.04.01.
Article Google Scholar
Hine, J. F., Herrington, C. G., Rothman, A. M., Mace, R. L., Patterson, B. L., Carlson, K. L., & Warren, Z. E. (2018). Embedding autism spectrum disorder diagnosis within the medical home: Decreasing wait times through streamlined assessment. Journal of Autism and Developmental Disorders, 48(8), 2846–2853. https://doi.org/10.1007/s10803-018-3548-3.
Article PubMed Google Scholar
Huerta, M., & Lord, C. (2012). Diagnostic evaluation of autism spectrum disorders. Pediatric Clinics of North America, 59(1), 103. https://doi.org/10.1016/j.pcl.2011.10.018.
Article PubMed PubMed Central Google Scholar
Juárez, A. P., Weitlauf, A. S., Nicholson, A., Pasternak, A., Broderick, N., Hine, J., et al. (2018). Early identification of ASD through telemedicine: Potential value for underserved populations. Journal of Autism and Developmental Disorders, 48(8), 2601–2610. https://doi.org/10.1007/s10803-018-3524-y.
Article PubMed PubMed Central Google Scholar
Keehn, R. M., Ciccarelli, M., Szczepaniak, D., Tomlin, A., Lock, T., & Swigonski, N. (2020). A statewide tiered system for screening and diagnosis of autism spectrum disorder. Pediatrics. https://doi.org/10.1542/peds.2019-3876.
Article Google Scholar
Khowaja, M. K., Hazzard, A. P., & Robins, D. L. (2015). Sociodemographic barriers to early detection of autism: Screening and evaluation using the M-CHAT, M-CHAT-R, and follow-up. Journal of Autism and Developmental Disorders, 45(6), 1797–1808. https://doi.org/10.1007/s10803-014-2339-8.
Article PubMed PubMed Central Google Scholar
Kuhn, M., & Johnson, K. (2013). Over-fitting and model tuning. In Applied predictive modeling (pp. 61–92). New York: Springer.
Landa, R. J. (2018). Efficacy of early interventions for infants and young children with, and at risk for, autism spectrum disorders. International Review of Psychiatry, 30(1), 25–39. https://doi.org/10.1080/09540261.2018.1432574.
Article PubMed Google Scholar
Lord, C., Rutter, M., DiLavore, P. C., Risi, S., Gotham, K., & Bishop, S. (2012). Autism diagnostic observation schedule, second edition (ADOS-2) manual (part 1): Modules 1–4. Torrance, CA: Western Psychological Services.
Google Scholar
Luyster, R., Gotham, K., Guthrie, W., Coffing, M., Petrak, R., Pierce, K., et al. (2009). The Autism Diagnostic Observation Schedule—Toddler Module: A new module of a standardized diagnostic measure for autism spectrum disorders. Journal of Autism and Developmental Disorders, 39(9), 1305–1320. https://doi.org/10.1007/s10803-009-0746-z.
Article PubMed PubMed Central Google Scholar
Maenner, M. J., Shaw, K. A., Baio, J., Washington, A., Patrick, M., DiRienzo, M., et al. (2020). Prevalence of autism spectrum disorder among children aged 8 Years—Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2016. Morbidity and Mortality Weekly Report: Surveillance Summaries, 69(4), 1–12. https://doi.org/10.15585/mmwr.ss6904a1.
Article PubMed Central Google Scholar
Mazurek, M. O., Curran, A., Burnette, C., & Sohl, K. (2019). ECHO Autism STAT: Accelerating early access to autism diagnosis. Journal of Autism and Developmental Disorders, 49(1), 127–137. https://doi.org/10.1007/s10803-018-3696-5.
Article PubMed Google Scholar
Mullen, E. M. (1995). Mullen Scales of Early Learning. Circle Pines, MN: American Guidance Service.
Google Scholar
Oswald, D. P., Haworth, S. M., Mackenzie, B. K., & Willis, J. H. (2017). Parental report of the diagnostic process and outcome: ASD compared with other developmental disabilities. Focus on Autism and Other Developmental Disabilities, 32(2), 152–160. https://doi.org/10.1177/1088357615587500.
Article Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research, 12, 2825–2830.
Google Scholar
Sacrey, L. A. R., Zwaigenbaum, L., Bryson, S., Brian, J., Smith, I. M., Roberts, W., et al. (2018). Parent and clinician agreement regarding early behavioral signs in 12- and 18-month-old infants at-risk of autism spectrum disorder. Autism Research, 11(3), 539–547. https://doi.org/10.1002/aur.1920.
Article PubMed Google Scholar
Sarkar, A., Wade, J., Swanson, A., Weitlauf, A., Warren, Z., & Sarkar, N. (2018, July). A data-driven mobile application for efficient, engaging, and accurate screening of ASD in toddlers. In International conference on universal access in human–computer interaction (pp. 560–570). Cham: Springer. https://doi.org/10.1007/978-3-319-92049-8_41.
Sheldrick, R. C., Frenette, E., Vera, J. D., Mackie, T. I., Martinez-Pedraza, F., Hoch, N., et al. (2019). What drives detection and diagnosis of autism spectrum disorder? Looking under the hood of a multi-stage screening process in early intervention. Journal of Autism and Developmental Disorders, 49(6), 2304–2319. https://doi.org/10.1007/s10803-019-03913-5.
Article PubMed PubMed Central Google Scholar
Sparrow, S. D., Cicchetti, D. V., & Balla, D. A. (2005). Vineland-II Adaptive Behavior Scales: Survey Forms Manual. Circle Pines, MN: American Guidance Service.
Google Scholar
Stainbrook, J. A., Weitlauf, A. S., Juárez, A. P., Taylor, J. L., Hine, J., Broderick, N., et al. (2019). Measuring the service system impact of a novel telediagnostic service program for young children with autism spectrum disorder. Autism, 23(4), 1051–1056. https://doi.org/10.1177/1362361318787797.
Article PubMed Google Scholar
Swanson, A. R., Warren, Z. E., Stone, W. L., Vehorn, A. C., Dohrmann, E., & Humberd, Q. (2014). The diagnosis of autism in community pediatric settings: Does advanced training facilitate practice change? Autism, 18(5), 555–561. https://doi.org/10.1177/1362361313481507.
Article PubMed Google Scholar
Wagner, L., Corona, L. L., Weitlauf, A. S., Marsh, K. L., Bermn, A. F., Broderick, N. A., et al. (2020). Use of the TELE-ASD-PEDS for autism evaluations in response to COVID-19: Preliminary outcomes and clinician acceptability. Journal of Autism and Developmental Disorders. https://doi.org/10.1007/s10803-020-04767-y.
Article PubMed PubMed Central Google Scholar
Wall, D. P., Kosmicki, J., Deluca, T. F., Harstad, E., & Fusaro, V. A. (2012). Use of machine learning to shorten observation-based screening and diagnosis of autism. Translational Psychiatry, 2(4), e100. https://doi.org/10.1038/tp.2012.10.
Article PubMed PubMed Central Google Scholar
Yingling, M. E. (2019). Participation in Part C early intervention: One key to an earlier diagnosis of autism spectrum disorder? The Journal of Pediatrics, 215, 238–243. https://doi.org/10.1016/j.jpeds.2019.06.034.
Article PubMed Google Scholar
Zuckerman, K. E., Lindly, O. J., & Sinche, B. K. (2015). Parental concerns, provider response, and timeliness of autism spectrum disorder diagnosis. The Journal of Pediatrics, 166(6), 1431–1439. https://doi.org/10.1016/j.jpeds.2015.03.007.
Article PubMed PubMed Central Google Scholar
Zwaigenbaum, L., & Warren, Z. (2020). Commentary: Embracing innovation is necessary to improve assessment and care for individuals with ASD: A reflection on Kanne and Bishop (2020). Journal of Child Psychology and Psychiatry. https://doi.org/10.1111/jcpp.13271.
Article PubMed Google Scholar

Download references

Acknowledgments

The authors wish to acknowledge and thank the parents and children whose data contributed to this work, as well as the research staff at the Vanderbilt Kennedy Center Treatment and Research Institute for Autism Spectrum Disorders.

Funding

The study was supported by funding from NIH/NIMH (R21MH118539, R43MH115528, R44MH115528), the Eunice Kennedy Shriver National Institute of Child Health and Human Development (U54 HD08321), and the Vanderbilt Institute for Clinical and Translational Research. The Vanderbilt Institute for Clinical and Translational Research (VICTR) is funded by the National Center for Advancing Translational Sciences (NCATS) Clinical Translational Science Award (CTSA) Program, Award Number 5UL1TR002243-03.

Author information

Authors and Affiliations

Vanderbilt Kennedy Center/Treatment and Research Institute for Autism Spectrum Disorders, Vanderbilt University Medical Center, Nashville, TN, USA
Laura L. Corona, Liliana Wagner, Amy S. Weitlauf, Jeffrey Hine, Amy Nicholson, Caitlin Stone, Alison Vehorn & Zachary Warren
Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN, USA
Laura L. Corona, Liliana Wagner, Amy S. Weitlauf, Jeffrey Hine, Amy Nicholson, Caitlin Stone & Zachary Warren
Adaptive Technology Consulting, LLC, Nashville, TN, USA
Joshua Wade
Department of Psychiatry & Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
Amy Nicholson & Zachary Warren
Department of Special Education, Vanderbilt University, Nashville, TN, USA
Zachary Warren

Authors

Laura L. Corona
View author publications
You can also search for this author in PubMed Google Scholar
Liliana Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Wade
View author publications
You can also search for this author in PubMed Google Scholar
Amy S. Weitlauf
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Hine
View author publications
You can also search for this author in PubMed Google Scholar
Amy Nicholson
View author publications
You can also search for this author in PubMed Google Scholar
Caitlin Stone
View author publications
You can also search for this author in PubMed Google Scholar
Alison Vehorn
View author publications
You can also search for this author in PubMed Google Scholar
Zachary Warren
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LC, LW, and AW led the preparation of the manuscript. JW prepared the technical portions. ZW, AV, and JW led the analytical work related to data preparation and analysis, and JW conducted the machine learning analyses. ZW, AW, LC, LW, JH, AN, and CS developed the assessment instrument. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Laura L. Corona.

Ethics declarations

Conflict of interest

Joshua Wade is the CEO and Co-founder of Adaptive Technology Consulting. Dr. Warren has served as a Consultant for Adaptive Technology Consulting and Roche. Dr. Weitlauf has served as a Consultant for Adaptive Technology Consulting.

Ethical Approval

All procedures performed in this study were in accordance with the Ethical Standards of the Institutional Research Committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. Analysis of existing clinical data was approved by the Institutional Review Board at Vanderbilt University Medical Center.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Corona, L.L., Wagner, L., Wade, J. et al. Toward Novel Tools for Autism Identification: Fusing Computational and Clinical Expertise. J Autism Dev Disord 51, 4003–4012 (2021). https://doi.org/10.1007/s10803-020-04857-x

Download citation

Accepted: 19 December 2020
Published: 08 January 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s10803-020-04857-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Toward Novel Tools for Autism Identification: Fusing Computational and Clinical Expertise

Abstract

Similar content being viewed by others

Validation of Autism Diagnosis and Clinical Data in the SPARK Cohort

Testing the accuracy of an observation-based classifier for rapid detection of autism risk

Applications of Supervised Machine Learning in Autism Spectrum Disorder Research: a Review

Methods

Participants

Data Analytic Procedure

Analytic Approach

Feature Space Exploration

Model Selection

Results

Discussion

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Toward Novel Tools for Autism Identification: Fusing Computational and Clinical Expertise

Abstract

Similar content being viewed by others

Validation of Autism Diagnosis and Clinical Data in the SPARK Cohort

Testing the accuracy of an observation-based classifier for rapid detection of autism risk

Applications of Supervised Machine Learning in Autism Spectrum Disorder Research: a Review

Methods

Participants

Data Analytic Procedure

Analytic Approach

Feature Space Exploration

Model Selection

Results

Discussion

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation