Next Article in Journal
Therapeutic Potential of Cannabis: A Comprehensive Review of Current and Future Applications
Previous Article in Journal
Reactive Oxygen Species-Dependent Activation of EGFR/Akt/p38 Mitogen-Activated Protein Kinase and JNK1/2/FoxO1 and AP-1 Pathways in Human Pulmonary Alveolar Epithelial Cells Leads to Up-Regulation of COX-2/PGE2 Induced by Silica Nanoparticles
Previous Article in Special Issue
Aberrantly Expressed Hsa_circ_0060762 and CSE1L as Potential Peripheral Blood Biomarkers for ALS
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Detection of Amyotrophic Lateral Sclerosis (ALS) Comorbidity Trajectories Based on Principal Tree Model Analytics

1
Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei 106, Taiwan
2
Department of Software Systems & Cybersecurity, Monash University, Melbourne, VIC 3800, Australia
3
Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3086, Australia
4
Department of Neurology and Stroke Center, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei 100, Taiwan
*
Author to whom correspondence should be addressed.
Biomedicines 2023, 11(10), 2629; https://doi.org/10.3390/biomedicines11102629
Submission received: 7 August 2023 / Revised: 11 September 2023 / Accepted: 22 September 2023 / Published: 25 September 2023

Abstract

:
The multifaceted nature and swift progression of Amyotrophic Lateral Sclerosis (ALS) pose considerable challenges to our understanding of its evolution and interplay with comorbid conditions. This study seeks to elucidate the temporal dynamics of ALS progression and its interaction with associated diseases. We employed a principal tree-based model to decipher patterns within clinical data derived from a population-based database in Taiwan. The disease progression was portrayed as branched trajectories, each path representing a series of distinct stages. Each stage embodied the cumulative occurrence of co-existing diseases, depicted as nodes on the tree, with edges symbolizing potential transitions between these linked nodes. Our model identified eight distinct ALS patient trajectories, unveiling unique patterns of disease associations at various stages of progression. These patterns may suggest underlying disease mechanisms or risk factors. This research re-conceptualizes ALS progression as a migration through diverse stages, instead of the perspective of a sequence of isolated events. This new approach illuminates patterns of disease association across different progression phases. The insights obtained from this study hold the potential to inform doctors regarding the development of personalized treatment strategies, ultimately enhancing patient prognosis and quality of life.

1. Introduction

Amyotrophic Lateral Sclerosis (ALS) is a rare but devastating neurodegenerative disease that affects upper and lower motor neurons, primarily observed in men. The disease is typically diagnosed around the age of 62, and the median survival time for patients with ALS is approximately three years [1]. Patients often succumb to respiratory complications such as bronchopneumonia or pneumonia [2,3,4] In Taiwan, the average annual incidence and prevalence rates were 0.51 and 1.97 per 100,000 individuals, respectively. Furthermore, the economic burden of ALS management was significantly higher than average healthcare costs [5]. Despite the diverse clinical phenotypes of the disease, the pathophysiology of ALS remains unknown [2]. Although substantial efforts have been made, viable interventions to halt or slow down the progression of the disease remain elusive, underscoring the inherent complexity of ALS [6].
Studies have found comorbidities in patients with ALS at frequencies that differ from the general population [7]. These comorbidities may modulate disease progression. For example, diabetes and hypertension correlated with delayed onset of ALS, while specific cardiovascular disorders could influence the course of the disease positively or negatively [7,8,9,10]. However, most of these potential risk factors related to ALS were often assessed in isolation, stressing the need for an integrated approach to understand the intricate relationships between comorbidities and ALS [11].
Given the progressive nature of ALS, understanding the comprehensive course of the disease is crucial for optimal clinical management [12]. However, many studies often restricted the prediction of the disease to a single progression event or categorize patients into static progression phenotypes [13,14]. Such methodologies fall short of capturing the dynamic narrative of disease progression and interactions with comorbidities. Furthermore, traditional analysis methods [15,16,17,18,19,20] often interpreted disease trajectories as a chain of disease events, potentially overlooking the simultaneous presence of comorbidities in ALS patients.
Recent developments in unsupervised machine learning algorithms such as Elastic Principal Graphs (ElPiGraph) have paved the way for a more comprehensive analysis of complex multidimensional data. These techniques, widely applicable in various fields including biology and medicine, offer opportunities to construct robust and scalable models of intrinsic dataset geometry. This enables the identification and analysis of high-dimensional data points and their respective relationships [21].
Our study leverages the capabilities of the ElPiGraph method to unravel the associations between ALS and its comorbidities. We introduce a novel perspective that views ALS trajectories as a succession of disease stages characterized by cumulative disease occurrences and transitions. By transforming patient records into the same multidimensional space, our approach aims to bridge the gap between individual patient experiences. It provides a more comprehensive and nuanced interpretation of ALS progression and extracts more insightful knowledge from the available data.

2. Materials and Methods

2.1. Taiwan’s ALS Dataset

This study utilized anonymized data from 83 patients diagnosed with Amyotrophic Lateral Sclerosis (ALS) obtained from a cohort dataset of one million enrollees of Taiwan’s National Health Insurance Research Database (NHIRD) over an 18-year span (1996–2013) [22]. Of these patients, 51 (61.45%) were male and 32 (38.55%) were female. The age of their first ALS diagnosis ranged from 34 to 70 years, with an interquartile range (IQR) between 44 and 62 years. The dataset includes 50,613 related medical records documenting hospital, outpatient, and emergency visits for the 83 ALS patients. Each record includes the date and diagnosis, as represented by the International Classification of Diseases, Ninth Revision (ICD-9) [23].
Given the heterogeneity of ALS, we focused on cases representing definitive diagnoses involving both upper and lower motor neurons (ICD-9 335.20). Alongside these cases, we assembled a control group consisting of four-fold subjects with age- and gender-matched individuals for each ALS patient within a three-year age range (1000 days). This control group consisted of 332 patients without ALS, with a total of 158,677 corresponding medical records.

2.2. Study Design

This study consisted of four key stages for trajectory analysis based on principle tree approaches, including the following:
  • transformation of patient medical records into a quantifiable multi-dimensional space;
  • application of principal trees to reveal the concealed data geometry and topology;
  • identification of diseases associated with ALS across different pathways and stages to highlight comorbidity interaction;
  • extraction of different trajectories from the ALS patient group progressing to various final states.

2.2.1. Transforming Patient Medical Records into a Multidimensional Space

Each medical record includes three essential elements: (1) patient identifier (PID), (2) visit date, and (3) diagnosis of the disease according to ICD-9. We transformed each diagnosis into a one-hot encoding format, which is a binary representation where each unique diagnosis was encoded as a separate feature. The evolution of diseases was then reflected based on each patient’s cumulative count of disease occurrences (we counted each diagnosis once per patient per day).
The process of transforming patient medical records into a multidimensional space is outlined in Figure 1. The original table of patient records was first converted into a one-hot table, where each unique diagnosis was represented by its own column. For example, the number “1” indicates the presence of the diagnosis for a given patient on a particular date, while a “0” indicates its absence. This one-hot table was then transformed into its corresponding cumulated version, accumulating the count of each diagnosis over time for each patient.
We standardized the features by scaling and centering the data to unit variance. We then conducted a principal component analysis (PCA) using 60 components, as determined by the elbow rule—a method used to choose the optimal number of components by locating the “elbow” in the plot of explained variance versus the number of components [24,25]. This analysis transformed 50,613 initial records for 83 ALS patients with 493 unique diagnoses into 22,981 records in 60 dimensions.

2.2.2. Application of Principal Tree to Reveal ALS Progression Pathway

A principal tree serves as an effective data approximator, capturing complex relationships within a data space that may not be attainable with linear segments alone [26]. It helps reveal the underlying structure in the disease progression data that might remain obscured and ignored. We utilized the ElPiGraph library [21] to construct the principal trees, aiming to grasp the intricate topology and progression patterns of ALS.
Once the principal tree was constructed, we analyzed the directionality of the edges connecting the nodes. This analysis enables us to map the trajectories representing the progression of ALS. We designated the root node as the node representing the state before any diagnosis, where all cumulative counts were initially zero. As we moved further away from the root node, the nodes represented progressively advanced stages of the disease for ALS subjects, and we could determine the directions of edges accordingly. The terminal states of the trajectories were then marked as the tree’s leaf nodes.

2.2.3. Exploring Disease Associations and Comorbidity Interaction in ALS

To differentiate between disease progression and various pathways, we adopted an approach that located diseases during state transitions. This strategy involved three main tasks: detection of potential diseases that either contribute to or arise from ALS, extraction of diseases that show increased occurrences during state transitions, and identification of unique diseases associated exclusively with each branch.
To identify diseases significantly associated with ALS, we used the odds ratio, a statistical measure widely employed in epidemiological studies. The odds ratio assesses the likelihood of an event occurring in one group compared to another, thereby indicating the effect size [27,28]. In our analysis, we set an odds ratio threshold of 2 and required a 95% confidence interval that did not include zero to ensure the clinical significance of the identified associations [29,30].
Subsequently, we identified diseases associated with each edge of the principal tree, specifically those where the number of diagnoses increased during a state transition. This approach does not only highlight disease dynamics but also helps characterize different progression pathways. Additionally, we pinpointed “branch-only” diseases, those unique to a particular branch and not inherited from ancestral branches or shared with others. Identifying these unique diseases enriches our understanding of distinct disease interactions within each trajectory.

2.2.4. Extraction and Classification of ALS Disease Trajectories

We constructed disease trajectories from previous principal branches, which are paths that extend from the root node to the leaf nodes in the multidimensional space defined by cumulative counts of disease occurrences. Each patient’s final medical record is transformed into a point within this space. The “nearest” node, determined by the shortest geometric distance, is used to map each patient’s final observed state within this network of disease trajectories. If a patient’s last observed state aligns with a single trajectory, we assigned them to that specific category. However, if the patient’s final state overlapped with multiple trajectories—potentially due to truncated observable trajectories—we assigned these patients to an uncategorized group for further analysis.

3. Results

3.1. Identification of Diseases Associated with ALS

Our analysis identified several diseases with a significant association with Amyotrophic Lateral Sclerosis (ALS), as indicated by odds ratios (ORs) and 95% confidence intervals (CI) displayed in Table 1. Each listed disease varies in ORs, highlighting the relative likelihood of simultaneous diagnosis in ALS patients compared to non-ALS individuals.
Of these diseases, 28 were present at one or more state transition edges, suggesting their potential involvement in disease progression. Additionally, 26 diseases were designated as “branch-only”, appearing exclusively within certain branches and not being traceable to ancestral lines. These “branch-only” diseases suggest distinctive ALS-related disease pathways. Furthermore, 23 diseases were linked exclusively to a specific edge, reinforcing the idea of specialized disease trajectories.
Seven diseases showed an infinite OR, implying a solid association with ALS. These included “dislocation of the jaw”, “chromosomal anomalies”, “late effects of acute poliomyelitis”, “acute poliomyelitis”, “encephalitis myelitis and encephalomyelitis”, “muscular dystrophies and other myopathies”, and “disorders of other cranial nerves”.
Other diseases, such as “other diseases of spinal cord” and “Parkinson’s disease”, demonstrated high ORs, indicative of a significant connection with ALS. “Malignant neoplasm of prostate” and “diseases of white blood cells” also suggested a meaningful relationship with ALS, despite broad 95% CIs.
Several neurological conditions, including “hereditary and idiopathic peripheral neuropathy”, “infantile cerebral palsy”, and “other conditions of brain”, showed strong associations with ALS, reinforcing its neurological nature. Interestingly, conditions not directly related to the nervous system, like “septicemia”, “infections of kidney”, and “cardiac dysrhythmias”, were also associated with ALS, pointing to the disease’s potential systemic impacts.

3.2. Decoding ALS Pathways through the Principal Tree

When exploring the pathways of progression of ALS, we used a principal tree to capture the courses of disease progression. As shown in Figure 2, the tree begins with a root node (0), representing the state of the disease prior to diagnosis. The tree then bifurcates into multiple pathways, ending in 12 leaf nodes, each representing a distinct potential path of disease progression.
All patients in our study started at root node 0 and advanced to the first bifurcation at node 1, marking the onset of ALS. This transition was characterized by an increased prevalence of nine diseases, indicating that these may be early indicators of the progression of ALS. Beyond node 1, the tree split into four distinct pathways, each leading to different outcomes in the progression of the disease. The distribution of diseases and patients across unique pathways in the ALS progression principal tree were shown in Table 2.
The paths from node 1 to node 9 and from node 1 to node 2 did not show “branch-only” diseases, suggesting a common progression route for many ALS patients without developing specific diseases. However, the routes from node 1 to nodes 4 and 3 displayed unique “branch-only” diseases, hinting at alternative pathogenetic pathways. These included “Dementia” (290) on the path to node 4, while “Acute poliomyelitis” (045), “Muscular dystrophies and other myopathies” (359), and “Other paralytic syndromes” (344) were on the path to node 3.
The branch from node 3 further bifurcates into multiple pathways, each marked by different “branch-only” diseases. For example, the path from node 3 to node 8 was distinguished by “infantile cerebral palsy” (343), while the route from node 3 to node 5 was identified by “Facial nerve disorders” (351). Furthermore, the transition from node 3 to node 6 was marked by “Other unspecified disorders of the nervous system” (349), “Mononeuritis of the lower extremity and an unspecified site” (355), and “Encephalitis myelitis and encephalomyelitis” (323).
Collectively, this principal tree model offers a visualization of the varied trajectories of progression of ALS, suggesting that, while some patients follow common progression routes with no unique “branch-only” diseases, others may experience specialized pathways with distinct diseases marking their journeys.

3.3. Delineation and Analysis of Patient Trajectories in ALS Progression

Our study yielded the delineation of eight distinct trajectories representing ALS progression patterns, each substantiated by at least three patients. The varied patient trajectories in the ALS progression principal tree were shown in Figure 3 respectively. These trajectories revealed the dynamic landscape of ALS progression.
We identified a distinctive “Uncategorized Group” comprising patients whose progression patterns do not fit within any of the eight defined trajectories. These patients’ disease pathways may be shorter, failing to exceed a bifurcation point that leads to only a terminal node, suggestive of a potentially truncated ALS progression pattern. This unique progression pattern, as observed in the initial ALS records for these patients, tends to cluster at the early stages of the disease. This divergence warrants further investigation and may provide valuable insights into the heterogeneity of ALS progression.
Apart from the Uncategorized Group, our analysis classified patients into seven distinct trajectories, each reflecting the unique progression dynamics of ALS. The diversity in these trajectories echoes the high heterogeneity in the progression of ALS, underscoring the potential of distinct underlying disease mechanisms or risk factors.

4. Discussion

Amyotrophic Lateral Sclerosis (ALS) is a rapidly progressing neurodegenerative condition characterized by complex and multifaceted dynamics, which pose challenges to our understanding of its progression. In order to address the complexities, our study employed a data-driven approach to unravel the intricate patterns of ALS progression and its interplay with concurrent diseases.
We utilized a principal tree algorithm to capture the underlying data geometry forming the multifaceted ALS progression patterns. In our framework, ALS progression is conceptualized as a journey through sequential stages of the disease, each represented by a node in the tree. Each node signifies an accumulation of diseases, while the edges connecting nodes symbolize potential transitions, thus depicting the dynamic nature of disease progression and the directional shifts that reflect ALS progression in relation to cumulative comorbidities.
While the traditional literature often portrays disease trajectories as sequences of discrete disease events, our model offers a fresh perspective by viewing trajectories as progressions through stages, each comprising multiple co-existing conditions. This holistic view fosters a more genuine and subtle understanding of ALS progression, revealing the intricate relationship between ALS and its comorbid conditions.
One significant aspect of our study is the exploration of ALS’s relationship with concurrent diseases. Conventional research methods typically analyze ALS-associated risk factors independently, frequently overlooking the complex interactions among comorbid conditions. In contrast, our study contextualizes ALS within a network of co-existing diseases, underscoring the potential influence of these conditions on ALS progression. Our methodology, which converts patient records into a multidimensional space, enables a deeper understanding of disease progression by visualizing all disease states as interconnected components of a complex system.
For instance, from node 0 through node 1 to node 3, our analysis elucidates the common pathway of early symptoms encountered during the initial stages of ALS.
The early diagnosis of ALS presents a challenge due to the disease’s heterogeneity, subtle onset, and overall rarity. This often results in misdiagnoses that can prompt unsuitable treatments, such as unneeded surgeries, leading to delays in proper diagnosis and exacerbating patient morbidity [31,32,33].
In the transition from node 0 to node 1, the majority of misdiagnoses are “disorders of the peripheral nervous system (350–359)”, “dorsopathies (720–724)”, and “disorder of soft tissue (725–729)”. These non-specific conditions, often manifesting with symptoms such as muscle weakness and fatigue akin to ALS, highlight musculoskeletal and peripheral nervous system disorders in early ALS misdiagnosis.
Interestingly, the prevalence of misdiagnoses such as “anxiety, dissociative and somatoform disorders (300)” suggests a tendency towards psychosomatic interpretations of early ALS symptoms.
The shift from node 1 to node 3 sees a transition towards diseases with more defined neurological characteristics, such as pure motor weakness of limbs, muscle atrophy, and fasciculations. These conditions often share overlapping neurological symptoms with ALS, such as muscle atrophy, fasciculations, and limb weakness, contributing to misdiagnoses.
Our analysis captures the complexities of diagnosing ALS, considering the myriad conditions that can mimic its early symptoms. It emphasizes the necessity for a more sophisticated and holistic diagnostic approach that encompasses a robust understanding of ALS’s varied presentations and promotes interdisciplinary collaboration among general practitioners, neurologists, and other specialists.
Building on the foundation of our computational findings, it becomes imperative to consider integrating complementary experimental techniques, for example, in the domain of soft matter nanomechanics. Techniques such as atomic force microscopy (AFM) and optical tweezers, highlighted by Magazzù et al. [34], stand out as invaluable tools in shedding light on the biomechanical intricacies at the single-molecule level. These methods, by quantifying nanomechanical parameters like Young’s modulus, hold the potential to enrich our understanding of cellular systems, especially within the scope of neurodegenerative pathologies. Likewise, Strijkova et al. [35] have intelligently leveraged AFM to unravel the distinctive mechanical and morphological attributes of platelets in various neurodegenerative conditions, including Parkinson’s disease, ALS, and Alzheimer’s disease. Such groundbreaking experimental works underscore the imperative of meshing computational analyses with tangible experimental data. Furthermore, Varga et al. [36] employed AFM to delve into the elasticity of ALS myotubes, attributing it to a specific ALS-inducing mutation in mice. These revelations can potentially layer our understanding of ALS, bridging its pathogenesis to cellular manifestations.
While our study offers novel insights into the complexities of ALS progression, it is crucial to validate the robustness and reliability of our findings further. Future research must verify the consistency of our model’s representation of disease trajectories across various datasets and diverse patient demographics. Despite these considerations, our study introduces a pioneering approach of incorporating comorbidity data into tree edges during disease stage transitions, offering a fresh perspective on understanding ALS. We suggest that this approach has the potential to considerably enhance our comprehension of ALS and provide a solid analytical framework for similar future research. Through these contributions, we aspire to promote the development of improved care strategies and therapeutic interventions for this debilitating disease.

Author Contributions

Conceptualization, D.T. and T.-W.P.; methodology, Y.-S.W.; software, Y.-S.W.; validation, Y.-S.W.; formal analysis, Y.-S.W.; investigation, D.T., K.A., L.-K.T., and T.-W.P.; resources, L.-K.T.; data curation, Y.-S.W. and L.-K.T.; writing—original draft preparation, Y.-S.W. and K.A.; writing—review and editing, D.T. and T.-W.P.; visualization, Y.-S.W.; supervision, D.T. and T.-W.P.; project administration, D.T. and T.-W.P.; funding acquisition, T.-W.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research and the APC were funded by National Science and Technology Council (Taiwan), grant number MOST 104-2321-B-019-009 and MOST 111-2221-E-027-113-MY2.

Institutional Review Board Statement

The Institutional Review Board (IRB) of Chang Gung Memorial Hospital and National Taiwan University Hospital in Taiwan approved the study and waived the need for informed consent because the identification of subjects in the National Health Insurance Research Database (NHIRD) had been de-identified prior to their release for research in order to ensure confidentiality (IRB No. 104-5430B and 202112090RINC). We conducted this study in accordance with the Code of Ethics of the World Medical Association Declaration of Helsinki.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets are from the NHIRD and were used only for this study under policy limitation. Therefore, the data are not publicly available. Data usage regulations can be checked from the website https://nhird.nhri.org.tw/en/ (accessed on 1 September 2023).

Conflicts of Interest

All authors declare no affiliations with or involvement in any organization or entity with any financial or non-financial interest in the subject matter or materials discussed in this manuscript.

References

  1. Mead, R.J.; Shan, N.; Reiser, H.J.; Marshall, F.; Shaw, P.J. Amyotrophic Lateral Sclerosis: A Neurodegenerative Disorder Poised for Successful Therapeutic Translation. Nat. Rev. Drug Discov. 2022, 22, 185–212. [Google Scholar] [CrossRef] [PubMed]
  2. Grad, L.I.; Rouleau, G.A.; Ravits, J.; Cashman, N.R. Clinical Spectrum of Amyotrophic Lateral Sclerosis (ALS). Cold Spring Harb. Perspect. Med. 2017, 7, a024117. [Google Scholar] [CrossRef] [PubMed]
  3. Corcia, P.; Pradat, P.F.; Salachas, F.; Bruneteau, G.; le Forestier, N.; Seilhean, D.; Hauw, J.J.; Meininger, V. Causes of Death in a Post-mortem Series of ALS Patients. Amyotroph. Lateral Scler. 2009, 9, 59–62. [Google Scholar] [CrossRef] [PubMed]
  4. Masrori, P.; Van Damme, P. Amyotrophic Lateral Sclerosis: A Clinical Review. Eur. J. Neurol. 2020, 27, 1918–1929. [Google Scholar] [CrossRef] [PubMed]
  5. Tsai, C.P.; Wang, K.C.; Hwang, C.S.; Lee, I.T.; Lee, C.T.C. Incidence, Prevalence, and Medical Expenditures of Classical Amyotrophic Lateral Sclerosis in Taiwan, 1999–2008. J. Formos. Med. Assoc. 2015, 114, 612–619. [Google Scholar] [CrossRef] [PubMed]
  6. Mitsumoto, H.; Brooks, B.R.; Silani, V. Clinical Trials in Amyotrophic Lateral Sclerosis: Why so Many Negative Trials and How Can Trials Be Improved? Lancet Neurol. 2014, 13, 1127–1138. [Google Scholar] [CrossRef] [PubMed]
  7. Diekmann, K.; Kuzma-Kozakiewicz, M.; Piotrkiewicz, M.; Gromicho, M.; Grosskreutz, J.; Andersen, P.M.; de Carvalho, M.; Uysal, H.; Osmanovic, A.; Schreiber-Katz, O.; et al. Impact of Comorbidities and Co-Medication on Disease Onset and Progression in a Large German ALS Patient Group. J. Neurol. 2020, 267, 2130–2141. [Google Scholar] [CrossRef] [PubMed]
  8. Körner, S.; Kollewe, K.; Ilsemann, J.; Müller-Heine, A.; Dengler, R.; Krampfl, K.; Petri, S. Prevalence and Prognostic Impact of Comorbidities in Amyotrophic Lateral Sclerosis. Eur. J. Neurol. 2013, 20, 647–654. [Google Scholar] [CrossRef]
  9. Chen, L.; Xu, L.; Tang, L.; Xia, K.; Tian, D.; Zhang, G.; Wang, Y.; Yu, Z.; Ma, J.; Zhang, Y.; et al. Trends in the Clinical Features of Amyotrophic Lateral Sclerosis: A 14-Year Chinese Cohort Study. Eur. J. Neurol. 2021, 28, 2893–2900. [Google Scholar] [CrossRef]
  10. Xu, K.; Ji, H.; Hu, N. Cardiovascular Comorbidities in Amyotrophic Lateral Sclerosis: A Systematic Review. J. Clin. Neurosci. 2022, 96, 43–49. [Google Scholar] [CrossRef]
  11. Bellavia, A.; Dickerson, A.S.; Rotem, R.S.; Hansen, J.; Gredal, O.; Weisskopf, M.G. Joint and Interactive Effects between Health Comorbidities and Environmental Exposures in Predicting Amyotrophic Lateral Sclerosis. Int. J. Hyg. Environ. Health 2021, 231, 113655. [Google Scholar] [CrossRef] [PubMed]
  12. Majmudar, S.; Wu, J.; Paganoni, S. Rehabilitation in Amyotrophic Lateral Sclerosis: Why It Matters. Muscle Nerve 2014, 50, 4–13. [Google Scholar] [CrossRef] [PubMed]
  13. Taylor, A.A.; Fournier, C.; Polak, M.; Wang, L.; Zach, N.; Keymer, M.; Glass, J.D.; Ennist, D.L. Predicting Disease Progression in Amyotrophic Lateral Sclerosis. Ann. Clin. Transl. Neurol. 2016, 3, 866–875. [Google Scholar] [CrossRef] [PubMed]
  14. Huang, C.H.; Yip, B.S.; Taniar, D.; Hwang, C.S.; Pai, T.W. Comorbidity Pattern Analysis for Predicting Amyotrophic Lateral Sclerosis. Appl. Sci. 2021, 11, 1289. [Google Scholar] [CrossRef]
  15. Jensen, A.B.; Moseley, P.L.; Oprea, T.I.; Ellesøe, S.G.; Eriksson, R.; Schmock, H.; Jensen, P.B.; Jensen, L.J.; Brunak, S. Temporal Disease Trajectories Condensed from Population-Wide Registry Data Covering 6.2 Million Patients. Nat. Commun. 2014, 5, 4022. [Google Scholar] [CrossRef] [PubMed]
  16. Siggaard, T.; Reguant, R.; Jørgensen, I.F.; Haue, A.D.; Lademann, M.; Aguayo-Orozco, A.; Hjaltelin, J.X.; Jensen, A.B.; Banasik, K.; Brunak, S. Disease Trajectory Browser for Exploring Temporal, Population-Wide Disease Progression Patterns in 7.2 Million Danish Patients. Nat. Commun. 2020, 11, 4952. [Google Scholar] [CrossRef] [PubMed]
  17. Pinaire, J.; Aze, J.; Bringay, S.; Poncelet, P.; Genolini, C.; Landais, P. Hospital Healthcare Flows: A Longitudinal Clustering Approach of Acute Coronary Syndrome in Women over 45 Years. Health Inform. J. 2021, 27, 14604582211033020. [Google Scholar] [CrossRef] [PubMed]
  18. Wu, Y.-S.; Taniar, D.; Adhinugraha, K.; Wang, C.-H.; Gung, C.; Pai, W. Progression to Myocardial Infarction Short-Term Death Based on Interval Sequential Pattern Mining. BMC Cardiovasc. Disord. 2023, 23, 394. [Google Scholar] [CrossRef]
  19. Tavazzi, E.; Daberdaku, S.; Zandonà, A.; Vasta, R.; Nefussy, B.; Lunetta, C.; Mora, G.; Mandrioli, J.; Grisan, E.; Tarlarini, C.; et al. Predicting Functional Impairment Trajectories in Amyotrophic Lateral Sclerosis: A Probabilistic, Multifactorial Model of Disease Progression. J. Neurol. 2022, 269, 3858–3878. [Google Scholar] [CrossRef]
  20. Tavazzi, E.; Gatta, R.; Vallati, M.; Cotti Piccinelli, S.; Filosto, M.; Padovani, A.; Castellano, M.; Di Camillo, B. Leveraging Process Mining for Modeling Progression Trajectories in Amyotrophic Lateral Sclerosis. BMC Med. Inform. Decis. Mak. 2022, 22, 346. [Google Scholar] [CrossRef]
  21. Albergante, L.; Mirkes, E.; Bac, J.; Chen, H.; Martin, A.; Faure, L.; Barillot, E.; Pinello, L.; Gorban, A.; Zinovyev, A. Robust and Scalable Learning of Complex Intrinsic Dataset Geometry via ElPiGraph. Entropy 2020, 22, 296. [Google Scholar] [CrossRef] [PubMed]
  22. Lin, L.Y.; Warren-Gash, C.; Smeeth, L.; Chen, P.C. Data Resource Profile: The National Health Insurance Research Database (NHIRD). Epidemiol. Health 2018, 40, e2018062. [Google Scholar] [CrossRef] [PubMed]
  23. ICD. ICD-9-CM—International Classification of Diseases, Ninth Revision, Clinical Modification. Available online: https://www.cdc.gov/nchs/icd/icd9cm.htm (accessed on 23 September 2023).
  24. Nguyen, L.H.; Holmes, S. Ten Quick Tips for Effective Dimensionality Reduction. PLoS Comput. Biol. 2019, 15, e1006907. [Google Scholar] [CrossRef] [PubMed]
  25. Pedregosa, F.; Michel, V.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Vanderplas, J.; Cournapeau, D.; Pedregosa, F.; Varoquaux, G.; et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  26. Gorban, A.N.; Sumner, N.R.; Zinovyev, A.Y. Topological Grammars for Data Approximation. Appl. Math. Lett. 2007, 20, 382–386. [Google Scholar] [CrossRef]
  27. Szumilas, M. Explaining Odds Ratios. J. Can. Acad. Child Adolesc. Psychiatry 2010, 19, 227. [Google Scholar] [PubMed]
  28. Simon, S.D.; Biostatistician, R. Understanding the Odds Ratio and the Relative Risk. J. Androl. 2001, 22, 533–536. [Google Scholar] [CrossRef]
  29. Andrade, C. Understanding Relative Risk, Odds Ratio, and Related Terms: As Simple as It Can Get. J. Clin. Psychiatry 2015, 76, 21865. [Google Scholar] [CrossRef]
  30. Andrade, C. How to Understand the 95% Confidence Interval Around the Relative Risk, Odds Ratio, and Hazard Ratio: As Simple as It Gets. J. Clin. Psychiatry 2023, 84, 47304. [Google Scholar] [CrossRef]
  31. Srinivasan, J.; Scala, S.; Jones, H.R.; Saleh, F.; Russell, J.A. Inappropriate Surgeries Resulting from Misdiagnosis of Early Amyotrophic Lateral Sclerosis. Muscle Nerve 2006, 34, 359–360. [Google Scholar] [CrossRef]
  32. Richards, D.; Morren, J.A.; Pioro, E.P. Time to Diagnosis and Factors Affecting Diagnostic Delay in Amyotrophic Lateral Sclerosis. In Amyotrophic Lateral Sclerosis; Exon Publications: Brisbane, Australia, 2021; pp. 15–34. [Google Scholar] [CrossRef]
  33. Turner, M.R.; Talbot, K. Mimics and Chameleons in Motor Neurone Disease. Pract. Neurol. 2013, 13, 153–164. [Google Scholar] [CrossRef]
  34. Magazzù, A.; Marcuello, C. Investigation of Soft Matter Nanomechanics by Atomic Force Microscopy and Optical Tweezers: A Comprehensive Review. Nanomaterials 2023, 13, 963. [Google Scholar] [CrossRef]
  35. Strijkova, V.; Todinova, S.; Andreeva, T.; Langari, A.; Bogdanova, D.; Zlatareva, E.; Kalaydzhiev, N.; Milanov, I.; Taneva, S.G. Platelets’ Nanomechanics and Morphology in Neurodegenerative Pathologies. Biomedicines 2022, 10, 2239. [Google Scholar] [CrossRef]
  36. Varga, B.; Martin-Fernandez, M.; Hilaire, C.; Sanchez-Vicente, A.; Areias, J.; Salsac, C.; Cuisinier, F.J.G.; Raoul, C.; Scamps, F.; Gergely, C. Myotube Elasticity of an Amyotrophic Lateral Sclerosis Mouse Model. Sci. Rep. 2018, 8, 5917. [Google Scholar] [CrossRef]
Figure 1. Schematic representation of transforming patient medical records into a multidimensional space. The process consists of three steps. (A) The original table of patient records, with each record featuring the patient identifier (PID), the hospital visiting date, and the disease diagnosis. (B) Conversion of the original table into a one-hot table, with each unique diagnosis represented as a separate column. In this format, a ‘1′ signifies the presence of the diagnosis for a given patient on a particular date, while a ‘0′ denotes its absence. (C) Transformation of the one-hot table into a cumulative table, aggregating the occurrence count of each diagnosis over time for each patient. This cumulative representation forms the basis for further multidimensional data analysis.
Figure 1. Schematic representation of transforming patient medical records into a multidimensional space. The process consists of three steps. (A) The original table of patient records, with each record featuring the patient identifier (PID), the hospital visiting date, and the disease diagnosis. (B) Conversion of the original table into a one-hot table, with each unique diagnosis represented as a separate column. In this format, a ‘1′ signifies the presence of the diagnosis for a given patient on a particular date, while a ‘0′ denotes its absence. (C) Transformation of the one-hot table into a cumulative table, aggregating the occurrence count of each diagnosis over time for each patient. This cumulative representation forms the basis for further multidimensional data analysis.
Biomedicines 11 02629 g001
Figure 2. Principal tree illustrating heterogeneous pathways of progression of ALS disease. The tree originates from a root node labeled as “0”, and it is split into multiple pathways that end at twelve leaf nodes. Each leaf node represents a unique trajectory of progression of ALS. The tree captures the presence of “branch-only” diseases along specific routes, highlighting distinct disease progression pathways. The numbers shown within the nodes serve as identifiers and possess no intrinsic meaning. The nodes are linked by edges, the width of edge corresponding to the number of patients following a particular pathway. The presence of a disease is indicated by its ICD-9 code. Points scattered throughout the figure represent individual patient medical visits, color-coded for patient differentiation.
Figure 2. Principal tree illustrating heterogeneous pathways of progression of ALS disease. The tree originates from a root node labeled as “0”, and it is split into multiple pathways that end at twelve leaf nodes. Each leaf node represents a unique trajectory of progression of ALS. The tree captures the presence of “branch-only” diseases along specific routes, highlighting distinct disease progression pathways. The numbers shown within the nodes serve as identifiers and possess no intrinsic meaning. The nodes are linked by edges, the width of edge corresponding to the number of patients following a particular pathway. The presence of a disease is indicated by its ICD-9 code. Points scattered throughout the figure represent individual patient medical visits, color-coded for patient differentiation.
Biomedicines 11 02629 g002
Figure 3. Detailed illustration of varied patient trajectories in the ALS progression principal tree. The figure consists of eight subgraphs, each representing a unique disease progression pathway undertaken by a specific patient group, originating from the initial onset (node 0) to differing terminal stages. Subgraphs are sorted based on the number of patients in each group. The “Uncategorized Group” encompasses patients whose progression patterns do not fit into any trajectory. The trajectories of these patients may be too short, failing to surpass a bifurcation point leading to a terminal node, indicative of a potentially truncated ALS progression pattern. Each of the subsequent subgraphs elucidates distinct ALS progression dynamics, characterized by changes in associated diseases (denoted by ICD-9 codes) along each pathway. The red points across the pathways denote the location of the first ALS onset for each patient, highlighting the varied progression patterns within ALS. The title of each subgraph specifies the particular trajectory along with the respective patient count.
Figure 3. Detailed illustration of varied patient trajectories in the ALS progression principal tree. The figure consists of eight subgraphs, each representing a unique disease progression pathway undertaken by a specific patient group, originating from the initial onset (node 0) to differing terminal stages. Subgraphs are sorted based on the number of patients in each group. The “Uncategorized Group” encompasses patients whose progression patterns do not fit into any trajectory. The trajectories of these patients may be too short, failing to surpass a bifurcation point leading to a terminal node, indicative of a potentially truncated ALS progression pattern. Each of the subsequent subgraphs elucidates distinct ALS progression dynamics, characterized by changes in associated diseases (denoted by ICD-9 codes) along each pathway. The red points across the pathways denote the location of the first ALS onset for each patient, highlighting the varied progression patterns within ALS. The title of each subgraph specifies the particular trajectory along with the respective patient count.
Biomedicines 11 02629 g003
Table 1. Diseases significantly associated with Amyotrophic Lateral Sclerosis (ALS). Diseases are denoted by the International Classification of Diseases (ICD-9) codes and listed alongside their respective odds ratios (ORs) and 95% confidence intervals (CIs).
Table 1. Diseases significantly associated with Amyotrophic Lateral Sclerosis (ALS). Diseases are denoted by the International Classification of Diseases (ICD-9) codes and listed alongside their respective odds ratios (ORs) and 95% confidence intervals (CIs).
ICD-9Disease NameOR95% CI
038Septicemia2.851.28–6.17
045Acute poliomyelitisinf1.68–inf
138Late effects of acute poliomyelitisinf6.10–inf
185Malignant neoplasm of prostate16.61.61–827.73
242Thyrotoxicosis with or without goiter3.351.26–8.64
276Disorders of fluid electrolyte and acid–base balance41.86–8.54
288Diseases of white blood cells16.61.61–827.73
290Dementias4.391.58–12.24
300Anxiety, dissociative, and somatoform disorders21.20–3.37
311Depressive disorder, not elsewhere classified3.231.16–8.71
323Encephalitis myelitis and encephalomyelitisinf1.68–inf
331Other cerebral degenerations5.071.25–21.58
332Parkinson’s disease18.354.78–103.91
333Other extrapyramidal disease and abnormal movement disorders4.441.67–11.78
336Other diseases of spinal cord24.935.28–236.30
343Infantile cerebral palsy8.481.77–53.57
344Other paralytic syndromes5.12.34–11.17
345Epilepsy and recurrent seizures3.291.01–10.30
348Other conditions of brain7.92.30–30.90
349Other and unspecified disorders of the nervous system3.661.85–7.18
351Facial nerve disorders6.981.33–45.92
352Disorders of other cranial nervesinf2.70–inf
353Nerve root and plexus disorders5.42.97–9.83
354Mononeuritis of upper limb and mononeuritis multiplex3.331.76–6.23
355Mononeuritis of lower limb and unspecified site3.821.54–9.34
356Hereditary and idiopathic peripheral neuropathy15.366.77–37.64
357Inflammatory and toxic neuropathy6.822.92–16.47
358Myoneural disorders7.863.08–21.27
359Muscular dystrophies and other myopathiesinf16.82–inf
360Disorders of the globe4.931.51–16.50
427Cardiac dysrhythmias2.111.17–3.73
434Occlusion of cerebral arteries3.151.64–5.99
436Acute, but ill-defined, cerebrovascular disease5.062.28–11.32
438Late effects of cerebrovascular disease2.591.21–5.39
459Other disorders of circulatory system4.981.39–18.48
518Other diseases of lung6.282.73–14.81
590Infections of kidney2.751.01–7.14
714Rheumatoid arthritis and other inflammatory polyarthropathies2.491.10–5.42
718Other derangement of joint3.221.28–7.86
721Spondylosis and allied disorders2.351.40–3.99
722Intervertebral disc disorders2.181.28–3.71
728Disorders of muscle ligament and fascia2.491.46–4.23
729Other disorders of soft tissues3.591.70–8.47
758Chromosomal anomaliesinf1.68–inf
813Fracture of radius and ulna3.421.13–10.00
830Dislocation of jawinf1.68–inf
891Open wound of knee, leg (except thigh), and ankle2.181.08–4.29
952Spinal cord injury without evidence of spinal bone injury8.481.77–53.57
Table 2. Distribution of diseases and patients across unique pathways in the ALS progression principal tree. Each row represents an edge (transition between nodes) within the ALS progression tree model. Diseases are marked by ICD-9 codes and categorized into “Associated Diseases” and “Branch-Only Diseases”. “Associated Diseases” represent all illnesses observed along a specific edge, whereas “Branch-Only Diseases” specify those unique to a specific edge, exclusive of those shared with other branches originating from the same parent node or inherited from ancestral nodes. The “Number of Patients” column quantifies the patients traversing each pathway, presented as both a numerical count and a percentage relative to the total patient cohort. This table underscores the complex intertwining of disease states across varying ALS progression paths and emphasizes the disease’s heterogeneity.
Table 2. Distribution of diseases and patients across unique pathways in the ALS progression principal tree. Each row represents an edge (transition between nodes) within the ALS progression tree model. Diseases are marked by ICD-9 codes and categorized into “Associated Diseases” and “Branch-Only Diseases”. “Associated Diseases” represent all illnesses observed along a specific edge, whereas “Branch-Only Diseases” specify those unique to a specific edge, exclusive of those shared with other branches originating from the same parent node or inherited from ancestral nodes. The “Number of Patients” column quantifies the patients traversing each pathway, presented as both a numerical count and a percentage relative to the total patient cohort. This table underscores the complex intertwining of disease states across varying ALS progression paths and emphasizes the disease’s heterogeneity.
EdgeAssociated Diseases (ICD-9)Branch-Only Diseases (ICD-9)Number of Patients
1–3045, 729, 722, 721, 359, 353, 438, 427, 344, 300, 728, 138, 357045, 359, 438,
427, 344, 138
31 (37.35%)
1–42902908 (9.64%)
1–9721, 729, 722, 436 1 (1.20%)
1–2335 10 (12.05%)
3–11332, 729, 722, 721, 352, 813, 353, 344, 300, 728, 311352, 813, 332, 3113 (3.61%)
3–7729, 356, 438, 427, 300, 434, 436 5 (6.02%)
3–8343, 045, 1383436 (7.23%)
3–5434, 351, 4383519 (10.84%)
3–10722, 359, 353, 427, 300, 1851853 (3.61%)
3–6335, 721, 356, 355,
300, 323, 349, 357
349, 355, 3234 (4.82%)
11–17332, 718, 300, 290, 813290, 7181 (1.20%)
11–14729, 722, 721, 353, 344,
355, 300, 728, 311
3551 (1.20%)
11–16352 1 (1.20%)
7–12434, 356, 436, 438 3 (3.61%)
7–13335, 332, 729, 359, 438, 427, 3003322 (2.41%)
0–1335, 729, 722, 721, 353,
300, 728, 357, 436
335, 729, 722, 721, 353, 300, 728, 357, 43655 (66.27%)
17–22332, 718, 300, 290, 813 1 (1.20%)
22–24332, 718, 300, 290, 813 1 (1.20%)
14–20729, 722, 721, 353, 344,
355, 300, 728, 311
1 (1.20%)
16–21352 1 (1.20%)
10–18729, 722, 359, 353, 427, 300, 185 3 (3.61%)
21–23352 1 (1.20%)
6–15335, 721, 356, 355,
300, 323, 349, 357
4 (4.82%)
13–19335, 332, 729, 359,
438, 427, 300, 434
2 (2.41%)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, Y.-S.; Taniar, D.; Adhinugraha, K.; Tsai, L.-K.; Pai, T.-W. Detection of Amyotrophic Lateral Sclerosis (ALS) Comorbidity Trajectories Based on Principal Tree Model Analytics. Biomedicines 2023, 11, 2629. https://doi.org/10.3390/biomedicines11102629

AMA Style

Wu Y-S, Taniar D, Adhinugraha K, Tsai L-K, Pai T-W. Detection of Amyotrophic Lateral Sclerosis (ALS) Comorbidity Trajectories Based on Principal Tree Model Analytics. Biomedicines. 2023; 11(10):2629. https://doi.org/10.3390/biomedicines11102629

Chicago/Turabian Style

Wu, Yang-Sheng, David Taniar, Kiki Adhinugraha, Li-Kai Tsai, and Tun-Wen Pai. 2023. "Detection of Amyotrophic Lateral Sclerosis (ALS) Comorbidity Trajectories Based on Principal Tree Model Analytics" Biomedicines 11, no. 10: 2629. https://doi.org/10.3390/biomedicines11102629

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop