Introduction

During the past decade, novel assessments focusing on different aspects of spinal cord injury (SCI) were developed to obtain good psychometric properties for clinical trials. These multidimensional functional scores such as Spinal Cord Independence Measure (SCIM)1 or Graded Redefined Assessment of Strength Sensibility and Prehension2 have been developed as outcome measures specifically for the SCI population to be used to quantify the effects of novel therapeutic interventions.2 Although fulfilling the needs of researchers in terms of having sound psychometric properties, they have gained less acceptance among clinical practitioners because of their complexity, time required to perform the assessments and nonintuitive interpretation. Examples of clinically well-accepted scores are the Frankel scale3 and its successor the American Spinal Injury Association (ASIA) Impairment scale (AIS),4 which both classify patients on a five-point scale in an ascending order according to the extent of the neurological impairment caused by the SCI. Their regular clinical use is mainly based on their good clinical interpretability and easiness of being communicated to others. Each grade represents a distinct clinical impression of a patient; for example, patients with a complete lesion are unable to walk and have a poor prognosis to recover substantial motor and sensory function. Although distinct categories exist for the classification of the neurological impairment, their counterparts for categorization of the overall functional status of a patient are missing. The need for a functional classification scheme based on a limited number of grades was addressed recently by the introduction of the SCIM grades.5, 6 The SCIM grades consist of eight clusters describing distinct patterns in SCIM’s 17 subitems, which were derived by a purely data-driven approach (k-medoides cluster analysis). The respective grades represent rather abstract values without a direct clinical relevance and are difficult to communicate to professionals and patients. The Walking Index for Spinal Cord Injury Version II (WISCI)7 is another frequently used capacity measure of walking function for use in clinical trials, which captures the extent and nature of assistance that persons with SCI require to walk.8 By design, it captures only walking function and not overall functional status, and it accordingly has floor effects for nonambulating subjects, as there is only one level available for grading nonambulating subjects.

A different approach to setting up a grading scheme for describing pathological motor function is based on specific locomotor stages (LSs) developed for infants suffering from cerebral palsy (CP) by Vojta.9 Respective stages capture in an ascending order: initiation of a targeted movement, locomotion requiring the use of the arms and legs, bipedal ambulation, and a one-legged stand for >3 s (Stork test).10 They represent a hierarchical 10-point scale, with every stage having a clear, clinically meaningful description of specific locomotor patterns.11 The progress of an individual with SCI during rehabilitation is to some extent comparable to the typical development of ontogenic locomotion in children. Normative studies in the first half of the twentieth century provided detailed, qualitative descriptions of ordered sequences of stages for prone and upright locomotion.12, 13 The recovery of ambulation after SCI14 also represents a hierarchical process and follows to some degree distinct stages15—for example, a certain degree of trunk stability is a prerequisite for the ability to stand, which is a precondition for achieving independent ambulation.

Therefore, the aim of this project was to apply and adapt LSs derived from CP to SCI and to quantify the inter-rater reliability and validity of this rating scale. We hypothesized that despite the different underlying pathophysiological mechanisms of CP and SCI the LSs originally designed for classification of CP can be adapted to SCI with sound (validity correlation coefficient >0.7 (Fitzpatrick et al.16) and reliability correlation coefficient >0.7 (Aaronson et al.17)) psychometric properties.

Materials and methods

As a first step, the 10 LSs (Table 1) originally developed for the assessment of children with CP were assessed in an SCI cohort. The aim of this initial pilot study was twofold: (1) to examine the basic applicability and the distribution of the LSs for CP in a typical SCI cohort and to (2) test the inter-rater reliability in individuals with SCI. On the basis of the results of this pilot study, an expert-based consensus process was initiated to adapt the CP LSs for consistent classification of the motor skills of individuals with SCI.

Table 1 Locomotor Stages in Spinal Cord Injury (LOSSCI) transformation table

At the same time, a validity trial was conducted in a separate SCI cohort at a different SCI center. In the framework of this trial, study participants were assessed with the original LSs for CP. However, the validity analysis was performed with the SCI-adapted LSs.

Pilot study on applicability and reliability in SCI

The inter-rater reliability of the original LSs was conducted in 2006 at the Spinal Cord Injury Center, Werner Wicker Clinic, Bad Wildungen, Germany by two trained (1 day of training in theoretical and practical issues, including supervision) physical therapists. Sample size was determined before study onset on the basis of reliability study correlation coefficients of the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI),4 the Functional Independence Measure18 and the Barthel Index. The planned sample size of approximately 70 patients was calculated on the basis of an estimated correlation between raters of at least 0.8±0.1 of the above-mentioned established scores. Inclusion criteria were subjects at least 8 weeks after SCI (without any maximal time since injury constraint), aged between 18 and 80 years and two successive assessments by two raters within 1–5 days. The study protocol was approved by the local ethics committee (application number: 15/06). All participants gave written informed consent.

Validity trial

Two databases were combined for the evaluation of the construct validity. The local Heidelberg data sets of the European Multicenter Study on Human Spinal Cord Injury (EMSCI)14 provided the data source for the SCIM.1 EMSCI uses a fixed time schedule referenced to the date of injury. It is tested within 2, 4±1.7, 12±2, 24±2.6 and 48±7.1 weeks after injury (WAI). The SCIM assessment was chosen as a gold standard and reference for the validity determination, because it is thoroughly investigated and it documents functional scores covering many functional aspects of daily living in SCI subjects.

LSs were assessed by 10 trained physical therapists from 2002 to 2011 in Heidelberg. LSs were documented for each of the study participants every 4 weeks. Data sets from both sources were matched based on the rule that respective assessment dates were not allowed to differ by >15 days. This threshold was chosen as a compromise between a high sample size and a tolerable probability of functional or neurological changes that can occur in the acute and subacute rehabilitation phase time window. The EMSCI protocol was approved (S-188/2003) by the ethics commission of the medical faculty of the Heidelberg University.

Spinal Cord Independence Measure

The SCIM is a comprehensive ability rating scale specifically for SCI individuals. It captures independence on performing everyday tasks in 17 items categorized into self-care, respiration and sphincter management, mobility (indoors and outdoors, on even surface) and mobility outdoors.

The SCIM was revised in 2001 (SCIM II1) and in 2007 (SCIM III5). In SCIM III, single items were changed or deleted, but the category and total sum scores remained unchanged. Accordingly, EMSCI switched from SCIM II to SCIM III in 2007. The majority of subjects in the validity cohort were assessed with the SCIM II (76.4%); 23.6% were scored with the SCIM III.

Statistical analysis

Accuracy, defined as the percentage of raw agreement between raters and Cohen’s weighted κ coefficients, was used to quantify inter-rater reliability. Validity was evaluated by Spearman’s rank correlation coefficients, which are interpreted as weak (0.2–0.4), moderate (0.4–0.7), strong (0.7–0.9) and excellent (0.9–1.0).19 The threshold for significance was α<0.05. Statistical analysis was performed with R 3.1.2.20

Results

Applicability

The original LSs were assessed in 65 patients (77% male, 43.1% tetraplegia, 49% motor and sensory incomplete lesion). Mean age was 44.9±16.0 years, and the time between injury and testing was 10.8±7.6 years. The two raters conducted most (35.4%) of the assessments per individual within 1 day, with a median of 2 days.

The LSs for CP have a nonuniform distribution in the assessed SCI population (Figure 1a). Grades 0 and 4 are missing and the upper stages 6 are sparsely represented.

Figure 1
figure 1

Applicability study results. (a) Distribution of the original LSs developed for CP. (b) Distribution after transformation of the data set (a) into the transformed LSs for SCI. The transformation is (1) color-coded and (2) indicated by arrows between panels (a) and (b). Stage 4 of the original LSs was not found in the SCI cohort, and it was accordingly omitted in panel (a).

Reliability of the original LSs

Weighted Cohen’s kappa was 0.985 (P<0.0001, confidence interval (CI): 0.968 1.000). Only four mismatches between raters were found, resulting in an accuracy of 93.4% (CI: 84.1%, 98.2%).

CP LS adaptation to SCI

On the basis of expert consensus, the 10-point scale of the LSs as described by Vojta for CP11, 21 were adjusted for the SCI population. The consensus process was twofold. First, an expert panel consisting of all authors provided the conceptual basis for the adaptation of the LSs. Among the seven experts were five experienced physical therapists (BM-B >25 years in SCI medicine, Imke Smoor >5 years, AvR >20 years; MSc in physical therapy, Irma Stierle >30 years, GD>10 years; MSc in physical therapy) and two experienced researchers (RR >15 years; PhD in Biomedical Engineering, CS >10 years; MSc in Medical Informatics). The main outcome of the expert panel discussions was the need to adapt the CP LSs to properly reflect functional deficits related to a spinal cord lesion. As a result, stages mainly determined by impairments of higher cerebral function—particularly relevant in children with CP—were omitted or pooled, with adjacent stages primarily focusing on locomotor function. In a second step, the distribution of the original LSs for CP (refer to ‘Applicability’ section) of the applicability cohort was analyzed for the final transformation. The composition and description of the Locomotor Stages in Spinal Cord Injury (LOSSCI) stages were iteratively modified to achieve a distribution as uniform as possible with the constraint of avoiding too many subjects classified with the lowest and highest LOSSCI stages (ceiling and floor effects). A statistician from the Medical Biometry Department of Heidelberg University Hospital supported this step. Individuals with SCI were not consulted in the decision-making process because of their lack of expert knowledge in the field of locomotion assessment.

The adapted five-point scale was named LOSSCI (Figure 1b and Table 1) and subsequently analyzed in terms of reliability and validity. The original stage 0 (Vojta11) tests the ability to initiate face-to-face contact. As this ability is generally not impaired in individuals with SCI, stage 0 was omitted and stage 1 was transferred to LOSSCI I.

Stage 4 was omitted, because this LS for CP does not exist in the SCI cohort. Patients of stages 5 and 6 need the help of upper-limb function to achieve locomotion and were therefore fused into one group (LOSSCI IV). With respect to activities of daily living and independent locomotion, the ability to perform a one-legged stand was not considered relevant enough to define a separate LS. Therefore, stages 7–9, which are defined by the ability to walk independently and to perform a one-legged stand, were combined with LOSSCI V.

Functional description of the LOSSCI stages

LOSSCI I tests the ability to control the position of the body so that it is possible to carry out isolated goal-directed movements of the arms. LOSSCI II tests the ability to upright the trunk against gravity in prone position and to perform isolated goal-directed movement with one arm. LOSSCI III tests locomotion in prone position using either the arms or the arms and legs (creeping—stomach remains in contact with the ground). LOSSCI IV tests the ability to move the body forward either by crawling (support on the hands and knees—stomach not on the ground) or by a verticalized bipedal gait with support through the arms using a walking aid, such as crutches or a wheeled walker. Substantial arm function is mandatory in LOSSCI IV, and its presence is the main discriminating factor between this stage and the highest stage. LOSSCI V represents the ability to walk bipedally without walking aids and the ability to perform a one-legged stand (this is considered a higher skill important for walking up and down the stairs independently).

Assessment procedure

Each LOSSCI stage should be evaluated in ascending order. The LOSSCI grading finally assigned to an examined subject is determined by the highest stage this subject can accomplish. This means that for testing LOSSCI I–III the subject is placed in the prone position on the treatment table. To qualify for LOSSCI III or IV, the subject has to creep/crawl for one full cycle using both arms, thereby fulfilling the criteria for quadrupedal locomotion. Physical assistance is not allowed during assessment. The need for walking aids such as braces (except ankle-foot orthosis), crutches or walkers for bipedal locomotion discriminates between LOSSCI IV and V. Someone who needs walking aids to achieve assisted bipedal ambulation is graded as LOSSCI IV. If no walking aids beside an ankle foot orthosis are needed for independent bipedal locomotion, a subject is graded as LOSSCI stage V. Experienced therapists may start the assessment testing the most probable stage. However, it is still recommended to test and document the examination results of all stages and not just test for the highest possible stage, which in the end determines the LOSSCI stage. With this approach, a subject with central cord syndrome and the ability to walk has to be graded as LOSSCI IV. For better comprehensibility, it is recommended to document that the LOSSCI I and II criteria are fulfilled but not the LOSSCI III criterion.

Example 1: Neurological Level of Injury (NLI) C5, AIS A.

A 31-year-old man with a complete cervical SCI caused by a car accident was assessed at all five EMSCI time frames during the first year after injury using the ISNCSCI, SCIM and LOSSCI assessments. The NLI fluctuated between C4 and C5 (C5, C5, C4, C5, C4). The AIS grade was A at all assessments. The upper-extremity motor scores (UEMS) were 14, 14, 18, 20 and 20. The lower-extremity motor score (LEMS) remained at 0. The total SCIM score was successively 0, 13, 17, 18 and 12 (deterioration was because of loss in bladder and self-care function). LOSSCI was graded as I at all assessments, because the subject was not able to upright the trunk in prone position through propping on arms (LOSSCI II) or to move forward using just his limited arm function (LOSSCI III).

Example 2: Central Cord Syndrome incomplete C4 lesion.

The course of recovery of a 64-year-old man with an incomplete cervical lesion for the five EMSCI time frames was as follows: NLI: C4, C4, C5, C5, C4; AIS: C, D, D, D, D; UEMS: 12, 20, 37, 37, 41; LEMS: 10, 30, 47, 47, 47; SCIM: 10, 19, missing, 69, 81. Besides the first assessment, the LEMS is approximately 10 points higher than the UEMS, indicating a central cord syndrome lesion.22 Three LOSSCI assessments are available for the first (LOSSCI I), the second (LOSSCI II) and the fourth (LOSSCI IV) time frame. LOSSCI V was not reached, because a one-legged stand was not possible for >3 s.

Reliability of LOSSCI

Weighted Cohen’s kappa was 0.976 (P<0.0001, CI: 0.948, 1.000). Only three mismatches between raters were found, resulting in an overall accuracy of 95.1% (CI: 86.3%, 99.0%). Two mismatches were found for LOSSCI stage II, which were classified one time as LOSSCI I and as LOSSCI III, respectively, by the second rater. The third discrepancy was found for LOSSCI IV, which was rated as LOSSCI III by the other rater.

Validity

The validity data sets consist of 448 SCIM records from 161 patients (2.8±0.9 records per patient) obtained in the EMSCI time frames within the first year after injury (2 WAI: 20%; 4 WAI: 31%; 12 WAI: 31%; 24 WAI: 16%; 48 WAI: 1%). Of these 161 patients, 65.8% were male, 54.7% were tetraplegic and 64.8% had an incomplete lesion. ISNCSCI data sets were available for 398 of the 448 SCIM records. Distributions of the AIS and NLI are shown in Figure 2. The correlation (Figure 3) between LOSSCI and AIS (coded as AIS A=1, AIS B=2, AIS C=3 and AIS D=4) is weak (R=0.30, P<0.05). Mean age at the time of injury was 48.3±20.2 years, and the time between injury and testing was 66.9±56.5 days. The mean difference of assessment dates (SCIM versus LOSSCI) was 5.7±4.4 days (range 0–15).

Figure 2
figure 2

(a) Distributions of the NLI and (b) AIS in the validity cohort. ND, not determinable.

Figure 3
figure 3

Box plot of the correlation between AIS and LOSSCI. Spearman's rank correlation coefficient R is 0.30 (P<0.05).

The distribution of LOSSCI stages is shown in Figure 4. LOSSCI stages are not uniformly distributed, because more than half of the records were determined as LOSSCI I. Spearman’s correlation coefficients (all P<0.0001) are listed in Table 2. Box plots (Figure 5) additionally depict the correlation between LOSSCI and all four SCIM subitems. The highest correlation was found between LOSSCI and SCIM indoor mobility (room and toilet; R=0.82). The lowest correlation was found between the SCIM respiration and sphincter management subscore and LOSSCI (R=0.68).

Figure 4
figure 4

Distribution of LOSSCI stages of the validity cohort.

Table 2 Spearman’s R of LOSSCI and SCIM items
Figure 5
figure 5

Box plots of the SCIM items: (a) self-care, (b) respiration and sphincter management, (c) mobility (room and toilet), and (d) mobility (indoor/outdoor) grouped by LOSSCI. Circles and crosses depict outliers and extremes, respectively.

Discussion

The aim of this study was to apply and to adapt a rating scale, which was originally developed to categorize locomotor function in CP, for the assessment of locomotor function in SCI and to quantify its reliability and validity. The adaption was necessary to reflect the different underlying pathologies of SCI. The original 10 LSs for CP were transformed into 5 stages to describe locomotor function in SCI. We found LOSSCI to have excellent reliability and moderate (SCIM subitem respiration and sphincter management) to strong (SCIM subitem self-care, mobility) validity. As expected, the SCIM mobility categories showed the best correlation. Indoor mobility displayed a slightly higher correlation (0.82) in comparison with outdoor mobility (0.77), indicating that LOSSCI better reflects indoor mobility aspects. The strong correlation with SCIM self-care category suggests that LOSSCI is not only suitable to categorize basic locomotor skills but also specific motor functions that are necessary for other everyday skills.

A complete LOSSCI assessment can be conducted by one trained physical therapist. It takes approximately 10 min and requires no technical aids besides a treatment table. LOSSCI is comparable to the WISCI assessment,23 which shows a high validity across multiple dimensions.8 However, testing the first five WISCI levels requires 10- m-long parallel bars, which are not available everywhere. The Spinal Cord Injury Functional Ambulation Profile (SCI-FAP)24 needs equipment that might not always be available, including a carpet, an armchair, two bricks, a trash can and four stairs with handrailing on both sides. Although designed for ambulatory individuals, the timed test can be applied in nonambulatory subjects by assigning the predefined maximum times for each of the seven individual timed items. Because of its nature, SCI-FAP is ideally suited as an outcome measure in the framework of interventional studies. However, it is not intended to be used for functional categorization of individuals with SCI. The Spinal Cord Injury Functional Ambulation Inventory (SCI-FAI)25 needs well-trained examiners and can only be assessed in already ambulating patients. This holds also true for the 10-m walk test, the 6-min walk test and the timed up and go test. Besides these timed measures of ambulation, only SCIM, SCI-FAI, WISCI,26 SCI-FAP and the SCIM grades6 as categorical measures of ambulation are validated in the SCI population. The SCIM grades are constructed on a purely data-driven algorithm. In contrast, LOSSCI combines an initial data-driven approach with a final pathology- and expert-consensus-based approach to define and clinically describe five meaningful stages, which may help gain faster acceptance among clinicians. The AIS is an example of a coarse but clinically meaningful and well-accepted tool in spinal cord medicine. The found distributions of the ISNCSCI variables AIS and NLI in the validity trial reflect the analysis of a typical SCI cohort,27 which reinforces the generalizability of the obtained results. Although the distributions of LOSSCI and AIS look similar (Figures 2b and 3), the weak correlation (R=0.30) together with the strong correlation between SCIM and LOSSCI suggests that LOSSCI assesses—as intended—the functional capabilities of an individual with SCI. These are of course somehow related to the neurological impairment, but LOSSCI essentially assesses a different domain than the AIS.

Study limitations

The sample for the reliability part of this work represents predominately chronic SCI subjects, whereas acute SCI patients served as the basis for the validity study. The excellent reliability found in the chronic SCI patients might be generalized for an acute sample. However, the reliability will most likely be lower in acute SCI patients, because chronic, as opposed to acute, SCI patients have a rather stable locomotor pattern.23 The LOSSCI distribution in the validity cohort is skewed toward LOSSCI grade I (53%, no trunk uprighting in prone position; Figure 3), which might reflect the contingent of tetraplegic subjects (55%) in this study. However, the proportion of individuals with tetraplegia was not higher compared with other SCI cohorts.20 The low frequency (5%) of ambulating patients might decrease the statistical power of the validity trial. This is most probably caused by an earlier discharge of ambulating subjects with incomplete SCI from the primary rehabilitation center, where all assessments were performed. Most of the assessments (51%) were conducted within 4 WAI. During this period, the general functional skills of acute SCI patients are rather low compared with their neurological status.14 A second validation in a chronic SCI cohort would sort out this issue and would additionally address the different properties of acute and chronic SCI populations.

As LOSSCI is, by design, a coarse functional rating scale, some floor and ceiling effects are inevitable similar to in other locomotion assessments: for example, SCIM mobility subscale or WISCI. These effects are most noticeable in the box plots of Figure 5c, where the box of the lowest LOSSCI stage I collapsed (median=25% quartile =75% quartile) at the lowest possible SCIM scores and vice versa for the highest LOSSCI stage V. However, this collapsing effect is not present for the adjacent stages LOSSCI II and IV, indicating that there is no severe saturation effect. The goal of the iterative consensus process was to achieve a relatively uniform distribution between the LOSSCI stages, and Figure 5c indicates that this aim is fulfilled.

Conclusion and future work

LOSSCI provides a reliable and valid clinical tool to assess locomotor function in SCI. Locomotor function is reflected by a wide range of motor skills, not only bipedal walking as in most other assessments in SCI. The prognostic value of an early LOSSCI assessment with respect to neurological recovery (ISNCSCI), functional outcome (SCIM) and ambulation (walking tests) will be investigated in the future.

Data archiving

There were no data to deposit.