Introduction

Lower limb amputation is a life changing event. Approximately 185,000 amputations of the lower and upper limb are carried out each year in the USA [1]. The number of people living with an amputation is expected to increase given that life expectancy is increasing as is the rate of diabetes mellitus [2]. The level at which a limb is amputated impacts the potential for successful prosthetic rehabilitation. The Scottish Physiotherapy Amputee Rehabilitation Group (SPARG) found that of those undergoing an amputation only 66.9% at the transtibial level and 20.9% at the transfemoral level advance to prosthetic limb usage [3]. The use of a prosthesis is physically [4,5,6] and cognitively demanding [7], requiring a period of rehabilitation [8] and gait training to become a proficient prosthetic user.

Early rehabilitation following amputation varies depending on an individuals’ general health, physical fitness, and other comorbidities [9]. Rehabilitation aims to return the prosthetic user to their pre-amputation activity level, where this is physically possible. Finding valid and reliable ways to communicate this activity level both intradisciplinary and interdisciplinary has become increasingly important for practitioners. While the more simplistic options such as the Special Interest Group in Amputee Medicine (SIGAM) mobility grades provide a number of categories, this is not always representative of the individuals function and capabilities beyond basic locomotion [10]. Tools such as the Amputee Mobility Predictor (AMP) [11] can be used both with and without a prosthesis to try to quantify activity level. For the more active prosthetic user, the Comprehensive High Level Amputee Mobility Predictor (CHAMP) can be used [12]. Following amputation, rehabilitation can take many forms, and may be appropriate at a variety of intervals. In the early stages, post-surgery general conditioning and early weight bearing are key. However, as confidence and mobility increase, more advanced components may be appropriate. Additional gait training to improve gait symmetry or cosmesis, or additional therapy if there is a prescription change, or a change in health status may be required. The ability to quantify any change helps to support the ongoing use of interventions and provides evidence of improved outcomes [13].

The use of outcome measures (OMs) is becoming more prevalent in the field of prosthetics and orthotics with at least 43 different measuring tools being used [14]. Clinicians administer self-report measures and ask prosthetic users to record trips and falls, pain levels, and complete other validated outcomes such as the Prosthesis Evaluation Questionnaire (PEQ) [15]. With the use of smart watch activity monitors and “in device” step counters, such as that of the Ottobock C-Leg (Ottobock Healthcare, Duderstadt, Germany), remote activity monitoring is becoming increasingly possible. This may be useful for prosthetic users who are comfortable with technology but less applicable to the elderly, lower activity individual. Outcome measures are used to assess quality of life [16], depression [17], mobility prediction [11], and physical activity levels [18]. While some of these outcome measures are self-reported, others are measured within a clinical or community setting. Some clinicians report that outcome measures are not of benefit in clinical practice and do not use them regularly [19]. Many studies use quality of life (QoL) outcomes to demonstrate the importance of appropriate prosthetic prescription and the psychological benefits [20]; they may provide information beyond demonstrating an improvement in walking. Many QoL outcomes take the form of questionnaires. Physical outcome measures play an important role in assessing the functional outcome of a prosthetic intervention. The use of performance-based measures such as the 6-min walk test [21] has been proven to provide good reliability in a lower limb amputee population [22]. One of the main challenges is determining the difference between a statistically significant outcome and a clinically significant change. For some outcome measures, the minimum detectable change gives an indication of the improvement required to be classed as a higher activity level [18, 23••]. While steps are being taken to establish this, there are also the challenges of the reported practice effects in performance-based tests [24].

Recent reviews have focused on the use of outcomes in clinical practice, the barriers to their use [25•], physical activity measurement [26], and activity following lower limb amputation in free-living conditions rather than within a clinical or laboratory setting [27]. This review aims to assess the current use of physical performance outcomes in rehabilitation with established lower limb amputees through a systematic review of the literature.

Methodology

A systematic literature review was completed in September 2022 in the following databases: Cochrane Library, Embase, PubMed, and Science Direct. Relevant keywords (and combinations) were selected including “physical activity” and “outcome measure” or “tool” and “lower limb prosthesis” or “lower limb amputee”. Included articles had to have been published after January 1, 2017, be available in English, be peer reviewed, accessible in full text, and the studies to have been completed on humans. All included articles had to have a focus on physical rehabilitation outcomes and have a baseline or control to compare with the intervention. Articles were excluded if they were published outside of this time frame, focused on outcome measure validation, phantom limb pain, spinal cord injuries, trial products, or computational models.

The two authors ran the searches independently and applied the agreed inclusion and exclusion criteria. The Cohen’s Kappa level of rater agreement was calculated as 0.7978 showing a substantial agreement between raters. A third reviewer was not required for any articles as agreement was achieved between the two authors. The initial search returned 1177 results, and after review of title and abstract, this was reduced to 32. Once articles had been read in full and exclusion and inclusion criteria were applied, this left 15 articles included in the final review. Articles were assessed against the Credibility, Accuracy, Reasonableness, Support (CARS) checklist [28] to evaluate each information source. No further articles were removed on the basis of quality or credibility.

Results

A total of 15 articles were included in the final review. Eight of these used physical performance measures to assess the impact of a change in component prescription as part of ongoing rehabilitation. The remaining seven used physical outcomes measures to assess the suitability of rehabilitation interventions for established amputees.

Across the papers, the outcomes measured were not generally limited to only physical outcome tools. Researchers used a combination of self-reported and measured outcomes often in combination with other quality of life (QoL) measures. QoL was assessed using many outcomes including the Short Form 36- QoL [29] and the Prosthesis Evaluation Questionnaire [15]. The physical outcomes measured used across the included studies are presented in Table 1.

Table 1 Frequency of physical performance outcome measure use

Several of the papers used physical performance outcomes as an assessment tool for comparing different prosthetic limb components. Çalışkan Uçkun et al. [30] used physical outcomes to compare pin locking vs vacuum socket suspension systems. Davie-Smith et al. [31], Jayaraman et al. [32], and Knight et al. [33] used physical outcome measures when comparing prosthetic knee prescriptions, specifically microprocessor (MPK) vs non-microprocessor (NMPK) or powered knees. Morgan et al. [34], Kim et al. [35], and Kaufman and Bernhardt [36] compared different designs of foot and ankle components, and Barnett et al. [37] compared MPK vs NMPK in combination with rigid and hydraulic ankles. Five of the studies used timed walk tests, two studies looked at the L-test, and two used the AMP.

Timed walk tests measure speed and distance and are a commonly used and easily administered outcome measure. The test only requires a stopwatch and a straight walking area of known length. The 2-min walk test (2MWT) is a reliable measure for use with unilateral lower limb amputees showing good interrater and intrarater reliability [38]. Similar results have been found with the 6-min walk test (6MWT) [22]. The outcomes of the 2MWT and the 6MWT have been shown to be comparable. The 2WMT was used in two studies [31, 37] and the 6MWT in three studies [32,33,34]. In the study by Davie- Smith et al. [31], the 2MWT was used to assess lower activity level, unilateral, transfemoral amputees comparing outcomes with a NMPK and an MPK 6 months post-delivery. A statistical improvement was found (p < 0.001) with an average improvement of 8 m when using the MPK. Barnett et al. [37] also found a statistically significant result from this test (p = 0.001) when comparing the combinations of NMPK, MPK, rigid ankle, and hydraulic ankle. The optimal prescription was found to be the MPK with a hydraulic ankle. Jayaraman et al. [32] also compared NMPK and MPK prescriptions in a group of dysvascular amputees. The 6MWT showed no statistically significant improvement between groups. Similarly, Knight et al. [33] found that veterans walked a shorter distance with the powered knee than the standard microprocessor knee. No significant improvement (p = 0.29) was found, in the distance walked, by Morgan et al. [34] in the 6MWT when comparing a standard energy storing foot and a cross-over style foot.

The L-test is an advancement of the timed up and go (TUG) test [39, 40]. It is easy to administer with the patient being asked to rise from a chair, walk 5 m, turn left and walk 5 m, turn right and walk 5 m, and walk a further 5 m to return to the chair. Given the additional complexity, when compared to the standard TUG, it is more appropriate for higher activity prosthetic users. When assessing performance 6 months post-delivery of a MPK, Davie-Smith et al. [31] found a significant improvement (p = 0.001) manifested by an average reduction in the time to perform the L-test by 5 s. When comparing combinations of knees and feet, Barnett et al. [37] found that the MPK with hydraulic ankle was on the threshold of significance (p = 0.05).

One of the other outcomes commonly selected by researchers was the Amputee Mobility Predictor (AMP) [11]. There are versions of the AMP available for assessment with and without a prosthesis. The AMP assesses prosthetic use across a number of tasks, and a higher score indicates a greater level of function. The AMP was used to determine current mobility, and only users who scored between 37 and 40 were selected for inclusion in the MPK assessment [31]; these would be deemed high K2 or low K3 on the Medicare activity classification scale. The participants achieved better outcomes when provided with an MPK. When comparing outcomes after randomized prescription of an NMPK and an MPK, Jayaraman et al. [32] found a statistically significant improvement (p = 0.008) following MPK prescription.

In the qualifying studies, balance was assessed using a number of outcomes, including the Activity Balance Confidence Scale (ABC) [41], Berg Balance Score (BBS) [42], and Four Square Step Test (FSST) [43]. Davie-Smith et al. [31] found a 61.9–77% improvement (p < 0.001) in ABC scores when participants were provided an MPK. Jayaraman et al. [32] looked at results in the BBS and FSST when comparing C-leg with NMPK. No significant differences were found between groups in either outcome.

The 7 other papers included in this review consider physical outcome measures to review the effectiveness of focused exercise programs. Many of these provide remote instruction and monitoring to the participants. Gaunaurd et al. [44] used mobile device outcome-based rehabilitation with the aim to improve mobility, and gait. Similarly, Christiansen et al. [45] considered the impact of a telehealth program on physical activity behaviour. Tao et al. [46] focused on balance and exercise therapy to impact walking capacity through interactive adaptive games. The Wii Fit® games and WiiNWalk-fit balance board is an exergaming platform where participants are able to complete games to enhance balance and exercise therapy. The Big Brain Academy: Wii Degree (Nintendo, Kyoto, Japan) was used as a control where participants complete activities focused on cognitive function. The WiiNWalk intervention did not improve balance or walking capacity. Godlwana, Stewart, and Musenge [47•] compared the effect of a home-based exercise programme with the standard outpatient follow up and used physical outcome measures to consider the effectiveness. Participants who completed the home exercise program had improved results. Wong and Gibbs [48] used physical outcome measures to study the impact of a “committed wellness-walking program.” Miller et al. [49] explored the effects of a supervised community-based exercise program on balance, confidence and gait. In a case study by Rosenblatt et al. [50], physical outcome measures were used to demonstrate the effect of non-limb wearing over a period of 4 weeks and limited use over a further 12 weeks. Four of these studies used the ABC, Activities-Specific Balance Confidence Scale; three studies made use of the 2MWT, and a further three made use of the Timed Up and Go (TUG).

The Activities-Specific Balance Confidence (ABC) Scale [41] is a self-reported outcome measure of balance confidence assessing vestibular, non-vestibular, and functional mobility. Participants rate balance confidence in performing 16 specific activities, with zero representing no confidence and 100 representing completed confidence in carrying out the task. In a population of older adults, Myers et al. [51] reported that results lower than 50% on the ABC scale are considered a low level of functioning and over 80% a high level of physical functioning. When analysed for relative reliability and internal consistency, they found “excellent relative reliability” within the population of people with lower extremity amputations [52].

The ABC was used in four of the papers reviewing exercise interventions [46, 48, 50, 53]. Tao et al. [46] reported a statistically significant group effect with the mean and (standard deviation) ABC improving from 83.3 (10.8) at baseline to 84.6 (12.4) at 12-month follow-up. This was using the WiiNWalk compared to the control group, using Big Brain Academy, who’s baseline was 82.7 (15.7) and reduced to 77.1 (18.2) at 12-month follow-up. The focus on balance and physical activity over cognitive training provided a positive improvement in balance. An increase from 2.4 to 2.8 in balance confidence was found by Wong and Gibbs [48] on the 4-point ABC scale (p = 0.074) when participants were noted to be committed to the wellness walking program (n = 30 participants). Participants were classified as committed when attending more than 3 sessions over the 3-year time period of the study. Miller et al. [53] demonstrated an increase in group mean score for ABC from 63.4 to 73.7% when taking part in a supervised community-based exercise program. Transtibial participants, as a group, improved by 14.3% and transfemoral participants by 18.7%. When considering the total group, 37.5% of the participants increased their mean score by more than 10%. Two participants had a reduction in mean score of 5%. Rosenblatt, Stachowiak, and Reddin [50] discuss the effects of non-limb wearing over short periods of time in a case study. Although objective balance measures were relatively unaffected, the self-reported ABC dropped from 79.4 at baseline to 60.9 at follow-up.

The 2-min walk test, as noted previously, is a reliable outcome measure for use with lower limb amputees. It was used in three of the papers [45, 46, 48]. The WiiNWalk group showed an improvement over the control. At baseline, the mean distance walked recorded as 125.6 m (42.6 standard deviation) and 130.8 m (39.8) at 12-month follow-up. The control group using Big Brain Academy demonstrated a reduction in walking distance from 126.5 m (30.4) at baseline and 125.5 m (30.8) at 12-month follow-up [46]. The wellness walking program reported by Wong and Gibbs [48] found that for one time, only participants (n = 220) walking speed was reported as 0.74 m/s ± 0.39 whereas for committed participants (n = 29) was reported as 0.88 m/s ± 0.27. The 2MWT was unable to prove a statistically significant change to average walking speed for participants of the wellness walking programme. Christiansen et al. [45] reviewed the effect of twelve, completed weekly, 20-min telehealth sessions of physical behaviour interventions. Baseline measures were carried out for both groups; Group 1 participated in the intervention during weeks 1–12 with no further intervention for the following 12 weeks. Group 2 had attention control during weeks 1–12 during which a therapist delivered a short health and safety talk related to age followed by a brief period of light range of motion tasks for both upper and lower extremities. This was followed by the 12-week intervention. Group 1 baseline was recorded as 97.7 m compared to 85.5 m at 12 weeks and 103.4 m at 24 weeks. Group 2 with the additional attention control input demonstrated a baseline of 101.8 m, 101.3 m at 12 weeks, and 103.3 m at 24 weeks.

The Timed Up and Go (TUG) was included in three papers [44, 45, 47•] and is applicable to participants with lower extremity amputations and has demonstrated validity in assessing functional mobility [40]. It has demonstrated high interreliability and intrareliability when used to examine elderly adults including amputees [54]. The Timed Up and Go measures functional mobility through a performance-based task which includes rising from a chair, walking 3 m at a comfortable pace, turning at the 3 m mark and returning to sitting in the chair. Participants who take longer than 30 s to complete the task have been shown to require more assistance, whereas if the test is completed in less than 20 s, the participant is likely to be independently mobile. Gaunaurd et al. [44] used a Mobile Device Outcomes-Based Rehabilitation Program (MDORP) to improve participants strength, mobility, and gait quality. To assess the outcomes of this, the researchers examined two course variants of the Timed Up and Go with the turn at 3 m both towards and away from the prosthetic side. Turning towards the prosthesis produced a baseline of 9.4 s (± 1.4 s) (range 5.9–11.2 s). Following intervention this reduced to 8.8 s ± 1.4 s (range 7.0–12.1). Turning away from the prosthesis at baseline was 9.6 s ± 1.6 s (range 7.0–12.6 s), and at 8 weeks post intervention, this reduced to 8.7 s ± 1.1 s (range 7.1–10.7 s). Neither of these showed a statistically significant difference. Christiansen et al. [45] reviewed TUG results in two groups following the use of the telehealth intervention, as previously described. The baseline of Group 1 was recorded as 19.4 s, at 12 weeks 18.0 s, and at 24 weeks 15.3 s. The Group 2 baseline was 14.9 s, at 12 weeks 14.8 s, and at 24 weeks 14.5 s. The authors noted that the intervention carried out with group 1 between weeks 1 and 12 did improve their time; however, the biggest improvement occurred between weeks 12 and 24 where there was no intervention. Group 2 remained consistent in their timings with the attention control during weeks 1 and 12 and intervention between weeks 12 and 24. No statistical analysis of the results was reported. Godlwana, Stewart, and Musenge [47•] did not report baseline measures for their control or intervention groups; however, there is a comparison of TUG results between post intervention at 3 months and then at 6 months. Both groups received the normal standard care; however, the intervention group received a home education and exercise programme alongside the standard care for 3 months. The control group at 3 months was reported as 45.078 s ± 41.516 s and at 6 months 36.077 s ± 36.186 s. Compared with the intervention group at 3 months 35.392 s ± 32.484 s and at 6 months 28.224 s ± 20.963 s, the intervention group at both 3 and 6 months is quicker than the control group; however, this was not statistically significant.

Discussion

In the reviewed papers, over thirty different outcome measures were used. Two-thirds of these related to physical performance outcome measures. The use of outcome measures has not routinely been integrated into clinical care for lower extremity amputees. Many prosthetic and orthotic clinicians report barriers such as lack of time, lack of understanding, or perceived clinical value [19]. Gaunaurd et al. [55] and Young, Rowley, and Lalor [56] reported “38% of prosthetic practitioners in the United States and 28.4% of prosthetics and orthotics (P&O) practitioners in the United Kingdom use OM’s routinely.” These low numbers exist despite the fact that there is clear evidence for the use of outcome measures in improving patient satisfaction, ensuring patient-centred decisions, recording patient progress, and ensuring evidence-based practice. Rapaport et al. [57••] provided an education program to prosthetic and orthotic practitioners and demonstrated an increase in confidence in the use of these measures and their monthly use doubled after this. Despite this, selecting an appropriate measure can be challenging.

While the use of OMs is considered to be important, it is also vital to recognize and understand the results of these. For primary amputees or those early in rehabilitation, there will be less consideration for the ceiling effect exhibited. Some authors have noted that established amputees may have achieved their optimal gait speed and physical outcome measures would be less appropriate than QoL measures [20]. Objective, quantifiable results are only meaningful if there is evidence of a minimal clinically important difference (MCID) for the outcome measure you are working with. MCID is “…defined as the smallest change in an outcome that a patient would perceive as clinically meaningful” [58] equating to a change in quantifiable results that is large enough to make a difference to patient prescription or care. In the papers reviewed often, the selected outcomes produced a statistically significant result but this was not classed as a MCID [31]. The TUG test has been shown to have a ceiling effect for more active amputees and is likely not suitable for established walkers who are of moderate activity or above [39].

While physical outcome measures provide an important insight into amputee physical functioning, they do not provide a complete metric of rehabilitation success. Quality of life, social adaptation, and mental health are all relevant factors to consider. Multiple outcome measures have subscales specific to these elements. These should be administered in conjunction with physical performance outcomes to give a more rounded picture of achievement. Balance and falls incidence were considered by a number of researchers: these factors also relate to device user confidence [31, 44, 46]. MPKs have been well documented to reduce falls incidence and as such are an important outcome when reviewing these components [59, 60]. In many of the papers, physical performance was not greatly impacted by the rehabilitation intervention but patients still reported personal improvement.

While this research was completed by two authors, a select number of databases were used so some literature may have been omitted.

Conclusion

Across the included papers, over 30 different outcome measures were included. One of the challenges for researchers is identifying the correct outcome measure to use. The measure should, ideally, be validated for use with the desired cohort, be sensitive enough to measure the required outcome, and be reliable. Practitioners should be provided with continuous professional development opportunities, including training sessions and the opportunity for hands on practice to establish outcomes measures in daily use as a standard of care.

From the literature published in the past 5 years, the most used physical outcome assessment tools used to assess rehabilitation intervention are timed walk tests and the Activity Balance Confidence Scale. All of these have been validated for use with lower limb amputees and as such represent a good methodological approach to research in the field.