Introduction

Individuals who use wheelchairs as their main means of mobility have a high incidence of shoulder pain owing to overuse and repetitive propulsion movement. The most commonly used assessment tools for shoulder pain in wheelchair users (WCUs) are questionnaires (mainly the Wheelchair Users Shoulder Pain Index (WUSPI) and Shoulder Pain and Disability Index (SPADI)), physical examination, inclinometer or dynamometer, electromyography and ultrasound. In addition, there are various diagnostic imaging methods widely used for the detection of shoulder pathologies (i.e. radiography, arthroscopy, computed tomography and magnetic resonance imaging). Thurston et al.1 argued the need for a noninvasive but accurate test to appropriately assess shoulder pain, suggesting thermography as a well-established technique for joint pain evaluation.

Infrared thermography (IRT) provides information about physiological processes by examining, recording and interpreting skin temperature (Tsk) distributions.2 There is a high correlation between both sides of the body; therefore thermal asymmetry in body areas indicates dysfunction, whereas increased heat is mainly associated with inflammation or infection.3 IRT has been applied successfully in the field of industry, construction and in a wide range of clinical applications,3 like the diagnosis of sports injuries.4 IRT is commonly used to estimate surface body temperature, and its reliability has been studied in healthy subjects.5, 6, 7, 8 Burnham et al.8 compared the reliability of three thermometers, finding that the infrared skin device was the most responsive (intraclass correlation coefficient (ICC) 0.97) for the hand, forearm, shoulder, thigh, shin and foot. The intra-examiner reproducibility of a study by Zaproudina et al.5 varied from poor in the fingertips to moderately high in the core areas, with a mean ICC of 0.47. However, no study has analyzed the reliability of IRT in the upper body of WCUs. The special characteristics of people with disabilities, such as sweating and thermoregulation problems, blood distribution or drug intake, make it necessary to study the factors affecting the application of IRT in WCUs.

Thermography requires standardized protocols and caution with the interpretation of thermograms, to avoid confusing past problems with current ones.2 Environmental factors (such as ambient temperature, humidity, atmospheric pressure or source radiation), individual factors (both intrinsic: such as gender, age, anthropometry, or skin emissivity, and extrinsic: such as intake factors, therapies or physical activity) and technical factors (such as protocol or camera features)9 have to be considered. In order to know if the technique is reproducible in standard conditions, it is necessary to analyze the reliability. Therefore, the aim of this article is to examine the reliability of IRT in WCUs.

Materials and methods

Subjects

A total of 24 manual WCUs, 22 men and 2 women (mean±s.d.; age, 39.2±10.9 years; height, 174±9 cm; weight, 72.9±10.9 kg; body mass index, 24.2±3.3 kg m−2), participated voluntarily in this study. In all, 19 were manual-wheelchair dependent and 5 were wheelchair independent (use of wheelchair only to practice sport), 14 were subjects with paraplegia, 3 with tetraplegia and 7 presented other disabilities (amputation, multiple sclerosis and poliomyelitis). Prior to the investigation, subjects were fully familiarized with testing procedures, were informed about the risks and benefits of the study, and signed an institutionally approved informed consent document. The protocol was approved by the Ethical Committee of the Technical University of Madrid and the Research Center of Physical Disability (CIDIF) of ASPAYM Castilla y León Foundation. Inclusion criteria were manual-wheelchair use for a period of at least 1 year and age >18 years.

IRT analysis

Infrared thermal imaging of the upper extremities and the trunk was recorded using a thermal camera (FLIR T335, FLIR Systems, Danderyd, Sweden), with a thermal sensitivity of 50 mK, a range of temperature from −20°C to +120°C, a length spectrum range of 7.5–13 μm, a resolution of 320 × 240 pixels and an accuracy of ±2%. The camera was calibrated before each trial using an external temperature reference source (thermistor PT-100, Telemeter Electronic, Donauworth, Germany) and was turned on 20' before to avoid calibration problems. The IRT images were analyzed with specific software (ThermaCAM Reporter, version 8, FLIR Systems). A white ‘Roll-up’ 125 × 206 cm was used to create a homogeneous background behind body figures and to avoid the radiation emitted by other objects. A ‘step’ was placed for slightly raising the subject from the floor surface. The environmental conditions were monitored by a portable weather station (BAR-908-HG model, Oregon Scientific, Portland, OR, USA).

Experimental procedure

Two thermographic photographs corresponding to the anterior and posterior upper body were taken over 2 days with an interval of 1 day. All images were taken and analyzed by the same observer. Images were taken with the subject’s own daily wheelchair in anatomical position. The camera was positioned perpendicular to the ground, and the distance between the subject and the infrared camera was fixed at 2–3 m, depending on the height of subjects, as the aim was to match the center of the image with the geometric center of the area to be evaluated. A transparent template helped the technician for the placement of the camera. Each image was manually divided into different body areas (Figure 1) on the basis of previous studies.7, 10, 11 Several body areas were eliminated from the analysis where: the lumbar area was hidden by the wheelchair, the abdomen and both flanks were not considered a relevant muscular area as many subjects with spinal cord injuries do not have muscle tone in that area, elbows and cubital fosses areas are not important for the propulsion of the wheelchair, the pectoral data of women were discarded owing to the influence of their bra.

Figure 1
figure 1

Distribution of the 36 regions of Interest in the anterior (left) and posterior (right) sides of the body.

The circadian-rhythm effect over Tsk12 was avoided by always testing each subject at the same hour of the day. During the first session, the subject responded verbally to a demographic and medical questionnaire. Before every thermographic examination subjects were instructed to refrain from the following for 24 h: applying creams or perfume, smoking, drinking alcohol or coffee, sunbathing, showering, receiving treatment, therapy, massage or undertaking exercise. All subjects postponed their weekly therapies after the thermographic tests, and they were asked not to push their wheelchairs before the test more than was strictly necessary. Prior to the assessment, subjects were asked to remain at rest, without clothes and jewelry on the upper body for at least 10 min before being photographed. This acclimatization period allowed the subjects’ Tsk to stabilize.13 Every test was performed under equal conditions and in the same room, in which the temperature was fixed at 23°C and drafts were eliminated.

Statement of ethics

We certify that all applicable institutional and governmental regulations concerning the ethical use of human volunteers were followed during the course of this research.

Statistical analysis

After applying the Kolmogorov–Smirnov tests with the Lilliefors correction, asymmetry and kurtosis, ±1 and ±2, respectively,14, 15 the normality of the dependent variables was verified and parametric statistics were therefore applied.

ICC was used to analyze the reliability and coefficient of variation (CV; s.d.*100 per mean) to analyze the dispersion data. ICC was characterized as follows:16 poor reproducibility, 0–0.39; fair reproducibility, 0.40–0.59; good reproducibility, 0.60–0.74; excellent reproducibility, 0.75–1.0. Descriptive statistics were used (mean±s.d.) to describe, organize and summarize the data for the number of assessments (day 1 and day 2). Side-to-side skin temperature differences (δTsk) were calculated for each area by subtracting the mean temperature of the maximum values of the right side from that of the left side. The minimum difference was calculated for all the body areas to study the relevancy of day-to-day Tsk. Student's t-test for related samples was used to ascertain whether there were any differences across the two measures. The statistical treatment of data was performed using SPSS 19.0 software for Windows (SPSS Inc., Chicago, IL, USA). The level of significance was set at P05 for all statistical analyses.

Results

Reliability of measurements

A total of 1728 images for the trunk and upper extremities were recorded from 24 subjects. By comparing the assessment of day 1 and day 2 for the temperature profile, no significant differences were observed in any of the 36 zones, or in the mean or maximum values. The only differences were observed in the average temperature of the right posterior arm (P=0.041), with a difference of 2.05°C between day 1 and day 2.

The best maximum values of ICC were found in the left anterior forearm, right posterior forearm, left posterior arm and left anterior trapezius, with 0.76, 0.77, 0.77, 0.79, respectively (see Table 1). For the average values the highest ICC was the following: left posterior forearm, 0.80; right posterior arm, 0.88; right dorsal, 0.79; right pectoral, 0.95; left supraspinatus, 0.76 (see Table 2). Some zones presented a poor reproducibility: right anterior forearm, 0.22; left anterior forearm, 0.24; left anterior arm, 0.18; left posterior arm, 0.23; left anterior shoulder, 0.15; left pectoral, 0.39; LptX left posterior trapezius, 0.38. Hence, IRT in WCUs showed a poor to excellent reliability for the left posterior arm and right posterior arm (ICC=0.15 and 0.95, respectively). For a graphical view of the day-to-day analysis of reliability see Figure 2.

Table 1 Descriptive maximum values of the temperature profile on day 1, day 2 and between both the assessments of the analyzed area
Table 2 Descriptive average values of the temperature profile on day 1, day 2 and between both the assessments for each analyzed area
Figure 2
figure 2

Day-to-day analysis of the reliability in all body areas measured.

The CV between day 1 and day 2 ranged between 1.05% (RpM) and 6.18% (LptX). Table 1 shows the variability and the reliability of each variable (maximum values) and Table 2 presents the same descriptive values for variables that describe average values. The mean of the average values of all the areas studied was: 31.83±2.04; CV, 2.80%; ICC, 0.56, with a δTsk between day 1 and 2 of 0.08°C. The analysis of the main body areas (see Table 3) allows appreciating a lower Tsk in the limbs (31.63±1.97) than in the trunk (31.98±1.79), and a lower Tsk in the distal extremities (forearms colder than arms, arms colder than shoulders).

Table 3 Interrater-reliability measures in terms of ICC and CV organized by general areas

The maximum temperatures were taken as references to calculate the δTsk between symmetrical sides of the body. Table 4 shows the absolute values of mean±s.d.; the first and second columns are referred as δTsk between both sides of the body on day 1 and day 2; the third column is related to δTsk between the days. The δTsk did not show any consistent pattern related to the handedness of the subjects. The minimum difference of all the body areas was 0.576.

Table 4 The mean maximal values of regional skin temperature (°C) and absolute values of δT (side-to-side temperature differences) with the standard deviations for the different skin areas of the study subjects

Discussion

Before establishing the efficacy of IRT as a tool for assessing the evolution of pain, it is necessary to evaluate its reliability in order for it to be implemented on populations with diverse characteristics. The main contribution of this study is that its reliability is highly dependent on the area to be analyzed. IRT in WCUs had a variable ICC and CV, and it demonstrated a poor to excellent reliability (ICC: LasX=0.15; RpX=0.95). This range of reliabilities must be taken into account when interpreting thermographic data.

To our knowledge, there is only one study investigating IRT and Tsk evaluation of WCUs.17 It found that the temperature in the palm, forearm, pectoral major and shoulder tended to increase during wheelchair propulsion compared with other parts of the trunk. In the posterior trunk the warmest areas corresponded to the trapezius and forearms.17 Our results are similar, as the trunk was warmer than the limbs, and the forearm and shoulder were warmer than the anterior trunk, but it is not possible to compare both investigations as our measurements were undertaken while the subjects were at rest.

There are no previous studies about the reliability of IRT as a means of assessing body temperature in WCUs, although some researchers have studied the IRT reliability of non-WCUs. Our study is focused on works that studied similar areas as ours (the upper body except the hands, wrists and lumbar areas). In a study by Littlejohn et al.18 thermal ICC values for the forearm ranged from 0.19 to 0.85. Zaproudina et al.5 analyzed the whole body and found a mean ICC of 0.47. Most of the areas with low ICC that they analyzed are coincident with our study, whereupon it is necessary to pay attention to those relevant areas (trapezius, forearm and shoulder). Burnham et al.8 found a high reliability (ICC 0.97) in the hand, forearm, shoulder, thigh, shin and foot. Several studies6, 19, 20, 21 about paraspinal Tsk discovered a fair to excellent intra-examiner reproducibility (ICC 0.51–0.98). However, the surface of some analyzed areas differs slightly between studies, and, consequently, some areas with the same name in each study5, 8 corresponded to different areas. Some studies only measured specific skin spots, because they used an infrared skin thermometer,8 a handheld thermographic scanner,19 or a handheld infrared paraspinal instrument21 instead of an infrared camera. Furthermore, the research by Plaugher et al.6 was undertaken with contact thermography, whereas IRT is noncontact thermography. Their research6 relied on the examiner’s interpretation instead of a computerized reading of data, whereas other studies, including ours, manually measured the body areas and compared one graph with another.

Injury processes of some body areas are the main explanation for the fluctuations in the reliability, as the temperature varies depending on the type and moment of the injury (post-traumatic pain syndrome and sympathetic nerve involvement result in vasoconstriction and a colder area; acute and inflammatory injury increase metabolism, blood flow and dermal temperatures).22 The particular characteristics of people with disabilities may have also influenced the reliability, such as the thermoregulation problems presented by the subjects with SCI23 and sweating characteristics of multiple sclerosis subjects.24 People with SCI have a loss of autonomic nervous system control for vasomotor and sudomotor responses below the level of SCI, and a reduced thermoregulatory effector response for a given core temperature.25, 26 As a consequence of their poikilothermic behavior, they have a reduced ability to tolerate thermal extremes,26 they have increased heat storage within the lower body25 and they tend to sweat less.25 This imbalance in temperature regulation (most pronounced in individuals with tetraplegia rather than paraplegia) makes Tsk very changeable from one day to another, influencing the reliability of our data. Other factors that may locally change the blood flow and the Tsk are the variation of the peripheral circulation in the distal parts of the body,5 obliterating artery disease, malignancies, varicose veins or inflammatory processes.27

The distribution of the upper body temperature in our study was similar to the literature.8, 28 Central sites were warmer than peripheral sites, and this may be explained by blood distribution.28 Unlike Zhu and Xin28 the subjects in our study had a slightly higher ventral body temperature (31.92±2.22) than dorsal body temperature (31.77±1.92). Commonly the ventral area has greater skin thickness than the dorsal area, and subcutaneous fat in the area is known to influence the thermal readings.18

The mean ICC for average values (Table 1) is 0.56 (fair). When areas with lower ICC values are removed (the forearm, arm, shoulder and trapezius) the mean ICC augments to 0.64 (good). For the maximum values (Table 2) the mean ICC is 0.59 (fair) and without the most troubled area, the pectoral, the mean ICC becomes 0.60 (good). These areas that are sensitive to error are involved in wheelchair propulsion; we suggest the participant is carried to the room to avoid doing any exercise before the testing. The average values have a greater number of areas with poor ICC than the maximum values. When regions of interest are manually drawn, certain limits can go out of the boundary and encompass part of the background picture, and hence the average temperature of this area would be lower. The maximum values are not affected by extreme values, and thus they are more congruent; therefore, they were chosen as references to calculate the δTsk between symmetrical sides of the body.

Burnham et al.8 affirmed that side-to-side temperature comparisons could be used as a measure of an instrument’s reliability. Most of the articles found δTsk or thermal asymmetry of around 0.5°C,8, 29 0.4°C5 and 0.3°C.11, 30 Our data showed a δTsk for maximum values of: 0.17±0.53°C and 0.15±0.49°C on days 1 and 2, respectively, with a δTsk between both days of 0.09±0.44°C. s.d. values over 2.5–3 indicate a Tsk abnormality in certain part of their bodies.29 There were no significant differences between day 1 and day 2. The correlation was good for the anterior and posterior arm, posterior forearm, dorsal, supraspinatus and central trapezius, and was excellent for the anterior and posterior shoulder, pectoral and infraspinatus. The average correlation was moderate (r=0.583). Low correlation was found in the anterior forearm, anterior and posterior trapezius. Previous literature proposes different reasons that could cause these variations, such as Owens et al.,19 who suggested that changes could be explained by physiological variations rather than equipment error.

Study limitations

A manual system was used to draw the limits of each area. Consequently, the demarcation lines of the examined area could vary slightly from one picture to another. The data may be affected by temporary injuries in the absence of a prior physical examination. The time interval between both measurements was reduced to one day to minimize the thermal evolution of potential lesions, given that the initial warm areas may quickly return to normal temperatures.7 However, this time interval is still wide and gives the highest error of reliability possible, because it measures many factors that may affect the temperature. We propose taking pictures on the same day to isolate the reliability from other factors.

Conclusions

This study showed that the reliability varies by anatomic regions of the body. The infrared technique evaluated in this study is fast, easy to use and it seems to be adequately reliable to monitor Tsk in the population studied. Particular considerations of each subject that affect Tsk values should be weighed, such as physiological variability of the blood flow, thermoregulation problems, hormonal changes, medication intake, scars or body fat. We recommend assessing the Tsk of WCUs in a standard wheelchair to avoid different backrest heights. Moreover, other technical factors (for instance, equipment accuracy, measurement environment or processing and interpretation of data) should be considered.

Further investigation is necessary to reproduce this study with different time intervals and to examine the validity and reliability of these temperature techniques in other indoor settings and exercise modes. Longitudinal prospective studies of IRT reliability on specific health conditions with larger and more homogeneous samples could help to explore the clinical application of the IRT on those pathologies. Future research should aim to associate the Tsk of the muscles involved in wheelchair propulsion with the pain in those areas, to prevent situations of injury risk.

DATA ARCHIVING

There were no data to deposit.