FormalPara Key Summary Points

Why carry out this study?

The assessment of outcomes in the field of pain management is rapidly evolving.

Chronic pain is a global health concern, with extensive economic and societal impacts, afflicting nearly 120 million Americans, with estimates of over 40% of adults experiencing pain for over 3–6 months.

Linear and one-dimensional pain assessments are outdated and inaccurate, whereas multidimensional validated patient-reported outcome assessments allow for a more complete view of the patient’s pain experience.

To our knowledge, there has never been an investigation into the normative values for the chronic pain population so that we can better understand and benchmark our treatment algorithms. The purpose of this study is to survey and report on PROMIS-29, the pain impact score, and estimated HUI-3 scores of patients entering into chronic pain treatment in the United States.

What was learned from the study?

This data set is the first published normative data set describing the PROMIS-29 assessment in the chronic pain population.

The patient population is more homogeneous than expected, and females were found to have higher levels of dysfunction.

Digital Features

This article is published with digital features, including a summary slide, to facilitate understanding of the article. To view digital features for this article go to https://doi.org/10.6084/m9.figshare.13643255.

Introduction

The assessment of outcomes in the field of pain management is rapidly evolving. Once dependent on linear numerical rating scales (NRS) and visual analog scales (VAS), emphasis within the specialty has shifted toward multidimensional patient-reported outcomes (PRO) [1]. This strategy of assessing pain from a multidimensional perspective in the context of the biopsychosocial model is recommended by every recent major consensus panel, including the US Department of Health and Human Services (HHS) [2], the National Institutes of Health (NIH) Research Task Force (RTF) for Chronic Low Back Pain [3], the Institute of Medicine (IOM) [4], the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials (IMMPACT) [5], and more.

A common measurement language is critical to galvanizing innovation, research, and care improvement. The Patient-Reported Outcomes Measurement Information System (PROMIS) was created by the NIH in 2004, and several studies were initiated, focusing on heart failure, chronic obstructive pulmonary disease, rheumatoid arthritis, cancer, chronic low back pain, and major depression [6]. The goal of PROMIS was to update longer-form legacy instruments with validated outcomes measures using fewer questions with greater accuracy. Many familiar legacy measures were used to create the PROMIS item banks, and crosswalk tables are available to allow for conversion to legacy measures on the NIH-funded PROsetta Stone® website (www.prosettastone.org, NIH 1RC4CA157236, PI: David Cella, PhD). The PROMIS-29 battery assesses mental health, physical health, and social health through seven 4-question instruments, including assessments of fatigue, pain intensity, pain interference, physical function, sleep disturbance, anxiety, depression, and ability to participate in social roles and responsibilities [7]. The PROMIS-29 and its subdomains have been validated and/or cross-walked to a host of legacy measures in a variety of populations, including the Oswestry Disability Index [8], Roland-Morris Disability Questionnaire [9], Brief Pain Inventory-Pain Interference domain [10], the EuroQol Research Foundation five-dimension (EQ-5D) instrument [11], the Health Utilities Index Mark 3 (HUI-3) [12], and quality-adjusted life year [13] calculations.

The pain impact score (PIS) was defined and validated in 2014 by the NIH Research Task Force for Low Back Pain [3, 14]. The PIS is a calculated measure derived from the PROMIS-29 Pain Interference (PI), Physical Function (PF), and Pain Intensity (NRS) subdomain raw scores. The score for PF is inverted so that higher scores indicate more severe pain impact and physical dysfunction. The PIS is scored as follows:

PIS = NRS + PROMIS-PI + PROMIS-PF.

Score range is 8–50 points: NRS (0–10), PROMIS-PI (4–20), PROMIS-PF (4–20).

Estimated minimal clinically important difference (MCID) = 3 points [14].

The use of the PROMIS-29 and PIS as a multidimensional assessment and outcomes tool was previously compared to subjective percent pain relief (PPR) after a trial of spinal cord stimulation in 127 subjects [15]. PIS was found to correlate more strongly with PPR than did the change in NRS pain score or any PROMIS-29 subdomain alone [15].

The HUI-3 is a widely used measure in health-related quality of life (HRQOL) studies that can be used to estimate quality-adjusted life years (QALYs) [16]. The HUI-3 assesses eight health status dimensions on a 5- or 6-point scale (normal to highly impaired): vision, hearing, speech, ambulation, dexterity, emotion, and cognition [16, 17]. Hays et al. [12] described regression models on data from the original PROMIS cohort to accurately estimate HUI-3 scores across a representative US national sample of 3000 subjects. This same methodology was applied in this study to calculate HUI-3 preference scores in this sample of patients presenting to a chronic pain clinic.

The PROMIS-29 has been used in thousands of studies, published in over 150 journals, and is validated with cross-talks to other instruments. To our knowledge, there has never been an investigation into the normative values for the chronic pain population so that we can better understand and benchmark our treatment algorithms. The purpose of this study was to survey and report on PROMIS-29, the pain impact score, and estimated HUI-3 scores of patients entering into chronic pain treatment in the United States.

Methods

Study Design and Ethics Statement

Subjects were enrolled in a consecutive fashion throughout all participating sites from March 2019 to May 2020, after obtaining a waiver of consent and a full waiver of HIPAA authorization through the Western Institutional Review Board (WIRB) under Common Rule 45 CFR 46.116. The institutional review board (IRB) was happy to accept the decision of WIRB without any further determinations. Inclusion criteria included new patients entering into pain practice, patients > 18 years of age, and completion of the PROMIS-29. The PROMIS-29 has evolved into its third generation, v2.1, and comprises the following assessments: PROMIS Short Form (SF) v2.0—Physical Function 4a, PROMIS SF v1.0—Anxiety 4a, PROMIS SF v1.0—Depression 4a, PROMIS SF v1.0—Sleep Disturbance 4a, PROMIS SF v1.0—Ability to Participate in Social Roles and Activities 4a, PROMIS SF v1.0—Pain Interference 4a, and PROMIS Pain Intensity item (Global07).

PROMIS-29 data, along with patient demographic data including age and gender, was captured at all sites using a digital outcomes capture system (Real World Outcomes™, Celéri Health, Wilmington, DE, USA) within 3 weeks of the initial consultation for pain care. Statistical analysis was performed, calculating descriptive statistics of the median, mean, and mode of the reported T-scores of the PROMIS-29, its seven independent domains, and the calculated pain impact score.

Statistics

When comparing different age groups across the seven subdomains, along with the gender assessment, Welch’s t-test was employed to test the hypothesis that the two populations would have equal means.

Results

Between 2018 and 2020, 19,546 patients were enrolled across the 12 participating sites, obtaining baseline PROMIS-29 v2.1 upon entry into chronic pain practices across the United States. Patients were categorized by age group (less than 40 years, 41–60 years, 61–80 years, 81 years and older), along with an assessment of the effects of gender on the measured sample. Of the 19,546 patients assessed, 9165 were between the ages of 61 and 80, followed by the 40–60 group with 6840. In Fig. 1, we demonstrate PROMIS-29 score distributions in all subjects who presented to an outpatient chronic pain management clinic during the period 2018–2020, along with gender analysis (Fig. 1). Spanning all age groups, the average PROMIS-29 T-scores for the chronic pain population are as follows: pain interference, sleep disturbance, fatigue, anxiety, depression, social participation, and physical function are 64.61, 57.19, 58.50, 53.94, 54.45, 40.06, 36.23, respectively, with a pain impact score of 33.19 and pain intensity of 6.38 on an 11-point NRS scale (Fig. 1). The PROMIS-29 v2.1 battery consists of seven instruments sorted by subject age, assessing the mean, median, and mode for each age group. These groups were analyzed to detect differences amongst these cohorts in terms of their population profile related to mental, physical, and social health. A summary of this data is presented in Fig. 2. Figures 1 and 2 depict the proportion of all subjects achieving the calculated raw scores and T-scores in the respective domains. The color coding scheme represents the categorization of score ranges in the respective domains as severe, moderate, mild, and normal (Figs. 1, 2). Figure 3 shows the proportion of all subjects achieving the calculated impact scores and HUI-3 scores, for the total cohort as well as subcategories, based on age group (< 40, 41–60, 61–80, > 81) and gender. The mean score for each respective domain and subgroup is shown (Figs. 1, 2, 3). This was used to determine normative values for the chronic pain population and to demonstrate the heterogeneity of the population.

Fig. 1
figure 1

a Demonstrates PROMIS-29 score distributions in all subjects presenting to an outpatient chronic pain management clinic between 2018 and 2020. Spanning all age groups, the average PROMIS-29 T-scores for the chronic pain population are as follows: pain interference, sleep disturbance, fatigue, anxiety, depression, social participation, and physical function are 64.61, 57.19, 58.50, 53.94, 54.45, 40.06, 36.23, respectively, with a pain impact score of 33.19 and pain intensity of 6.38 on an 11-point NRS scale. b PROMIS-29 score distributions in female subjects presenting to an outpatient chronic pain management clinic between 2018 and 2020. The profile of these subjects (n = 11,310) indicates that on average, this population displays mild–moderate dysfunction across all PROMIS-29 domains, as noted above. c PROMIS-29 score distributions in male subjects presenting to an outpatient chronic pain management clinic between 2018 and 2020. The profile of these subjects (n = 8235) indicates that on average, this population displays mild–moderate dysfunction across all PROMIS-29 domains, as noted above

Fig. 2
figure 2

a The patient cohort aged 0–40 was evaluated by age to determine normative values for the chronic pain population to detect significant differences among those referred and seen in the chronic pain population. b The patient cohort aged 41–60 was evaluated by age to determine normative values for the chronic pain population to detect significant differences among those referred and seen in the chronic pain population. c The patient cohort aged 61–80 was evaluated by age to determine normative values for the chronic pain population to detect significant differences among those referred and seen in the chronic pain population. d The patient cohort aged 81 and older was evaluated by age to determine normative values for the chronic pain population to detect significant differences among those referred and seen in the chronic pain population

Fig. 3
figure 3

a Pain impact score, calculated for total cohort, age groups, and gender. b HUI-3 scores, based on age group and gender

Mental Health

The depression assessment from the PROMIS SF v1.0 Depression scale demonstrates T-scores averages of 54.92, 54.75, 54.13, and 54.39 for those aged less than 40, 41–60, 61–80, and above 80 years, respectively. Interestingly, the scores are not statistically different among age groups or gender. The mean anxiety assessment scores from the PROMIS SF v1.0 Anxiety 4a are 55.45, 54.61, 53.29, and 52.72 for those aged less than 40, 41–60, 61–80, and above 80 years, respectively. Again, interestingly, among the age and gender assessed, there is no statistically significant difference, although females tended to have higher dysfunction and greater pain described.

Physical Health

The pain interference assessment from the PROMIS SF v1.0 Pain Interference 4a scale demonstrates T-scores of 64.73, 65.23, 64.26, and 63.71 for those aged less than 40, 41–60, 61–80, and above 80 years, respectively. Mean gender differences among the respondents were 64.95 versus 64.14. Fatigue scores assessed via the PROMIS SF v1.0 Fatigue 4a demonstrated in the chronic pain population average scores of 59.23, 59.26, 57.84, and 57.92 for those aged less than 40, 41–60, 61–80, and above 80 years, respectively. Gender differences were not statistically significant, as demonstrated with average scores of 59.49 and 56.99 for females and males, respectively. Physical function, measured from the PROMIS SF v1.0 Physical Function 4a, with high scores being supportive of care and low scores representing more dysfunction, are 37.96, 36.70, 35.85, and 34.00 for those aged less than 40, 41–60, 61–80, and above 80 years, respectively. Gender differences were not realized, with scores 35.61 for women and 36.96 for men. Mean sleep disturbance scores, as measured by the PROMIS-29 SF v1.0 Sleep Disturbance 4a, are 59.42, 59.11, 55.99, and 52.73 for those aged less than 40, 41–60, 61–80, and above 80 years, respectively. Gender differences for the aforementioned average scores are 57.64 and 56.63 for women and men, respectively.

Social Function

The mean scores for PROMIS-29 SF v2.0 4a Ability to Participate in Social Roles and Activities for the age groups less than 40, 41–60, 61–80, and above 80 years are 40.72, 39.55, 40.19, and 40.69, respectively, and for females and males are 39.90 and 40.24, respectively, with lower scores representing greater dysfunction.

Pain Intensity

The mean global pain impact scores, calculated as pain intensity + PROMIS Pain Interference + inverse of Physical Function, are 32.25, 33.41, 33.08, and 34.11, for those aged less than 40, 41–60, 61–80, and above 80 years. Pain intensity, based on a numerical rating scale of 0–10, is 6.50, 6.56, 6.24, and 6.24 for age groups of less than 40, 41–60, 61–80, and above 80 years, and gender grouping mean scores of 6.53 and 6.19 for females and males, respectively.

Gender Difference Analysis

Gender influence on PROMIS-29 assessment across the seven domains for patients entering into a chronic pain practice in the United States was evaluated as described above.

Discussion

This study represents the first time that this calculation has been performed, and has significant implications in measuring treatment success. This then serves as benchmark normative data on the chronic pain population entering into chronic pain and neurosurgery practices in the real world. The PROMIS-29 is generalized to the entire US population, and as such can serve as a common measurement language that is universally relevant across disease states. By creating a normative data set for the chronic pain population at multiple sites, this study serves to evaluate the typical PROMIS-29 presentation in this population so as to aid in the identification of outliers and to allow for more translational research. This has never been performed in a community-based, multicenter fashion for the chronic pain population.

For most PROMIS instruments, a T-score of 50 is the average for the US general population, with a standard deviation of 10. The T-score is reported with a standard error. A larger T-score represents more of the domain being measured, such that measures for negatively worded concepts represent worse than average. Larger scores of positively worded attributes represent a score better than average. As compared to the general population, all the PROMIS-29 T-scores measured in the population of chronic pain patients presenting to clinics seeking treatment are worse by a standard of deviation of 0.5–1.4 in general. It is interesting to note that although we assessed median and mean for all scores and instruments within the PROMIS-29, along with gender influence, the data is strikingly similar, spanning multiple different diagnoses, with little difference detected amongst age groups. We, however, did not specifically compare diagnosis or medication usage to the cohort studied, but follow-up studies are pending. Chronic pain patients demonstrate statistically higher reported dysfunction than the general population across the multiple domains assessed by the PROMIS-29 (Table 1).

Table 1 Age groups across the statistically significant subdomain using Welch’s t-test

When comparing different age groups across the seven subdomains, we found that there were significant differences between the < 40 and 80+ age groups in both sleep disturbance and physical function. For sleep disturbance, the mean T-score for < 40 was 59.59, and the mean T-score for 80+ was 52.93, demonstrating an average difference of 6.66 (0.67 standard deviation [SD]) between the two age groups. Likewise, for physical function, the mean T-score for < 40 was 37.99, and the mean T-score for 80+ was 34.06, meaning on average a difference of 3.93 (0.40 SD). This resulted in an MCID greater than 3. In addition, Welch’s t-test was performed to compare the age groups. In Table 2, we show a 95% confidence interval between the < 40 and 80+ age groups in sleep disturbance of between 5.95 and 7.94, with a p value of 5.58E–78. For physical function, the 95% confidence interval between the two age groups is 3.73 to 4.75, with a p value of 1.77E–54.

Table 2 (a) Age group comparison of the pain impact score using Welch’s t-test; (b) Age group comparison of the HUI-3 using Welch’s t-test

Overall, the PROMIS-29 showed great generalizability across a diverse cohort, with fairly homogeneous findings. Patients were examined in clinics across the United States, spanning previous treatments, medications, surgeries, and diagnoses. The HUI-3 assessment of the eight health status dimensions on a 5- or 6-point scale (normal to highly impaired), namely vision, hearing, speech, ambulation, dexterity, emotion, and cognition, was assessed based on age and gender and demonstrated no statistically significant differences, with mean scores of 0.67, based on assessment from Welch’s t-test.

There are several limitations to this study. As these assessments were collected prospectively for those new patients entering into pain practice, there may have been selection bias regarding those new patients that completed the assessment. Further, it must be appreciated that there are many diagnoses and pain qualifiers that differ for pain therapies, including duration of pain, location of pain, and previous treatments that are typically representative of those patients seeking care in these centers, although interestingly, the results are very homogeneous.

Conclusion

Patient outcome assessment is changing, focused on the complete survey of patient well-being using universally relevant measures rather than linear scores. This data set is the first published normative data set describing the PROMIS-29 chronic pain population referred to chronic pain centers in the United States in a multicenter, retrospective fashion spanning over 2 years, defining a fairly homogeneous chronic pain population as measured by the PROMIS-29 and its calculated representations, across the seven domains. Future study is needed to fully appreciate the implications of the norms of this population, which is descriptively more homogeneous than expected.