BACKGROUND

High-need patients are defined as those who represent a considerable challenge for healthcare delivery organizations. This population is considerably heterogeneous due to the different ways this population can be defined. “High-need” may refer to the complexity created by medical conditions interacting with comorbid psychiatric and/or substance use disorder.1 It may also refer to frailty and multimorbidity, for example elderly, patients with multiple medical conditions or those who require higher levels of nursing assistance.2 Furthermore, social determinants of health concurrently exacerbate the health of these patients and contribute to the difficulties in their care management.3 Often, these complexities lead to frequent, high utilization of the healthcare system, often in ways that are considered low value from the standpoint of the healthcare system or insurance payer.4 Furthermore, this heterogeneity contributes to unclear goals and outcomes related to the high-needs patient as well as lack of knowledge regarding effective processes or models of care.

Much of the difficulty in developing models of care for high-needs patients is that the typical biomedical model of disease and treatment is by itself inadequate to address their comorbidities, as it fails to capture not only the synergistic interplay of medical comorbidities but also the contextual factors of a patient.5 High-needs patients by definition do not fit into the traditional model due to their comorbidity across multiple types of determinants, which can include medical, psychiatric, social, behavioral and functional limitations. Health problems across these domains do not function in a vacuum but instead can be compounding and serve to complicate the care of other illnesses, an obvious example of this being the patient with depression and diabetes who experiences difficulty with diabetic medication compliance due to their psychiatric symptoms.

The ongoing failure to develop effective models of care for high-needs patients is in part due to a lack of evidence-based guidelines regarding the optimal approach to this heterogeneous population. While quality indicators are well established and readily available for individual medical conditions, those for complex (and high-need patients) are notably scarce and, moreover, have a poor evidence-base. In order to reduce healthcare expenditures of high-cost, high-need patients and to simultaneously increase quality of care for this population, first we need to establish proven indicators that signal quality care of complex, high-needs patients.6 Thus, the purpose of our systematic review was to examine the literature to describe quality indicators for the care of complex, high-needs patients, which could in turn be used as a reference for physicians seeking to improve assessment and delivery of care for this heterogeneous population.

METHODS

The systematic review followed a detailed protocol, reporting follows PRISMA guidance, and the review is registered in PROSPERO (CRD42020215917). The results of the evidence review informed a RAND Appropriateness Panel to establish quality indicators for high-need patients.

Searches

The systematic review required a carefully designed search strategy given the diverse terminology and definitions used to describe complex, high-need patients in the literature. The search was conducted by a systematic review reference librarian with input from content experts to identify appropriate search terms to identify complex, high-needs patients. The search strategy is documented in the Appendix.

We searched PubMed, CINAHL, EMBASE, Cochrane Database of Systematic Reviews, and PROSPERO from database inception through July 2020. In addition, we searched Google, Google Scholar, and the guideline clearing houses ECRI and GIN for published practice guidelines for high-need patients using the search terms high-needs, complex, high-cost, high utilizers (services or cost), super users, high-risk, and multi-morbid patients.

Eligibility Criteria

Two independent reviewers (MB, KS, SH, TS) screened the search output. All citations deemed relevant by at least one reviewer were obtained as full text. Full-text publications were screened by two reviewers against the eligibility criteria and discrepancies were resolved through discussion (MB, KS, SH, TS). Studies meeting these criteria were eligible for inclusion in the review:

  • Population:

    • We accepted the authors’ definition of high-needs patients. Given that there is no consensus for the nomenclature, we included studies that described frequent user, high attenders, high utilizers, complex patients, high cost patients, and multi-morbid patients/patients with comorbidities or multiple chronic conditions. We included high-need patients of all ages. Older age was not used as an indicator of high-need but qualifiers such as frail elderly were included.

  • Quality indicators

    • We included potential quality indicators as suggested by the authors. Quality indicators could encompass care processes-related measures (e.g., follow-up post discharge, continuity of care, medication errors), heath services utilization measures (e.g., hospital readmission, emergency department visit), care satisfaction (e.g., patient satisfaction, care needs met, trust in care provider), and health outcomes (e.g., mortality, physical functional status, mental functioning, quality of life).

    • Indicators specific to only one clinical condition were excluded (e.g., a set of quality indicators for depression treatment was not eligible). We also excluded quality indicators specific to antenatal care and those exclusively addressing end-of-life care.

  • Study design

    • We included publications that explicitly suggested quality of care indicators. These encompassed publications suggesting quality indicators based on expert consensus identified in a consensus finding process; suggestions from individual authors were excluded. We also included publications reporting empirical data on quality indicators (e.g., studies assessing whether a set of quality of care indicators has been met). We also included publications that set out to evaluate quality of care for patients and that used indicators designed for the study or previously developed indicators. In addition, publications that evaluated the validity of potential quality indicators, e.g., by documenting the presence or absence of evidence that the indicator was useable, or by providing other measurement characteristics (e.g., reliability or rater agreement for the indicator) were eligible.

    • Studies reporting on the effect of an intervention on patients without clear mention of evaluating the quality of care were excluded. Publications only reporting examples of quality indicators were also excluded. Care guidelines without explicit quality of care indicators and systematic reviews were retained as background and used for reference mining.

  • Other limiters

    Publications were limited to English language publications and full-text publications. Quality indicators were collected to support an Appropriateness Panel and the language restriction was implemented to ensure timely delivery of the review and transparency for panelists evaluating the indicators. Publications in abbreviated format were excluded (e.g., conference abstracts).

All inclusion screening decisions were documented in citation management software.

Data Abstraction

The data abstraction used detailed, pilot-tested forms in an online software for systematic reviews. Data abstraction focused on study characteristics necessary to understand and evaluate the publications’ results. The data extraction included author and year, multiple publications with additional information on the specific set of quality indicators (where available), type of study (consensus guideline or empirical study to suggest indicators, empirical study to evaluate a proposed indicator), the number and type of stakeholders involved in development of indicators, consensus method, all suggested indicators (label and brief description), number and type of high-need patients in empirical studies testing quality indicators, and evidence for suggested indicators (presence and absence of supporting evidence). Data were abstracted by one reviewer (KS, JS, MB) and checked by an experienced systematic reviewer and a content expert (SH, TS).

We standardized the indicators to the extent possible by translating the indicators into labels followed by a description. Each label started with the care process of action (e.g., “Complex health care needs documentation”). More details about the process were added in parentheses, and timing information was moved to the end of the indicator (e.g., “annual foot exam” changed to “foot exam annually”) to facilitate comparisons across indicator sets and author groups.

Analysis

We summarized quality indicators in a narrative synthesis. All included studies were documented in a detailed evidence table to allow a concise overview. We analyzed the frequency of indicators across the different indicator sets. Studies were categorized by setting, age group, and patient characteristics to illustrate differential populations among those included in the literature review.

Critical Appraisal

We used a published tool relevant to quality of care indicators.7,8 Critical appraisal domains included stakeholder involvement and use of empirical evidence in establishing the quality indicators. Stakeholder involvement was operationalized as three items (Item 1.1: The group developing the indicator includes individuals from relevant professional groups; Item 1.2: Considering the purpose of the indicator, all relevant stakeholders have been involved at some stage of the development process; Item 1.3: The indicator has been formally endorsed). The domain Scientific evidence was also operationalized as three items (Item 2.1: Systematic methods were used to search for scientific evidence; Item 2.2: The indicator is based on recommendations from an evidence-based guideline; Item 2.3: The supporting evidence has been critically appraised). The third domain; Additional evidence, formulation, and usage; included nine items addressing the feasibility, reliability, and validity of the indicators (Item 3.1: The numerator and denominator are described in detail; Item 3.2: The target patient population of the indicator is defined clearly; Item 3.3: A strategy for risk adjustment has been considered and described; Item 3.4: The indicator measures what it is intended to measure (validity); Item 3.5: The indicator measures accurately and consistently (reliability); Item 3.6: The indicator has sufficient discriminative power; Item 3.7: The indicator has been piloted in practice; Item 3.8: The efforts needed for data collection have been considered; Item 3.9: Specific instructions for presenting and interpreting the indicator results are provided).

Synthesis and Quality of Evidence

The quality of evidence assessment for each condition-unspecific indicator is documented in a summary of findings table. To be included in the summary table, indicators had to have been described in at least five identified studies given the large number of suggested indictors. The assessment took into account the following: the number of studies that used the indicator, the critical appraisal results regarding developing the indicator, applying the indicator in practice, and the evidence supporting the indicator. We used the GRADE categories High, Moderate, Low, and Very low to describe the quality of evidence for the indicators across studies.9High quality of evidence would indicate the indicator has been shown to reliably identify outcomes for the care of high-need patients (e.g., differentiate from other patients). Moderate quality of evidence was reserved for indicators have been tested in practice and that have shown some discriminant validity in so far that there was variation in the study sample (not everyone met the criterion, the indicator can distinguish between high and low need). Low quality of evidence was given when one of the criteria was not met (e.g., criteria to establish the indicator, to apply the indicator, or evidence for the content of the indicator). Very low quality of evidence was reserved for indicators where multiple criteria were not met.

RESULTS

The search identified 6964 citations. Of these, 1382 publications were obtained as full text, and 53 studies reported in 70 publications met inclusion criteria.10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79 Although we searched without date restriction, the earliest included study was published in 2004.28 Of these, 39 (74%) were based in the USA, 14 (26%) were international. The list of excluded studies and the reasons for exclusion are documented in Appendix 2. In addition, Appendix 3 shows the list of background publications that were used to identify additional studies and publications. The diagram (Fig. 1) documents the literature flow.

Figure 1
figure 1

Literature flow diagram.

The evidence table in the appendix provides an overview of all included studies (Supplementary Appendix 4). The branch diagram in Figure 2 displays the distribution of the 53 included studies across population subgroups as defined by types and combinations of comorbidities. Figure 2 illustrates the differential amount of evidence between different quality indicators based on how specifically the population was defined. The studies were categorized by (1) care setting: community dwelling (48 Citations), hospitalized (3 Citations), and home-bound (2 Citations) as well as by (2) age: children (5 Citations), adults (29 Citations), and elderly (19 Citations). In addition, we categorized the studies by the theme of the complexity defined. These themes included medical comorbidity (45 Citations); psychological/mental comorbidity (15 Citations); behavioral comorbidity (6 Citations), (including addiction); social needs (8 Citations), (including poverty, housing insecurity, food insecurity), functional limitation/decline (21 Citations), (including dementia, loss of mobility) and those only definable by utilization as high utilization (14 Citations). These latter patient characteristics were only applied to the community-dwelling populations, as specific characteristic descriptions were not available for the home-bound and hospitalized population manuscripts.

Figure 2
figure 2

Distribution of literature among subpopulations.

In total, 1703 individual quality indicators have been suggested across identified studies. The indicators are displayed in alphabetical order in Appendix 6. The scope of the selected indicators ranged from detailed criteria (e.g., HgA1c < 6.5%) to very broad categories (e.g., “care coordination”). Identified indicators also ranged from treatment recommendations to explicit scoring criteria to determine whether quality standards were met (Fig. 3). Some studies described condition-specific indicators relevant to patient subgroups (“in patients with diabetes, assess…”), some used exclusively condition-independent criteria (e.g., “documentation of the medication list in the medical record available to all care agencies”), and some sets of indicators used a mixture of indicator types. Some studies documented broad domains under which they categorized individual quality indicators. Figure 4 provides an overview of the eight domains this review found in the included studies, organized by broad groupings. Distinct domains included disease and symptoms, delivery of care, outcomes, prevention, collaboration/coordination, patient engagement, access to care, and patient safety.

Figure 3
figure 3

Critical appraisal.

Figure 4
figure 4

Quality indicator domains.

The summary of findings table (Table 1) provides an overview of the quality of evidence supporting the most frequently suggested quality indicators. The table displays the indicators, supporting evidence, and the GRADE score. All identified indicators ranged between Moderate and Low quality of evidence. Hospital continuity of care (including discharge planning and follow-up) and hospitalization-related indicators such as readmission rates were included in most studies (16 and 15, respectively). Furthermore, other indicators involved starting or monitoring medications, patient assessments and screenings, new symptom or disease follow-up and treatment, safety and preventive medicine, and documentation.

Table 1 Summary of Findings and Quality of Evidence Table

Figure 3 provides an overview of the critical appraisal assessment across studies.

Studies varied in their methodological rigor. Half (49%) of the studies provided clear definitions of the target populations of high-need patients. Very few studies included any indicators that had been formally endorsed (16%) or were based on recommendations from evidence-based guidelines (11%), although some used systematic methods to search for scientific evidence (29%). Some studies included indicators that had already been piloted (31%). Few studies documented risk adjustments for indicators. Ratings of the individual studies are shown in the table in the appendix (Supplementary Appendix 5: Critical Appraisal).

The evidence table (Supplementary Appendix 4) describes the difference in approaches to quality indicator development. Approaches included quality indicator proposals (e.g., following a consensus process), evaluation of existing quality indicators (e.g., evaluating which quality indicators could be assessed in chart review), or use of quality indicators to document quality of care (e.g., to test the effect of an intervention on care quality). Some studies provided detailed information about the development process of the quality of care indicators, while other studies included indicators that had never been used in practice. Studies varied widely in their reporting of validity and reliability of proposed indicators and not all provided evidence that the indicators are measuring what they are supposed to measure, i.e., care quality.

DISCUSSION

We identified and evaluated existing quality indicators in an effort to support healthcare organizations, clinicians, and patients to ensure that quality of care is assessed and quantified in this heterogeneous patient group. Many quality indicators were specific, identifiable, and externally valid as demonstrated in multiple pilot and validity studies. However, overall, the large quantity of widely varied quality indicators in this review clearly demonstrates a lack of consensus to guide studies that attempt to measure care delivered to high-needs patients.

Our review of the studies that suggested, used, or evaluated quality indicators in high-needs populations produced an astounding 1703 indicators. Even more astonishing is that while there was conceptual overlap across studies and quality indicators, there were notably very few true duplicate quality indicators.

Overall, our findings in this systematic review reflect the challenging heterogeneity of this population and the extraordinary amount of unique indicators found through our review appears to represent a failure of consensus of key quality of care measures that can be applied specifically to all high-needs patients. This failure is important in its own right in that it demonstrates not only the state of the literature, but also likely demonstrates the need for a novel approach to forming expert consensus of quality indicators to be used in measuring care delivered to this population. Several observations can be made that point out uniquely neglected subset populations that represent potential areas to advance quality indicators. While we found no sets of criteria from organizations such as CMS and the National Quality Forum for this patient group, there are select examples of successful quality indicator frameworks (e.g., ACOVE) applied to well-defined subset populations within the larger, more varied cohort of high-needs patients.

Among age- and setting-based populations displayed in Figure 2, community-dwelling adults and elderly constituted the majority of the literature, representing 28 and 15 studies, respectively. There were few home-bound (2 total, 1 adult, 1 elderly) and hospitalized (3 elderly) patient population articles. The literature pertaining to complex children was small (5 Citations) and represented a myriad of quality indicators among diverse populations within children, yielding little congruence among indicators. This was partially due to the way the high-needs child was defined, with several of the studies focused on complexity as defined by experts, others with complexity based on tertiary and quaternary care (e.g., congenital diseases), another focused on high utilization among asthma with depression/anxiety, and the last manuscript looking broadly at high utilization as compared to the general pediatric populations.

Across the various age and setting subpopulations, further subcategorizations were found to define high-needs patients. Among the community-dwelling population, some were specified by high utilization (14 total, 4 adult, 4 children, 6 elderly) and others used comorbidity-based definitions. These overlapping comorbidity subgroups included categories of medical (26 adult, 5 children, 14 elderly), psychological/mental (12 adult, 3 children, 0 elderly), behavioral (5 adult, 1 children, 0 elderly), social (7 adult, 0 children, 1 elderly), and functional limitation/decline (9 adult, 4 children, 8 elderly). These various categories not only delineate different quality indicators, but also differ in the level of evidence supporting the measures of quality. The frequency of manuscripts pertaining to these subgroupings illustrates some of the disparity in the quantity of evidence among the different populations—especially the lack of focus on patient characteristics that are not medical comorbidities.

The most frequently represented and rigorously supported quality indicators exist in the medically comorbid adult and elderly populations, centered on several iterations of the Assessing Care of Vulnerable Elderly Study (ACOVE) and their subsequent application in other populations. This original study developed quality measures specifically devoted to the vulnerable elderly defined as those most likely to die or become severely disabled in the next two years. This study created an impetus for development of substantial further examination into quality of care among the vulnerable elderly. This development is evidenced by the predominance of quality indicators from ACOVE studies in the quality of evidence table with basic care coordination indicators, such as “hospital continuity of care, discharge planning and follow-up after inpatient stay” and “hospitalization (appropriate vs inappropriate including readmissions)” (Table 1). Nonetheless, despite some concordance among the most common indicators in the quality of evidence table, most indicators reached only moderate quality of evidence based on GRADE criteria, with several of low quality. This embodies the opportunity to improve measurement of quality within the broader high-needs population other than the frail elderly quality indicators developed by ACOVE.

Quality indicators that had enough concordance to meet the criteria for the quality of evidence table (Table 1) included condition-specific screening measures (vision screening, blood pressure screening), prevention measures (fall prevention, stroke prevention), treatment measures (aspirin use, beta-blocker offered in heart failure,), behavioral interventions (smoking cessation counseling), and documentation measures (weight documented, alcohol/substance use intake documentation, constipation documentation). These represented consistent primary care–related quality indicators that substantially overlap with United States Prevention and Screening Task Force items, particularly regarding prevention and screening.80,81

While there was substantial evidence for quality indicators specific to the frail elderly (ACOVE) and for disease-specific measures, there was an absence of evidence surrounding quality indicators for patients with serious mental illness (e.g., schizophrenia), social, and behavioral comorbidities and those defined solely by high utilization. These patients remain extremely challenging for the healthcare system, and yet there are few attempts to evaluate care delivery in the quality indicators literature.

There is a lack of consensus across the literature as to how the high-needs population should be defined, and this creates a lack of comparability, lowering the external validity of each of the quality indicators due to unclear population concordance.6 The fundamental challenge of defining the high-needs population exists in the great diversity of comorbidities and contextual contributors that lead them to be high utilizers of the healthcare system. The comorbidity subpopulations in Figure 2 illustrate this diversity and the lack of robust evidence, apart from the adult and elderly population with medical comorbidities. The majority of the literature furthermore tends to focus on a single or few disease conditions (e.g., diabetes and hypertension) rather than patients with comorbidities across many subgroups (e.g., diabetes with bipolar disorder, homelessness, and substance use). This bias leads to quality indicators addressing patients with several medical conditions, which does not adapt well to patients with complex biopsychosocial needs.

Care models to address social and behavioral determinants of health have developed over the last decade in other literature, utilizing programs that involve interprofessional teams, intensive care coordination, and necessary restructuring of care delivery.82,83,84,85,86 These programs share the recognition that the high-need patients, though belonging to a wide range of etiologies of their utilization, are similar in that they do not fit into the current medical infrastructure. Despite forays into treatment regimens and new models of care to approach this population, lack of development of quality indicators continues to hamstring providers and health systems in improvement of outcomes among the most difficult to treat high utilizers.

Our review had several limitations. First, our search strategy was purposefully broad in order to capture all quality indicators being suggested, used, or evaluated in patients who could be considered high-needs to avoid reliance on authors’ use of identical terminology. This approach led to a highly sensitive catchment of both quality indicators and high-needs populations and provides a comprehensive analysis of the quality indicators currently utilized internationally in the delivery of medical care to high-needs patients. However, this methodology results in many of the quality indicators being disease-specific (e.g., glucose monitoring for diabetes) and not necessarily exclusive to the high-needs patient. In addition, we restricted to English language studies, which may have missed important indicators, in particular those in other cultures. Furthermore, we also note that our quality of evidence assessment was not based on standard GRADE criteria. We designed criteria relevant to assess the quality of evidence for the proposed quality indicators, which future research should replicate or develop further.

To improve quality indicators and their application, we need to better define and more specifically apply quality indicators according to the varying profiles of high-need patients. Development of quality indicators among high-needs patients may benefit from following the success of the ACOVE literature, which succeeded in large part due to its focus on a specific sub population. Other subpopulations of high-needs patients need a similarly focused quality indicator infrastructure to be rigorously developed in order to maximize efficacy and clinician relevance. Finally, patients with social and behavioral comorbidities require prompt study, as they lag behind the other subpopulations of high-needs patients in the quality indicator literature.