Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Understanding the characteristics of high users of hospital services in Singapore and their associations with healthcare utilisation and mortality: A cluster analysis

  • Mimaika Luluina Ginting ,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing

    ginting.mimaika.luluina@geri.com.sg

    Affiliation Geriatric Education and Research Institute Singapore, Singapore, Singapore

  • Yan Hoon Ang,

    Roles Conceptualization, Funding acquisition, Investigation, Writing – review & editing

    Affiliation Department of Geriatric Medicine, Khoo Teck Puat Hospital, Singapore, Singapore

  • Soon Hoe Ho,

    Roles Data curation, Formal analysis, Writing – review & editing

    Affiliation Geriatric Education and Research Institute Singapore, Singapore, Singapore

  • Grace Sum,

    Roles Writing – review & editing

    Affiliation Geriatric Education and Research Institute Singapore, Singapore, Singapore

  • Chek Hooi Wong

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Supervision, Writing – review & editing

    Affiliations Geriatric Education and Research Institute Singapore, Singapore, Singapore, Health Services & Systems Research, Duke-NUS, Singapore, Singapore

Abstract

Introduction

High users of hospital services require targeted healthcare services planning for effective resource allocation due to their high costs. This study aims to segmentize the population in the “Ageing In Place-Community Care Team” (AIP-CCT), a programme for complex patients with high inpatient service use, and examine the association of segment membership and healthcare utilisation and mortality.

Methods

We analysed 1,012 patients enrolled between June 2016 and February 2017. To identify patient segments, a cluster analysis was performed based on medical complexity and psychosocial needs. Next, multivariable negative binomial regression was performed using patient segments as the predictor, with healthcare and programme utilisation over the 180-day follow-up as outcomes. Multivariate cox proportional hazard regression was applied to assess the time to first hospital admission and mortality between segments within the 180-day follow-up. All models were adjusted for age, gender, ethnicity, ward class, and baseline healthcare utilisation.

Results

Three distinct segments were identified (Segment 1 (n = 236), Segment 2 (n = 331), and Segment 3 (n = 445)). Medical, functional, and psychosocial needs of individuals were significantly different between segments (p-value<0.001). The rates of hospitalisation in Segments 1 (IRR = 1.63, 95%CI:1.3–2.1) and 2 (IRR = 2.11, 95%CI:1.7–2.6) were significantly higher than in Segment 3 on follow-up. Similarly, both Segments 1 (IRR = 1.76, 95%CI:1.6–2.0) and 2 (IRR = 1.25, 95%CI:1.1–1.4) had higher rates of programme utilisation compared to Segment 3. Patients in Segments 1 (HR = 2.48, 95%CI:1.5–4.1) and 2 (HR = 2.25, 95%CI:1.3–3.6) also had higher mortality on follow-up.

Conclusions

This study provided a data-based approach to understanding healthcare needs among complex patients with high inpatient services utilisation. Resources and interventions can be tailored according to the differences in needs among segments, to facilitate better allocation.

Introduction

Globally, there is an increasing trend of population ageing with multiple chronic conditions, mental health and medication-related problems, and social vulnerability. This has contributed to a shift towards more complex needs of the aging population [1]. Healthcare delivery systems and organisations that were initially developed in response to acute needs have been facing challenges in integrating care to address the complex needs of patients [2]. Older adults are often high users of inpatient services with repeated hospitalisations, and increasing challenges in care transition have resulted in hospital readmission, mortality, and higher healthcare cost [3]. Previous studies reported high healthcare costs associated with multiple comorbidities, mental health problems, increasing age, and end-of-life care [46].

The strain on healthcare resources has increased calls for more efficient and improved strategies in healthcare services, as the Organization for Economic Cooperation and Development reported high users of hospital resources with estimates ranging from 10% to 34% of healthcare expenditures [7]. Optimal management with a more targeted, efficient, and coordinated healthcare services by having a more person-centered approach in understanding these “high user” patients should be sought to facilitate better care to address their complex needs and to facilitate better resource allocation [8].

While it is impossible to develop care models for every individual at a population level, understanding the needs around groups of people with similar characteristics could aid in resource allocation and programme planning. Identifying these groups, known as population segments, can allow optimisation of healthcare service planning at population level and develop integrated healthcare system that is more targeted and efficient [911].

Population segmentation uses variables or characteristics used to assign population to homogeneous groups at various levels including–macro (e.g., general population), meso (e.g., specific population with certain disease or conditions, i.e., diabetic patients), and micro (e.g., individual adverse event risk stratification) [10,12]. Derivation of segments could be done through expert inputs (a priori) or post-hoc using statistical methods applied to empirical data [13,14]. Most population segmentation studies often use healthcare utilisation, medical, and socio-demographic characteristics as the basis of segmentation and fewer studies used functional and social variables [2,15,16]. There is increasing evidence that using medical complexity alone is insufficient in explaining healthcare utilisation [3]. A more comprehensive look to include other domains of psychosocial needs is also considered important in understanding the factors driving healthcare utilisation and guiding risk stratification and segmentation to improve care delivery for these patients [3].

This study segmentizes a high user patient population in the Ageing In Place-Community Care Team (AIP-CCT) programme, a programme for complex patients with high inpatient service use, into segments with distinct characteristics, based on their medical complexity and psychosocial needs. We defined high users as complex patients with high inpatient service usage, specifically, those with three or more hospital admissions in the past 12 months since the enrolment in AIP-CCT programme. We also explore the association between segment membership and healthcare utilisation (hospital admissions, length of hospital stay (LOS), emergency department (ED) visits, AIP-CCT programme utilisation) and mortality at 180-day follow-up.

Materials and methods

Sample and data source

We used the National Healthcare Group (NHG) Regional Health System (RHS) database. The database included data on socio-demographics, primary and secondary diagnosis (based on International Classification of Diseases 10th Revisions), public healthcare utilisation (hospital admissions, day surgeries, ED visit), hospital length of stay, and mortality. We also used the AIP-CCT programme administrative database, which included data on the programme’s home visit assessment. These include socio-demographic profile, medical, functional and psychosocial assessment, and programme utilisation. We included all patients enrolled into AIP-CCT between June 2016 and February 2017 (n = 1,326). After excluding 312 patients with incomplete data, our study analysed a total of 1,012 AIP-CCT patients. For each patient, the first AIP-CCT home visit was identified as the index time for analysis. The baseline period was defined as the 180 days prior to the index time. The follow-up period was the 180 days immediately following the first home visit assessment.

Programme description

The AIP-CCT programme was a homecare service that delivers multi-disciplinary care to a high user patient population. This population had complex needs with progressive or life-limiting conditions, with high inpatient services use (Table 1). The high service use refers to three or more hospital admissions in the past 12 months from programme enrolment. The delivery of care was guided by a comprehensive needs assessment of the medical, functional, nursing, and psychosocial profiles of the patient in Singapore Northern region.

thumbnail
Table 1. AIP-CCT programme inclusion and exclusion criteria.

https://doi.org/10.1371/journal.pone.0288441.t001

The AIP-CCT is a hospital-led programme which offered home visits and tele-consultations to support patients and caregivers in managing chronic diseases for 3 months after being enrolled into the programme. The programme aimed to address the unmet needs of patients by giving support to the patients/caregivers in managing chronic disease to reduce patients’ acute hospital utilisation. It also provided case management to link patients to community services for their social needs. Potential patients referred from hospital inpatient wards, specialist outpatient clinics (SOC), and the ED were triaged and linked to a community team based on their assessed needs. The team comprised of nurses, doctors, physiotherapists, occupational therapists, speech therapists, pharmacists, medical social workers, and healthcare assistants [17]. If there was continuing needs beyond the 3 months period, patients would be referred to community long term Home Medical and Home Nursing Services ran by Social Service Agencies, which are non-profit organisations that provide welfare services and/or services that benefit the community at large [18].

Cluster analysis approach

We performed a cluster analysis to generate non-overlapping population segments. Clustering aimed to partition a set of data points into segments (clusters), so that the data points were more similar to each other than data points in different segments. Medical complexity and psychosocial needs were used as input variables to assign observations into homogeneous segments in the clustering process.

Medical complexity was measured using the Charlson Comorbidity Index (CCI). CCI is widely used as a measure of comorbidity, with higher score indicating higher mortality risk and more severe comorbid conditions [19]. It provides a valid assessment of individual’s unique clinical condition [19]. The chronic conditions used to compute CCI included myocardial infarction, congestive heart failure, peripheral vascular disease, cerebrovascular disease, dementia, chronic pulmonary disease, rheumatic disease, mild/moderate/severe liver disease, diabetes with and without chronic complication, hemiplegia or paraplegia, renal disease, any malignancy, metastatic solid tumor, and HIV/AIDS [20].

Psychosocial need was measured using Social Triage (ST) score, which was a composite variable on patient/family social support, patient’s mental health, treatment compliance, and patient/family coping response. Patient/family social support was assessed by examining the availability of existing caregiver or formal/informal social support for patient. Patient’s mental health was assessed by examining the existence of mental condition e.g., dementia, depression, psychiatric/substance abuse, alcoholism, or other behaviour problem that affected care. Treatment compliance was based on patient’s adherence to treatment/care plans. Patient/family coping response was assessed by rating their ability to cope with diseases. Each component of the composite variable has a 3-point rating score, with higher score indicating poorer support, more mental health issues, poor treatment compliance, and poor coping response, respectively. S1 Table describes each component within the ST score.

We conducted the cluster analysis using the Partitioning Around Medoids (PAM) approach. PAM is a simple unsupervised machine learning clustering algorithm that groups data into a specified number (k) of clusters (segments). This approach searches for representative observations in the dataset called medoids that are centrally located in clusters, and then assigns all other observations to the closest medoid, in order to create clusters [21,22].

Determining the optimal number of segments

We applied a post-hoc approach to define the optimal number of segments presented within the data set, using the R package NbClust [23]. NbClust provides 30 indices which estimate the optimal number of segments in a data set and proposes the best clustering scheme from different results obtained by varying all combinations of number of segments, distance measures, and clustering methods [23]. We selected the optimal number of segments with a majority rule, which is the number of segments derived from majority of the indices. The final segmentation outcome was assessed by its clinical relevance and interpretability, to evaluate the goodness of clustering algorithm results.

Statistical analyses

Sociodemographic, clinical, functional, and psychosocial variables, and healthcare utilisation pattern were collected at baseline to describe the characteristics of each segment of patients. Chi-square/Fisher exact test and one-way ANOVA/Kruskal-Wallis H tests were used to determine whether there were statistically significant differences in the baseline characteristics across the patient segments for categorical and continuous variables (parametric and non-parametric), respectively.

Next, we examined the association of segment membership and prospective healthcare utilisation and mortality. We performed multivariable negative binomial regression model using the patient segments as the predictor and the number of hospital admissions, LOS, ED visits, and programme utilisation over the 180-day follow-up period as outcomes. In addition, we applied multivariable cox proportional hazard regression model to assess and compare the time to the first hospital admission and mortality between segments within the 180-day follow-up period. All models were adjusted for age, gender, ethnicity, ward class, and baseline healthcare utilisation. Ward class refers to the highest tier of hospital ward where the patient had stayed among his/her all hospital admissions. Different ward classes have different facilities, charges and level of subsidy for hospitalisation costs, but the quality of medical care remains the same in all wards. All analyses were performed in R version 3.6.1 (R Core Team, 2019).

Results

Baseline characteristics of the three patient segments

Among all indices within the NbClust package, majority (9 of 30 indices) proposed 3 as the best number of segments. Following the majority rule, we ran the PAM algorithm with k = 3 in the study population.

There were a total of 1,012 patients across all three segments. The mean age of patients was 75.8 years (Standard Deviation (SD): 12.5). Majority of the patients were female (55.5%), married (48.2%), and of Chinese ethnicity (61.7%). Majority lived in 4 rooms public housing apartment (43%). Table 2 presents the baseline characteristics of the three patient segments.

thumbnail
Table 2. Baseline characteristics of the patient segments.

https://doi.org/10.1371/journal.pone.0288441.t002

There were no significant differences in age among segments. The three segments had significantly different medical complexity and psychosocial needs, as reflected by the CCI (p-value = <0.001) and ST score (p-value = <0.001). This reflects the central aim of cluster analysis which is to maximise the distance between clustering variables. Segment 2 had the highest medical complexity with highest mean CCI score 4.5 (SD: 2.0), followed by Segment 1 mean CCI score 2.3 (SD: 2.3) and Segment 3 mean CCI score 0.9 (SD: 0.8). Segment 1 had the highest psychosocial needs as reflected by the highest mean ST score 7.4 (SD: 1.2) with poorer patient/family social support, mental health, treatment compliance and coping response, followed by Segment 2 (mean ST score 4.3 (SD: 0.6)) and Segment 1 (mean ST score 4.2 (SD: 0.4)).

In addition, the non-clustering variables including functional and baseline healthcare utilisation differed significantly. This demonstrated that each segment was largely distinct. The Barthel Activities of Daily Living (ADL) (p-value = <0.001) and Instrumental Activities of Daily Living (IADL) (p-value = <0.001) score were significantly different across segments. Segment 1 had the highest functional limitation with the lowest mean ADL score of 49.1 (SD: 37.4) and lowest mean IADL score 2.2 (SD: 2.8), followed by Segment 2 (mean ADL score 60.8 (SD: 34.2) and mean IADL score 2.3 (SD: 2.5)), and Segment 3 (mean ADL score 67.3 (SD: 34.9) and mean IADL score 3.1 (SD: 2.9)). Baseline healthcare utilisation were also significantly different among the three segments. Segment 2 had the highest number of ED visits (mean = 2.3, SD: 3.0), hospital admissions (mean = 2.1, SD: 1.2), and SOC visits (mean = 3.1, SD: 3.5) at baseline, followed by Segment 1 (ED visits (mean = 2, SD: 2.1), hospital admissions (mean = 1.7, SD: 1.4), and SOC visits (mean = 2.2, SD: 3.1)), and Segment 3 (ED visits (mean = 1.9, SD: 1.1), hospital admissions (mean = 1.7, SD: 0.9), and SOC visits (mean = 2.1, SD: 2.5)).

Healthcare utilisation in the follow-up period

We examined the number of hospital admissions, LOS, ED visits, and SOC visits in the 180-day follow-up period using the multivariable negative binomial model, adjusting for age, gender, ethnicity, ward class, and all baseline healthcare utilisation (i.e., number of ED visits at baseline, number of hospital admissions at baseline, and number of SOC visits at baseline) (Table 3).

thumbnail
Table 3. Hospital admission, LOS, ED visit, SOC visit, and Programme utilisation in the 180-day follow-up across patient segments.

https://doi.org/10.1371/journal.pone.0288441.t003

Compared to those in Segment 3, the rate of getting hospitalised for patients in Segment 2 was approximately two-fold higher (Incidence Rate Ratio (IRR) = 2.11, 95% CI: 1.7 to 2.6) and 1.6 times higher in Segment 1 (IRR = 1.63, 95% CI: 1.3 to 2.1). Compared to Segment 3, patients in Segment 2 had a rate 2.6 times higher for LOS (in days) (IRR = 2.63, 95% CI: 1.9 to 3.65), and those in Segment 1 had a rate 2.5 times higher for LOS (in days) (IRR = 2.45, 95% CI: 1.71 to 3.5). Compared to Segment 3, the rate of having ED visits was greater in Segment 2 (IRR = 1.86, 95% CI: 1.5 to 2.3) and Segment 1 (IRR = 1.56, 95% CI: 1.3 to 1.9). However, compared to Segment 3, the rate of having SOC visits was lower in Segment 1 (IRR = 0.78, 95% CI: 0.67 to 0.9).

Programme utilisation in the follow-up period

We examined programme utilisation in the 180-day follow-up period using the multivariable negative binomial model adjusting for age, gender, ethnicity, ward class, and baseline healthcare utilisation (Table 3). Compared to Segment 3, the rate of having home visits was 76% more in Segment 1 (IRR = 1.76, 95% CI: 1.6–2.0) and 25% more in Segment 2 (IRR = 1.25, 95% CI: 1.1–1.4).

Time to first hospital admission and mortality

We examined time to first hospital admission and time to mortality during the 180-day follow-up, using cox proportional hazard regression models adjusting for age, gender, ethnicity, ward class, and all baseline healthcare utilisation (i.e., number of ED visits at baseline, number of hospital admissions at baseline, and number of SOC visits at baseline) (Table 4).

thumbnail
Table 4. Time to first hospital admission and mortality during the 180-day follow-up across patient segments.

https://doi.org/10.1371/journal.pone.0288441.t004

Compared to Segment 3, there was a higher risk of having their first hospital admission for those in Segment 2 (Hazard Ratio (HR) = 2.13, 95% CI: 1.7 to 2.6) and Segment 1 (HR = 1.62, 95% CI: 1.3 to 2.1). Compared to Segment 3, there was an approximately two-fold higher risk of dying among those in Segment 1 (HR = 2.48, 95% CI: 1.5 to 4.1) and Segment 2 (HR = 2.25, 95% CI: 1.4 to 3.6).

Discussion

The cluster analysis demonstrated in this study identified three patient segments based on medical complexity and psychosocial needs variables. The three segments had significantly different medical complexity and psychosocial needs, as clustering analysis maximised the distance between the segmentation variables.[24] In addition, other non-clustering variables i.e., ADL, IADL, and baseline healthcare utilisation, were also found to differ significantly, demonstrating that each segment was largely distinct. Fig 1 illustrates the differences among segments.

Population characteristics

Of the three segments, Segment 1 had the highest functional dependency and psychosocial needs with 35% of the segment having a total dependence based on ADL score category, lowest IADL score and the highest risk of having psychosocial needs based on the ST score. Segment 2 had the highest medical complexity, reflected by the highest CCI score. Segment 3 had the lowest medical complexity, functional dependency and psychosocial needs, hence this segment was used as a reference group for the multivariable analysis.

Population utilisation and mortality

Our study examined the association between membership across different segments with healthcare utilisation and mortality at 180-day follow-up. We found that the rate of having hospital admission, LOS (in days), ED visit, SOC visit, programme utilisation, and risk of hospital admission and mortality in the follow-up period differed across the three patient segments, suggesting that our segmentation approach could arrive at population segments with varying risk of healthcare utilisation and mortality.

Patients in Segment 1 with the highest functional dependency and psychosocial needs had the highest programme utilisation with moderate rate of hospitalisation and ED visit, and the highest risk of dying in the follow-up period. It is noteworthy that this Segment also had moderate medical complexity, with an average CCI score of 2.3 (SD: 2.3). Another study reported that individuals who were physically dependent had higher odds of mortality in the follow-up period as compared to those who were largely independent, even when both groups had similar level of comorbidity [25]. This suggests that physical dependency might serve as a differentiating factor that contributes to higher odds of mortality in the former group. A study amongst older adults with multimorbidity reported that physical functioning acted as a mediator of the multimorbidity and mortality association [26]. Literature also reported that psychosocial factors, e.g., social support, mental health, coping ability, are important predictors of mortality beyond severity of comorbidity. This might relate to the effect of these factors towards a negative influence in physical health. It is plausible that this might involve mechanisms that induced stress, which may affect health in various ways, such as via endocrine and / or immune system [27].

Patients in Segment 2 with the highest medical complexity reflecting more medical needs had the highest risk of healthcare utilisation in the follow-up period. This finding reinforced previous studies on the high medical needs impact on hospitalisations and ED visits [4]. It is noteworthy for this Segment 2 to receive moderate programme utilisation during the follow-up period. Although the study did not look into the cause, it could be postulated from the high SOC visits that the segment had more unstable medical conditions.

Segment 3 as the reference group, had the lowest programme utilisation, healthcare utilisation and mortality in the follow-up period.

Policy and research implications

Previous systematic review on predisposing, enabling and need characteristics of high cost patients included age, ethnicity, socioeconomic conditions, organisational factor (e.g., supply of health services and providers), medical conditions, health, and functional status. In these high-cost patients, inpatient services were most often reported as the main cost driver [4]. Furthermore, it reported that high-cost patients were more likely to die [4]. Our study reinforced the association between the characteristics of high users with healthcare utilisation and mortality.

Previous studies were mostly conducted in the United States and Canada, with a limited studies done in Asia [46]. It is plausible that there will be a diversity in patients’ characteristics and utilisation across different countries due to the difference in the epidemiological and health system factors. In addition, different countries might have different healthcare financing system, that could involve single payer or multi-payer framework.

Our study gave a more in-depth look on profiling the high users of inpatient services in Singapore. Moreover, it utilised the segmentation analysis approach to identify homogeneous population segments with similar characteristics and needs, to inform population health management and to allow for a tailored policy based on the local context.

This study reinforced the need to look at medical complexity, functional, and psychosocial needs as the distinguishing characteristics in segmenting high users of inpatient services into different population segments. In this study, Segment 1 with more functional dependency and psychosocial needs was found to have higher rate of programme utilisation, which included home visits done by either a doctor, nurse, physiotherapist, speech therapist, or occupational therapist. This suggests more needs for healthcare to support them with comprehensive assessment and care from the multi-disciplinary team. Segment 2 with more medical needs had the most need for hospital services. Studies reported that medically complex patients might benefit from more intensive medical models, including ambulatory intensive care units [4,28].

In this study of high users, the ones with the lowest medical, functional and psychosocial needs (Segment 3) comprised of almost half of the total population (44%) despite their relatively lesser demand on both programme and hospital services as compared to the Segments 1 and 2. Further profiling and differentiation of this group might aid in tailoring appropriate care to support them.

This study demonstrated that cluster analysis using administrative data could help identify patient segments with distinct healthcare characteristics and utilisation among patients with complex needs and who are high users of inpatient services. Majority of the studies on population segmentation included general population as the target population for segmentation, while others restricted to those with specific diseases or conditions [29]. This study further defined the target population at risk of adverse events i.e. complex high users of inpatient services, at risk of frequent hospital admission and mortality.

Most population segmentation systems adopt a healthcare utilisation risk-based rather than healthcare needs-based segmentation approach [2,30,31]. There is an increasing sentiment that segmentation based on healthcare utilisation risk is deemed inadequate to inform the development and allocation of healthcare services, as needs for specific healthcare services can occur due to few underlying phenomena, including the presence of morbidity risk, pain/discomfort, dysfunction, risk of mortality, patient’s subjective impression, or providers’ normative assessment [2]. Healthcare needs-based segmentation approach proposes a more comprehensive approach to individuals as it could provide indications for healthcare interventions that may lessen morbidity risk and address their needs [2]. Policy makers can optimise population health planning by applying healthcare needs-based population segmentation approach as an integral component in assessing population healthcare needs in relation to available services and resource allocation.

This study used CCI and ST as the segmentation variables, which are able to capture a comprehensive view of the individuals’ medical and psychosocial conditions, as a proxy of their needs. These variables not only covered the individuals’ risk of morbidity, but also their health-related behavior such as treatment compliance, and the wider determinants of health including their support system and coping response.

Most clustering algorithms depend on some assumptions in order to define the number of subgroups present in a data set. Similarly, the PAM approach required the number of clusters (k) to be pre-determined as input. As a consequence, the resulting clustering scheme requires some sort of evaluation as regards its validity and the goodness of clustering algorithm results [23]. Criteria to assess market segmentation have been widely adopted to evaluate the quality of population segmentation in healthcare context [13,32]. These include identifiability/interpretability (segments should be recognised and interpreted easily), substantiality (each segment should have sufficient size), stability (each segment should be relatively stable over time), and actionability/accessibility (each segments should be easily addressed and targeted with distinctive health intervention strategies) [13,32]. In this study, the segmentation outcomes demonstrated meaningful interpretation and clinical relevance as each segments were distinct in relation to its medical, functional and psychosocial needs pattern. In addition, each segments also differed in their rate and risk of future healthcare utilisation and mortality. These indicate a potential application for health service planning at population level.

There are a few study limitations. First, our study included older patients with high hospital service utilisation and complex chronic conditions, which may limit the generalisability of our findings to a more diverse and general population. Second, as the ST was not widely used and involved subjectivity from provider-rated assessment, its applicability to other populations requires further testing. Further studies might consider assessing the validity and reliability of the tool in different context and population. Third, we did not assess the segments performance against different data set. Hence, external validation and generalisability of the segmentation outcome requires further testing. Fourth, we did not include healthcare utilisation from other RHSs. However, majority of patients in Singapore utilise services within one regional health system [33]. Fifth, we did not conduct any analysis to compare healthcare utilisation and mortality in the excluded high users group, i.e., those discharged to institutional care or other community support services. Future studies might want to look into this population segment, to understand its characteristics, healthcare utilisation pattern, and risk of mortality. Lastly, this study is limited by the relatively short follow-up time. Future studies might consider having longer follow-up period to assess the stability of the segmentation outcome over time and also to track progression of each segment and transition between segments, thus allowing for the evaluation of targeted healthcare interventions and development of preventive healthcare services for each segment.

It is important to align the segmentation approach with the population segmentation objectives, as segmentation outcomes may differ with varied input variables and analytical approaches. Selection of variables for segmentation purposes requires iterative processes with contextual knowledge and expertise in data mining and clinical applicability.

Conclusion

A cluster analysis approach in segmenting the population of complex high users of inpatient services using administrative data could identify segments with distinct characteristics and care needs. This study demonstrated the value in capturing a more comprehensive view on factors that influence the healthcare utilisation among complex high users of inpatient services. It provides evidence on healthcare needs-based segments that might potentially help policy makers in allocating resources efficiently by tailoring intervention to meet these needs. It also informs future research on the potential benefit of including data on psychosocial and nursing assessment to characterise complex high users of inpatient services.

Supporting information

Acknowledgments

We would like to thank Lydia Chin Lee Hoon and Michael Yi Jing for helping with data acquirement, and Robin Choo Wai Munn for providing feedback for the manuscript.

References

  1. 1. Nguyen H, Manolova G, Daskalopoulou C, Vitoratou S, Prince M, Prina AM. Prevalence of multimorbidity in community settings: A systematic review and meta-analysis of observational studies. Journal of comorbidity. 2019;9(1–15). PubMed Central PMCID: PMC671708. pmid:31489279
  2. 2. Chong JL, Lim KK, Matchar DB. Population segmentation based on healthcare needs: a systematic review. Systematic Reviews 2019;8(202). PubMed Central PMCID: PMC6693177. pmid:31409423
  3. 3. Lafortune L, Beland F, Bergman H, Ankri J. Health status transitions in community-living elderly with complex care needs: a latent class approach BMC Geriatrics 2009;9:6. PubMed Central PMCID: PMC2645408. pmid:19192295
  4. 4. Wammes JJG, Wees PJvd, Tanke MAC, Westert GP, Jeurissen PPT. Systematic review of high-cost patients’ characteristics and healthcare utilisation BMJ Open 2018;8 e023113. PubMed Central PMCID: PMC6129088. pmid:30196269
  5. 5. Anderson M, Revie CW, Stryhn H, Neudorf C, Rosehart Y, Li W, et al. Defining ’actionable’ high- costhealth care use: results using the Canadian Institute for Health Information population grouping methodology International Journal for Equity in Health 2019;18(1):171. PubMed Central PMCID: PMC6842471. pmid:31707981
  6. 6. Fouayzi H, Ash A. Should High-Frequency Hospital Users be Excluded from 30-Day Readmission Quality Measures? Health Services Research. 2020;55(S1):92–3. PubMed Central PMCID: PMC7440506.
  7. 7. Lehtonen L, Wild C, Ricciardi W, Barros PP, Brouwer WBF, Maeseneer JD, et al. Defining Value in "Value-Based Healthcare": Report of the Expert Panel on effective ways of investing in Health (EXPH). Luxembourg: Publications Office of the European Union: European Union 2019.
  8. 8. Lynn J, Straube BM, Bell KM, Jencks SF, Kambic RT. Using population segmentation to provide better health care for all: the “Bridges to Health” model. The Milbank Quarterly. 2007;85(2):185–208. PubMed Central PMCID: PMC2690331. pmid:17517112
  9. 9. Porter ME, Pabo EA, Lee TH. Redesigning primary care: a strategic vision to improve value by organizing around patients’ needs. Health Affairs. 2013;32(3):516–25. pmid:23459730
  10. 10. Chong JL, Matchar DB. Benefits of population segmentation analysis for developing health policy to promote patient-centred care. Annals Academy of Medicine 2017;46(7):287–9. pmid:28821893
  11. 11. Berwick DM, Nolan TW, Whittington J. The triple aim: care, health, and cost. Health affairs. 2008;27(3):759–69. pmid:18474969
  12. 12. Vuik SI, Mayer EK, Darzi A. Patient segmentation analysis offers significant benefits for integrated care and support. Health Affairs. 2016;35(5):769–75. pmid:27140981
  13. 13. Wedel M, Kamakura WA. Market segmentation: Conceptual and methodological foundations. 2nd ed. Eliashberg J, editor. New York: Springer Science+Business Media, LLC 2000.
  14. 14. Low LL, Kwan YH, Liu N, Jing X, Low ECT, Thumboo J. Evaluation of a practical expert defined approach to patient population segmentation: a case study in Singapore BMC Health Services Research. 2017;17(771). pmid:29169359
  15. 15. Yoon S, Goh H, Kwan YH, Thumboo J, Low LL. Identifying optimal indicators and purposes of population segmentation through engagement of key stakeholders: a qualitative study. Health Research Policy and Systems. 2020;18(26). PubMed Central PMCID: PMC7035731. pmid:32085714
  16. 16. Low LL, Yan S, Kwan YH, Tan CS, Thumboo J. Assessing the validity of a data driven segmentation approach: A 4 year longitudinal study of healthcare utilization and mortality. PLoS ONE. 2018;13(4):e0195243. pmid:29621280
  17. 17. Khoo Teck Puat Hospital. Ageing-In-Place [updated 14 Jul 2022]. Available from: https://www.ktph.com.sg/community-resources/be-supported.
  18. 18. National Council of Social Service. About Social Service Agencies, Charities & IPCs Singapore National Council of Social Service 2023 [cited 2023 13 April]. Available from: https://www.ncss.gov.sg/social-service-agencies/about-social-service-agencies-charities-ipcs.
  19. 19. Charlson ME, Carrozzino D, Guidi J, Patierno C. Charlson Comorbidity Index: A Critical Review of Clinimetric Properties Psychotherapy and Psychosomatics 2022;91:8–35. pmid:34991091
  20. 20. Quan H, Sundararajan V, Halfon P, Fong A, Burnand B, Luthi J-C, et al. Coding Algorithms for Defining Comorbidities in ICD-9-CM and ICD-10 Administrative Data Medical Care 2005;43(11):1130–9. pmid:16224307
  21. 21. Laan MVd, Pollard K, Bryan J. A new partitioning around medoids algorithm. Journal of Statistical Computation and Simulation. 2003;73(8):575–84.
  22. 22. Kaufman L, Rousseeuw PJ. Finding Groups in Data: An Introduction to Cluster Analysis New Jersey John Wiley & Sons, Inc.; 1990.
  23. 23. Charrad M, Ghazzali N, Boiteau V, Niknafs A. NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set Journal of Statistical Software 2014;61(6):1–36.
  24. 24. Nnoaham KE, Cann KF. Can cluster analyses of linked healthcare data identify unique population segments in a general practice-registered population? BMC Public Health. 2020;20(798). PubMed Central PMCID: PMC7254635. pmid:32460753
  25. 25. Ng SCW, Kwan YH, Yan S, Tan CS, Low LL. The heterogeneous health state profiles of high-risk healthcare utilizers and their longitudinal hospital readmission and mortality patterns BMC Health Services Research. 2019;19:931. PubMed Central PMCID: PMC6894210. pmid:31801537
  26. 26. Wei MY, Kabeto MU, Galecki AT, Langa KM. Physical Functioning Decline and Mortality in Older Adults With Multimorbidity: Joint Modeling of Longitudinal and Survival Data The Journals of Gerontology: Series A. 2019;74(2):226–32. PubMed Central PMCID: PMC6599282. pmid:29529179
  27. 27. Dalgard OS, Haheim LL. Psychosocial risk factors and mortality: A prospective study with special focus on social support, social participation, and locus of control in Norway Journal of Epidemiology and Community Health. 1998;52(8):476–81. PubMed Central PMCID: PMC1756754. pmid:9876357
  28. 28. Brown RS, Peikes D, Peterson G, Schore J, Razafindrakoto CM. Six Features of Medicare Coordinated Care Demonstration Programs That Cut Hospital Admissions Of High-Risk Patients Health Affairs. 2012;31(6). pmid:22665827
  29. 29. Yan S, Kwan YH, Tan CS, Thumboo J, Low LL. A systematic review of the clinical application of data-driven population segmentation analysis. BMC Medical Research Methodology. 2018;18(121). PubMed Central PMCID: PMC6215625. pmid:30390641
  30. 30. Kansagara D, Englander H, Salanitro A, Kagen D, Theobald C, Freeman M, et al. Risk prediction models for hospital readmission: a systematic review. JAMA. 2011;306(15):1688–98. PubMed Central PMCID: PMC3603349. pmid:22009101
  31. 31. Vuik SI, Mayer E, Darzi A. A quantitative evidence base for population health: applying utilization based cluster analysis to segment a patient population. Population Health Metrics 2016;14(44). PubMed Central PMCID: PMC5124281. pmid:27906004
  32. 32. Jadczaková V. Review of segmentation process in consumer markets. Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis. 2013;61(4):1215–24.
  33. 33. Low LL, Tay WY, Ng MJM, Tan SY, Liu N, Lee KH. Frequent hospital admissions in Singapore: clinical risk factors and impact of socioeconomic status Singapore Medical Journal. 2018;59(1):39–43. PubMed Central PMCID: PMC5778257. pmid:27311740