Introduction

Waiting time in emergency departments (EDs) is a major policy concern internationally [1,2,3]. Long waits can adversely affect patients, in terms of longer inpatient stays [4], higher mortality rates [5, 6], and an increase in the costs of care. In England, a target that 98% of patients should be assessed and admitted, transferred or discharged within 4 h was introduced in 2004 and was initially followed by a significant reduction in long (over 4 h) waits [7, 8]. In 2010, the target was reduced to 95%, but this revised goal has not been met nationally since July 2015 [9].

From 2011/12 to 2015/16, there were on average 1.1 million waits over 4 h each year, breaching the national target by around 400,000.Footnote 1 In part, the pattern of recent breaches of ED waiting time targets reveals mounting pressure on EDs: attendances to English EDs have risen steadily from 21.5 million in 2011/12 to 22.9 million in 2015/16 [10]. Population ageing means this trend is expected to continue as people aged 65 or over (65 +) constitute one-fifth of ED attendances [9] and have higher rates of attendance [11].

In March 2019, NHS England published its plans to overhaul targets across the NHS, including the 4-h waiting time target in EDs [12]. The Powis review of access standards at English EDs underscored continued policy concerns with long waits [13]. The review proposed four new ED standards, including mean waiting time in EDs and a 1-h maximum wait for patients in need of urgent care, to replace the 4-h target. These proposed new standards aim to align incentives faced by hospital managers and clinicians with clinical best practice. It is currently unclear how the proposed measures would be converted into targets for EDs and, hence, to what extent incentives will be aligned. A stronger evidence base for understanding drivers of long waits can assist policy makers in the ongoing process of updating and refining incentives to maximise the quality of care for patients [14].

Commentators have pointed to a range of factors to explain long waits. Higher volumes of non-urgent visits, influenza outbreaks, inadequate staffing and hospital bed shortages can give rise to overcrowding in EDs [15], leading to longer waits for attendees to be assessed, prioritised and treated [6, 15, 16]. With one exception, previous studies have examined the impact of single factors on breaches of the 4-h target, but not explored the effects of these factors in combination [2, 11] or have been limited to a single ED [7]. In a more comprehensive analysis, Keogh et al. explore the relationship between breaches of the 4-h waiting time target and other performance measures, notably cancelled operations, using quarterly data from 2011 to 2016 [17].

We employ a retrospective observational study to examine the effects of a range of factors on all attendances to major EDs in England from 2011/12 to 2015/16. We contribute to the evidence base in four ways. First, we exploit weekly rather than quarterly data, allowing us to identify smaller effects that may still be important from a clinical or policy perspective. Second, we test for effects on two measures of long waits: ‘long overall waits’, which capture the number of waits exceeding 4 h in the ED, and ‘long trolley waits’, which capture the number of patients waiting over 4 h on a trolley (gurney) between the decision to admit and admission to an inpatient ward. As trolley waits affect a subset of attending patients and part of their journey through EDs, they might be affected by different factors to the overall measure. For example, the availability of inpatient beds might be expected to have a more direct impact on the waiting time of patients in need of admission. Third, we investigate a range of demand and supply factors in combination to explain these long waits. Fourth, we explore residual variation in long waits among EDs after accounting for observable demand and supply factors. This allows us to identify EDs with more frequent long waits than expected. These EDs would repay further investigation.

Data

We use information from all Hospital Trusts with a major ED in England, excluding EDs in specialist children’s hospitals,Footnote 2 from April 2011 to March 2016. A Trust is an administrative unit managing the provision of hospital care and responsible for one or multiple EDs within the local vicinity. Our analysis is by Trust because the data we use to measure attendances and 4-h waits are reported for Trusts and the Trust is the level at which policy makers generally interact with providers. A major ED is open 24 h a day on all days of the year, has full resuscitation facilities and is led by a senior physician, referred to in England as a ‘hospital consultant’. Major EDs are the primary providers of emergency medicine in England, accounting for 79% of ED attendances in 2015/16, excluding attendances to specialist children’s hospitals.Footnote 3

Outcome measures

We consider two measures of long (over 4 h) waits: long overall waits and long trolley waits. Counts of these events are constructed from the A&EFootnote 4 Attendances and Emergency Admissions statistical collection, published by NHS England and analysed at the weekly level [18].Footnote 5 The first measure, ‘long overall waits’, is the number of attendances to the ED in a week that last more than 4 h, from arrival to the patient being dealt with. Here, ‘dealt with’ covers discharge alive or dead, inpatient admission or transfer elsewhere [19]. The second measure comprises the number of ED attendances lasting more than 4 h after a decision has been made to admit the patient as an inpatient. We refer to this as ‘long trolley waits’, because it reflects the time spent waiting on a trolley (sometimes in the ED or a hospital corridor) before an appropriate inpatient bed becomes available. For example, a patient might arrive at A&E, wait 3 h for initial assessment, at the end of which a decision is made to admit them. They might wait a further 2 h to be admitted. In this case, the overall wait would be 5 h and the trolley wait would be 2 h.

As a patient must attend an ED to experience a long wait and because EDs differ in the number of attendances they receive, we include an exposure term of number of ED attendances. That is, the logarithm of the number of ED attendances is included as a variable with a coefficient constrained to 1. This means the models explain variation in rates of long waits (numerator) over attendances (denominator).

Explanatory variables

We include three groups of explanatory variables: (1) demand variables capturing features of the local population and ED attendees; (2) supply variables in terms of characteristics of the ED, the hospital in which the ED is based, and characteristics of long-term care (LTC) supply in the local area; and (3) time factors (years, and months of the year to capture seasonal effects).

Demand variables

We include attendee demographic and clinical characteristics of age, gender and number of diagnoses.Footnote 6 These characteristics are taken from the Hospital Episode Statistics (HES), a patient-level dataset of all ED attendances to NHS Trusts. [20] We employ variables that capture the weekly average characteristics of attendees to each ED.Footnote 7

We also include the percentage of attendees who live in the lowest quintile of income deprivation, assessed using the English index of multiple deprivation 2010 [21]. Jones and Wildman [22] found patients with lower income had poorer self-assessed health. Such patients may take longer to assess as a consequence.

We include three mutually exclusive methods of leaving EDs, namely the percentage of attendees who leave an ED without being treated; the percentage admitted to an inpatient ward; and the death rate per 1000 ED attendances.

Finally, we include the percentage of people aged 65 + living within 10 km, to capture demand of the local population. People aged 65 + are more frequent users of ED services [11].

Supply variables

We include variables to measure characteristics of the ED or Trust in which it is based. These factors are, in principle, more likely to be under the control of the ED than demand side factors. We capture staffing arrangements as the ratio of full time equivalent (FTE) ED physicians over number of attendances. The numerator is from the Electronic Staff Record (ESR), a census taken at the end of each quarter in 2011/12 and each month from April 2012 [23].Footnote 8 The denominator is a count of weekly attendances, in thousands. All else being equal, we would expect a higher ratio of physicians to attendances would mean patients are seen more quickly, thereby reducing the frequency of long waits.

We also include a dummy variable taking the value 1 if the ED is in a Trust that also has a Minor Injury Unit (MIU), Urgent Treatment Centre or Walk-in Centre and 0 otherwise [24].Footnote 9 These units, hereafter jointly referred to as MIUs, were created to relieve pressure on EDs by caring for less severely ill patients. We would, therefore, expect fewer long waits in EDs with co-located MIUs.

We account for Trust inpatient bed occupancy rates, published quarterly by NHS England [25]. For each quarter, we calculate the average percentage of available overnight beds that are occupied, after excluding maternity beds.Footnote 10 The higher the occupancy rate, the lower the capacity to admit new patients from EDs [17, 26, 27]. This might result directly in a wait over 4 h for patients to be admitted, or indirectly affect all ED patients by contributing to crowding within the ED.

We account for delayed transfers of care (DTOCs), which are drawn from monthly situation reports published by NHS England [28]. We use the percentage of available bed-days ‘lost’ in each month to capture discharge delays from a hospital.Footnote 11 A higher percentage may be associated with more long waits because a delayed discharge from hospital reduces bed availability for patients waiting to be admitted from an ED. Inpatient delays may also capture unobserved aspects of the quality of patient flows, perhaps due to internal bed management arrangements or to the availability of post-discharge care.

We also account for the local supply of long-term care (LTC), captured as the rate of care homeFootnote 12 beds per head of population aged 65 + in the local area. A higher rate of care home beds may reduce delayed transfers out of hospital and therefore reduce hospital bed occupancy. To calculate the rate, we use as a numerator the number of registered care home beds for people aged 65 + (including beds for people with dementia) within a 10-km radius of the ED,Footnote 13 sourced from monthly snapshots published by the sector regulator, the care quality commission (CQC) [29].

Time trends and seasonality

The demand for and supply of ED care, hospital and long-term care are all subject to temporal variation. A set of dummy variables for month of the year represents cyclical patterns of need, most notably the influenza season during winter months. We also include a set of year dummy variables, which capture general changes in need over time as well as national policy changes that might affect waiting time.

Methods

We use a count model to investigate variation across EDs and over time in the number of long overall waits and long trolley waits. This approach explicitly recognises the distributional characteristics of a dependent variable with integer values. A Poisson model with ED fixed effects (FEs) [30, 31] is employed for this analysis. The conditional mean of the model is presented in Eq. 1

$$\lambda_{it} = \exp (D_{it} \beta_{1} + S_{it} \beta_{2} + T_{t} \beta_{3} + \log ({\text{ATT}}_{it} ) + \delta_{i} ),$$
(1)

where \(\lambda_{it}\) is the conditional expected number of long waits in ED i and week t, \(D_{it}\) and \(S_{it}\) are vectors of demand and supply variables as described in the Data section and \(T_{t}\) is the set of dummy variables for months and years, where April and 2011 are the base month and year, respectively. \({\text{ATT}}_{it}\) is the number of attendances to the ED. This variable enters the model as an exposure term, and is given a constrained value of 1. The ED-specific FEs \(\delta_{i}\) capture unobserved time-invariant effects. We calculate standard errors clustered at the ED level.

We preferred the Poisson fixed effects model for several reasons. First, unlike random effects models, it allows for the unobserved time-invariant variables to be correlated with the observed variables and therefore it provides complete adjustment for these potential correlations.

Second, a well-known limitation of the cross-sectional Poisson model is that it specifies the conditional variance to be equal to the conditional mean (equidispersion). In many applications, the dependent variable is overdispersed, even after conditioning on covariates to allow the Poisson rate to vary across units. This leads to deflated standard errors and low P values. By including department fixed effects in the Poisson model, we account for time-invariant unobserved heterogeneity. This is expected to reduce the conditional variance. If there is unobserved heterogeneity that is specific to particular points in time, overdispersion may still exist. For this reason, we calculate standard errors clustered at the ED level.

Third, in the panel data context, the Poisson fixed effects model is more robust than the Negative Binomial fixed effects model. Estimating the NB fixed effects model with unconditional maximum likelihood is subject to potential incidental parameter problems. Estimating by conditional maximum likelihood is feasible with a specific parameterisation of the model [30] but it has been shown that this is not a true fixed effects model controlling for all time-invariant covariates [32].

Fourth, when investigating variation in hospital-specific effects, detailed below, employing fixed effects places no restrictions on the values which can be taken for each Trust level effect, which is not the case if random effects are applied.

Employing fixed effects does not in itself account for serial correlation. However, the implications of serial correlation are limited to inference and we adjust for potential underestimation of standard errors by employing robust standard errors clustered at the department level.

We also present an alternative specification where we consider attendances to Minor Injury Units as well as to EDs. The dependent variable, exposure term and patient characteristics are calculated from the set of attendances to an ED (as in our main specification), plus those attending an MIU. In this way, we account for a wider set of patients receiving treatment, including those with more minor conditions. However, this is not our preferred specification as there is a risk of double counting patients who attend an MIU and then are sent to a co-located ED, or vice versa. These cases cannot be identified from the aggregated data on waits, from which our dependent variables are calculated.

To undertake comparisons across EDs, we derive the indirectly risk-standardised waits ratio (SWR). The SWR is a ratio of observed long waits over the counterfactual expected number of long waits had the given ED been at a hypothetical benchmark. The benchmark is defined as the ED with the average estimated FE. The denominator of the SWR is predicted from the Poisson model using the demand and supply characteristics of the ED and the FE of the hypothetical benchmark ED. Therefore, the SWR captures variation in long waits attributable to the ED-specific FE, not to the demand and supply factors accounted for in the model. An SWR above 1 implies that the ED has a higher rate of long waits than expected, given the demand and supply factors it faces, and might, therefore, repay further investigation to identify ED-specific explanatory characteristics that cannot be observed in routine administrative data.

Results

Descriptive statistics

Table 1 presents descriptive statistics for long overall and long trolley waits and the various explanatory variables. Information for two samples is presented: (i) Attendances to Emergency Departments; (ii) Attendances to Emergency Departments and Minor Injury Units. Our analysis includes 139 departmentsFootnote 14 followed over 260 weeks. Of attendances to EDs alone, around 8% (155/1941) of weekly attendances to the average ED lasted more than 4-h, increasing from 5% in April 2011 to 19% in March 2016. Over the same period, the percentage of long trolley waits increased from 16 to 21% of long overall waits.Footnote 15

Table 1 Descriptive statistics

Figure 1 presents a line plot of the weekly average rate of long overall waits/ED attendances and long trolley waits/long overall waits. The figure shows these rates have increased over time but there are large week-by-week and seasonal variations that dominate this general trend.

Fig. 1
figure 1

Mean % of weekly waits for ED attendees

Descriptive statistics indicate that patients attending EDs are more likely to live in more deprived areas, with 23% of patients living in the lowest quintile of deprivation. On average, less than one diagnosis is recorded for each patient, likely to reflect limited diagnostic activity in the ED. Of attendees, 3% leave untreated, 27% are admitted to a hospital ward and 0.15% die while in the ED.

Around 17% of people living within 10 km of an ED are aged 65 +, slightly lower than the national rate. This suggests some relative concentration of younger populations around hospitals, which are more likely to be based in urban areas. Inpatient bed occupancy averages 86%, just above the recommended rate of 85% [2].

Descriptive statistics for the combined sample of attendances to EDs and MIUs are very similar to that of attendances to EDs alone because the vast majority of attendances are to EDs. Information for the combined sample indicates attendances to MIUs are slightly less complex, with a slightly lower average age and lower rate of death. This is to be expected as MIUs were introduced to provide treatment for a less severely ill population. The variable ‘% of ED attendances ending in admission’ is identical in the two samples because it is not possible to calculate this variable separately for ED and MIU attendances. The other variables reporting identical information for the two samples relate to wider populations or services such as inpatient care or long-term care, calculated from the wider Trust or geographical area (see the Data section for details).

Regression results

In the regression analysis, we model the count of overall or trolley waits lasting more than 4 h (long waits) and report incident rate ratios (IRRs). Results are presented in Table 2.

Table 2 Regression results

Results for the analysis of long overall waits are reported in columns 2 (IRR) and 3 (ci95) of Table 2. For the demand variables, we find that an increase in mean patient age by 1 year is associated with a 4.2% increase in the rate of long overall waits, a 1 percentage point increase in male patients is associated with reductions in long overall waits by 1.3%, a 1% increase in the percentage of patients leaving without being treated (‘untreated’) is associated with an increase in the rate of long overall waits of 15.1%, and an increase in the death rate by one in 1000 ED attendances is associated with an increase in long overall waits by 3.3%.

In terms of supply variables, the introduction of an MIU and an increase by 1 percentage point in the inpatient bed occupancy rate are associated with increases in the rate of long overall waits by 19.9% and 1.5%, respectively.

We also find seasonal and annual patterns, with 10 months significantly different from the base month (April). The rate of long overall waits is higher in winter months and lower during the summer. We also observe a significant increase in the rate of long waits over time, captured by year dummy variables.

In general, long trolley waits are affected by similar factors to long overall waits (as reported in Table 2, columns 4 and 5) except that gender has no significant effect on the rate of long trolley waits, nor does the presence of an MIU. The rate of trolley waits is also increasing over time, but is not significantly different from the 2011 base year until 2013/14, instead of 2012/13 for overall waits.

In columns 6 and 7 of Table 2, we report results after replacing the dependent variable, exposure term and patient characteristics from EDs alone with measures that combine ED and MIU attendances. In general, the qualitative findings of this analysis are the same as for overall long waits among attendances to EDs alone. However, we find that a 1 percentage point higher concentration of people in the most deprived quintile is associated with a 1.3% lower rate of long overall waits and an increase of one in the rate of physicians per 1000 attendances is associated with a reduction in long overall waits by 0.8%.

Variation across EDs

In Fig. 2 we plot the SWRs for long overall and long trolley waits separately. The correlation between the two sets of SWRs is quite low (r = 0.34), implying that the two wait measures provide different insights into the nature of ED performance. Moreover, for both measures, there is considerable variation in performance: in some EDs the number of long waits is half the predicted level, in others it is twice what would be expected based on the demand and supply characteristics we have accounted for. These differences could be related to differential staffing arrangements or management practice between EDs. To improve performance, attention might be directed at the small cluster of EDs at the right-hand tail of the distribution, with more frequent long overall waits than expected (Fig. 2a) and at the long tail of EDs in which patients experience many more long trolley waits than predicted (Fig. 2b).

Fig. 2
figure 2

Risk-standardised waits ratios for long overall and trolley waits

Discussion

We investigate the determinants of long overall waits and trolley waits by exploiting variation over time and across EDs within the English NHS.

The rate of long overall waits is higher in EDs serving older patients and/or those with a higher proportion of females, and in EDs where higher proportions of attendees die or leave before receiving treatment. Long overall waits are also higher at EDs in Trusts with greater bed occupancy or where a co-located MIU is introduced. With the exception of gender and the introduction of an MIU, the same factors are significantly associated with long trolley waits.

As has previously been found [17], our study confirms that long waits are less common in Trusts with lower bed occupancy rates, so efforts to reduce bed occupancy might also help reduce long ED waits by allowing patients to be admitted more quickly to an inpatient bed. Depending on the drivers of inpatient occupancy rates, this might be achieved through an increase in the supply of inpatient beds or by changes to admission and discharge policies.

Both older age and the death of the attendee may be markers of greater case-mix complexity, which might increase waits if complexity prolongs treatment time or demands a more intensive use of resources. The relationship may operate in two directions. In one way, these markers of complexity might lead to longer treatment times for these patients and knock-on delays in care for other ED patients. In the other, people facing longer ED waits may be at greater risk of death, as highlighted in other studies [5, 6].

When more attendances end without treatment, long waits are more frequent. This might reflect a range of scenarios including patients leaving untreated if their expected waiting time is long [19]. EDs also act as a safety net for people suffering a mental health crisis: poor access to specialist mental health services can cause delays [33], and these individuals are more likely to leave EDs without being seen [34]. If this mechanism is the primary one, untreated attendances might be a proxy for additional pressure placed on EDs when mental health services are not available.

Trusts that established MIUs were those with ED waiting time problems. EDs in Trusts with an MIU typically treat a more complex case-mix on average, because less complex patients are directed to the MIU. This probably explains the significant positive association between the introduction of an MIU and long waits.

Previous evidence is mixed regarding the impact of staffing levels on ED waiting times [35, 36] and our study shows that the effect of the ratio of physicians to attendances is statistically insignificant in our main analyses with a small significant negative effect only in our analysis of combined attendances to ED and MIU. Possible reasons for the lack of significance in the main analysis include: (i) The rate of physicians per attendance at EDs might be close to optimal given the production function of each ED and levels of other inputs; (ii) The data on physicians, which are limited to monthly or quarterly snapshots of employees, do not sufficiently capture fluctuations in the ED at shorter intervals. That said, in the analysis of combined ED and MIU attendances, we found that a higher rate of physicians to attendances is associated with fewer long waits, which would accord with expectations.

In all our specifications, neither the percentage of bed days lost due to delays in discharging patients nor the rate of care home beds per 1000 persons have a significant impact on waits. As with data on ED physicians, these data are limited to capturing monthly or quarterly snapshots. The lack of significance may also reflect limited variation in these data, which may arise if the local supply of long-term care is largely determined by need assessment.

Compared to the separate analysis of ED attendances, the combined analysis of ED and MIU attendances found that rates of long waits are lower in areas where levels of deprivation are higher. This might reflect hospital attendances at a lower threshold as a substitute for other forms of care such as GPs in more deprived areas. As less complex patients are treated in MIUs and might be dealt with more promptly in that setting than an ED, the strength of such an effect might be more strongly observed when MIU attendances are included.

While there is wide variation across EDs in their rates of long overall and trolley waits, our analysis detects only a handful of factors that significantly explain this variation. Some of these factors are the ones over which neither EDs nor their host Trusts have any control, in particular the demographic characteristics of patients attending the service.

Although we have not identified a predominant factor that explains why long waits are more frequent in some EDs than others, our analysis is able to identify those EDs with more frequent long overall and trolley waits that cannot be explained by a range of observable demand and supply features. There are several potential mechanisms which might lead to this observation. For example, some EDs might face higher variation in the number of attendances within a week, including more frequent spikes in demand. Alternatively, the variation might reflect differences in policies of assessment, admission and disposal across hospitals. The EDs at the right side of each SWR distribution would repay closer study to ascertain how they might improve and to determine what information is needed to apply a waiting time target as a fair measure of performance.

This work has five main limitations that could be addressed by future research. First, the scope of the study is restricted to factors that affect ED waits conditional upon attendance. We therefore do not consider factors that drive attendances, although these could also influence waits. That said, we allow for staffing and organisational features over which the ED may have the greatest control.

Second, our unit of analysis is the Hospital Trust. Some Trusts have more than one major ED (each located on a separate site within the Trust), so considering multiple departments as a single organisation restricts our ability to control for site-specific staffing or locational characteristics and constraints within Trusts. In addition, whilst we control for average patient characteristics within a Trust, whether an attendee experiences a long wait is likely to depend partly on their individual patient characteristics. As datasets improve in their coverage and granularity [37], future research should consider multilevel modelling to take account of the clustering of patients within EDs, and the presence of multiple EDs within Trusts.

Third, our unit of time is aggregated to the weekly level. Spikes in pressure on EDs can occur over a few hours [36], due to a specific event like a road traffic accident. Our weekly measure is not sensitive to the timing of such spikes in demand but captures persistent high demand from week to week. We are also unable to identify features of weekly attendances which require information at shorter intervals, such as variance in attendances within the week. Daily datasets currently cover only the winter months, which are not a representative portion of a year, but if these are extended to the whole year, it might be possible to gain insights into day-to-day variations in activity and performance [38].

Fourth, we measure staffing levels by the number of physicians in an ED. Waiting times could also be influenced by the availability of other staff, notably ED nurses [35], but these data are not recorded routinely in England.

Finally, there might be additional constraints in the process of assessing and providing care for patients in an ED, leading to longer waits. This could include the level of crowding already present in an ED, though this is partially captured by the number of attendances. There might also be constraints in the speed with which tests can be carried out, due to the availability of equipment needed to perform tests or of the technical staff required to prepare and analyse them. Unfortunately, there are no national data that capture these processes, but could form part of the review of those EDs at the tail of the SWR distribution.

Conclusion

There is substantial unexplained variation across EDs, with the rate of long waits at a handful of EDs being substantially higher than predicted by the demand and supply characteristics of the ED. The reasons for these differences may not be observable from currently available routine data but insights might be gained by further investigation of those EDs with a high rate of unexplained long waits, to ascertain why their performance is worse than expected.