econ- ct7

profilelolo1339
10198_2017_Article_891.pdf

ORIGINAL PAPER

Hospital heterogeneity: what drives the quality of health care

Manhal Ali1 • Reza Salehnejad1 • Mohaimen Mansur1

Received: 8 July 2016 / Accepted: 28 March 2017 / Published online: 24 April 2017

� The Author(s) 2017. This article is an open access publication

Abstract A major feature of health care systems is sub-

stantial variation in health care quality across hospitals.

The quality of stroke care widely varies across NHS hos-

pitals. We investigate factors that may explain variations in

health care quality using measures of quality of stroke care.

We combine NHS trust data from the National Sentinel

Stroke Audit with other data sets from the Office for

National Statistics, NHS and census data to capture hos-

pitals’ human and physical assets and organisational

characteristics. We employ a class of non-parametric

methods to explore the complex structure of the data and a

set of correlated random effects models to identify key

determinants of the quality of stroke care. The organisa-

tional quality of the process of stroke care appears as a

fundamental driver of clinical quality of stroke care. There

are rich complementarities amongst drivers of quality of

stroke care. The findings strengthen previous research on

managerial and organisational determinants of health care

quality.

Keywords Health care quality � Stroke � NHS � Machine learning � Regression trees � Panel data � Prediction � Mixed effects model

JEL Classification C23 � C40 � C52 � C53 � D23 � I10 � ILL � L22

Introduction

A central observation of the health care systems is the

existence of substantial heterogeneity in the quality of

health care services across hospitals. The quality of health

care measured along dimensions such as mortality or the

process of care for medical conditions such as cancer,

stroke and pneumonia varies widely across hospitals

[3, 42]. Figure 1 captures the distribution of the quality of

the process of care for stroke across NHS hospitals or trusts

using the data from the Royal College of Physicians for the

years 2004–2010.

It is evident that the quality of the process of stroke care

varies across hospitals. The figure also reveals a

notable positive shift in the distribution of the quality of

stroke care over the years. Notwithstanding this, variations

remain, with some hospitals still performing above the

median quality whereas others are performing below the

median quality. Mooney [55] and Rudd et al. [69] reveal

wide variations in the quality of stroke care across the

regions in the UK and England. Mohammed et al. [54] also

document similar variations in the case of NHS hospitals.

And Ayanian and Weissman [3] and Kupersmith [42]

provide a systematic review on the differences in health

care quality across teaching and non-teaching hospitals.

Hospitals are major providers of health care services and

account for a major expenditure of the overall health care

budget. Any effort at reducing the heterogeneity and

enhancing the quality of care across hospitals can bring

substantial benefits to society. Such efforts demand

understanding why some hospitals perform better than

other hospitals, providing a higher quality of care.

The literature on the drivers of hospital quality is vast,

offering a rich list of explanations for the observed

heterogeneity in the quality of care. We borrow a

& Reza Salehnejad [email protected]

Manhal Ali

[email protected]

Mohaimen Mansur

[email protected]

1 University of Manchester, Manchester, UK

123

Eur J Health Econ (2018) 19:385–408

https://doi.org/10.1007/s10198-017-0891-9

theoretical framework from the economic literature on

productivity [75] to shape our analysis. Using the frame-

work, we distinguish between internal and environmental

drivers of the quality of health care. Internal drivers refer to

factors that are under the control of the hospital’s man-

agement team. The literature studying internal drivers has

paid most attention to hospital size, physical assets, human

capital, financial constraints, IT investment, R&D and

organisation of decisions. External drivers refer to com-

petition in the output market, regulatory environment and

general socio-economic conditions.

Starting with the empirical literature on the internal

drivers of hospital performance, a series of papers studies

impacts of hospital size, physical assets, human capital and

information technology on the quality of health care.

Raerty et al. [64] and Needleman et al. [57] study the role

of human capital and find a positive relationship between

nurse staffing and health care quality in NHS and US

hospitals. Athey and Stern [2] and Menachemi et al. [51]

provide evidence on a positive relationship between health

information technology and outcomes. Information tech-

nology reduces communication costs, facilitates coordina-

tion across diverse and complex activities and assists

information processing and decision making. All these can

reduce errors and enhance quality. Gaynor et al. [30],

O’Brien et al. [58] and Hentschker and Mennicken [34]

document a positive relationship between volume and

outcome and process measures of quality of care. The

association is likely due to the ability of larger hospitals to

capitalise on the economies of scale and benefit from

specialisation that can enhance the quality of care.

An emerging finding in the literature is that the above

factors do not fully account for the observed heterogeneity.

Variations in care measured in mortality for disease con-

ditions or the process of care measures persist even after

controlling for case mix, human capital (medical and non-

medical staff), information technology, hospital

characteristics such as bed size, teaching status or foun-

dational trust status, cost–contributing factors, post-opera-

tive complications and statistical noise

[5, 10, 11, 31, 35, 47, 68].

The finding has led researchers to turn attention towards

management practices and organisational factors. Bloom

et al. [8] collect survey data on management styles and

practices from 2000 hospitals across the UK, USA,

Canada, France, Germany, Sweden, Italy, Brazil and India

and join the survey data with hospital performance data.

The researchers find that the quality of management prac-

tices and hospital performance are highly correlated, sug-

gesting a possible role of management practices in driving

hospital outcomes. Bray et al. [12] study the possible

impact of the organisation of the provision of care on the

quality of stroke care, using the data on English NHS trusts

for the year 2010. The authors employ a composite mea-

sure of the quality of the organisation of stroke services by

taking into account staffing (e.g., training level), facilities

(e.g., provision of continuous physiological monitoring)

and service level (e.g., access to round-the-clock emer-

gency imaging and thrombolysis). Adjusting for patient

characteristics, the researchers find that hospitals with

higher organisational scores are more likely to achieve

higher clinical quality measured by stroke process of care

measures. Flood [25], West [77], McConnell et al. [48, 49]

and Ghaferi et al. [31] also highlight the importance of

managerial and organisational practices as an underlying

cause of the variations in the quality of care across hos-

pitals. The increasing evidence on the critical role of

managerial and organisational factors points to the view

that it is the organisation of resources and management

routines that eventually link hospital inputs to outputs.

There is no production function independent of the

organisation of the firm or, in our case, hospital [46]. As the

organisational characteristics of a hospital change, so does

the hospital’s production function.

0

5

10

20 40 60 80 score

co un

t

2004

0

5

10

40 60 80 score

co un

t

2006

0 5

10 15

40 60 80 100 score

co un

t

2008

0 5

10 15

50 60 70 80 90 100 score

co un

t

2010

Fig. 1 Histogram of the total process score

386 M. Ali et al.

123

Another substantial body of literature examines the role

of external factors in shaping hospital performance. A key

result in this literature is that competition, provided prices

are regulated, has a positive impact on health care quality.

NHS hospitals in more competitive regions provide higher

health care quality compared to hospitals located in regions

where there is less competition [61]. In the same line,

Bijlsma et al. [4] study the impact of competition intro-

duced in The Netherlands from 2004 to 2008 on a wide

range of quality measures. The results indicate that hos-

pitals in areas with stronger competition show higher

improvement in several process measures of quality but the

impact of competition on outcome measures appears neg-

ligible. Other researchers have looked, among other fac-

tors, at the impact of government policies on mergers [29],

public reporting of performance [44] and provision of

information [38].

This study seeks to contribute to the literature in health

economics that argues for the impact of management

practices and organisational factors in driving hospital

performance and the quality of clinical care, where by

organisational factors we also mean organisation of

resources [18, 75]. Building on the literature, our general

hypothesis is that hospital performance is not only

dependent on the overall availability of resources but,

more critically, depends also on the organisation and

management of the resources and the fit between the

resources and management practices [18, 28]. Variations

in the organisation and management of resources partly

account for the observed heterogeneity in the quality of

care across hospitals [19]. The significance of such factors

is likely to be higher for some of the services a hospital

provides, such as provision of care for stroke patients

[12]. Stroke is a medical emergency that requires imme-

diate medical intervention as the benefit to the patient is

time dependent. To achieve this, a high level of coordi-

nation is required between various steps of care from the

onset of stroke to rehabilitation. And this is only feasible

if relevant complementary resources are in place and well

organised and stroke specialists and trained staff collab-

orate in a multidisciplinary and timely manner. As an

attempt to further investigate the general hypothesis, we

focus on the provision of stroke care. Our specific

hypothesis, thus, is that variations in the management and

organisational quality of the process of hospital services

such as stroke care partly drive variations in the clinical

quality of stroke care.

Our emphasis on the organisation of resources and

management practices is by no means new. A substantial

body of literature, as mentioned above, has already docu-

mented evidence on the possible impact of organisational

and management factors in driving hospital performance.

The literature, however, has so far mainly sought to

identify factors that drive the quality of care and has paid

little systematic attention to possible interactions among

the factors driving hospital performance. This is partly

because the theory is silent on complex interactions that

may exist among key variables driving performance and

partly because common econometric techniques are not

equipped to reveal interactive structures based on the data

alone. The theory, though, emphasises the importance of

complementarity among assets and skills that drive pro-

ductivity and, in our case, health care quality. In other

words, it points to the importance of critical interactions

among factors driving quality, not simply the mere pres-

ence or absence of the factors or the level of the factors

[14]. We aim to take a first step towards systematically

exploring interactive structures (complementarities) among

the variables that drive hospital quality in a data-driven

manner. With this in mind, our research hypothesis runs as:

Research hypothesis: Variations in the organisation and

management of resources involved in the process of hos-

pital services such as stroke care partly drive variations in

the clinical quality of the services (stroke care). Moreover,

it is the joint presence of complementary resources and

management practices that critically affects the quality of

the clinical process of the services (stroke care).

To assess this hypothesis, we put together a wide panel

data set on a rich list of candidate variables that could

possibly drive hospital quality in stroke care for which the

organisation of resources and management of the process

of care matter critically. The panel spans from year

2004–2010. To our knowledge the only significant policy

change that took place during this period is the introduction

of Department of Health’s National Stroke Strategy

implemented in England in December 2007. We appro-

priately account for this change in our analysis in order to

capture the potential impact this policy may have on the

quality of stroke care in English hospitals. As our measure

of quality, we employ an index that captures the quality of

the process of stroke care as opposed to an outcome

measure. Process of care measures illuminate the compli-

cated process of delivering health care and describe the

specific actions associated with health care delivery. It

describes the care that patients actually receive and

assesses the extent to which hospitals perform health care

processes to achieve the desired aims. An important

advantage of process measures over outcomes is that it

does not suffer from the case-mix adjustment bias [12, 22].

Outcome measures such as mortality, in contrast, may be

confounded by variations in demographics or case mix and

may be susceptible to coding or measurement errors.

However, the validity of process measures depends on

whether they are a useful proxy of subsequent outcomes

and that the interventions being measured are related to the

expected effects [76].

Hospital heterogeneity: what drives the quality of health care 387

123

We combine our measure of the clinical quality of

stroke care with a measure of the organisational quality of

the process of care and other data drawn from several

sources. We will employ a new class of unbiased panel

regression tree estimator from the machine learning liter-

ature to study the data. A reason behind the choice of the

method is the intuitive interpretability of the results. The

non-parametric method can reveal potential interactions

among the variables, which could offer valuable informa-

tion about the processes that drive variations in quality of

stroke care across NHS hospitals. We use an out of sample

prediction error estimator to calculate the predictive

accuracy of the models.

Our results point to the significant role of organisational

features of the process of stroke care in driving the clinical

quality of the process of stroke care. Including a composite

measure of organisational design for the provision of stroke

care improves the predictive accuracy of the models. The

predictive results are robust to the inclusion of year as an

additional variable to capture trends in the data. Using year

as an additional predictor, our measure of organisational

design for the provision of stroke care still has a significant

predictive role in that it reduces both the in-sample and

out-of-sample prediction error rate. Furthermore, the

results of the unbiased panel regression tree estimator are

consistent with our benchmark linear mixed effects model.

The coefficients in the linear mixed effects model are

consistent with the variables entering in the tree models.

Specifically, the coefficient of the organisation score is

substantial and statistically significant. The results are

robust to the inclusion of fixed hospital and year effects.

The key result also remains unchanged when we rescale the

clinical and organisational measures of quality by taking

out the within-hospital means of the variables to eliminate

potential hospital-specific unobserved time-invariant

factors.

Our findings are consistent with the theories from the

organisational economics and productivity literature when

extended to the health care setting. The results are partic-

ularly in line with the findings of Bloom and Van Reenen

[9], Bloom et al. [6] and Bloom et al. [7] who find that

capital intensity or technology cannot alone fully account

for large differences in total factor productivity (TFP)

across firms. Management practices and organisational

design are most fundamental in driving hospital perfor-

mance and quality. High organisational quality enhances

performance, productivity and quality. At the heart of any

policy effort to eliminate the observed heterogeneity in the

quality of care across hospital there should be a well-

founded effort to reduce heterogeneity in organisational

quality and management practices.

The rest of the article is organised as follows. ‘‘Data and

descriptive statistics’’ introduces the data. ‘‘Panel

regression trees’’ outlines the predictive approach and

describes a class of non-parametric regression methods for

the study of panel data. ‘‘Empirical analysis’’ presents our

basic empirical results by assessing the predictive accuracy

of the hypotheses. ‘‘Robustness analysis’’ carries out

robustness checks. ‘‘Conclusion’’ concludes the analysis.

Data and descriptive statistics

The study has gathered hospital and regional level panel

data from several administrative sources. Our two key

variables include composite measures of the clinical and

organisational quality of the process of stroke care. We

obtain the data on these measures from the Royal College

of Physician’s National Sentinel Stroke Audit from 2004 to

2010 where the data have been collected at the hospital

level. The Royal College of Physicians carried out rounds

of the National Sentinel Stroke Audit (NSSA) in every

2-year cycle from 1998 to 2010. The NSSA monitored the

progress of stroke care in England, Wales and Northern

Ireland. Between 1998 and 2010, the NSSA achieved 100%

voluntary participation of hospitals, collecting data on

more than 60,000 patients from England, Wales and

Northern Ireland. The focus of the audit was centred on two

components: clinical and organisational. The former mea-

sured the clinical quality of the process of stroke care

whereas the latter measured the organisational quality of

stroke care. NSSA was eventually replaced by Sentinel

Stroke National Audit Programme (SSNAP). As a result of

the replacement, some of the questions used to construct

the organisational and clinical components of the NSSA

data underwent changes, making it difficult to combine the

data from the two audits. For consistency, we have

restricted the sample to the NSSA data that are in public

domain and left the SSNAP data for a separate research.

The audit data have been used for research studies, for

instance, in Bray et al. [12] and McNaughton et al. [50].

Clinical process of care

In assessing the quality of the process of care for stroke, the

NSSA collected data on the practice of a set of clinical

standards relating to patients hospitalised with a primary

diagnosis of stroke with the ICD10 codes of I61 (intrac-

erebral haemorrhage), I63 (cerebral infarction) or I64

(stroke, not specified as haemorrhage or infarction). Data

were collected on 8697, 13,625, 11,369 and 11,353 patients

respectively for the years 2004, 2006, 2008 and 2010

across the NHS trusts or hospitals in England, Wales and

Northern Ireland. The clinical standards reflect scientifi-

cally based practices that are set by the National Institute

for Health and Clinical Excellence (NICE). The clinical

388 M. Ali et al.

123

standards are organised into six domains of the process of

stroke care, including initial patient assessment, multidis-

ciplinary assessment, screening and functional assessment,

care planning, communication with patients and carers, and

acute care. Table 4 lists these domains and their respective

clinical standards. A score of 0–100 is assigned to each

domain. An average of the scores of the six domains for

each NHS hospital is calculated to arrive at a composite

measure of the clinical quality of the process of care for the

hospital. We use the data for the 4-year period from the

year 2004 to 2010 for English NHS trusts. It is clear from

Fig. 1 that there is notable variation in the scores of hos-

pitals that also changed over time. We conducted an

analysis of variance, which shows that between-hospital

variance (70.06) and within-hospital variance (42.77)

account for 62 and 38% of the total variation in the data,

respectively. Further information about the data, organi-

sation, methods, proforma and questionnaire are available

on https://www.rcplondon.ac.uk/.

Organised stroke care

Stroke is a medical emergency that requires immediate

medical intervention as the benefits are critically time

dependent. To achieve this, a high level of coordination is

required between various steps of care from the onset of

stroke to rehabilitation. And this is only feasible if relevant

complementary resources are in place and stroke specialists

and trained staff collaborate with each other in a multi-

disciplinary and timely manner. To capture the organisa-

tional quality of the process of stroke care, the audit

included a set of questions on the availability of stroke

units, neurovascular facilities, TIA clinics and specialist

stroke teams and their features, on whether the hospital

provided education and engaged in clinical research, and

on whether the hospital sought patient’s views and had

produced reports within the last 12 months. The audit

further collected data on average waiting time for scanning.

Table 1 describes some of the hospital features about

which the audit collected data. The NSSA summarises

these features into eight organisational domains. They

consist of acute care organisation, organisation of care,

consultant sessions (overall service), interdisciplinary ser-

vices (stroke unit), TIA/neurovascular service, education

and research, team working and communication with

patient and carers. On this basis, a composite measure of

organisational quality is developed on a scale of 0 to 100 to

assess the organisational quality of the stroke care process.

The composite measure captures a mixture of organisa-

tional features and management practices critical for stroke

care. Audit queries regarding the presence of a specialist

stroke team or TIA clinic, for example, relate to the

availability of resources. In contrast, audit queries relating

to Stroke Unit Trialists’ Collaboration (STUC) character-

istics, e.g., multidisciplinary meetings at least weekly to

plan patient care, provision of information to patients about

stroke or continuing education programmes for staff, or

queries on whether patients’ views are sought refer to a

management practices.

Over the years 2006–2010, the NSSA organisational

audit went through changes in the form of inclusion of new

and exclusion of old questions to collect data on a hospi-

tal’s organisational quality for stroke care. Despite the

changes, a significant number of questions within the above

domains remained the same. Particularly, the question-

naires used to collect data between 2006 and 2008 organ-

isational audits were very similar or overlapping. We

expect any impact of the organisational component on

quality of stroke care to be slow. To capture these delayed

effects we use 1-year lagged values of predictors when

predicting the quality of stroke care instead of considering

contemporaneous information. This leaves us with 2006

and 2008 organisational scores. Given the strong resem-

blance between 2006 and 2008 audits, we do not expect

that the changes in the audits would distort our analysis.

Further information on the data, questionnaire proforma

Table 1 Disaggregated variables for organisational design of stroke care

Variables Description Measure

ASU Acute stroke unit 1 = yes; 0 otherwise

CSU Combined stroke unit 1 = yes; 0 otherwise

RSU Rehab stroke unit 1 = yes; 0 otherwise

Specialist Presence of specialist stroke team 1 = yes; 0 otherwise

Report Report produced with the last 12 months 1 = yes; 0 otherwise

PatientViews Whether patient views were sought 1 = yes; 0 otherwise

SUTC 5 key features of all stroke units by Stroke Unit

Trialists Collaboration

Integer scale from 1 to 5 where

5 if all features met

ESD Presence of early supported discharge team 1 = yes; 0 otherwise

NeuroClinic Presence of neurovascular/TIA clinic 1 = yes; 0 otherwsie

Hospital heterogeneity: what drives the quality of health care 389

123

and the method for constructing the composite organisa-

tional measure are available at https://www.rcplondon.ac.

uk/ and in https://www.rcplondon.ac.uk/projects/national-

sentinel-stroke-audit.

In addition to an aggregate measure, we supplement the

analysis using disaggregated measures or factors of stroke

organisation that are common across 2006 and 2008 and

are listed in Table 1 above. These factors included in the

audit are important markers for quality of care and capture

aspects of organisational design for stroke care for each

NHS hospital, i.e., their settings or configuration.

Table 2 below gives the descriptive statistics for the

clinical and organisational quality of the process of stroke

care for the NHS hospitals. Both the mean and median

scores for clinical and organisational measures of the

hospitals have been monotonically increasing over their

respective sample periods. Conversely, the measure of

variations captured by standard deviation for both the

clinical process score and organisational performance has

been decreasing except for an increase for process scores in

2006. Despite the gradual decline, variations in both clin-

ical quality and organisational performance across the

hospitals still persist. In other words, considerable hetero-

geneity exists with some hospitals providing better quality

services than others.

Hospital and regional characteristics

We add several explanatory variables to our data to control

for other factors that may affect health care quality. We add

Teaching and Foundation Trust (FT) status for each hos-

pital. Teaching hospitals tend to treat patients with the most

severe and complex illnesses and have a higher chance of

learning how to deal with complex cases. This may lead to

a higher quality of care. Conversely, teaching hospitals can

introduce a delay in the treatment process due to consul-

tants’ role in training medical students [1].

Foundation trust hospitals are non-profit public organ-

isations that enjoy greater managerial and financial

autonomy from the central government control. FT hos-

pitals are allowed to retain their surpluses, which they can

invest in staff salaries or capital equipment. They are also

allowed to borrow money to improve services. Hospitals

with FT status are likely to have higher quality of care for

stroke. Farrar et al. [23] find that FT hospitals provide a

better quality of care measured using lower in-hospital

mortality.

Larger NHS hospitals can lead to higher quality of care

because of economies of scale and the ‘‘volume-outcome’’

hypothesis. Or, they might lead to lower quality of care due

to diseconomies of scale from the greater complexity of

their organisational structure. A number of studies include

hospital size measured by the number of hospital beds

among factors affecting hospital quality [40, 49, 62, 65].

We control for hospital size and case-load capacity using

the number of hospital beds and the number of operating

day-case theatres.

A rich literature highlights the importance of adequate

specialists and neurologists in the prevention and man-

agement of stroke patients and improved stroke outcomes

of care [17, 20, 53, 59, 66, 70, 73, 74, 78]. Hospitals with

more skilled staff and specialists are likely to enjoy higher

medical knowledge and experience and be capable of

providing a relatively higher quality of stroke care. We

include the number of neurologists, neurosurgeons and

neurophysiology staff to control for the number of spe-

cialists. Park et al. [60] and Ketcham et al. [39] find that

larger size physician groups are likely to improve health

care quality. We add general measures of medical and non-

medical staffs including professionally qualified staff and

health care scientists.

A number of studies have found a positive relationship

between nurse staffing and health care quality. Fine et al.

[24] study the relationship between hospital characteristics

such as nurse staffing and organisation type and measures

of care for pneumonia. Raerty et al. [64] and Needleman

et al. [56, 57] find a positive relationship between nurse

staffing and health care quality in NHS and US hospitals.

Cho and Yun [20] find that adequate nurse staffing mea-

sured by the bed-to-nurse ratio is positively associated with

stroke care quality. We control for the number of nurses

and the nurse-to-bed ratio.

Table 2 Descriptive statistics for clinical process score and

organisational score

Clinical process score Organisational score

2004 2006 2008 2010 2006 2008 2010

Minimum 25.00 31.00 40.00 52.00 23.00 32.00 47.00

First quartile 51.75 58.25 63.00 73.00 57.25 63.75 62.00

Median 61.00 67.00 71.00 79.00 64.00 71.00 69.00

Mean 60.48 66.13 69.74 78.75 63.74 70.59 69.65

Third quartile 68.25 75.75 77.00 85.0 72.00 79.00 76.75

Maximum 93.00 93.00 96.00 97.00 89.00 95.00 96.00

Standard deviation 12.36 13.33 11.32 8.57 11.96 11.37 10.20

390 M. Ali et al.

123

Studies have also documented considerable regional

variations in health care quality [32, 37, 45, 71]. Mooney

[55] and Rudd et al. [69] find considerable variability in the

quality of care for stroke across England. We consider

median inflation-adjusted wage, inequality, the proportion

of regional population without any qualifications, the

number of stroke admissions, all-cause standardised mor-

tality rate and the stroke mortality rate to control for socio-

economic and regional-health factors. Gaynor et al. [29]

control for regional measures including the median wage to

proxy demand and need for care in a given area. Table 3 in

the ‘‘Appendix’’ provides the full list of the variables used

in the study, their definitions and sources.

Panel regression trees

While the main objective of this article is to determine

what predicts the quality of hospital care, our aim is to

portray any predictor-response relationship in a simple,

easy-to-understand and intuitive way. We are interested in

making meaningful sense of how several predictors may be

involved in complex interactions with one another when

helping to explain quality. For example, we would like to

understand whether the standard of care at hospitals that

are located in a wealthy area and have a low nurse-to-bed

ratio is different from the quality of care offered in hos-

pitals with a higher nurse-to-bed ratio situated in a rela-

tively lower income region. Traditional parametric

methods such as regression models do not always offer

straightforward interpretation of such intricate interplay of

variables. We, therefore, resort to a class of non-parametric

techniques, commonly known in machine learning litera-

ture as regression trees, that serve the purpose well. The

tree mechanism involves recursively partitioning the pre-

dictor space into a number of small regions based on

simple rules and then using the mean or median of the

realised values (e.g., quality of care) of observations (e.g.,

hospitals) belonging to a region as the predicted value for a

new observation that falls in that particular region. Most

importantly, the splitting decision rules, order of impor-

tance of selected predictors and their interactions are

summarised in a visually attractive and intuitive way. To

our knowledge, this is one of the very few studies in health

economics that exploit regression trees with an aim to find

drivers of quality of care.

Several tree growing algorithms are proposed in the

statistics and machine learning literature. The most widely

used are ‘CART’ [13] and ‘C4.5’ [63], which function by

maximising a statistical criterion over all possible predic-

tors and split points simultaneously. These methods are

often criticised for biased selection of variables, which

have many possible splits and missing values [36]. In this

article we opt to use a conditional inference framework

proposed in Hothorn et al. [36] that rectifies the problem of

selection bias by choosing predictors for splitting based on

a series of tests identifying statistically significant associ-

ation between the responses and predictors.

Since we observe a number of hospitals over several

years the data used in our study are longitudinal or panel.

Such a data structure requires careful accounting of pos-

sible variations across subjects that cannot be captured by

observed predictors and also autocorrelation across obser-

vations from the same subject. If we observe subjects (e.g.,

hospitals) i ¼ 1; 2; . . .; I at times t (years) ¼ 1; 2; . . .; T a general additive model can be defined as:

yit ¼ fðxitÞ þ ai þ uit ð1Þ

ui1

..

.

uiT

0 BB@

1 CCA � Nð0; RiÞ ð2Þ

ai � Nð0; DÞ ð3Þ

where for each subject i and time period t, yit denotes the

response of interest (e.g., quality of care),

xit ¼ ðxit1; . . .; xitKÞ 0 denotes a vector of K predictors (e.g.,

hospital characteristics, regional features, etc.) and uit denote the errors. The ai are the time-invariant, subject-

specific unobserved heterogeneity component. We allow

for serial correlation in errors uit from a particular subject

by defining their covariance matrix Ri to be non-diagonal.

The errors are, however, assumed to be independent across

subjects and uncorrelated with ai.

We seek to apply flexible tree-based methods for

approximating the unknown relationship f, which may well

be non-linear. It is only over the last decade that devel-

opments have been made in generalising trees for panel

data applications. This article uses a method called an

unbiased random effect expectation maximisation (RE-

EM) tree, only recently developed in Fu and Simono [27].

Unlike many existing methods including a standard RE-

EM tree of Sela and Simono [72], which rely on CART-

type tree building algorithms, the unbiased RE-EM tree

incorporates the conditional inference framework of

Hothorn et al. [36] and is, therefore, unbiased in nomi-

nating predictors for splitting. The unbiased RE-EM tree

operates by alternating between two principal steps: one

estimating f by using a tree method and the other esti-

mating unobserved heterogeneities ai by utilising a linear

panel regression model. The process is initialised by setting

the starting value of ai to zero.

As any non-parametric estimator, regression trees are

subject to over-fitting. To achieve an optimal trade-off

between the bias and variance (over-fitting), we rely on the

Hospital heterogeneity: what drives the quality of health care 391

123

out-of-sample predictive accuracy of the model, estimated

using cross validation. We select the regression tree model

with the lowest prediction error to identify variables that

potentially contribute to the quality of stroke care.

By adopting the regression tree method, we complement

the dominant tradition in econometrics, where the emphasis

is on parameter estimation and statistical significance

testing. There are no coefficients in regression trees.

Instead, the idea is whether the variable of interest or the

interactions implied by the background domain-specific

information or theory appear in the tree that best predicts

the outcome variable.

Our methodological stance comes close to the seminal

work of Milton Friedman [26] and the practice in science: a

minimum check for the empirical adequacy of a hypothesis

is whether the predictions of the hypothesis are consistent

with the empirical model that yields the best out-of-sample

predictions. Such an approach does not establish the

hypothesis. It can help narrow down the set of plausible

hypotheses that are consistent with our data. Predictive

consistency invites us to take a hypothesis seriously, assess

its compatibility with our background knowledge and

devise empirical studies that can support drawing causal

conclusions.

Empirical analysis

We employ an unbiased random regression tree estimator

for panel data to develop a series of predictive models to

identify key predictors of the quality of the process of

stroke care in NHS hospitals. In building the tree models,

we rely on the literature to decide on potential explanatory

variables that may drive the quality of the process of stroke

care across hospitals. Some of the potential control vari-

ables are highly correlated. The panel regression tree

estimator arbitrarily selects from among highly correlated

variables to build a model [41]. To gain insight into the

possible impact of each variable, we use a data-driven

method proposed in Kuhn and Johnson [41] to a priori

select from among the correlated variables to build a model

and further examine the effect of replacing the variables

with excluded variables to assess the robustness of the

results. (The supplementary results are available on

request.) Two variables are considered as highly correlated

when the correlation coefficient exceeds the threshold 0.75.

All unbiased REEM-Tree models are fitted using a random

intercept model to allow for variations in hospitals’ quality

due to unobserved attributes.

Among the variables measuring physical capital, the

variables operating theatres and day case theatres are

highly correlated (0.88). We a priori exclude the number of

operating theatres from the base model. The correlation

among nurses, general medicine group (gmg) staff, science

and allied professionals staff exceeds the threshold of 0.75.

We keep nurses in the base model, partly because this

choice will lead to models with overall lower out-of-sam-

ple prediction error. The number of nurses in a hospital is

highly correlated with the number of beds. We replace

nurses with the nurse-to-bed ratio in order to be able to

identify possible distinct effects of physical capital (or

hospital size) and non-specialist human capital. Among the

regional health and socio-economic variables, the weekly

median wage and inequality are highly correlated (0.845).

We exclude inequality from the base models. Also, stroke

mortality and the all-age standardised mortality ratio

(SMR) are highly correlated (0.87). We keep stroke

mortality.

We consider four unbiased regression tree models to

gain insights into the structure of the data. The first three

models regress the measure of the quality of the clinical

process of stroke care on the individual variables that

capture organisational features of the process of stroke

services and relevant control variables. The fourth regres-

sion tree model replaces the individual variables with the

composite measure of organisational quality of the process

of stroke care to better capture possible complementarity.

We next compare the results of the unbiased REEM-

Tree with linear mixed effects models (LME) to check the

robustness of the results. We rely on the in-sample and out-

of-sample predictive performance of the models to select a

model from among the trees that best fits the data.

Disaggregate models

Recall we categorised possible drivers of the quality of

stroke care into internal and external drivers of hospital

performance. Internal drivers refer to factors such as

physical assets, human capital, the structure of decisions

and the internal organisation of the hospital. External dri-

vers refer to regional health and socio-economic features.

We start with a model of internal drivers and next adds

variables capturing potential external drivers. Our under-

lying methodological principle is that if any of these factors

drive the quality of stroke care, variables measuring the

factors will appear in a model that best predicts the quality

of stroke care [26]. With these preliminaries, the first panel

regression tree model includes the primary organisational

variables and the variables measuring physical assets,

human capital and hospital characteristics. This means the

variables entering the formula underlying the model consist

of:

Model 1: Score � beds ? day case theatres ? nurses ? neurology ? neurophysiology ? neurosurgeons ?

teaching ? ft ? ASU ? RSU ? CSU ? SUTC ? Specialist

? ESD ? NeuroClinic ? PatientViews ? Report.

392 M. Ali et al.

123

The dependent variable score measures the quality of the

clinical process of stroke care. Table 3 defines the rest of the

variables. Applying the unbiased panel regression tree esti-

mator to the data yields the tree in Fig. 2. For the current

analysis, a significance threshold of 0.10 for unbiased

REEM-Trees has been set (i.e., a 10% false-positive rate for

statistical significance is set). The variable ‘specialist’

appears in the initial node of the tree, suggesting that the

presence of a specialist stroke team is the most important

predictor of the quality of stroke care. Hospitals with spe-

cialist stroke teams enjoy an overall higher quality score,

which is in line with the findings in Xian et al. [80]. Indeed,

the highest quality score belongs to hospitals with a specialist

stroke team and a higher nurse-to-bed ratio (with mean

quality score 84.94), confirming the findings in Cho and Yun

[20]. Also, as found in Farrar et al. [23], among hospitals with

a specialist stroke team, foundation trust hospitals yield a

higher quality score. The lowest quality score appears in

hospitals that lack a specialist stroke team, have a low nurse-

to-bed ratio and lack a combined stroke unit (CSU) (with the

mean quality being 65.61) [16]. Overall, the model points to

the significance of specialist human capital and organisa-

tional variables. The variables representing physical capital

do not appear in the unbiased panel tree. The model points to

critical complementarities between the presence of a spe-

cialist stroke team and a high nurse-to-bed ratio. It is the joint

presence of these features that predicts a higher quality.

Adding the variables representing potential external

drivers of quality will give rise to our second model:

Model 2: Score � beds ? day case theatres ? nurses ? neurology ? neurophysiology ? neurosurgeons ?

teaching hospital ? foundation trust ? ASU ? RSU ?

CSU ? SUTC ? Specialist ? ESD ? NeuroClinic ?

PatientViews ? Report ? stroke admissions ? stroke

mortality ? median wage ? no qualifications.

From the tree results in Fig. 3, median wage appears at

the initial node of the tree, suggesting that economic con-

ditions are among important predictors of the quality of

stroke care [37]. The mean quality of stroke care in areas

with a higher weekly median wage ([486.5) is on average

Model 1

Specialist p < 0.001

1

≤ 0 > 0

nursesBeds p < 0.001

2

≤ 1.899 > 1.899

CSU p = 0.013

3

≤ 0 > 0

n = 41 y = 65.61

4 n = 29

y = 71.196

5

RSU p = 0.012

6

≤ 0 > 0

n = 28 y = 71.18

7 n = 43

y = 76.472

8

nursesBeds p < 0.001

9

≤ 2.478 > 2.478

ft p = 0.022

10

≤ 0 > 0

nursesBeds p = 0.014

11

≤ 1.556 > 1.556

n = 11 y = 69.331

12 n = 68

y = 76.655

13

CSU p = 0.031

14

≤ 0 > 0

n = 35 y = 76.88

15 n = 28

y = 80.324

16

n = 20 y = 84.94

17

Fig. 2 The tree includes variables capturing hospital characteristics, human capital factors, teaching and foundation trust status, and

primary variables reflecting the quality of the organisation of the

process of stroke care. The higher a variable appears in the tree

structure, the more predictively significant the variable will be. The

organisational variable ‘Specialist’ appears at the initial node of the

tree as the predictively most important variable. Each box in the

terminal nodes show two figures, the first (n) stating the number of

observations falling in the branch and the second (y) giving the mean

value of the observations in the branch. Hospitals with the highest

quality score fall in the branch where the value of the dummy

‘‘specialist’’ equals one and the nurse-to-bed ratio exceeds 2.478

Hospital heterogeneity: what drives the quality of health care 393

123

higher. On the right-hand-side branch, the second predic-

tively most significant variable is the nurse-to-bed ratio.

The equality of the process of stroke care is the highest in

hospitals in affluent areas where the nurse-to-bed ratio

exceeds (2.512) [12]. On the left-hand side, specialist is the

second predictively most important variable. The quality of

the process of stroke care is overall higher in hospitals with

a specialist stroke team in place. The variables appearing in

the third layer of the tree include those that capture the

organisation of the process of stroke care. The presence of

a stroke specialist team, SUTC, and the presence of an

early support discharge team (ESD) occupy prominent

positions in the tree [43, 67]. The tree also reveals

important interactions (complementarities) among the

variables. The interaction between a comparatively lower

median wage (\486.5) and the absence of a stroke spe- cialist team where there is no early support discharge team

(ESD) yields the lowest mean average quality score.

Overall, the model points to several results. To begin

with, the model reveals variations in the quality of stroke

care across regions, consistent with the findings in Grimaud

et al. [33]. An explanation for the regional variation is that

wealthier regions have more access to higher quality

human capital resources, which is likely to enhance the

quality of care a hospital provides. Another explanation,

which receives support from our data, is that hospitals in

richer regions devote more resources to health care [21]. In

the sample, the median wage is reasonably highly corre-

lated with the nurse staff ratio (0.437), clinical staff ratio

(0.47) and gmg staff ratio (0.404). Median wage is also

negatively correlated with regional stroke mortality. A

third explanation is that a higher median wage is associated

with higher education [37]. Patients with higher levels of

education are likely to be aware of their rights and the

quality of services they receive. And this can translate into

pressures on hospitals to provide higher quality services.

Model 2

median_wage p < 0.001

1

≤ 486.5 > 486.5

Specialist p < 0.001

2

≤ 0 > 0

ESD p = 0.001

3

≤ 0 > 0

n = 85 y = 67.842

4 noqual

p = 0.005

5

≤ 14.3 > 14.3

n = 10 y = 68.59

6 n = 17

y = 76.698

7

neurosurRegion p = 0.002

8

≤ 17 > 17

n = 10 y = 68.177

9 median_wage

p = 0.006

10

≤ 420.5 > 420.5

n = 31 y = 73.081

11 n = 73

y = 78.552

12

nursesBeds p < 0.001

13

≤ 2.512 > 2.512

SUTC p = 0.003

14

≤ 4 > 4

n = 19 y = 74.325

15 nursesBeds p = 0.018

16

≤ 1.842 > 1.842

n = 11 y = 75.888

17 admissions p = 0.006

18

≤ 5176 > 5176

n = 13 y = 79.696

19 n = 12

y = 84.767

20

n = 22 y = 86.713

21

Fig. 3 The tree adds external variables such as weekly median wage to the variables underlying model 1. Weekly median wage appears at

the start of the tree as the predictively most significant variable,

followed by the nurse-to-bed ratio and specialist. Other variables

measuring the organisational feature of the process of stroke care

occupy important positions in the tree, supporting the idea that

organisational factors are among the most significant predictors of the

clinical quality of care

394 M. Ali et al.

123

The model also points to the pivotal role of the organisa-

tional features of the stroke care process in driving the

quality of stroke care. The organisational variables occupy

prominent positions in the regression tree. Finally, the

model casts doubt on whether hospital size affects the

quality of the process of stroke care. The number of beds,

which measures hospital size, does not appear in the tree.

Any claim for the causal validity of these results call for

further analysis.

Figure 1 previously revealed a shift in the distribution of

the stroke quality measure over the sample period. The

monotonic increase in the mean quality score could be due

to a change in the underlying joint probability distribution

of the variables driving the quality of stroke care. We next

include year dummies in the last model to capture the time

trend and examine whether the key patterns observed in the

trees remain unchanged. Our sample spans over four

periods from year 2004 to year 2010. However, the data on

the organisational variables are not available for year 2004.

Further, the predictor variables have been lagged to deal

with contemporaneous endogeneity. This means there are

effectively data for two periods in the unbiased tree esti-

mation. We take the year 2008 as the base year and

introduce a dummy variable ‘yr2010’ that takes value 1 if

an observation belongs to year 2010 and zero otherwise.

Adding the dummy variable to the list of variables will lead

to our third model:

Model 3: Score � beds ? day case theatres ? nurses ? neurology ? neurophysiology ? neurosurgeons ?

teaching hospital ? ft ? ASU ? RSU ? CSU ? SUTC ?

Specialist ? ESD ? NeuroClinic ? PatientViews ? Report

? stroke admissions ? stroke mortality ? median wage ?

no qualifications ? yr2010.

In Fig. 4 the year dummy appears in the initial node of

the tree, indicating a higher mean quality of stroke care in

year 2010. A reason for the split of the initial node at 2010

Model 3

yr2010 p < 0.001

1

≤ 0 > 0

neurosurRegion p < 0.001

2

≤ 71 > 71

SUTC p = 0.002

3

≤ 4 > 4

n = 38 y = 66.507

4 Report

p = 0.008

5

≤ 0 > 0

beds p = 0.004

6

≤ 631 > 631

n = 26 y = 64.129

7 n = 17

y = 73.663

8

n = 51 y = 71.764

9

n = 18 y = 79.587

10

median_wage p < 0.001

11

≤ 524.8 > 524.8

Specialist p = 0.002

12

≤ 0 > 0

teaching p = 0.007

13

≤ 0 > 0

neurology p = 0.003

14

≤ 1 > 1

n = 15 y = 76.639

15 n = 11

y = 70.261

16

n = 12 y = 79.676

17

n = 95 y = 78.928

18

n = 20 y = 86.1

19

Fig. 4 The tree adds a year dummy to the variables in tree model 2. The dummy variable for year 2010 appears at the initial node of the

tree as the most predictively significant factor, followed by median

weekly wage and neurosurgeon per region. The mean quality score is

higher in year 2010, consistent with the descriptive results in Fig. 1.

The variables reflecting the organisational quality of stroke care

occupy prominent positions. A number of variables including ‘day

case theatres’, ‘neurology’ and FT fails to appear in the tree

Hospital heterogeneity: what drives the quality of health care 395

123

can be the implementation of the National Stroke Strategy

Policy in NHS England, which was implemented in

December 2007. The policy measures included monitoring

and assessing the quality of stroke care across NHS hos-

pitals, which might have shifted the distribution of the

overall measure of the quality of stroke care. The second

set of most predictively significant variables are weekly

median wage and neurosurgeons per region. The average

clinical quality of stroke care is higher in wealthier regions

or regions with a higher number of neurosurgeons. Con-

sidering the fact that only a small proportion of stroke

patients requires surgery we take caution in interpreting the

relevance of the number of neurosurgeons by identifying it

as an indicator of broader specialist capacity rather than an

absolute/direct driver of quality of stroke care in hospitals.

Primary organisational variables occupy the next promi-

nent positions in the tree. Consistent with the previous

results, higher clinical quality of stroke care is correlated

with better organisation of stroke care. The presence of the

organisational score variable is robust to including year as

a predictor.

Composite measure

A fundamental insight of economic theory is the critical

complementarity among diverse elements that shape the

organisation of the firm and its performance [14]. In Fig. 2,

in the left-hand-side branch, the absence of a specialist

stroke team, a comparatively lower nurse-to-bed ratio and

the absence of a combined stroke unit jointly give rise to a

lower mean average quality of stroke care, or having a

stroke specialist team present in a foundation trust hospital

leads to higher mean stroke care quality. To capture

complementarities among organisational factors con-

tributing to the process of stroke care, we adopt a com-

posite measure of overall organisation of the process of

stroke care constructed in the National Sentinel Stroke

Audit (NSSA). Replacing the disaggregate indicators of

organisational quality with the composite measure, which

is derived from the primary indicators, while maintaining

other variables in the model, will yield the model:

Model 4: Score � beds ? day case theatres ? nurses ? neurology ? neurophysiology ? neurosurgery ?

teaching ? ft ? admissions ? stroke mortality ? median

wage ? no qualifications ? organisation score ? yr2010.

Figure 5 represents the unbiased panel regression tree

built from these variables. The composite measure of the

organisational quality appears at the initial node of the tree,

indicating the organisational quality of the process of

stroke care as the most prominent predictor of the clinical

quality of stroke care. Hospitals with high organisational

quality ([67) located in comparatively affluent areas enjoy

the highest quality of stroke care. The mean predicted

scores for year 2010 wherever the year dummy appears are

higher than the mean predicted scores for year 2008. Fur-

ther, but interestingly, hospitals with a low composite

organisation score (\67) but a larger number of beds ([591) reveal a relatively lower mean quality of stroke care. In the absence of proper organisation of resources,

larger hospitals might be at a disadvantage. Finally, the

lowest mean quality of stroke care belongs to hospitals in

regions with a comparatively lower number of neurosur-

geons (\61) and low composite organisation score (\64). The reason for the relative simplicity of the model is

manifold: Human capital variables strongly predict the

organisation score; weekly median wage is also highly

correlated with other variables capturing the health and

socio-economic regional characteristics; human capital

variables are highly correlated among themselves and with

variables such as (the number of) beds. Overall, once the

composite measure of organisation quality of the process of

care is included among the explanatory variables, it

appears as the predictively most significant variable, con-

sistent with the recent economic research on productivity

that identifies organisational features and management

practices and style as the deepest drivers of firm perfor-

mance. The figures highlight the importance of comple-

mentarity as in Milgrom and Roberts [52] and Brynjolfsson

and Milgrom [14]. The results of the tree models confirm

the insights of the wider economics productivity literature

on the importance of the organisation of resources,

organisational design and management practices.

Table 5 reports the in-sample goodness of fit measures

AIC and BIC for the four unbiased regression tree models.

The in-sample fit measure AIC and BIC scores monoton-

ically decrease from model 1 (2195.83) to model 4

(2096.64), suggesting that the unbiased regression tree

model 4, with the composite organisation measure, fits the

data best. According to Burnham and Anderson [15], dif-

ferences in the values of AIC of around 4–7 correspond to

roughly 95% significance. The table also reports the leave-

one-out and k-fold out-of-sample-predictive performance

measures for the unbiased tree models. Consistent with the

in-sample fit measures, the final model yields the lowest

out-of-sample prediction error among the models—(8.531)

and (8.268) respectively.

Figure 6 provides the diagnostic plots for unbiased

REEM-Tree model 4. The residuals from the model are

approximately normally distributed, as indicated from the

Normal Q–Q plot in the left panel of the figure. The right

panel of the figure gives the residuals versus fitted values

plot to check for constant variance or homoscedasticity

assumption of errors. The residuals are scattered around the

horizontal line and the width of the data points are equal

396 M. Ali et al.

123

throughout, confirming homoscedasticity. The model

enjoys a satisfactory in-sample goodness of fit.

Robustness analysis

The regression trees presented above do not take care of

hospital fixed effects features. In the absence of strong

theory, one requires alternative techniques to capture pos-

sible causal connections. We build a series of linear panel

data models to investigate whether the coefficients of the

variables appearing in the trees are statistically significant

and whether the coefficient estimates are robust to the

inclusion of hospital individual fixed effects. Further, we

report whether the results are robust to the choice of

alternative measures of physical capital, human capital and

regional factors. We also consider replacing our human

capital variables with relevant ratios.

Correlated random effects models

We first check the empirical validity of the patterns

observed in trees by comparing them to a class of linear

mixed effects models (LME). The LME results come quite

close to traditional random-effects results. Both assume

that ai in Eq. (1) uncorrelated with explanatory variables.

We have no theoretical ground to warrant such an

assumption. A prime reason for choosing LME as a

benchmark is its consistency with the REEMtree. The

unbiased panel tree estimator is built around LME. Further,

our interest is to estimate possible impacts of time invariant

variables such as the teaching or foundation trust status of

hospitals on quality of care, which makes a standard fixed

effect (FE) estimator unsuitable. Next we estimate a cor-

related random effects (CRE) model that is a flexible

extension to RE and is comparable to FE [79]. Consider the

simple case of the panel model in Eq. (1), with a single

Model 4

orgscore p < 0.001

1

≤ 67 > 67

yr2010 p < 0.001

2

≤ 0 > 0

orgscore p = 0.009

3

≤ 60 > 60

n = 47 y = 64.243

4 neurosurRegion

p = 0.011

5

≤ 61 > 61

orgscore p = 0.008

6

≤ 63 > 63

n = 14 y = 64.004

7 n = 13

y = 70.063

8

n = 12 y = 75.617

9

beds p < 0.001

10

≤ 591 > 591

n = 30 y = 78.9

11 n = 20

y = 71.924

12

median_wage p < 0.001

13

≤ 524.8 > 524.8

yr2010 p < 0.001

14

≤ 0 > 0

n = 55 y = 73.076

15 n = 89

y = 79.298

16

neurophysiology p = 0.004

17

≤ 0 > 0

n = 13 y = 85.649

18 n = 10

y = 90.902

19

Fig. 5 The unbiased tree replaces the composite organisational measure for the primary features of the organisation of the process of

stroke care. The variable appears as the most predictively significant

factor, followed by median weekly wage and the year dummy. The

highest quality score belongs to hospitals with a high organisational

score and with the neurophysiology variable taking a value located in

wealthier regions. Variables day case theatres, nurse-to-bed, neurol-

ogy, neurosurgery, teaching, FT, admissions, stork mortality and ‘no

qualifications’ fail to appear in the tree

Hospital heterogeneity: what drives the quality of health care 397

123

time-varying explanatory variable xit. ai may be correlated

with fxit : t ¼ 1; 2; . . .; Tg. yit ¼ bixit þ ai þ uit; t ¼ 1; 2; . . .; T: ð4Þ

In CRE, we model the correlation between ai and

fxit : t ¼ 1; 2; . . .; Tg. Since ai is, by definition, constant over time, it can be correlated with the average level of the

xit. More specifically, let �xi ¼ T�1Rti¼1xit be the time aver- age. It is plausible to assume the simple linear relationship

ai ¼ a þ k�xi þ ri ð5Þ

where ri, by assumption, is uncorrelated with each xit. The

correlated random effects approach uses (5) in conjunction

with (4). Substituting the former into the latter gives:

yit ¼ bixit þ a þ k�xi þ ri þ uit ¼ a þ bixit þ k�xi þ ri þ uit ð6Þ

The equation has a composite error term ri þ uit, consisting of a time-invariant unobservable ri and the idiosyncratic

shocks, uit. Further, because uit is assumed to be uncorre-

lated with xit, all s and t, uit is uncorrelated with �xi. With

these assumptions, the estimation problem becomes iden-

tical to the random effects estimation of

yit ¼ a þ bixit þ k�xi þ ri þ uit: ð7Þ

The addition of �xi controls for the correlation between ai and the sequence fxit : t ¼ 1; 2; . . .; Tg. Wooldridge (2010,

Chapter 10) shows that b̂CRE ¼ b̂FE, where b̂FE stands for the fixed effects estimator. The CRE approach, thus, pro-

vides a way to include time-invariant explanatory variables

in what is effectively a fixed effects analysis.

We assess the robustness of the four models introduced

earlier. In each case, we consider two LME models: (1) a

model with all the variables entering the corresponding tree

and (2) a model with the within-group means of the time-

varying variables to capture fixed effects.

Table 6 presents the first set of the LME results. Each

column belongs to the four models respectively. The first

column includes the internal drivers of quality of stroke

care entering unbiased regression tree model 1 along with

the dis-aggregate measures of organisational performance.

Consistent with the tree model 1, variables nurse-to-bed

ratio (5.490), specialist stroke team (5.172), foundation

trust (2.487) and CSU (3.795) all are statistically signifi-

cant at less than the 5% critical level. In addition, the

variable SUTC (2.461) is also statistically significant at less

than the 5% level. These predictors have material effects

on the quality of stroke care. For example, a unit change in

the nurse-to-bed ratio results in a 5.49-unit change in the

quality of care index, given other variables are held con-

stant. The quality of care in a hospital that has a Specialist

Stroke Team is 5.172 points higher than a hospital that

does not have such a specialist team. Presence of a Com-

bined Stroke Unit (CSU) is associated with a 3.795-point

Model (4): Residuals vs Fitted Values

60 70 80 90

−1 5

−1 0

−5 0

5 10

15

Fitted Score Values

R es

id ua

ls

−3 −2 −1 0 1 2 3 −1

5 −1

0 −5

0 5

10 15

Normal Q−Q Plot

Theoretical Quantiles

S am

pl e

Q ua

nt ile

s

Fig. 6 In-sample fit plot for the unbiased tree for model 5

398 M. Ali et al.

123

improvement in quality while a foundation trust status

accounts for an additional 2.461 points. Column 2 corre-

sponds to regression tree model 2, which adds potential

external drivers of the quality of stroke care to the variables

in the first model. Variables median wage (0.084), Neu-

roClinic (4.709), STUC (2.639), specialist (3.935) and

foundation trust (3.332) are statistically significant at less

than the 5% level. Presence of a Neuroclinic improves the

quality score by 4.709 units. Foundation trust status, pro-

vision for a specialist stroke team and availability of well-

equipped stroke units (SUTC) all maintain sizeable effects

on quality as before with coefficients of 3.332, 3.935 and

2.639, respectively, but the impact of median wage is small

(0.084). There are discrepancies between the regression

tree results and the LME results—notwithstanding, the

organisational variables appear prominent in both LME

models. Variables Specialist and SUTC, which are signif-

icant in both regression models, occupy prominent posi-

tions in the trees too. Column 3 adds the year dummy to the

list of the variables. With the inclusion of the year dummy,

median wage is no longer statistically significant. In

addition to the year dummy (6.574), SUTC (2.748), neu-

roclinc (3.437) and neurosurgeon per region (0.047) are all

statistically significant at less than the 5% level. The

quality score was on average 6.574 points higher in the

year 2010 compared to the year 2008. This implies that the

Department of Health’s National Stroke Strategy, which

was implemented across English NHS trusts in December

2007, resulted in profound and significant improvement in

hospital performance. Importantly, though, even after

controlling for the year effect, the effects of some stroke

care resources, such as SUTC and Neuroclinic, remain

considerably large (2.748 and 3.437, respectively). As for

regression tree results, we define neurosurgeons per region

as a wider measure of specialist input when interpreting its

importance in predicting quality of stroke care. Column 4

replaces the primary organisational features of the process

of stroke care with the composite measure of organisational

quality of the process of stroke care. With the composite

measure included, variables FT (foundation trust status)

(1.824), stroke mortality (0.230), median wage (0.057), the

year dummy (4.831) and organisation score (0.331) are all

statistically significant at less than the 5% level. The fact

that the composite organisational score has a small effect

(0.331) on stroke quality possibly suggests that individual

components have disproportionate, and in certain cases

opposite, effects on quality. Overall, if any lesson can be

drawn from these results, it is the conclusion that organi-

sation of the process of stroke care matters greatly. The

organisational variables in one form or another appear

prominent in both estimation approaches.

The Hausman test indicates the presence of fixed effects.

Table 7 represents the LME results when, in line with the

correlated random effects approach, we add the within-group

means of the time-varying variables to the list of the variables

to capture potential fixed effects. In column 1, we add the

within-group means of the time-varying variables to the list of

the variables in model 1 that only includes potential primary

internal drivers of the clinical quality of the process of stroke

care. The fixed effects coefficients of the key organisational

variables CSU (3.716), SUTC (2.389), specialist (5.394) and

foundationtrust (2.614) are all statistically significantat5%or

below. No other variables appear as statistically significant.

Column 2 adds the within-group means of the time-varying

external drivers of the quality score. The organisational vari-

ables SUTC (2.559), specialist (2.575) and NeuroClinic

(4.069) are statistically significant at the 5% level or below.

The variable stroke admissions is significant at below the 5%

level with fixed effect coefficient 0.030. Mean admissions

(�0:030) andstrokemortality(1.715) are significantatthe 5% level.Column 3addsthe yeardummy variable. Thevariableis

significant at the 5% level, with a high magnitude (11.414), emphasising again the important impact of the policy change

mentioned before. Importantly, the organisational variables

SUTC (2.58) and specialist (2.308) remain statistically sig-

nificant at the 5% level or below. Column 4 replaces the pri-

mary organisational variables in model 3 with the composite

organisationalmeasure.Besidestheyeardummy(10.570),the

mean of the composite measure appears statistically signifi-

cant (0.449) at below the 1% level. All in all, even when we

consider fixed effects coefficient estimates, the variables

capturing aspects of the organisational quality of the process

of stroke care such as SUTC remain significant. The fixed

effects coefficients of the rest of the variables are not statis-

tically significant.

In the above, we did not separate hospital fixed effects

from time fixed effects. Testing for hospital and time effects,

there is strong evidence for the presence of time effects,

which is consistent with the observed pattern in Fig. 1. We

turn to conventional panel regression to examine time fixed

effects separately. Table 8 shows three models with time

effects. Column 1 presents the results for the model with the

internal drivers of clinical quality of the process of stroke

care, using disaggregated organisational measures. Column

2 adds potential external drivers. Column 3 replaces the

disaggregate organisational indicators with the composite

measure of organisational quality of the process of stroke

care. The disaggregate organisational variables SUTC and

specialist are statistically significant in the first two columns.

The variable neurology in the first model and NeuroClinc

(the presence of a neuro clinic) in the second model are

statistically significant too. And the composite measure of

organisational quality of the process of stroke care appears

stronglystatistically significant in the third model too. All the

three models point to the statistical significance of organi-

sational features of the process of stroke care. [The presence

Hospital heterogeneity: what drives the quality of health care 399

123

of time effects points to the possibility that changes in the

environment (say policies) may have affected the quality of

stroke care, but in the same way for all the hospitals.]

Further, the nurse-to-bed ratio variable appears statisti-

cally significant in the first two models, with a positive sign

across all the models, suggesting complementarity between

the number of nurses and beds. The coefficient for beds, as a

measure of hospital size, has a negative sign across the

models and is significant only in the first model. The mea-

sures indicating external factors are statistically significant

in one form or another. The number of neurosurgeons per

region is statistically and positively significant in the second

model and weekly median wage in the third model. The

variable stroke mortality is statistically significant in both

models—suggesting that in areas where stroke mortality

increased over time the quality of the process of stroke care

has improved too. It is possible that extra resources are

devoted to the process of stroke care in areas with high stroke

mortality, leading to a higher correlation between stroke

mortality and the clinical quality of stroke care.

To further support the robustness of our key results, we

rescale the measures of the quality of the clinical and

organisation of the process of stroke care by taking out the

within-hospital means of the variables, while including the

control variables in their level form. Table 9 reports the

results with the transformed variables. Column 2 adds

median wage to the list of control variables in column 1,

column 3 excludes the data on London hospitals, and column

4 adds London to the list of control variables. In all the

columns, the coefficient estimate of the composite organi-

sational score, in its transformed form, is significant at 1%,

with the magnitude ranging from 0.297 to 0.326. Changes in

the quality of the clinical process of stroke care are strongly

positively associated with changes in the composite organ-

isational score. And the coefficient of the year dummy is no

longer statistically significant. The results are consistent with

the theoretically plausible conjecture that variations in the

quality of the organisation of the process of care drives

changes in the quality of the process of stroke care. While the

panel structure of the short sample supports our hypothesis,

there is no claim for causation. There could be unobserved

time-varying factors that drive changes in both clinical and

organisational measures of quality.

The fact that the organisational quality of the process of

stroke care appears among critical drivers of the clinical

quality of stroke care raises a deeper question as to what

factors drive organisational quality. Our sample is inade-

quate for fully shedding light on the subject. Nonetheless,

we have regressed the composite measure of organisational

quality on several theoretically plausible drivers of organ-

isational quality. Columns 1 and 2 in Table 10 include

several internal drivers. Column 1 controls for individual

fixed effects and column 2 controls for both individual and

time fixed effects. Columns 3 and 4 add weekly median

wage. The columns respectively control for individual

fixed effects and both individual and time fixed effects. In

all the columns, the nurse-to-bed ratio is statistically and

quantitatively significant at the (10%) level or below. The

median weekly wage appears as statistically significant in

the last column, suggesting that hospitals located in more

affluent areas have higher organisational quality.

The results from the traditional econometric approaches

corroborate the findings from the unbiased REEM-Trees.

All the analyses indicate the importance of the organisa-

tional quality of stroke care in determining the clinical

quality of stroke care. The tree models point to critical

interactions among various variables driving stroke care

quality. It is a complex interaction among hospital

resources that shapes quality. Our key results are robust to

the use of ratio variables or alternative measures of human

or physical capital left out in constructing the trees because

of the high correlation. Our results are not driven by hos-

pitals in London. Excluding data relating to London’s

hospitals will not change our results, as shown in Table 9.

Conclusion

A feature of the health care systems is substantial variation in

health care quality across hospitals. This study has focused on

the quality of the process of stroke care as an indicator of the

overallqualityofNHShospitals.Drawingonthe literature,we

conjectured several explanations for the observed cross-hos-

pital heterogeneity and tested their predictive implications

using a short panel spanning 2004 to 2010. Both the para-

metric and non-parametric methods identify several features

of the organisation of stroke care as key predictors of hospital

quality. Presences of specialist stroke teams, well-equipped

stroke units and neurovascular clinics have all been found to

improve thequalityofstrokecareandthe improvementsrange

between 2 to 5 points in a quality score measured on a 0–100

scale. When a composite score as a summary of such organ-

isation of care has been used separately it consistently

appeared to be a significantly important driver of quality. The

result persists when potential confounders/variables are con-

trolled for and is particularly robust to the inclusion of a time

trend, which accounted for a major policy change during the

study period. The non-parametric methods identify the mea-

sure of organisation score as the key predictor of hospital

quality. The result persists when potential confounders/vari-

ables are controlled for and is particularly robust to the

inclusion of a time trend. The result supports an emerging

literature that emphasises the critical role of organisation of

resources in productivity. The non-parametric methods also

reveal critical interactions among potential determinants of

the quality of the process of stroke care.

400 M. Ali et al.

123

Several limitations offer scope for further research. The

predictive results narrow down candidate hypotheses on the

sources of hospital heterogeneity but are only indicative of

the causal impact of organisational factors on the quality of

stroke care; they do not establish causation. A thorough

analysis will consider further control variables such as

stroke admissions, examine alternative measures of quality

and attempt to obtain a more comprehensive measure of

organisation quality. As another limitation, our study does

not control for clinical networks. Under these clinical net-

works, hospitals combine and integrate resources with other

hospitals in the networks to provide high quality of care to

the patients. As mentioned in [61], for example, cardiac

networks in the English NHS have led to an increase in the

rate of specialist interventions and faster access time. The

results trace differences in the quality of the process of care

to differences in the organisation of resources and man-

agement practices across NHS hospitals. Ultimately, it is

vital to explain organisational heterogeneity—why some

hospitals enjoy a higher level of organisational design than

others. Future research will benefit from using a longer

longitudinal framework to better identify trends in perfor-

mance. Equally, it will prove insightful to compare the

effects of the National Stroke Strategy that was imple-

mented in England and compare the quality for stroke care

with Wales and Northern Ireland. This allows one to draw

causal effects of such policy changes (using quasi-experi-

mental techniques such as difference in difference) since

health care systems across England and in Wales or

Northern Ireland are broadly comparable. Finally, future

research needs to take into account detailed staff data

related to stroke care including rehabilitation and geriatric

staff and control for cardiac networks.

Acknowledgements We would like to thank participants at the 18th International Conference on Econometrics, Operations Research and

Statistics, the Health e-Research Centre seminar at University of

Manchester and the BES seminar at the Alliance Manchester Business

School. We are grateful to Peter Kawalek, Nathan Proudlove and

Irene Roele for their very helpful comments.

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://crea

tivecommons.org/licenses/by/4.0/), which permits unrestricted use,

distribution, and reproduction in any medium, provided you give

appropriate credit to the original author(s) and the source, provide a

link to the Creative Commons license, and indicate if changes were

made.

Appendix

See Tables 3, 4, 5, 6, 7, 8, 9 and 10

Table 3 Definition and source of variables used in the analysis

Variable Definition Source

Clinical process of care

score Total clinical process score The National Sentinel

Stroke Audit; RCP

Organisational variables

Organisation score Total organisational score The National Sentinel

Stroke Audit; RCP

Hospital characteristics

Teaching hospital Whether the hospital is a teaching hospital Self-coded from hospital’s

website

Foundation trust Whether the hospital is a foundation trust Self-coded from Monitor

website

Human capital

Non-medical staff Headcount of non-medical staff as of 30 September NHS workforce; statistics,

(HCHS)

Scientific staff All qualified scientific, therapeutic and technical staff as of 30 September NHS workforce; statistics

(HCHS)

Allied professionals Qualified allied health professionals as of 30 September NHS workforce; statistics,

(HCHS)

Clinical staff Total number of medical and dental staff as of 30 September NHS workforce; statistics,

(HCHS)

Qualified clinical staff Headcount of professionally qualified clinical staff as of 30 September NHS workforce statistics

(HCHS)

Nurses Headcount of qualified nursing midwifery and health visiting staff as of 30

September

NHS workforce statistics

(HCHS)

Hospital heterogeneity: what drives the quality of health care 401

123

Table 4 Clinical process score domains

Clinical domains Clinical standards

Initial patient assessment Screening for swallowing 24 h

Visual fields

Sensory testing

Brain scan within 24 h of stroke

Multidisciplinary assessment Swallowing assessment speech and language therapist

Physiotherapy assessment within 72 h

Initial assessment of communication 7 days

Occupational therapy assessment within 4 working days

Social work assessment within 7 days of referral

Screening and functional assessment Patient weighed at least once during admission

Evidence mood assessed

Cognitive status assessed

Screening for malnutrition

Care planning Evidence of rehabilitation goals

Plan to promote urinary continence

Receiving nutrition within 72 h

Communication with patients and carers Discussion with patient about diagnosis

Carer needs for support assessed separately

Skills taught to care for patient at home

Follow-up appointment at 6 weeks

Driving advice

Acute care Aspirin within 48 h of stroke

90% of stay in a sroke unit

Admitted to an acute or combined Stroke Unit within 4 h

Receiving fluids within 24 h

Receiving thrombolysis

Table 3 continued

Variable Definition Source

General Medicine

Group

Total medical staff at General Medicine Group as of 30 September NHS workforce statistics

(HCHS)

Neurology Total medical staff at Neurology group as of 30 September NHS workforce; statistics,

(HCHS)

Neurophysiology Number of medical staff at clinical neurophysiology group as of 30 September NHS workforce statistics

(HCHS)

Neurosurgeons Number of neurosurgeons as of 30 September NHS workforce statistics

(HCHS)

Physical capital

Beds Total number of available beds Hospital Estate and facilities

data

Operating theatres Number of operating theatres, quarter ending September Dept. of Health QMCO

Day case theatres Number of dedicated day case theatres quarter ending September Dept. of Health QMCO

Regional characteristics

Stroke admissions Regional emergency stroke admissions standardised by age and gender HSCIC

Stroke mortality Regional stroke mortality standardised by age and gender HSCIC

All SMR All age standardised mortality ratio (SMR) HSCIC

Median wage Regional full-time weekly median wage ONS; ASHE

Inequality Ratio of 10th and 90th percentile full time weekly wage ONS; ASHE

No qualifications Regional population with no qualifications (%) NOMIS

402 M. Ali et al.

123

Table 5 In- and out-sample fit measures

In-sample fit Out-sample fit

AIC BIC LOOCV k-fold

(1) (2) (3) (4)

Models

Model 1 2195.83 2236.35 9.314 9.319

Model 2 2147.44 2195.23 9.204 9.091

Model 3 2131.69 2175.86 8.562 8.405

Model 4 2096.64 2140.80 8.531 8.268

Table 6 Linear mixed effects regression results

Models

(1) (2) (3) (4)

(Intercept) 44.361 (5.706)*** -7.571 (23.684) 14.724 (23.185) -4.762 (21.197)

Beds -0.003 (0.002) -0.000 (0.002) -0.002 (0.002) -0.002 (0.002)

Day case theatres -0.117 (0.263) -0.371 (0.264) -0.172 (0.262) -0.231 (0.236)

Nurse-to-bed ratio 5.490 (1.567)*** 2.175 (1.701) 2.259 (1.648) 1.916 (1.525)

Neurology 0.168 (0.108) 0.015 (0.110) 0.067 (0.109) 0.036 (0.097)

Neurophysiology 0.311 (0.650) 0.340 (0.649) 0.392 (0.638) 0.291 (0.579)

Neurosurgery -0.087 (0.147) 0.001 (0.145) -0.019 (0.143) -0.026 (0.128)

Teaching 1.849 (1.549) 1.580 (1.533) 1.555 (1.504) 1.164 (1.331)

Foundation trust 2.487 (1.226)** 3.332 (1.209)*** 1.445 (1.223) 1.884 (1.117)*

ASU 1.524 (1.937) 1.576 (1.875) -0.370 (1.830)

RSU 1.066 (1.761) 0.388 (1.739) 0.625 (1.676)

CSU 3.795 (2.181)* 2.172 (2.145) 0.129 (2.100)

SUTC 2.461 (1.011)** 2.639 (0.977)*** 2.748 (0.935)***

Specialist 5.172 (1.136)*** 3.935 (1.132)*** 1.899 (1.150)

ESD 1.876 (1.314) 1.842 (1.275) 0.925 (1.233)

NeuroClinic 3.149 (2.128) 4.709 (2.071)** 3.437 (1.996)*

PatientViews -1.408 (1.904) -2.614 (1.897) -2.639 (1.817)

Report 0.909 (1.174) 1.088 (1.139) 1.115 (1.092)

Stroke admissions -0.000 (0.000) -0.000 (0.000) -0.000 (0.000)

Stroke mortality 0.141 (0.129) 0.200 (0.124) 0.230 (0.116)**

No qualifications 0.086 (0.332) -0.355 (0.336) -0.177 (0.302)

Median wage 0.084 (0.024)*** 0.036 (0.025) 0.057 (0.023)**

Neurosurgeons per region 0.019 (0.026) 0.047 (0.026)* 0.021 (0.023)

Year dummy 6.574 (1.250)*** 4.831 (1.178)***

Organisation score 0.331 (0.050)***

AIC 2222.923 2231.788 2205.433 2199.928

BIC 2295.973 2322.657 2299.844 2265.798

Num. obs. 303 303 303 303

*** p\0:001, ** p\0:01, * p\0:05, � p\0:1. Table 6 shows linear mixed effects results corresponding to the four tree models. The dependent variable is the quality score and explanatory variables are lagged 1

year. Column 1 includes hospital characteristics, human capital factors, teaching and foundation trust status,

and primary variables reflecting the quality of the organisation of the process of stroke care. Column 2 adds

a set of external variables and column 3 adds the year dummy. Column 4 substitutes the composite

organisational measure for the primary organisation features of the process of stroke care

Hospital heterogeneity: what drives the quality of health care 403

123

Table 7 Correlated random effects results

Models

(1) (2) (3) (4)

(Intercept) 42.975 (5.950)*** 6.059 (30.758) 11.222 (30.751) -19.008 (26.751)

Beds 0.017 (0.017) 0.009 (0.015) 0.008 (0.015) -0.000 (0.014)

Day case theatres -0.439 (1.207) -0.102 (1.047) -0.230 (1.037) -0.647 (1.005)

Nurse-to-bed ratio 4.794 (3.921) 0.669 (3.601) 0.552 (3.562) -1.196 (3.400)

Neurology -0.648 (0.736) -1.001 (0.680) -0.913 (0.674) -0.994 (0.655)

Neurophysiology 1.669 (3.573) -0.294 (3.128) -0.367 (3.094) -0.085 (3.017)

Neurosurgery 0.794 (1.129) 0.207 (1.041) 0.189 (1.029) 0.053 (1.002)

Teaching 1.761 (1.556) 1.032 (1.538) 1.467 (1.547) 1.110 (1.342)

Foundation trust 2.614 (1.238)** 1.600 (1.257) 1.716 (1.252) 1.675 (1.122)

ASU 1.599 (1.957) -0.140 (1.862) -0.198 (1.849)

RSU 1.107 (1.772) 0.809 (1.708) 0.664 (1.699)

CSU 3.716 (2.197)* 0.143 (2.123) 0.026 (2.111)

SUTC 2.389 (1.026)** 2.559 (0.963)*** 2.580 (0.956)***

Specialist 5.394 (1.161)*** 2.575 (1.172)** 2.308 (1.170)*

ESD 2.000 (1.323) 1.216 (1.244) 1.152 (1.235)

NeuroClinic 3.364 (2.155) 4.069 (2.020)** 3.302 (2.035)

PatientViews -1.099 (1.920) -2.051 (1.851) -2.378 (1.844)

Report 0.683 (1.187) 0.674 (1.121) 0.839 (1.116)

Beds (mean) -0.021 (0.017) -0.011 (0.015) -0.010 (0.015) -0.003 (0.014)

Day case theatres (mean) 0.349 (1.241) -0.112 (1.085) 0.072 (1.077) 0.409 (1.033)

Nurse-to-bed ratio (mean) 1.431 (4.314) 3.191 (4.091) 3.294 (4.054) 3.770 (3.824)

Neurology (mean) 0.825 (0.743) 1.099 (0.691) 1.004 (0.684) 1.035 (0.663)

Neurophysiology (mean) -1.424 (3.634) 0.653 (3.197) 0.785 (3.163) 0.203 (3.073)

Neurosurgery (mean) -0.881 (1.139) -0.255 (1.049) -0.238 (1.038) -0.094 (1.010)

Stroke admissions 0.030 (0.016)* 0.022 (0.016) 0.015 (0.015)

Stroke mortality -1.470 (0.958) -1.209 (0.955) -0.855 (0.918)

No qualifications -0.288 (0.366) -0.322 (0.365) -0.029 (0.320)

Median wage 0.067 (0.105) -0.225 (0.169) -0.133 (0.158)

Neurosurgeons per region -0.038 (0.201) -0.224 (0.216) -0.197 (0.208)

Stroke admissions (mean) -0.030 (0.016)* -0.022 (0.016) -0.015 (0.015)

Stroke mortality (mean) 1.715 (0.974)* 1.416 (0.973) 1.064 (0.933)

Median wage (mean) -0.025 (0.109) 0.256 (0.168) 0.194 (0.157)

Neurosurgeons per region (mean) 0.077 (0.202) 0.268 (0.218) 0.206 (0.209)

Year dummy 11.414 (5.220)** 10.570 (4.746)**

Organisation score 0.029 (0.083)

Organisation score (mean) 0.449 (0.104)***

AIC 2222.916 2226.488 2218.599 2202.447

BIC 2317.328 2352.433 2348.008 2307.439

Num. obs. 303 303 303 303

*** p\0:001, ** p\0:01, * p\0:05, � p\0:1. Table 7 shows correlated random effects results for the four models. Each time-varying explanatory variable appears once in its original form and once as a within-group (hospital) average. The coefficients of the variables reflect fixed

effects estimates. We control for both hospital and time fixed effects in column 3 and column 4 by adding the year dummy

404 M. Ali et al.

123

Table 8 Fixed effects model (time)

Models

(1) (2) (3)

Beds -0.004 (0.002)** -0.002 (0.002) -0.002 (0.002)

Day case theatres -0.045 (0.235) -0.189 (0.237) -0.246 (0.221)

Nurse-to-bed ratio 4.267 (1.446)*** 2.944 (1.567)* 2.201 (1.472)

Neurology 0.189 (0.096)** 0.072 (0.097) 0.037 (0.091)

Neurophysiology 0.302 (0.577) 0.359 (0.570) 0.267 (0.539)

Neurosurgery -0.069 (0.130) -0.020 (0.128) -0.031 (0.119)

Teaching 2.193 (1.381) 1.710 (1.359) 1.181 (1.242)

Foundation trust 1.405 (1.138) 1.772 (1.160) 2.042 (1.074)*

ASU -0.755 (1.857) -0.245 (1.836)

RSU 1.605 (1.628) 0.632 (1.627)

CSU 0.958 (2.073) 0.668 (2.033)

SUTC 2.902 (0.952)*** 3.261 (0.936)***

Specialist 2.381 (1.158)** 2.047 (1.147)*

ESD 1.248 (1.236) 1.248 (1.225)

NeuroClinic 2.893 (1.998) 4.110 (1.985)**

PatientViews -1.171 (1.779) -2.386 (1.808)

Report 1.338 (1.100) 1.197 (1.084)

Stroke admissions -0.000 (0.000) -0.000 (0.000)

Stroke mortality 0.216 (0.125)* 0.238 (0.115)**

No qualifications -0.347 (0.306) -0.154 (0.284)

Median wage 0.036 (0.023) 0.059 (0.021)***

Neurosurgeon per region 0.045 (0.023)* 0.018 (0.021)

Organisation score 0.359 (0.049)***

R2 0.191 0.244 0.297

Adj. R2 0.179 0.225 0.281

Num. obs. 303 303 303

*** p\0:001, ** p\0:01, * p\0:05, � p\0:1. Column 1 includes the internal drivers of clinical quality of the process of stroke care. Column 2 adds potential external drivers. Column 3 replaces the individual

organisational variables with the composite measure of organisational quality. All the three models control

for time fixed effects

Hospital heterogeneity: what drives the quality of health care 405

123

Table 9 Correlated random effects results: rescaled clinical and organisational quality scores

Models

(1) (2) (3) (4)

(Intercept) -9.850 (11.11) -53.01 (20.19)*** -50.61 (27.51)* -55.67 (26.95)**

Beds -0.003 (0.002)* -0.002 (0.002) -0.001 (0.002) -0.002 (0.002)

Day case theatres -0.125 (0.204) -0.228 (0.207) -0.352 (0.266) -0.228 (0.207)

Nurse-to-bed ratio 2.358 (1.475) 1.853 (1.408) 1.116 (1.538) 1.864 (1.415)

Neurology 0.070 (0.079) 0.035 (0.079) -0.023 (0.137) 0.037 (0.079)

Neurophysiology 0.320 (0.453) 0.297 (0.419) -0.067 (0.617) 0.294 (0.420)

Neurosurgery -0.044 (0.116) -0.024 (0.100) 0.029 (0.112) -0.026 (0.101)

Teaching 1.414 (1.330) 1.163 (1.300) 1.514 (1.518) 1.186 (1.322)

Foundation trust 1.248 (0.994) 1.849 (1.015)* 1.811 (1.115) 1.841 (1.019)*

Stroke admissions -0.000 (0.000) -0.000 (0.000) -0.000 (0.000) -0.000 (0.000)

Stroke mortality 0.084 (0.105) 0.228 (0.122)* 0.204 (0.138) 0.237 (0.136)*

No qualifications -0.422 (0.321) -0.182 (0.301) -0.173 (0.309) -0.189 (0.306)

Neurosurgeons per region 0.060 (0.015)*** 0.022 (0.020) 0.028 (0.038) 0.028 (0.038)

Median wage 0.056 (0.020)*** 0.060 (0.032)* 0.060 (0.031)*

Organisation score 0.297 (0.048)*** 0.325 (0.049)*** 0.313 (0.053)*** 0.326 (0.049)***

London -1.042 (5.961)

Year dummy -0.502 (0.965) -1.668 (1.020) -1.540 (1.323) -1.782 (1.246)

R 2 0.230 0.250 0.143 0.250

Adj. R 2 0.218 0.237 0.134 0.236

Num. obs. 303 303 265 303

*** p\0:01, ** p\0:05, * p\0:1. The dependent variable in all the models is the change in the quality of the process of stroke care obtained by taking out the within-hospital mean of the variable from the variable. Similarly, the table replaces the composite organisational score with the

change in the variable obtained by taking out the within-hospital mean of the variable. Column 3 excludes data relating to London hospitals.

Column 4 includes the London dummy, using the full data

Table 10 Fixed effects models: organisation score

Models

(1) (2) (3) (4)

Beds 0.017 (0.012) 0.018 (0.012) 0.018 (0.012) 0.016 (0.012)

Day case theatres 1.112 (0.790) 0.930 (0.795) 0.963 (0.798) 1.004 (0.788)

Nurse-to-bed ratio 5.238 (2.987)* 6.486 (3.079)** 6.301 (3.106)** 5.519 (3.086)*

Neurology -0.260 (0.576) -0.084 (0.585) -0.112 (0.588) -0.202 (0.582)

Neurophysiology 1.554 (2.918) 1.857 (2.912) 1.851 (2.924) 1.209 (2.901)

Neurosurgery -0.558 (0.904) -0.423 (0.904) -0.483 (0.904) -0.099 (0.909)

Teaching 0.540 (10.373) 0.490 (10.327) 0.466 (10.357) 0.865 (10.227)

Foundation trust -2.747 (1.847) -1.354 (2.044) -1.718 (2.026) -0.764 (2.045)

Median wage -0.034 (0.028) 0.358 (0.176)**

R 2 0.055 0.051 0.064 0.076

Adj. R 2 0.026 0.024 0.029 0.035

Num. obs. 342 342 342 342

*** p\0:001, ** p\0:01, * p\0:05, � p\0:1. The dependent variable is the composite organisational measure, all explanatory variables are lagged for 1 year, and the coefficients are all fixed effects estimates.

Column 1 controls for hospital fixed effects while column 2 controls for hospital and time fixed effects.

Columns 3 and 4 add weekly median wage. The columns respectively control for individual fixed effects

and both individual and time fixed effects. Column 5 adds the lag of the dependent variable and controls for

individual and time fixed effects

406 M. Ali et al.

123

References

1. Aragon, M.J.A., Castelli, A., Gaughan, J.: Hospital trusts pro-

ductivity in the english nhs: uncovering possible drivers of pro-

ductivity variations. CHE University of York Research Paper 117

(2015)

2. Athey, S., Stern, S.: An empirical framework for testing theories

about complimentarity in organizational design. NBER Working

Paper, National Bureau of Economic Research (1998)

3. Ayanian, J.Z., Weissman, J.S.: Teaching hospitals and quality of

care: a review of the literature. Milbank Q. 80(3), 569–593 (2002) 4. Bijlsma, M.J., Koning, P.W., Shestalova, V.: The effect of

competition on process and outcome quality of hospital care in

the Netherlands. De Econ. 161(2), 121–155 (2013) 5. Birkmeyer, J.D., Dimick, J.B.: Understanding and reducing

variation in surgical mortality. Annu. Rev. Med. 60, 405–415 (2009)

6. Bloom, N., Lemos, R., Sadun, R., Scur, D., Reenen, J.V.: The

new empirical economics of management. CEP Occasional

Papers 41, Centre for Economic Performance, LSE (2014)

7. Bloom, N., Propper, C., Seiler, S., Van Reenen, J.: The impact of

competition on management quality: evidence from public hos-

pitals. Rev. Econ. Stud. 82(2), 457–489 (2015) 8. Bloom, N., Sadun, R., Van Reenen, J.: Does management matter

in healthcare? Stanford Mimeo (2014)

9. Bloom, N., Van Reenen, J.: Measuring and explaining manage-

ment practices across firms and countries. Q. J. Econ. 122(4), 1351–1408 (2007)

10. Bradley, E.H., Curry, L.A., Spatz, E.S., Herrin, J., Cherlin, E.J.,

Curtis, J.P., Thompson, J.W., Ting, H.H., Wang, Y., Krumholz,

H.M.: Hospital strategies for reducing risk-standardized mortality

rates in acute myocardial infarction. Ann. Intern. Med. 156(9), 618–626 (2012)

11. Bradley, E.H., Herrin, J., Curry, L., Cherlin, E.J., Wang, Y.,

Webster, T.R., Drye, E.E., Normand, S.-L.T., Krumholz, H.M.:

Variation in hospital mortality rates for patients with acute

myocardial infarction. Am. J. Cardiol. 106(8), 1108–1112 (2010) 12. Bray, B.D., Ayis, S., Campbell, J., Hoffman, A., Roughton, M.,

Tyrrell, P.J., Wolfe, C.D., Rudd, A.G.: Associations between the

organisation of stroke services, process of care, and mortality in

England: prospective cohort study. BMJ 346, f2827 (2013) 13. Breiman, L., Friedman, J., Stone, C.J., Olshen, R.A.: Classifica-

tion and Regression Trees. CRC Press, Boca Raton (1984)

14. Brynjolfsson, E., Milgrom, P.: Complementarity in organization.

In: Gibbons, R., Roberts, J. (eds.) The Handbook of Organiza-

tional Economics. Princeton University Press, Princeton (2013)

15. Burnham, K.P., Anderson, D.R.: Model Selection and Multi-

model Inference: A Practical Information-theoretic Approach.

Springer, Berlin (2002)

16. Candelise, L., Gattinoni, M., Bersano, A., Micieli, G., Sterzi, R.,

Morabito, A., the PROSIT Study Group, et al.: Lancet stroke-unit

care for acute stroke patients: an observational follow-up study.

369(9558), 299–305 (2007) 17. Caplan, L.: Stroke is best managed by neurologists. Stroke

34(11), 2763–2763 (2003) 18. Chandra, A., Finkelstein, A., Sacarny, A., Syverson, C.: Pro-

ductivity dispersion in medicine and manufacturing. Am. Econ.

Rev. 106(5), 99–103 (2016) 19. Chen, L.M., Nallamothu, B.K., Krumholz, H.M., Spertus, J.A.,

Tang, F., Chan, P.S.: Association between a hospital’s quality

performance for in-hospital cardiac arrest and common medical

conditions. Circ. Cardiovasc. Qual. Outcomes 6(6), 700–707 (2013)

20. Cho, S.-H., Yun, S.-C.: Bed-to-nurse ratios, provision of basic

nursing care, and in-hospital and 30-day mortality among acute

stroke patients admitted to an intensive care unit: cross-sectional

analysis of survey and administrative data. Int. J. Nurs. Stud.

46(8), 1092–1101 (2009) 21. Cooper, R.A.: Regional variation and the affluence-poverty

nexus. JAMA 302(10), 1113–1114 (2009) 22. Crombie, I., Davies, H.: Beyond health outcomes: the advantages

of measuring process. J. Eval. Clin. Pract. 4(1), 31–38 (1998) 23. Farrar, S., Yi, D., Sutton, M., Chalkley, M., Sussex, J., Scott, A.: Has

payment by results affected the way that English Hospitals provide

care? Difference–in–differences analysis. BMJ 339, b3047 (2013) 24. Fine, J.M., Fine, M.J., Galusha, D., Petrillo, M., Meehan, T.P.:

Patient and hospital characteristics associated with recommended

processes of care for elderly patients hospitalized with pneumo-

nia: results from the medicare quality indicator system pneumo-

nia module. Arch. Intern. Med. 162(7), 827–833 (2002) 25. Flood, A.B.: The impact of organizational and managerial factors

on the quality of care in health care organizations. Med. Care

Res. Rev. 51(4), 381–428 (1994) 26. Friedman, M.: Essays in positive economics. University of Chi-

cago Press (1953)

27. Fu, W., Simonoff, J.S.: Unbiased regression trees for longitudinal

and clustered data. Comput. Stat. Data Anal. 88, 53–74 (2015) 28. Garicano, L., Rayo, L.: Why organizations fail: models and cases.

J. Econ. Lit. 54(1), 137–192 (2016) 29. Gaynor, M., Laudicella, M., Propper, C.: Can governments do it

better? Merger mania and hospital outcomes in the english nhs.

J. Health Econ. 31(3), 528–543 (2012) 30. Gaynor, M., Seider, H., Vogt, W.B.: The volume-outcome effect,

scale economies, and learning-by-doing. Am. Econ. Rev. 95, 243–247 (2005)

31. Ghaferi, A.A., Birkmeyer, J.D., Dimick, J.B.: Variation in hos-

pital mortality associated with inpatient surgery. New Engl.

J. Med. 361(14), 1368–1375 (2009) 32. Gobillon, L., Milcent, C.: Spatial disparities in hospital perfor-

mance. J. Econ. Geogr. 13(6), 1013–1040 (2013) 33. Grimaud, O., Béjot, Y., Heritage, Z., Vallée, J., Durier, J., Cadot,

E., Giroud, M., Chauvin, P.: Incidence of stroke and socioeco-

nomic neighborhood characteristics an ecological analysis of

dijon stroke registry. Stroke 42(5), 1201–1206 (2011) 34. Hentschker, C., Mennicken, R.: The volume–outcome relation-

ship and minimum volume standards–empirical evidence for

Germany. Health Econ. (2014)

35. Hoeks, S., Scholte op Reimer, W., Lingsma, H., van Gestel, Y.,

van Urk, H., Bax, J., Simoons, M., Poldermans, D.: Process of

care partly explains the variation in mortality between hospitals

after peripheral vascular surgery. Eur. J. Vasc. Endovasc. Surg.

40(2), 147–154 (2010) 36. Hothorn, T., Hornik, K., Zeileis, A.: Unbiased recursive parti-

tioning: a conditional inference framework. J. Comput. Graph.

Stat. 15(3), 651–674 (2006) 37. Kapral, M.K., Fang, J., Chan, C., Alter, D.A., Bronskill, S.E.,

Hill, M.D., Manuel, D.G., Tu, J.V., Anderson, G.M.: Neighbor-

hood income and stroke care and outcomes. Neurology 79(12), 1200–1207 (2012)

38. Katz, M.L.: Provider competition and healthcare quality: more

bang for the buck? Int. J. Indus. Organ. 31(5), 612–625 (2013) 39. Ketcham, J.D., Baker, L.C., MacIsaac, D.: Physician practice size

and variations in treatments and outcomes: evidence from

medicare patients with ami. Health Affairs 26(1), 195–205 (2007) 40. Kolstad, J.T., Kowalski, A.E.: The impact of health care reform

on hospital and preventive care: evidence from massachusetts.

J. Public Econ. 96(11), 909–929 (2012) 41. Kuhn, M., Johnson, K.: Applied Predictive Modeling. Springer,

Berlin (2013)

42. Kupersmith, J.: Quality of care in teaching hospitals: a literature

review. Acad. Med. 80(5), 458–466 (2005)

Hospital heterogeneity: what drives the quality of health care 407

123

43. Langhorne, P., Taylor, G., Murray, G., Dennis, M., Anderson, C.,

Bautz-Holter, E., Dey, P., Indredavik, B., Mayo, N., Power, M.,

et al.: Early supported discharge services for stroke patients: a

meta-analysis of individual patients’ data. Lancet 365(9458), 501–506 (2005)

44. Lindenauer, P.K., Remus, D., Roman, S., Rothberg, M.B., Ben-

jamin, E.M., Ma, A., Bratzler, D.W.: Public reporting and pay for

performance in hospital quality improvement. New Engl. J. Med.

356(5), 486–496 (2007) 45. Manheim, L.M., Feinglass, J., Shortell, S.M., Hughes, E.F.:

Regional variation in medicare hospital mortality. Inquiry 29(1), 55–66 (1992)

46. May, K.: Technological change and aggregation. Econometrica,

15(1), 51–63 (1947) 47. McClellan, M.B., Staiger, D.O.: Comparing hospital quality at

for-profit and not-for-profit hospitals. In: The changing hospital

industry: comparing for-profit and Not-for-profit institutions,

pp. 93–112. University of Chicago Press (2000)

48. McConnell, K.J., Chang, A.M., Maddox, T.M., Wholey, D.R.,

Lindrooth, R.C.: An exploration of management practices in

hospitals. Healthcare 2(2), 121–129 (2014) 49. McConnell, K.J., Lindrooth, R.C., Wholey, D.R., Maddox, T.M.,

Bloom, N.: Management practices and the quality of care in

cardiac units. JAMA Intern. Med. 173(8), 684–692 (2013) 50. McNaughton, H., McPherson, K., Taylor, W., Weatherall, M.:

Relationship between process and outcome in stroke care. Stroke

34(3), 713–717 (2003) 51. Menachemi, N., Chukmaitov, A., Saunders, C., Brooks, R.G.:

Hospital quality of care: does information technology matter?

The relationship between information technology adoption and

quality of care. Health Care Manag. Rev. 33(1), 51–59 (2008) 52. Milgrom, P., Roberts, J.: Complementarities and fit strategy,

structure, and organizational change in manufacturing. J. Ac-

count. Econ. 19(2), 179–208 (1995) 53. Mitchell, J.B., Ballard, D.J., Whisnant, J.P., Ammering, C.J.,

Samsa, G.P., Matchar, D.B.: What role do neurologists play in

determining the costs and outcomes of stroke patients? Stroke

27(11), 1937–1943 (1996) 54. Mohammed, M.A., Mant, J., Bentham, L., Raftery, J.: Comparing

processes of stroke care in high-and low-mortality hospitals in the

West Midlands, UK. Int. J. Qual. Health Care 17(1), 31–36 (2005)

55. Mooney, H.: Quality of stroke care varies widely across England.

BMJ 340, c1816 (2010) 56. Needleman, J., Buerhaus, P., Mattke, S., Stewart, M., Zelevinsky,

K.: Nurse-staffing levels and the quality of care in hospitals. New

Engl. J. Med. 346(22), 1715–1722 (2002) 57. Needleman, J., Buerhaus, P., Pankratz, V.S., Leibson, C.L., Ste-

vens, S.R., Harris, M.: Nurse staffing and inpatient hospital

mortality. New Engl. J. Med. 364(11), 1037–1045 (2011) 58. O’Brien, S.M., DeLong, E.R., Peterson, E.D.: Impact of case

volume on hospital performance assessment. Arch. Intern. Med.

168(12), 1277–1284 (2008) 59. Ogbu, U.C., Slobbe, L.C., Arah, O.A., de Bruin, A., Stronks, K.,

Westert, G.P.: Hospital stroke volume and case-fatality revisited.

Med. Care 48(2), 149–156 (2010) 60. Park, S., Lee, J., Ikai, H., Otsubo, T., Imanaka, Y.: Decentral-

ization and centralization of healthcare resources: investigating

the associations of hospital competition and number of cardiol-

ogists per hospital with mortality and resource utilization in

japan. Health Policy 113(1), 100–109 (2013) 61. Propper, C.: Competition, incentives and the english nhs. Health

Econ. 21(1), 33–40 (2012)

62. Propper, C., Burgess, S., Green, K.: Does competition between

hospitals improve the quality of care? Hospital death rates and the

nhs internal market. J. Public Econ. 88(7), 1247–1272 (2004) 63. Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier,

Amsterdam (2014)

64. Rafferty, A.M., Clarke, S.P., Coles, J., Ball, J., James, P., McKee,

M., Aiken, L.H.: Outcomes of variation in hospital nurse staffing

in english hospitals: cross-sectional analysis of survey data and

discharge records. Int. J. Nurs. Stud. 44(2), 175–182 (2007) 65. Reeves, M.J., Gargano, J., Maier, K.S., Broderick, J.P., Frankel,

M., LaBresh, K.A., Moomaw, C.J., Schwamm, L.: Patient-level

and hospital-level determinants of the quality of acute stroke care

a multilevel modeling approach. Stroke 41(12), 2924–2931 (2010)

66. Ringel, S.P.: The neurologist’s role in stroke management. Stroke

27(11), 1935–1936 (1996) 67. Ringelstein, E.B., Chamorro, A., Kaste, M., Langhorne, P., Leys,

D., Lyrer, P., Thijs, V., Thomassen, L., Toni, D.: European stroke

organisation recommendations to establish a stroke unit and

stroke center. Stroke 44(3), 828–840 (2013) 68. Roland, M., Rosen, R.: English nhs embarks on controversial and

risky market-style reforms in health care. New Engl. J. Med.

364(14), 1360–1366 (2011) 69. Rudd, A.G., Irwin, P., Rutledge, Z., Lowe, D., Wade, D., Pearson,

M.: Regional variations in stroke care in England, wales and

Northern Ireland: results from the national sentinel audit of

stroke. Clin. Rehabil. 15(5), 562–572 (2001) 70. Saposnik, G., Baibergenova, A., O’Donnell, M., Hill, M., Kapral,

M., Hachinski, V., et al.: Hospital volume and stroke outcome

does it matter? Neurology 69(11), 1142–1151 (2007) 71. Schootman, M., Lian, M., Pruitt, S.L., Deshpande, A.D., Hen-

dren, S., Mutch, M., Jeffe, D.B., Davidson, N.: Hospital and

geographic variability in thirty-day all-cause mortality following

colorectal cancer surgery. Health Serv. Res. 49(4), 1145–1164 (2014)

72. Sela, R.J., Simonoff, J.S.: Re-em trees: a data mining approach

for longitudinal and clustered data. Mach. Learn. 86(2), 169–207 (2012)

73. Svendsen, M.L., Ehlers, L.H., Frydenberg, M., Ingeman, A.,

Johnsen, S.P.: Quality of care and patient outcome in stroke units:

is medical specialty of importance? Med. Care 49(8), 693–700 (2011)

74. Svendsen, M.L., Ehlers, L.H., Ingeman, A., Johnsen, S.P.: Higher

stroke unit volume associated with improved quality of early

stroke care and reduced length of stay. Stroke 43(11), 3041–3045 (2012)

75. Syverson, C.: What determines productivity? J. Econ. Literat.

49(2), 326–365 (2011) 76. Ukawa, N., Ikai, H., Imanaka, Y.: Trends in hospital performance

in acute myocardial infarction care: a retrospective longitudinal

study in japan. Int. J. Qual. Health Care 26(5), 516–523 (2014) 77. West, E.: Management matters: the link between hospital

organisation and quality of patient care. Qual. Health Care 10(1), 40–48 (2001)

78. Whisnant, J.P.: The role of the neurologist in the decline of

stroke. Ann. Neurol. 14(1), 1–7 (1983) 79. Wooldridge, J.: Introductory Econometrics: A Modern Approach,

5th edn. Nelson Education, Toronto (2012)

80. Xian, Y., Holloway, R.G., Chan, P.S., Noyes, K., Shah, M.N.,

Ting, H.H., Chappel, A.R., Peterson, E.D., Friedman, B.: Asso-

ciation between stroke center hospitalization for acute ischemic

stroke and mortality. JAMA 305(4), 373–380 (2011)

408 M. Ali et al.

123

  • Hospital heterogeneity: what drives the quality of health care
    • Abstract
    • Introduction
    • Data and descriptive statistics
      • Clinical process of care
      • Organised stroke care
      • Hospital and regional characteristics
    • Panel regression trees
    • Empirical analysis
      • Disaggregate models
      • Composite measure
    • Robustness analysis
      • Correlated random effects models
    • Conclusion
    • Acknowledgements
    • Appendix
    • References