BUSINESS MANAGEMENT A+ WORK, ON TIME, NO PLAGARIZING; ON TIME

profilePelicans!!322
Improving_decision_making_in_hecono600.pdf

Improving Decision Making in Healthcare Operations

Matthew D. Dean, Ph.D.

University of Connecticut, 2010

This dissertation examines three distinct problems in the healthcare operations field and

demonstrates how decision making can be improved in each setting. The first problem

addresses how to make dynamic, real-time allocation decisions of testing time slots for a

Cardiac Diagnostic Testing Center (CDTC) in the presence of multiple competing patient

classes. Using historical data from the CDTC at a partner hospital, we explore the

performance of a dynamic model which is based on a multi-commodity network flow

formulation of the CDTC's scheduling problem. It incorporates the possibility of

"bumping" different patient classes in order to accommodate requests from other patient

classes. Our results indicate that this approach will result in drastically improved quality

of service for inpatients with little or no reduction in the quality of service to the

outpatients, relative to existing policy at our partner hospital. We then shift to exploring

another important problem involving healthcare resources: during a Mass Casualty

Incident (MCI), to which one of several area hospitals should each victim be sent? These

decisions depend on resource availability (both transport and care) and the survival

probabilities of patients. We focus on the critical time period immediately following the

onset of an MCI and are concerned with how to effectively transport victims to the

different area hospitals in order to provide the greatest good to the greatest number of

patients while not overwhelming any single hospital. We explore two settings. First, we

examine the deterministic version of the problem. We compare our proposed model with

Matthew D. Dean - University of Connecticut, 2010

a model in the extant literature and also against several current policies commonly used

by the so-called incident commander. Finally, we explore a dynamic and stochastic

setting for the MCI problem, looking at how our decisions affect the expected number of

survivors as events unfold over time. To address the infamous "curse of dimensionality",

we develop a heuristic methodology that combines the use of a stochastic dynamic

program (SDP) and Monte Carlo simulation to arrive at a final decision policy. We call

this methodology "scenario iteration." Managerial insights are also discussed for each of

the MCI models.

Improving Decision Making in Healthcare Operations

Matthew D. Dean

B.S., The College of William and Mary, 1997

A Dissertation

Submitted in Partial Fulfillment of the

Requirements for the Degree of

Doctor of Philosophy

at the

University of Connecticut

2010

UMI Number: 3411477

All rights reserved

INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted.

In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed,

a note will indicate the deletion.

UMT Dissertation Publishing

UMI 3411477 Copyright 2010 by ProQuest LLC.

All rights reserved. This edition of the work is protected against unauthorized copying under Title 17, United States Code.

A ® uest ProQuest LLC

789 East Eisenhower Parkway P.O. Box 1346

Ann Arbor, Ml 48106-1346

Copyright by

Matthew D. Dean

2010

APPROVAL PAGE

Doctor of Philosophy Dissertation

Improving Decision Making in Healthcare Operations

Presented by

Matthew D. Dean, B.S.

Major Advisor

Associate Advisor

Robert Day

Associate Advisor

Robert Garfinkel

Associate Advisor

Steven Thompson

University of Connecticut

2010

II

Table of Contents APPROVAL PAGE ii

Acknowledgements vi

Chapter 1 - Introduction 1

Chapter 2 - Dynamic Allocation of Cardiac Diagnostic Testing Time Slots .5

2.1 Introduction 5

2.2 Literature Review 9

2.3 Problem Setting 10

2.3.1 The State of the Hospital 12

2.4 An Overview of the Markov Decision Process 14

2.4.1 Decisions 14

2.4.2 Rewards to the hospital 14

2.4.3 Costs to patients 15

2.5 Details of the MDP 15

2.5.1 Notation 15

2.5.2 Process State 19

2.5.3 Constraints 20

2.5.4 Stage Reward 20

2.5.5 State Transition 22

2.5.6 Objective Function 23

2.6 A DSS Based on a Multi-commodity Flow Snapshot Model (MFSM) 23

2.6.1 Notation 25

2.6.2 Derivation of Inpatient Delay Costs under MFSM 26

2.6.3 Formulation 27

2.6.4 Size of the Model 28

2.7 Experimental Analysis 29

2.7.1 The MFSM versus the MDP 29

2.7.2 The MFSM versus the Current Approach 32

2.7.3 Results 34

2.8 Conclusions 37

CHAPTER 3 - Mass-Casualty Triage: Distribution of Victims to Multiple

Hospitals 39

iii

3.1. Introduction 39

3.2. Literature Review 43

3.3. Resource-Constrained Triage 44

3.3.1 Brief introduction to STM 45

3.3.2 Limitations of STM 46

3.3.3 Motivational Example 46

3.3.4 Our Enhanced Model 50

3.4. Formulation of the Resource-Constrained Triage Problem 52

3.4.1 Notation 52

3.4.2 Formulation 53

3.4.3 A Small Example 54

3.4.4 Enhanced Model's Performance vs. STM Model 55

3.5. Experiments 56

3.5.1 Experimental Setup 58

3.5.2 Low Score First Not Always Best Approach 61

3.5.3 Comparison of Enhanced Model to Heuristics 62

3.6. Managerial Implications and Conclusion 64

CHAPTER 4 - Investigating the Distribution of Victims to Multiple Hospitals in a Dynamic and Stochastic Setting 66

4.1. Introduction 66

4.2. Review of the Relevant Literature 68

4.3. Problem Description 70 4.3.1 Data Elements 71

4.3.2 The Stochastic and Dynamic Problem 73

4.4. Scenario Iteration 74

4.4.1 General Heuristic Algorithm 76

4.4.2 The Stochastic Dynamic Programming Model 77

4.4.3 The Monte Carlo Simulation 80

4.4.3 Convergence 83

4.5. Experimental Analysis 85

4.5.1 Experimental Setup 85

4.5.2 Model Performance 88

4.5.3 Performance of the Heuristic Methodology 90

4.5.4 Ability to Scale 90

4.6. Conclusion 93

IV

Chapter 5 - Conclusion and Future Directions 94

References 98

v

Acknowledgements

I would like to thank each member of my committee for their guidance and

encouragement. Their help along my winding journey are appreciated far more than they

know. My major advisor, Professor Suresh Nair, has reminded me of what a rewarding

experience it is to have the opportunity to work with a team of truly intelligent individuals.

He renewed my vigor to attack research problems head-on. His constant pressure to

provide results quickly made completing this dissertation a reality. Also, his insights and

ability to explain things clearly and simply have renewed my aspirations to be able to do

the same thing for all of my future students. The days when Bob Day and I would

brainstorm ideas on his whiteboard were treasured by me. His high energy and desire to

solve problems are contagious. Rob Garfinkel always expected my best effort and

wanted me to push myself as far as possible. Steve Thompson has been a steady and

valuable mentor to me both during our shared time at UConn and beyond.

I would also like to thank my fellow PhD students in the OPIM department for all

the times we shared together. During my early years, Steve helped ground my belief that

"real" research can be practical. Towards the latter part, Travis Maynard proved an

invaluable sounding board during our runs. His continued friendship is cherished.

Last, and most importantly, I thank my wife Jennifer for believing in me. Her

stead-fast love and patience throughout this process warrants, in my mind, her own

special "PhD." How she manages to raise two small boys while I am always at work, I do

not know. Cole and Evan have reminded me of the true definition of patience, even if I

do not always adhere to it. They always bring me joy at the end of the day. Without

Jennifer, I would not have succeeded. I cannot thank her enough.

Chapter 1 - Introduction

This dissertation examines three distinct problems in the healthcare operations field and

demonstrates how decision making can be improved in each setting. The first problem

addresses how to make dynamic, real-time allocation decisions of testing time slots for a

Cardiac Diagnostic Testing Center (CDTC) in the presence of multiple competing patient

classes. The second and third parts of the dissertation investigate how to determine

which of several area hospitals to send victims of a Mass Casualty Incident (MCI). In the

two MCI settings, first, we explore the problem in a deterministic setting. We compare

our proposed model with a model in the extant literature and also against several current

policies commonly used by the so-called incident commander. Finally, we explore a

dynamic and stochastic setting for the MCI problem, looking at how our decisions affect

the expected number of survivors as events unfold over time.

A CDTC makes real-time scheduling decisions that impact the use of its

equipment and rooms. Both inpatients and outpatients are frequent users of CDTC

resources, and physicians may prescribe one of several test protocols. The result is a

highly complex online decision-making environment, in which the decisions made by the

CDTC affect both direct revenues and costs associated with the patients undergoing

cardiac diagnosis, and the costs associated with other hospital resources, such as bed

availability. Using historical data from the CDTC at a local partner hospital, we explore

the performance of a dynamic model (instead of the currently used static policies), based

on a multi-commodity network flow formulation of the CDTC's scheduling problem that

incorporates the possibility of "bumping" or delaying of different patient classes in order

to accommodate requests from other patient classes. To demonstrate how this can be

1

done, we provide a descriptive model of the CDTC's scheduling problem in the form of a

Markov decision process (MDP). We then contrast the results obtained by using a

tractable, state-aggregated version of the MDP to the results obtained from using our

suggested protocol. Solving the full MDP representing the problem is impractical in the

online setting of this study and motivates the introduction of the proposed dynamic

network approach.

Our results indicate that this approach will result in drastically improved quality of

service for inpatients with little or no reduction in the quality of service to the outpatients,

relative to existing policy at our partner hospital. Our methods provide both an online

decision support tool and a framework by which simulation results may be used as

guidance for more profitable long-term capacity planning. In addition, hospital

administrators can use the results of the simulations to make strategic decisions about

how much capacity to reserve for inpatient tests in order to achieve their desired service

levels.

We then shift to exploring another important problem involving healthcare

resources: during an MCI, to which one of several area hospitals should each victim be

sent? These decisions depend on resource availability (both transport and care) and the

survival probabilities of patients. Simple heuristics used at present (such as, send the

most critical patient out first, to the closest hospital) may not result in the best utilization

of resources or the best overall outcome. In the first part of this problem, we focus on a

deterministic setting. Triaging victims of an MCI and subsequently transporting them to

area hospitals provides a challenge to the disaster response team. Because time is of

the essence, and there are multiple variables to consider in quick time, simple heuristics

are usually used. We describe an enhanced model that takes into account multiple

hospitals and their capacities and capabilities, victim-specific care times, degradation of

2

survival due to delays, increase of care time due to delays, and transport availability

depending on when and where these resources were deployed and dispatched.

We focus on the critical time period immediately following the onset of an MCI

and are concerned with how to effectively transport victims to the different area hospitals

in order to provide the greatest good to the greatest number of patients while not

overwhelming any single hospital. The Sacco Treatment Method (STM) model (Sacco et

al., 2005) is the only model that quantifies the term "greatest good to the greatest

number." They develop a "score-based" mathematical programming model that

maximizes the expected number of survivors from the MCI. Though this model is an

improvement over simple heuristics, there are many assumptions in the model which

may not be realistic in real time settings.

To address these issues, we propose three major enhancements to the STM

model, making our model more realistic and useful in practice. The enhancements we

make are: the time needed to treat/process a victim at a hospital varies depending on

the severity of the injury to the victim; care times for a particular victim will increase as

his score decreases (gets worse); we explicitly consider the treatment capacities and

capabilities of the area hospitals; and the availability of resources for transporting victims

depends upon where and when it was sent in the previous trip.

This resource-constrained triage problem is formulated as a mixed-integer

program. We compare our model to the STM model and show that, in our experiments,

our model has an average improvement of 36.5% and 69.0% in the expected number of

survivors, depending on how the STM model is implemented.

We then design a comprehensive experiment to compare our enhanced model

with three commonly used policies for allocating victims to hospitals during an MCI. Our

results indicate a marked improvement over all of these currently used heuristics. We

also discuss under which circumstances it would be advantageous to have more

3

transport resources available versus more hospital capacity, providing guidelines for

implementation of our model to incident commanders of mass casualty incidents.

The final part of the dissertation investigates the evacuation of victims of an MCI

in a dynamic setting. This problem is extremely challenging because of the large state

space that needs to be explored due to the number and dimension of state variables that

need to be modeled. Even very small instances of the problem can result in several

million states that need to be considered. To address this issue, we develop a heuristic

methodology that combines the use of a stochastic dynamic program (SDP) and Monte

Carlo simulation to arrive at a final decision policy. The simulation helps predict the

probability of occurrence of status of several state variables so they do not have to be

explicitly modeled in the SDP, thus reducing the size of the state space dramatically.

The SDP optimal policy table is then used in the simulation to determine decisions. The

simulation then generates an updated probability transition matrix to be used in the SDP.

We find that, using the scaling methodology proposed, this iterative solving of the SDP

and simulation converges to a stable policy table fairly quickly. We call this methodology

"scenario iteration."

We compare the performance of scenario iteration over several scaling methods,

and over several test data sets. We also compare the results and applicability of the

dynamic model compared to the static model developed in the second part of the

dissertation. Managerial insights are discussed for each of the MCI models.

4

Chapter 2 - Dynamic Allocation of Cardiac Diagnostic

Testing Time Slots

2.1 Introduction

Many hospitals in the United States provide a wide range of inpatient and outpatient

healthcare services to the communities they serve. Outpatient services tend to be more

lucrative because they require less infrastructural investment and have lower ancillary

support costs. The profits obtained from outpatient services are crucial, because they

enable the hospital to offset possible losses on certain inpatient services and invest in

new medical technologies. Thus, the units that provide outpatient services are often

treated as profit centers, each of which strives to maximize revenue. Inpatient services,

with the exception of some surgical services, are typically not highly lucrative. For these

services, the hospital typically receives a lump sum payment determined by the patient's

classification to a particular Diagnostic Related Group (DRG) at the time of admission.

However inpatient care requires a great deal of infrastructure and support personnel to

be constantly in place. This results in very high fixed costs, so that hospital units that

provide inpatient services must operate at high levels of utilization in order to break

even. The problem of optimally balancing the levels of inpatient versus outpatient

services is complicated by the fact that the demand for inpatient services is variable. The

need to maintain high levels of utilization and meet uncertain demand makes the

management of inpatient services challenging.

While low demand that results in poor utilization is a concern, high demand and

the associated congestion can also present a significant problem. For example, if a

5

given inpatient unit in a hospital does not have enough beds to accommodate all

patients in need, then some of them will be "held" in the emergency department (ED)

until a bed becomes available. When the ED becomes too crowded, it can no longer

function effectively. In the extreme case the hospital may have to deny admission to

potential inpatients. Often this takes the form of diversion of ambulances to other

hospitals, since this is a common manner of inpatient arrival. The hospital then loses the

revenue that would have resulted from treating those patients that were diverted, and its

reputation as a reliable provider is also damaged.

Some hospital facilities are commonly termed "outpatient units", although that is

not entirely accurate, as these units often provide services to inpatients as well. One

typical example is the cardiac diagnostic testing center (CDTC) at Windham Community

Memorial Hospital (WCMH) in Willimantic, CT. Like the CDTC at most hospitals, its

primary role is to provide a range of cardiac screening tests to outpatients. These

screenings are used for preventative care as opposed to the diagnosis of urgent,

potentially life threatening conditions. Patients are typically referred to the CDTC by their

physicians, and schedule appointments for prescribed tests, well in advance.

The other role of the CDTC is to perform cardiac screenings on inpatients. For

them, the stress test is generally the last test prior to discharge because it is the most

aggressive in terms of how much strain is placed on the patient's heart. When a patient

is admitted with chest pain a number of tests are done that do not place any strain on

the heart (blood tests, EKGs, etc.). The purpose of those tests is to find evidence that

the heart muscle is not receiving enough blood flow, causing the patient to experience

chest pain. When these tests are negative, the final test is a stress test to push the heart

and see if any symptoms arise. If the heart behaves normally under exercise conditions,

then it is safe to send the patient home. Only if symptoms appear during this final

screening, which in practice is uncommon, must the patient remain hospitalized. At

6

WCMH in particular, even if a stress test is not normal, patients still leave the hospital.

Rather than going home they are typically transferred to a larger facility that is able to

perform complex cardiovascular procedures such as angioplasty, coronary artery

stenting, coronary artery by-pass grafting, etc.

Providing screenings to inpatients presents a challenge because, unlike those for

outpatients, the requests for screening often come with little or no notice. If at the time of

the request there is no immediate opening, a scheduling decision must be made. In

essence, this involves choosing which among various classes of patients, some of whom

are inpatients and others outpatients, to service at a given time period. A serious area of

concern observed at WCMH prior to our study was that some outpatients do indeed

balk, presumably choosing to be screened elsewhere, if they are rescheduled from their

prearranged time slot. Further, in many cases physicians are affiliated with multiple

hospitals in a given area, and as a result, hospitals face competition from other hospitals

when the physicians feel that one hospital is providing better service. Hospitals can also

face competition from the physicians that are affiliated with the hospital. For example,

one of the cardiologists affiliated with WCMH made the decision a few years prior to our

study to stop working with the CDTC and open her own screening clinic. It follows that,

considering only the profit to the CDTC, part of an intuitively dominant decision rule

would be to give priority to outpatients. Thus an ad hoc strategy used by many hospitals,

including WCMH, is to give priority to outpatients except in some time slots that are

reserved for inpatients.

On the other hand, the CDTC can play a key role in maintaining patient flow and

helping the hospital avoid the negative consequences of an overcrowded ED. By

providing an inpatient screening the CDTC effectively "opens a bed" because that

patient can now almost always be discharged, either to home or to another hospital,

after its completion. This provides the hospital with the ability to accommodate an

7

additional patient, should one arrive. Note that any patient whose screening is delayed

must also be assigned a subsequent time slot. Thus, because there are two potential

classes of costs, we choose to follow many authors (e.g., Green et al. (2006)) and

optimize a composite "social welfare" function of the profits to the hospital from providing

patient services, costs to the hospital of lost opportunity, and "inconvenience" costs to

the patients of delays in treatment. It should be stressed that the resulting algorithm is a

decision support system, so that its users can certainly input any costs that they deem

appropriate, including the possibility of not considering patient inconvenience costs by

setting those values to zero.

The scheduling problem is complicated by the fact that there are different types

of cardiac screenings. Some have multiple phases, some span a two-day period, and

the phases in a multiple-phase test must be done in lockstep fashion over time. That is,

once a screening begins, or resumes on its second day, subsequent steps in that day

must be done in the immediately following time periods. The problem can be modeled as

a finite-horizon, discrete-time Markov decision process (MDP) as described in Section

2.8. However, one quickly finds that the computational methods based on this model

tend to be impractically slow and memory intensive when compared to the pace of the

actual decision making environment, motivating the introduction of a computationally

tractable scheduling tool.

Therefore, we introduce a decision support system based on a Multi-commodity

Flow Snapshot Model (MFSM), grounded in the theory of dynamic network optimization.

This approach is both flexible and robust, making it applicable to the various

configurations of patient and test-protocol types that may be found in practically any

CDTC. Based on historical data from WCMH, our experiments indicate that the MFSM

approach results in drastically improved quality of service for inpatients, with little or no

reduction in the quality of service to outpatients relative to the existing policy at WCMH.

8

The bottom line of the hospital will also show a healthy improvement. Our methods

provide both an online decision support tool and a framework by which simulation results

may be used as guidance for more profitable long-term capacity planning.

The rest of this chapter is organized as follows. The related literature is reviewed

in Section 2.2 and in Section 2.3 the problem is formally defined. In Section 2.4 we

provide an overview of the MDP approach with details following in Section 2.5. The

dynamic-network model is described in Section 2.6. In Section 2.7 we use simulation

and real data from WCMH to evaluate the results of the network model relative to the

MDP and to the prior decision making policy utilized by WCMH. Conclusions, with

managerial implications and directions for future research, are addressed in Section 2.8.

2.2 Literature Review

Efficiency of healthcare operations has been widely studied in the management science

literature, and detailed histories of the application of operations research models to

problems in the healthcare industry can be found in (Flagle, 2002) and (Brandeau et al.,

2004). A number of techniques have been employed to help improve patient flow

including dynamic allocation of patients to the inpatient units (Thompson et al., 2009),

bedside registration (Parker, 2004), and applications of simulation and queuing theory to

identify and remove patient flow bottlenecks (Litvak et al., 2001).

The problem of allocating CDTC time slots shares some traits with a number of

stochastic control applications. Allocating service capacity over time among several

competing customer classes has been studied in various settings outside of healthcare,

including hotel management (Lieberman and Yechiali, 1978; Bitran and Gilbert, 1996),

car rentals (Carrol and Grimes, 1995; Geraghty and Johnson, 1997), and airline yield

management (Belobaba, 1989). These articles fall into the category of revenue

management. One distinguishing feature between revenue management and our

diagnostic allocation problem is that in many instances of revenue management,

9

customers can choose which "fare class" they belong to, whereas in our setting, the

screening class is determined by a physician.

The more closely related problem of appointment scheduling and real-time

capacity allocation in an MRI setting was addressed by (Green et al., 2006). The use of

approximate dynamic programming to solve the problem of dynamically allocating

diagnostic imaging resources to multiple patient priority classes, in order to achieve

targeted wait times was investigated in (Patrick et al, 2008). While Patrick et al (2008)

and Green et al. (2006) consider allocation of time slots among multiple patient classes,

the problem addressed here is different for two reasons. First, we expand the state

space definition to include the number of available inpatient beds for different services.

This increases the size and complexity of the problem, but thereby more accurately

captures the costs that CDTC decisions impose on the rest of the hospital. Second, we

consider time slot allocation decisions for both multiple-phase and single-phase tests.

The distinction is that a multiple-phase test involves performing a set of procedures at

separate times. To the best of our knowledge a setting involving a mixture of single- and

multiple-phase tests with a mixture of patient classes in the operations research

healthcare literature, has not been investigated prior to this study. We utilize a dynamic

network flow model within our proposed approach to solving the problem. A recent

example utilizing dynamic network models to schedule and dispatch concrete trucks was

described by (Durbin and Hoffman, 2008).

2.3 Problem Setting

Cardiologists can typically choose from among a number of different types of screenings

for each patient. For example, at WCMH there were three options. All consist of one or

more of the stages listed below. Each of the three stages takes about one hour to

perform. The stages within a given day must be performed in consecutive periods, while

two-day screenings must be completed in successive days. The stages are:

10

• Exercise-Monitor: The patient is connected to an electrocardiograph

machine and the heart is monitored while the patient exercises.

• Rest-lnject-lmage: The patient rests and is then injected with a radio-opaque

isotope, and a special camera called a gamma camera is then used to take

images of blood flow through the heart.

• Inject-lmage: The patient is injected with the isotope and the camera takes

images of blood flow through the heart.

Then the three screening types are as follows:

1. One Day Regular. Exercise-Monitor.

2. One-day nuclear. Rest-lnject-lmage; Exercise-Monitor; Inject-lmage.

3. Two-day nuclear: Day 1 = Exercise-Monitor; Inject-lmage. Day 2 =

Rest-lnject-lmage.

Exercise or "stress" tests are performed in a different room than imaging. After

the stress test stage of a one- or two-day nuclear screening, once clean-up is

completed, another patient can be brought into the stress room. During each stage of

the screening the patient is observed one-on-one by a licensed clinician. Thus the only

limiting physical resources are the number of stress rooms and the number of cameras.

Because each of the screening types described above can be performed on inpatients

and outpatients, there are six distinct patient classes as shown in Table 2.1. They are:

Inpatient Regular (IR); Outpatient Regular (OR); Inpatient One-Day Nuclear (UN);

Outpatient One-Day Nuclear (01N); Inpatient Two-Day Nuclear (I2N); Outpatient Two-

Day Nuclear (02N)). Table 1 indicates the sequence of resources that are required for

each screening on each day. 'C and 'S' indicate stages that require a camera or a stress

room respectively.

11

Table 2.1: The screening classes at WCMH.

Class Day 1 IR, OR UN, 01N I2N, 02N

S C-S-C S-C

NA NA C

Initial outpatient reservations are made in advance through direct consultation

with the physician's office, the patient, and the scheduler at the CDTC. Appointments will

be made only if, at the time of scheduling, there are projected to be sufficient resources

to handle the various steps of the screening. Therefore initial scheduling of outpatients is

considered to be exogenous to the decision process. If it is determined, before an

outpatient arrives at the hospital, that the reservation cannot be honored, the patient or

doctor's office will be notified and the appointment rescheduled. On the other hand,

since inpatients are already in the hospital there is no need for external communication.

They can simply be rescheduled to a later period in the same day or perhaps to the

following day. Of course these decisions come with costs (see Section 2.4.2). More

significantly it is imperative that the decision model itself is able to verify that the

rescheduled beginning time (or resuming time in the case of two-day nuclear

screenings) is resource feasible.

2.3.1 The State of the Hospital

Since the primary motivation to prioritize inpatient screenings is to avoid patient flow

bottlenecks, the evolving state of the hospital in terms of the number of available beds

on the specialized inpatient units must be considered. A hospital is capable of providing

a number of different patient care services, e.g. cardiology, obstetrics, critical care,

neurosurgery, etc. A small community hospital may provide 1 5 - 2 0 different services,

while an academic medical center could easily provide more than 100 services. The

inpatient units are specialized in a manner that enables them to provide care to patients

12

in need of one or more of these services. A neonatal intensive care unit is an example of

a unit specialized to support a single service. In contrast, a general medical/surgical floor

can support several services. Often more than one unit can support a given service. For

example, in a large hospital there are typically several units that can provide care to a

patient admitted with hyperglycemia, a potentially deadly complication of diabetes.

Some of these services, e.g. cardiology, often require the patient to be on a

telemetry monitor in order to continuously monitor heart rhythm and rate. In this problem

setting, inpatients in need of cardiac screening directly impact the availability of beds

that are capable of supporting services that require telemetry monitoring. However, since

the inpatient units can potentially support more than one service, the decision to accept

an inpatient request for a screening can impact the flow of patients in need of services

that do not require telemetry. For example, consider an inpatient unit that can provide

services to cardiology patients (who require telemetry monitoring) and general medical

patients (who do not require telemetry and can use any type of bed). Assume the unit is

full, but has one patient who can be discharged after a cardiac screening. Once that

patient is discharged, the hospital now has the capacity to admit another cardiology

patient or another medical patient.

The state of the hospital is therefore defined as the number of beds available to

provide care to patients in need of each service. The state evolves over time as a

function of random arrivals and the exogenous decisions of bed managers and

physicians regarding which floors to assign to new patients, random departures (patients

discharged from the hospital), and decisions made by the CDTC. The probability

distributions of the random arrival and departure of patients needing these different

services are easily estimated from historical data.

13

2.4 An Overview of the Markov Decision Process

The problem can be modeled as a finite-horizon, discrete-time, Markov decision process

(MDP). Here we give an overview of the basic decisions, rewards, and costs since they

are needed in order to motivate the snapshot model of the following section. Further

details of the MDP can be found in the next section. Time is partitioned into fixed-length

one-hour time intervals. At the beginning of a period, the state of the process is

observed and a decision is made. Immediate costs and rewards are incurred based on

the state and the decision. As indicated earlier we follow Green et al. (2006) in

maximizing social welfare. That is, the function to be maximized adds the hospital's

expected profit and subtracts the inconvenience costs to the patients who are

rescheduled.

2.4.1 Decisions

The basic decisions to be made for each time period within a given day are:

• The number of each type of screening (IR, OR, UN, 01N, I2N, 02N at WCMH)

to initiate;

• The number of each type of two-day screening (I2N, 02N at WCMH) to resume

on their second day;

• The rescheduled day and time of each type of screening that was scheduled to

begin but was not initiated;

• The rescheduled time in the current day to resume each type of two-day

screening that was scheduled to resume but was not resumed.

2.4.2 Rewards to the hospital

Deterministic rewards to the CDTC are obtained when an outpatient's screening is

realized. These can be monetized based on the fee structure for such screenings.

Positive expected rewards accrue to the hospital based on the DRGs and their

14

corresponding fee structures of arriving inpatients. On the other hand negative expected

rewards to the CDTC are based on the loss of revenue from outpatients who balk after

being rescheduled.

2.4.3 Costs to patients

Both inpatients and outpatients incur inconvenience costs if their screenings are

delayed. How to monetize these is a subtle and flexible decision for the user of the

system. In our simulations we have followed Green et al. (2006) and based these on the

expected hourly wages of a typical patient.

2.5 Details of the MDP

One-day tests collect rewards when started, since the model constraints ensure that

they must finish, while two-day tests collect rewards when resumed. The transition

probabilities for subsequent states are also functions of the initial state and the decision.

2.5.1 Notation

The following notation is used in the MDP model (for some we include their realizations

at WCMH in parentheses):

Parameters

m The number of time periods in a workday. (In the WCMH model, m = 10.)

r The set of resource types consumed by the CDTC, ({camera (c), stress

room (s)}).

p{ The number of each resource, (e.g., pc = 2; ps = 1 indicates that the

hospital has two cameras and one stress room).

Z The set of healthcare services the hospital provides.

Zc c z The set of services that require patients to be on cardiac monitoring.

15

Zu The set of services that can be accommodated by inpatient unit u.

Uz The set of inpatient units that can accommodate service z.

W The set of services impacted by the decision to start or resume an

inpatient screening. A service z is an element of V if one of the following

conditions hold:

(1) zezc

(2){uue[/z}nzc*0

Q The set of all screening classes, ({IR, ..., 02N}).

Qin The set of classes of inpatient screenings, ({IR, UN, I2N}).

qout The set of classes of outpatient screenings, ({OR, 01N, 02N}).

Ql £ Q The set of classes with one-day protocols, ({IR, UN, OR, 01N}).

Q2 £ Q The set of classes with two-day protocols, ({I2N, 02N}).

p[ The number of first day phases in a class i protocol, (e.g., pT 02N = 2).

pf The number of second day phases for class i e Q2.

a?:{ ~ (1. ifthe/t/lfirstday phase of class i uses resource 1 (0, otherwise

a?'? = (1, if the;'tflsecond day phase of class i 6 Q2 uses resource 1 (0, otherwise

T - {1,...,T}: the index set of time periods, where period t is the current

period.

yv(/c) = k + (m-k)modm: the index of the last time period in the day that

includes time k 6 T.

at The reward for testing a patient of class i.

n z The expected reward for providing service z to a newly arrived patient into

the hospital.

16

pin The cost of delaying the scheduled first day start or second day

resumption of an inpatient screening by one period within a day.

pout -rne cost of delaying the scheduled first day start or second day

resumption of an outpatient screening by one period within a day.

ybed The expected opportunity cost (due to the inability to admit a newly arrived

patient) of making an inpatient wait overnight.

Yin The deterministic cost to the hospital of keeping an inpatient an additional

night (because they leave right after screening) in the hospital.

yout t n e expected cost, based on the probability of balking, of making an

outpatient wait an additional day before beginning a screening.

Decision Variables

At any time t, we make the decision to commit to a test by initiating or resuming it in the

current time period. Once a test is initiated or resumed, the necessary resources are

reserved for all time periods needed for the phases in the remainder of the current day.

When a two-day test is initiated, the second day resources are not committed until the

following day. This allows for the possibility that the final stage of the two-day nuclear

test could be performed at any time on the following day.

yf The number of class i 6 Q screenings to initiate at time t.

yf The number of class i G Q2 screenings to resume in their second days at

time t.

xf The number of class i e Q screenings currently scheduled to be initiated IK

at time t, "intraday" rescheduled for time k e [t + 1, ...,N(t) - p{ + 1} in

the current day.

17

x?k The number of class i e Q2 screenings currently scheduled to be

resumed at time t, intraday rescheduled for time fc e {t + 1 N(t)-

v{ +1}.

xf0 The number of class i e Q screenings currently scheduled to be initiated

at time t, that are instead "extraday rescheduled", i.e., not rescheduled

for any time k e [t + 1, ...,N(t) -p[ + l } . Note that patients returning for

the second day of a two-day nuclear screening cannot be extraday

rescheduled.

State Variables

wf the number of class i G Q screenings scheduled to start at time k.

w?k The number of class i e Q2 screenings scheduled to resume at time k.

ol The number of uncommitted resources -6 available at time k today, where

resources become committed from the decisions yf and yf.

bz the total surplus of available beds on units in Uz.

L' = f « » 4. „ °< ",l* + ""'-"•• • » n^ber of newly arrived patients in (-(A6Z + bz), otherwise J r

need of service z, that are diverted to a different hospital due to lack of bed

availability (the bbz are random variables defined below).

L°ut = A,(0-Pf

k=t

the total number of rescheduled class i e Qout patients that choose not to

return (the Ad[k and Adf0 are also defined below).

18

Random Variables

AdL The number of class i e Q°ut screenings rescheduled to start at time k e IK

{t + 1,..../V(t) - p{ + 1} that choose not to return.

Ad/' The number of class extraday rescheduled i e Qout screenings that choose

not to return.

AA£ The number of arrivals of patients in need of service z to inpatient unitu e Uz.

AS? The number of departures of patients in need of service z from inpatient unit

u E Uz.

Abz The change in the surplus of available beds on units in Uz (the change

happens during time t, i.e. entering time t +1). Abz is a function of AA" and

A5" and is used for notational convenience. The number of arrivals and the

number of departures are independent between services and each other.

Hence,

!P(Abz)= £ ^P(AA|-A5zy) uei/z zezu

&wf The number of new test requests to start class i screenings at time k.

2.5.2 Process State

The process state (5) depends on the number of patients waiting to have tests started,

the number of patients waiting to have tests finished, uncommitted numbers of resources

at each time period, and the number of available inpatient and ED beds. When a

decision is made to begin a test, the resources needed to complete the test are

committed to that test in the appropriate time periods.

19

2.5.3 Constraints

Feasible decisions at time t satisfy the following:

w(t)-Pf+i (2.1)

fe=t+i

w(t)-Pf+i (2.2)

*?+ Z fc=t+l

s = W?t V i G Q

Btfyf + otfy;) ieQ

< oi+j_x V ; E { I yv(t)-t}and^er (2.3)

Constraints (2.1) ensure that all screenings scheduled to begin at the current

time f are either initiated or rescheduled. Constraints (2.2) perform the same function for

the second day, except that extraday rescheduling is not an option because, once

started, the two-day nuclear stress tests must be completed by the end of the second

day. Constraints (2.3) guarantee that resource capacities are not exceeded at any period

in the current day.

2.5.4 Stage Reward

At state S, once a feasible decision D ••= (yf,x[0,x[k,y?,x?k) has been made there are

immediate benefits and costs that yield a net reward of R(S,D).

When t mod m * 0:

R(S,D)= h-h-h

where

iEQin ieQ'

N(t)-p{+l

h = pout £ £ x / + £ a.Lout

itQOUt k=t+l iEQout

20

N(t)-pf + l

iEQin k=t+l ZEZ

When t mod m = 0:

K(S,D)= h-h-h

where

f3=Y in Y x{0+Y^Lz

iBQin ZEZ

The reward for providing cardiac screening tests are captured in fx. The cost of

rescheduling outpatients to a different time period within the current day plus the cost of

losing the outpatients that choose to leave and have the service provided elsewhere are

expressed in f2 and /4. From a practical standpoint, the possibility of an outpatient

deciding to have the service provided elsewhere is only a risk when the decision is made

on whether or not to reschedule the initiation of the screening. For outpatients returning

for the second day of a two-day nuclear screening, even if the appointment is

rescheduled, it is not realistic that the patient would leave and start the entire screening

process again with a different provider. The cost of requiring inpatients to wait until a

later period in the day plus the expected cost of potential loss of inpatients due to

overcrowding, are found in /3 and f5. The determination of stage reward when

tmodm = 0 differs from those when tmodm * 0, since the waiting costs associated

with an overnight delay are higher than for those for a single time period within the

current day.

21

2.5.5 State Transition

Let S be the current state, D the chosen decision array, and S the state after arrivals and

departures occur. S is updated as follows:

Whent <N(t):

®fk ~ Wik+Awik + Xik-Mik

vv,> = wA. + xt Hk Hk T Aifc

**> " oh,-Yjtffy[ + alfyf) ieQ

rbz + Abz + Lz,

bz = \b7 + Ab7 + I *'+ Z yi ieQlnnQ1 iEQinnQ2

t = t + 1

Lz = 0

lout = o

When t = N(t),

®tk

K

t

=

=

=

wlk+AwL

bz + Abz + Lz

t + 1

Vi, vk = t + l,...,N(t)

vi, v/c = t + i, ...,yv(t)

V; e { 0 . . . . , t f ( t ) - t } ,

fe r

v z e zyy v z e ¥

V i e Q out

V/c = t + 1, ...,N(t)

Vz £ Z

The state transitions from 5 to 5 with probability

Ps,s(D) = Y\p(Aw[k) Y\ HO fl y « ) FK f[^ ieo J ieoout ieoout \-zez J \-zez I ieQ J [ i e Q 0 U t J Lie<20ut

We make the reasonable assumption that demands for screening classes and time slots

within classes are mutually independent, since demand arises from multiple independent

22

sources. Similarly, we assume that the probability that an outpatient who was

rescheduled decides not to return, is independent of both the screening class and the

time slot. Additionally, the net changes in the number of available beds of the various

services are assumed mutually independent.

2.5.6 Objective Function

Let Vt(S) be the maximum expected reward of a T stage problem that begins in state S,

and D be the set of feasible decisions that can be made. S is the state that results from

beginning in state S and taking action D e D and then random variables are realized.

The objective function is

Vt(S) = max\R(S,D) + £ Ps,§(D)Vt+1(S) , t < T

The "curse of dimensionality" becomes apparent when examining the dimension

of the state space. The state variable bz can range from having no available beds to

every bed capable of accommodating service z being open. Let bz be the total number

beds in Uz plus one (i.e., the cardinality of the range for bz). Within a single time period,

then, we will have a dimension on the order of X\z€Zbz. For example, a hospital with 20

services and a maximum capacity of 20 for each service results in a dimension of 212 0

for the state variable bz. Even before considering the other state variables, it is apparent

that this model becomes intractable quickly and motivates the desire for another

technique.

2.6 A DSS Based on a Multi-commodity Flow Snapshot Model (MFSM)

For any realistic problem instance, the MDP described above is too large to solve in

reasonable time (see Section 2.5). In particular, WCMH wanted to make allocation

23

decisions in an online context in less than 10 minutes. Therefore, we introduce a model

based on a current "snapshot" of the state of the hospital, prior to each decision point

with the stochastic elements represented by the expected profits (positive or negative)

associated with the various decisions. The problem is modeled via a dynamic maximum

profit multi-commodity flow formulation. This approach is both flexible and robust,

making it applicable to the various configurations of patient and test-protocol types that

may be found in practically any CDTC or similar center. The snapshot includes the

number of available beds on the inpatient units and in the ED. This information is

combined with the current CDTC schedule, any outstanding or new requests for cardiac

screenings, and patients currently in the screening process.

An integer multi-commodity time-space network flow model (MFSM) is then

constructed, where each test request is a commodity. Each node in the network

represents a point in time and a physical location. An arc between two nodes represents

a possible step in the process flow for that particular test request. Profits are associated

only with arcs that represent waiting (i.e., the decision to reschedule a patient) and arcs

that represent the initiation or resumption of an outpatient screening. The positive profits

associated with initiating outpatient tests, and the negative profits associated with

making patients wait, are deterministic. In contrast, the negative profits associated with

the opportunity cost of not testing inpatients, are based on expectations of new inpatient

arrivals in the future. Similarly, for outpatients, they are based on the expected loss from

balking.

The stochastic aspect of MFSM is captured by dynamically calculating the profits

on the arcs. Since the underlying structure of the network does not change dramatically

between time periods, updating the network is more efficient than rebuilding it at each

decision epoch. Therefore at each decision point the network is updated using the new

24

information (i.e., nodes and arcs are pruned or added and arc profits are adjusted

accordingly).

2.6.1 Notation

The following notation is used in the formulation of MFSM:

s The source node

t The sink (terminal) node

i,j Node indices

k Test index

K The set of test requests

V The node set

A The arc set

Vc The set of camera nodes

Vs The set of stress nodes

Vw The set of "waiting" nodes

y+co = 0':ay)e^) V~(0 = {j:(j,QeA}

Af+ The set of arcs leaving camera node i, i.e., {(£,;') \i e Vc, j £ K+(£)}

Af+ The set of arcs leaving stress node i, i.e., {(£,;') \i e Vs, j e V+(i)}

A?+ The set of arcs leaving wait node i, i.e., {(£,;') \i e Vw, j e V+(i)}

C The number of available imaging cameras

S The number of available stress rooms

zijk The flow over arc (ij) for test k

vUk The expected profit obtained by sending a single unit of flow over the arc

(i,j) for test k

25

Deterministic positive profit comes from the arcs associated with screening an

outpatient. That is, arcs (ij) where i e Vw and ;' e (Vc u Vs). On the other hand,

negative profit is assigned to the (rescheduling) arcs (i,y) where i e {s} u Vw, and

j £ yw in a future period. The specific derivation of these costs relies on the notation of

Section 2.5 and is shown immediately below in Section 2.6.2.

2.6.2 Derivation of Inpatient Delay Costs under MFSM

The negative profit consists of the sum of the deterministic inconvenience cost to the

patient and the expected lost opportunity associated with being unable to admit a new

inpatient. The opportunity cost relates only to the decision to delay an inpatient

screening request. It is a function of the profit associated with admitting a new inpatient

and the probability that, in the future time period, the number of available beds will be

less than the number of new inpatients. Inpatient arrival rates and the associated profits

are easily estimated from historical data. We used an aggregated Poisson distribution

with parameter Az to describe the arrival distribution for each service. To compute Az

using historical admission data, we aggregated the arrival rate for each service over all

units capable of treating that type of patient. Hence, Az = Zueuz<*z- Kolmogorov-Smirnov

tests at the a = 0.05 level showed that there was a strong goodness-of-fit between the

aggregated Poisson distributions and the historical patient arrivals.

The expected cost of delaying an inpatient is then calculated as£z6Z£2z:P(Az >

bz), where Az is the aggregated arrival rate for patients of service z as described above.

The P(-) term represents the probability that the number of patients that need a bed is

greater than the number of currently available beds. The values for ilz were estimated

using WCMH's billing records.

26

2.6.3 Formulation

Figure 2.1 illustrates the multi-commodity network flow formulation. It shows a simple

example consisting of a single day, three time slot scenario, and two patient classes (UN

and OR). A multi-commodity time-space network is constructed for two test requests

from the OR patient class (Test 1 and Test 3) and a single test request for a patient of

class UN (Test 2). Each node in the network represents a point in space and time, e.g.

the nodes labeled 2, 4, and 6 represent the stress test phase of the screening at a

particular time. The arc (1, 2) represents the possible decision to begin Test 1 during the

first time slot. The arc (1, 3) represents the alternate decision of rescheduling initiation of

Test 1 to the next time slot, etc.

Figure 2.1: An example network representation of a one-day, three-time-slot problem instance. For this miniature and illustrative example, all feasible arcs are shown, indicating the possible sequences of arriving, waiting, and being tested that three patients may experience.

Time Time Time Slot 1 Slot 2 Slot 3

OR Patient Class Waiting Room

UN Patient Class Waiting Room

Stress Room

Camera

• Test 2

* Test 3

27

• Testl

The formulation of the multi-commodity network flow problem is

vijkzijk max V Y„..,7,., (2.4) (i,j)EA k€K

subject to

2 « * " I *(*-<> VI6K\{,.t},*6ir (2.6) y6K+(i) 76V-(i)

V _ _ V 7 - - 1 for i = t, fc E ff (2.7)

I I

fee* ((,y)exf+

zi;'k — ^

V t £ Vs (2.8)

V i e r (2.9)

2Ufc £{0,1} V (i,j)eA,k EK (2.10)

The constraint sets (2.5) - (2.7) ensure the conservation of flow for each node.

The bundle constraints (2.8) - (2.9) restrict the flow out of the stress and camera nodes

to their respective capacities.

2.6.4 Size of the Model

The size of the problem is a function of the current CDTC schedule and the number of

new test requests. For example, for a 10-day time horizon, each with 10 time slots, of

which 80% of the slots are booked, a WCMH instance involving three new UN requests

on the first day results in 3,515 constraints and 6,214 variables. If the initial schedule is

90% filled, the number of constraints increases to 3,948, and the number of variables

increases to 6,928.

28

The model (2.4) - (2.10) is an integer program, and we have verified that its

underlying constraint matrix is not totally unimodular. However, as we discuss in Section

2.7.2, our experiments have shown empirically that this problem is quite easy to solve to

optimality.

2.7 Experimental Analysis

MFSM was evaluated in two ways. First, for small, computationally tractable problems,

we conducted an ex-post analysis to compare it to the MDP. In the second test we

compared it against WCMH's current rule of always prioritizing outpatients and reserving

occasional slots on the schedule to accommodate inpatients should the need arise.

2.7.1 The MFSM versus the MDP

A comparison of MFSM against the MDP required an ex-post analysis because, after the

first instance where the two methods make different decisions, the state space and

transition probabilities change, and we cannot expect them to yield directly comparable

solutions. This makes it impossible to compare performance based on, say the

percentage of time the same decision is made. Rather, after the conclusion of each

complete scenario, we calculate the value of the decisions made by each method using

the same cost structure.

Prior to determining the MDP optimal policy, several simplifying steps were taken

to reduce the size of the state space. This focus on "small" problems was done only for

the sake of comparing the MDP to MFSM. Without such simplifications the MDP quickly

became intractable, providing no basis for comparison; in any practical setting, MFSM

would not be limited to such a simplified, aggregate state space. To give an idea of the

difficulty of implementing the MDP, finding the optimal policy for this small instance took

over one hour. Additionally, the memory utilized during the policy generation was over

29

one gigabyte. The simplified problem setting included only the three most common

patient classes found at WCMH (UN, OR, and 02N), a single camera, a single stress

test room, and a time horizon of two days (i.e., 20 stages). We defined three classes of

unit specialization, cardiac, general medical/surgical, and emergency services. For ease

of exposition we will use bc = cardiac, bg = general, be = emergency services.

Additionally, the state of the hospital was consolidated and limited to four possibilities:

1. State V. bc= 3,bg = be= 2,

2. State 2: bc = bg = be = 1,

3. State 3: bc = bg = be =0,

4. State 4: bc = bg = be = -1.

We further restricted the number of patients of each class requesting a test to at

most two during each time period, and solved the MDP using backward recursion. The

optimal policy was then used in the simulation of the MDP methodology. Using arrival

patterns similar to the ones at WCMH, we generated four sets of new request arrivals for

the time horizon. We examined arrival rates of an average of two, four, six, and eight

new requests over the two-day time horizon. This equates to an average of one, two,

three, and four new requests per day. Two thousand unique requests were generated for

each of these four levels of arrival. The MDP simulation and the MFSM simulation both

used the same arrival sets. Three different initial CDTC schedules were simulated:

empty, 50% full (i.e., 10 screenings already scheduled during the 20 period horizon), and

80% full.

For the analysis between the two approaches, we calculate the percentage

"performance gap" for a single scenario as (om-on)/om, where om is the ex-post objective

value of the MDP and on is the ex-post objective value of the MFSM. In Table 2.2 the

mean percentage performance gap, along with its corresponding 95% confidence

30

Table 2.2: A comparison between the MDP and the MFSM. All numbers are rounded to two significant digits to the right of the decimal.

• > • • -.r- . v - 1 - •'..•

Empty Empty Empty Empty Half Full Half Full Half Full Half Full 80% Full 80% Full 80% Full 80% Full

^ : , ; f e j : r'^f>;iko 1 per day 2 per day 3 per day 4 per day 1 per day 2 per day 3 per day 4 per day 1 per day 2 per day 3 per day 4 per day

• ri&\\>- " : :

0.00% 0.07% 0.10% 0.62% 0.19% 0.76% 1.44% 0.76% 3.21% 7.89% 13.88% 17.99%

(0.00%, 0.00%) (-0.04%, 0.19%) (-0.07%, 0.26%) (0.23%, 1.01%) (0.11%, 0.27%) (0.50%, 1.01%) (0.53%, 2.34%) (-1.97%, 3.49%) (2.90%, 3.52%) (7.39%, 8.38%)

(11.41%, 16.35%) (12.93%, 23.04%)

interval, are reported for each of the twelve design points. Overall, as expected, there is

a small bias in favor of the MDP but MFSM performs well. Also, note that the confidence

intervals for four of the twelve design points indicate that there is no significant difference

between the two approaches (i.e., the ones that contain zero in the 95% confidence

interval). Otherwise, the values for the mean percentage performance gaps indicate that

the ex-post objective value of MFSM closely approximates that of the MDP. The higher

performance gap rates from the last two design points are a result of having more

patients requesting screenings than this experimental CDTC can accommodate within

the time horizon. We expect the MDP to outperform MFSM in these cases, because the

generated MDP policy explored all of the possibilities, while MFSM takes a snapshot

prior to each decision and uses the data to generate expected profits for the various

decisions.

On average, when computationally tractable, we expect the MDP to outperform

the MFSM heuristic. The difference between the two techniques is in the complexity of

31

how future events are anticipated. In any MDP, current decisions are made under the

assumption that future decisions will also be made optimally, but this comes with the

huge computational burden of planning decisions for all possible contingencies or

possible future states. The MFSM heuristic, on the other hand, ignores the impact of

future decisions, and instead simply projects average costs for current decisions, based

on probabilities of future events and their costs. Simply put, the MFSM ignores the fact

that we will respond to mitigate the effects of unexpected or low-probability events,

whether good or bad, while the MDP does not.

Thus for this small-scale test, we do find instances for which the MDP does a

better job of anticipating future events, consequently outperforming the MFSM heuristic.

As seen in Table 2.2, the MFSM heuristic performs very well compared to the MDP until

we get into a full schedule situation and there is an unexpectedly high volume of

inpatient add-on arrivals. It should be noted though, that for this unnaturally small

experiment (which was made small-scale enough to implement the MDP) the impact of

one suboptimal patient routing decision has a large impact on the system as a

percentage. But as we will see in the next section, where we compare MFSM to the

hospital's prior policy, we were able to verify through a larger-scale simulation the that

the MFSM network approach was robust, making vastly improved decisions in situations

where the schedule was full, arrivals were heavy, and the problem instance was of a

real-world size.

2.7.2 The MFSM versus the Current Approach

To compare MFSM with the hospital's current approach we simulated the CDTC's

operations for two-week planning periods. Under the current approach some time slots

are reserved, but when a conflict between an outpatient and an inpatient occurs, the

policy is to always prioritize the outpatient. We therefore refer to WCMH's current policy

32

as "POP" for prioritize outpatients. The 10 working days in the two-week planning period

are divided into 10 one-hour time slots. In order to gain additional insights, we vary the

proportion of CDTC capacity allocated to testing outpatients from 60% to 100% as

shown in Table 2.3 for a total of nine design points. (Prior to our study, WCMH typically

reserved 90% of the schedule for outpatients).

The simulation contained a common set of 200 independent "sequences" of

inpatient test requests. Each sequence of inpatient test requests was created by

randomly generating between zero and three inpatient requests ("add-ons") using a

discrete uniform distribution for each of the 10 days in the planning horizon. This

common set of inpatient test requests was used for each of the nine design points. For

the simulations we did not consider the possibility of rescheduling outpatients for another

day. So, while the outpatients can be delayed, if they cannot be tested on the day they

were originally scheduled we assume that the patient decides to be tested at another

facility and the hospital loses the revenue. This assumption gives a conservative

estimate of the impact of MFSM on the biweekly revenue change because, in reality, a

significant percentage (likely a majority) of these patients would be expected to still

choose to be tested at WCMH. Finally, we do not consider cancellations from the

outpatient schedule since the percentage of "no-shows" at the WCMH CDTC is less than

1%. We analyzed three performance metrics: difference in revenue, the percentage of

the outpatients and inpatients requests served, and the number of additional nights

rescheduled inpatients must remain in the hospital.

We used lp_solve v5.5 to solve all problem instances of MFSM. Problems were

first solved by relaxing the integrality constraints on the decision variables. We verified

that underlying constraint matrix is in fact not totally unimodular, but nonetheless the

structure of the objective function in this setting tends to result in optimal solutions that

are integer valued. In the rare occasions that fractional solutions were obtained (6.5% of

33

the instances during this experiment) lp_solve's branch-and-bound routine quickly

provided an integer solution with the same objective function value after only a single

branching. Thus, in our experience, each MFSM problem instance required a solution of

at most two linear programs, with any linear program instance solving in less than one

second using lp_solve. The average solution time was 0.6 seconds.

2.7.3 Results

Tables 2.3 and 2.4 show the average biweekly increase in revenue due to using MFSM

instead of POP and the 95% confidence interval for revenue increase. Since WCMH

typically allocates 90% of the CDTC capacity to outpatient testing and since the inpatient

census is typically elevated for the six month October-March period, we extrapolate an

expected revenue gain during this period of approximately $150,000 ($11,437.13 * 13).

The other half of the year MFSM could help them realize an additional $40,000 in

revenue gains (reduce the outpatient schedule to 80% corresponding to $3,142.73 * 13),

giving WCMH a yearly expected revenue gain of $190,000.

Table 2.3: Biweekly revenue increase when adopting MFSM. For all design points, the revenue gains were statistically significant using paired f-tests.

60% $1,296.41 80% $3,142.72 85% $5,509.32 87% $8,167.28

90% $11,437.13

92% $14,824.84 95% $18,943.79

8 97% $23,142.20

100% $28,108.32

34

Table 2.4: 95% confidence intervals for biweeklv revenue aain.

ISstelF^:! 1

2

3 4

5 6

'•:i,y.7. ;Vy . \

v-S->t':'^ V . 9 ,•"•""

@@; t̂etflfe'. 60%

80%

85% 87%

90%

92%

95%

97%

100%

J^teMBBi, $1,152.38

$2,623.67

$4,500.76

$6,802.22

$9,642.63 $12,695.79

$16,354.68 $20,115.97

$24,548.83

Igfi^KH $1,440.43

$3,661.77

$6,517.88

$9,532.34

$13,231.63

$16,953.90

$21,532.89 $26,168.42

$31,667.82

Table 2.5 compares MFSM to the POP approach in terms of the service provided

to inpatients and outpatients. Service level is calculated as the average percentage of

inpatient and outpatient requests that are honored during the two-week scheduling

period. The graphical illustration of service levels shown in Figures 2.2(a) and 2.2(b)

provides some interesting observations. By using MFSM, we are able to significantly

improve the inpatient service level with only a small decrease in the outpatient service

Table 2.5: A comparison of service quality.

1

2

3

• • • 4 - : - ' • • ' : . •

5

6

•7,-v-!;..:.;...

& :#B. 9

60%

80%

85%

87%

90%

92%

95%

97%

100%

100%

100%

100%

100%

100%

100%

100%

100%

100%

i mtm v?

100%

97.03%

86.86%

79.58% 66.58%

53.90%

33.89%

19.93%

0.00%

•um&mssxism

0 0.31

0.85

1.34

2.03

2.67

3.62

4.45

5.57

biases : 100%

98.72%

96.81%

95.75% 94.20%

92.83%

90.74%

89.34%

87.45%

100%

99.29%

97.36%

96.54%

95.02%

93.26%

90.62%

88.55%

85.38%

0 0.14

0.25

0.30

0.32

0.35

0.37

0.40

0.42

35

Figure 2.2: A comparison of service levels.

Outpatient Service Level

100% <m% so%

1 70% S 60% 8 50% '£ 40%

•S 30% 20% 10% 0%

• POP

MFSM

60% 80% 8S% 87%. 00% 92% 95% 97% 100%

Percentage of Capacity for Scheduled Outpatients

In patient Service Level

100% 30% 80%

E 70% 5 60% g 50%

i 40% X 30%

70% 10% 0%

• POP

MfSM

l l . (a) Outpatient Service Level

f>0% 80% 85% 87% 90% 92% 95% 97%, 100%

Percentage of Capacity for Scheduled Outpatients

(b) Inpatient Service Level

level. For example, when the CDTC allocates 90% of its testing capacity to outpatients

the POP approach provides an inpatient service level of 66.58% and an outpatient

service level of 100%. In contrast, MFSM provides an inpatient service level of 95.02%

and an outpatient service level of 94.20%.

We also investigate the additional number of nights inpatients have to spend in

the hospital. These results are in Table 2.5. Finally, Figure 2.3 compares the average

number of inpatient bed-days saved using MFSM and in place of the POP policy.

The difference in the reduction of inpatient bed-days was statistically significant

using paired Mests, in all design points except the first design point where only 60% of

the CDTC capacity was dedicated to outpatient tests. The result of reserving so much

capacity for inpatient tests is that even the POP approach is able to serve all requests

resulting in no avoidable additional inpatient nights. For all other design points MFSM

outperforms POP.

36

Figure 2.3: Comparison of the additional inpatient nights.

Average Additional Inpatient Nights

N ig

h

re e

A d d iti

o i

4

3

2 ••

1

n

• POP

MFSM

60% 80% 85% 87% 90% 92% 95% 97% 100%

Percentage of Capacity for Scheduled Outpatients

v.... The graphs in Figure 2.2 can also help the CDTC make strategic decisions

concerning their desired service levels under MFSM. For instance, if the hospital

administration decides to maintain a 90% outpatient service level then Figure 2.2(a)

implies that the hospital should fill 95% of CDTC capacity with outpatient tests. This

capacity allocation would also correspond to a service level of approximately 90% for

inpatients.

2.8 Conclusions

Real-time allocation of cardiac diagnostic screening time slots represents a complex

online decision-making problem of whether to screen inpatients or outpatients when

conflicts arise. The decision can impact the availability of beds on the inpatient units and

the financial performance of the hospital. In this paper we model the decision problem

facing the CDTC as a discrete time, finite horizon MDP. We also conclude that solving

the MDP is computationally intractable given the time constraints of the decision makers,

motivating the development of a decision support system based on a dynamic multi-

37

commodity network flow model. The proposed solution method is flexible and robust,

making it applicable to a CDTC at almost any hospital. The technique was tested with a

local partner hospital and showed that it is possible to achieve significant improvements

in quality of service to inpatients and overall hospital revenue, with only minor decreases

in outpatient service levels.

By considering the impact of inpatient census and ED congestion on patient flow,

we provide hospitals with the ability to dynamically prioritize inpatient and outpatient

requests for cardiac diagnostic testing services. The result is the ability to provide high

service levels to both populations, help ensure the availability of inpatient beds to the

individuals that need them, and a net improvement in financial performance. Future

research should be conducted to determine the optimal location of reserved time slots in

the scheduling horizon, in order to improve inpatient service levels while increasing

utilization. In addition, the capacity of the CDTC itself could be dynamically altered

through the use of overtime and modifications to the contracts with the cardiologists that

provide medical coverage. Dynamic capacity management can enable the CDTC to

achieve higher levels of utilization and service, and enable the hospital to achieve

improved patient flow, quality of care, and financial performance.

38

CHAPTER 3 - Mass-Casualty Triage: Distribution of

Victims to Multiple Hospitals

3.1. Introduction

Effective disaster management has gained the attention of the public due to numerous

recent natural catastrophes. However, in order to succeed at achieving effective disaster

management, the definition of the term "disaster" must first be agreed upon. Surprisingly,

experts in the disaster management field disagree on an exact definition of the term.

However, several common elements appear in most definitions. This dissertation utilizes

the definition of a mass casualty incident (MCI) as defined by the World Health

Organization and we use the terms disaster and MCI interchangeably throughout this

work. Their definition of a mass casualty incident (MCI) is "an incident which generates

more patients at one time than locally available resources can manage using routine

procedures" (WHO, 2007). Examples of MCI include disasters due to natural hazards

(e.g., floods, earthquakes, etc.) or manmade disasters such as a bioterrorism attack.

Effective response to a disaster, whether man-made or natural, should follow a

cyclical, four-step process. These steps include:

1. Preparedness

2. Response

3. Recovery

4. Mitigation

Preparedness refers to the actions taken before the disaster to allow effective response.

Response refers to the emergency actions taken just before, during, and just after the

onset of a disaster to reduce casualties, damage, and disruption. The actions taken after

39

a disaster to repair, rebuild, and restore community life to normalcy is the recovery

stage. Mitigation refers to the actions taken both after one disaster and before another to

reduce the physical impacts of hazards on a community.

We focus on the critical time period immediately following the onset of a MCI.

The typical casualty flow in disasters is shown in Figure 3.1. During the third phase in

Figure 3.1, triage, the medical evaluation and treatment must be rapid and accurate in

identifying the critically injured victims who require immediate life-saving care. The focus

of care must be on the population as a whole rather than the individual in order to

provide the greatest good for the greatest number (Frykberg, 2005). Triage plays a vital

role during this process. It is the process of assigning treatment and evacuation priorities

to victims of an MCI and is used to guide the allocation of the limited healthcare

resources; in effect, it determines who will be transported to the hospital. (For more

details on the principles and practice of triage see Frykberg (2005)).

Once the casualties have been triaged, the orderly distribution of victims from the

scene among different hospitals must take place. However, managing the patient load

during a disaster presents a major challenge to hospitals and disaster response

managers. Patient allocation decisions are typically made by the so-called "Incident

Command" based on resource availability, situation details (mechanism of injury, etc.),

and triage data. Figure 3.2 illustrates an example of the distribution of triaged victims to

hospitals during a disaster.

Initially the goal is to move all victims out of the immediate vicinity of the disaster

scene to a casualty collection area. The location of the casualty collection area should

be situated away from the disaster scene due to the dangers it can pose to both health

Figure 3.1: The typical flow of casualties during a disaster.

Rescue Decontamination Triage Evacuation

40

care providers and the survivors (Frykberg, 2002). Patients sent to the casualty

collection area are triaged, treated for immediately life threatening injuries, and

transported to area hospitals.

In practice, managing patient allocation across a group of hospitals is

challenging. Often the nearest hospitals, typically the major trauma centers, are quickly

overloaded. There are at least three reasons why the nearest hospitals become

overwhelmed. First, critically injured patients require medical intervention as soon as

possible. In many cases this necessitates that these patients go to the nearest hospital.

Second, for most disasters, the majority of the casualties are not critical. These so-called

"walking wounded" find their way on foot or by car to the nearest hospitals. Third,

decisions regarding the allocation of patients are complex and response managers do

not have the ability to dedicate large amounts of resources to that task.

Examples of hospital overload can be seen in the wake of the September 11,

2001 terrorist attacks. In that case the New York Downtown Hospital was so inundated

with casualties that it was forced to turn its cafeteria into a makeshift second emergency

room to treat more than 1200 patients (Landro, 2006). Likewise, in the aftermath of the

1994 aircraft crash at Pope Air Force Base in North Carolina, the closest hospital was

Figure 3.2: An example of victim distribution to a group of area hospitals during a disaster response.

41

inundated with a large number of casualties. Most of these patients were subsequently

sent to other area hospitals.

In order to avoid this geographic effect, a systematic method of distribution of

casualties from the casualty collection area among available hospitals should occur

according to injury severity and urgency, with the added goal of avoiding overloading

any single facility. One suggested approach within the urban setting is "leap-frogging" of

hospitals by sequential loads of transported casualties (Jacobs et al., 1983). The

problem with this type of "round-robin" allocation scheme is that it is often implemented

in a static fashion (e.g., every third casualty is routed to a specific hospital), whereas all

disasters are dynamic in nature. As new information concerning the disaster and the

supporting hospitals becomes available, the allocation mechanism should adjust to this

new information.

Most papers talk only about "doing the most good for the most number of people"

without actually precisely defining the objective. The two prevailing methods are START

(simple triage and rapid treatment) and SALT (sort, assess, life-saving interventions,

treatment and/or transport). Recent work has looked at the effectiveness of START

(Kahn et al., 2009) and compared the two methodologies (Cone et al., 2009; Lemer et

al., 2008). However, neither of these methods precisely defines what is meant by "the

most good for the most people". The one exception is the Sacco Triage Method (STM).

As described in the STM work (Sacco et al., 2005), the limitations of the current triage

methods, such as START, motivated their work.

The rest of this chapter is organized as follows. We provide a brief literature

review, highlighting papers relevant to decision making during the response stage of

disaster management in the next section. Section 3.3 describes the resource

constrained triage problem, highlights a current model, and describes our proposed

enhanced model to solving the problem. Section 3.4 provides the explicit formulation of

42

our enhanced model. We compare our model to several heuristics currently used in the

field in Section 3.5. In the final section we conclude by discussing the managerial and

policy implications of using our enhanced model.

3.2. Literature Review

There has been a great deal of research devoted to the general topic of disaster

response. Not surprisingly, the bulk of this research has been conducted by researchers

in the fields of emergency medicine, emergency response, and public policy. While a

great deal of useful clinical and applied knowledge has been obtained from these

studies, this literature review is focused more narrowly on disaster response literature

dealing with decision support systems and stemming from the operations management

literature.

Due to the inherent uncertainty surrounding the resources required to respond to

a specific mass casualty event, a great deal of research has been focused on issues

specific to certain types of disasters such as earthquakes (Barbarosoglu and Arda, 2004,

Fiedrich et al., 2000, Ozdamar et al., 2004), floods (Shim et al., 2002), industrial

accidents (Jianshe et al., 1994, Mould, 2001, Kourniotis et al., 2001), nuclear events

(Papamichail and French, 1999, Hobeika et al., 1994, French, 1996}, and terrorism

(Reshetin and Regens, 2003, Wright et al., 2006).

There has also been a great deal of research focused on supporting decision­

making and interactive methods to aid response and relief efforts. Barbarosoglu et al.

(2002) developed an interactive approach for the hierarchical analysis of helicopter

logistics for disaster relief operations. Brown and Vassiliou (1993) studied real-time

allocation and assignment of response units in the context of disaster relief, allowing

human intervention in the underlying optimization and simulation processes. The

integration of simulation and geographic information systems to develop spatial decision

43

support tools to aid in recovery operations was studied by de Silva and Eglese (2000).

Other research is again specific to the problems that arise in the context of certain types

of disasters. For example, Lee et al. (2006a, b) develop a decision support system that

evaluates facility layout and staffing allocations for emergency large-scale dispensing of

pharmaceuticals following a disease outbreak. For a more exhaustive list of OR/MS

research in disaster operations management see Altay and Green (2006).

Response logistics has also seen a fair amount of research. These types of

decisions include, among other things, transportation and evacuation after a mass

casualty event occurs. The article by Barbarosoglu et al. (2002), which explores

helicopter logistics, falls into this category. The work by Jotshi et al. (2009) investigates

dispatching and routing emergency vehicles with the support of data fusion during the

aftermath of an earthquake. Christie and Levary (1998) model the transportation of the

critically injured patients to hospitals as a multi-server queuing system and show that the

length of time that patients wait for an ambulance increases rapidly with the increase in

the inter-arrival rate of patients to the initial treatment/waiting area.

This diversity among disaster management research prompted the recent

position paper by Brandeau et al. (2009). In it, the authors provide six recommendations

for the modeling of health sector disaster responses.

3.3. Resource-Constrained Triage

The very definition of an MCI implies that we dealing in a resource-constrained

environment. In particular, the major resources we consider are availability of

ambulances and the capacity for the various severities of casualties at each of the

hospitals. For simplicity, we refer to the hospital capacity as "beds" even though there

are numerous inter-related resources that go into treating and stabilizing a victim of an

MCI.

44

The main decisions we need to make are which patient class to send from the

casualty collection area to a hospital and which specific hospital each patient should be

transported to. Our goal is to maximize the expected number of survivors with respect to

restrictions on the timing and availability of ambulances needed for transport and the

treatment resources available at the hospitals.

In order to calculate the expected number of survivors from an MCI, we need

both an estimate of the survival probability and how that survival probability deteriorates

over time. One way to accomplish this is to assign each victim a "score" that is indicative

of his current condition and follows some deterioration rate. As victims wait to be

transported, their condition deteriorates. In general, a victim's initial severity and type of

injury directly influences his deterioration rate.

The time horizon for the decision-making environment starts after the immediate

onset of the MCI and ends when all of the victims have been transported. The type of

disaster dictates the length of the time horizon. We will focus on a disaster that results in

blunt trauma victims, but the overall process and methodology can be used for any type

of disaster.

3.3.1 Brief introduction to STM

The one paper that attempts to work through the resource-constrained problem is the

Sacco paper (Sacco et al., 2005). Their main motivation is to provide a model and

process that addresses how to specifically articulate the oft-cited objective of "do the

most good for the most people." One of their main contributions is the recommendation

to use a "score", which can be computed based on a victim's severity and is predictive of

his survival probability, as the basis to quantify the objective. They suggest using the

"RPM" score, which includes the respiratory rate, pulse rate, and motor response. They

show that by using RPM scores and associated survival probabilities for blunt injured

45

victims that the STM model performs better than the traditional START method used for

triage in all of their simulations, with marked improvements as the resources to transport

were tightened.

3.3.2 Limitations of STM

Overall, the STM can be viewed as a "macro" model. For example, the model assumes

up front that we know how many resources for transporting victims for each time period

will be available. However, if multiple hospitals are involved, the time it takes to evacuate

casualties will depend on which transport resources are sent to which hospitals in each

period. These decisions influence the number of transport resources available in each

period because some ambulances may still be in transit due to traveling to a more

distant hospital. The STM approach would work if only one hospital exists and the

ambulances will be back by next decision epoch.

The decisions from the STM model are not specific; they simply identify the

number of victims in each class to treat. We are interested in sending patients to multiple

hospitals and want to prescribe where to send each victim of each class at each

decision period.

We are not only looking at score-based treatment estimates (as indicated as one

of the extensions in STM paper), but rather looking at score-based treatment estimates

related to the timing of the decision. We explicitly consider available hospital capacity

when making evacuation decisions.

3.3.3 Motivational Example

We now provide a small, illustrative example, motivating the use of a different approach.

Assume there are five different score levels, two hospitals where one is "local" and the

46

Table 3.1: Victim distribution and survival probabilities.

other is "remote", and we are interested in a time horizon consisting of six time periods.

The one-way travel distance to the local hospital is a single time period, whereas it takes

two time periods to reach the remote hospital from the casualty collection area. Table

3.1 shows the distribution of the victims and their associated survival probabilities as

time progresses. Table 3.2 shows the care times for each score level. For this small

example, we assume both hospitals have the same technology and can treat any of the

score levels in the same amount of time and that the initial capacity for each score level

at both hospitals is five.

Suppose there are five ambulances available at the beginning of the time horizon

and that they are all dedicated to transporting victims from this MCI for the duration of

the time horizon. Recall that the STM model assumes that all of the ambulances will be

back by the next decision point and a single hospital. Let's assume that the STM model

is interested in sending victims to the local hospital because it is closer. Running the

STM model on this data, we get the following result shown in Table 3.3 with an objective

function of 13.37 expected survivors. However, we immediately notice infeasibilities in

this solution. Because we are sending patients to the local hospital, where the one-way

travel time is one time period, the ambulances we send in the first time period will not be

available again until the third time period. The highlighted cells in Table 3.3 illustrate the

ambulance infeasibilities encountered in this small example. We may also confront

47

Table 3.2: Care times for each score level for the time horizon.

1 2 • • • •

v &•••••:

' 4 ::*••

.,-:. - 5 , , i ;

4 M.^tus M^^^.^.^Sft^Jl^feafe^vs-M. .*^***. >>._!, a««~*

6 5 4 3 3

H. , » »

7 6 4 4 3

,.. &**. >&, .. *„-4

8 6 5 4 4

; * < 2 t 9 7 5 5 4

.® '" 10 7 6 5 5

. .„ *}& S

11 8 6 6 5

capacity infeasibilities with the STM model since it does not explicitly handle treatment

resources. The final type of infeasibility that may occur with the STM model is that it may

send more victims of a particular score level than are awaiting transport at the casualty

collection area.

There are many different ways that we could fix the infeasibilities encountered

with the STM model. For this small example, we suggest first eliminating the ambulance

infeasibilities since these are the easiest to recognize and fix. As a second step, we then

fix the capacity infeasibilities by first reducing the current number of victims to transport

to stay within the capacity for each score level. Then add victims (up until we run out of

ambulances) to be transported in the order of the most severe injury to the least severe

(low score first), ensuring capacity constraints are not violated. Finally, we need to verify

that we do not send more victims of each score level than we have waiting at the

casualty collection area.

48

Table 3.3: Decisions from the STM model.

tp<?p||tFW

1 2 3 4 5

»;T-Olŝ |̂ S<Ni.

' *f" *̂ Wlv#r>>''̂ SMfffliijdW^ '*•*?' ' f ^ ' f ^ . ' j • '

0 4 0 0 1

• f t >

;> " <§ 0 5 0 0 0

&$. .

V ' £r," 0 5 0 0 0

l 5-

; - £ > 0 0 4 0 1

. \ * .:

§ ' 0 0 1 4 0

'3 .,

• ' t 0 0 5 0 0

.. -& .

0 14 10 4 2

5? . . ^ T . u . *,k&..,,

Table 3.4 shows how we fixed the ambulance infeasibilities; simply remove the

decisions that were made in the second, fourth, and sixth time periods since ambulances

are in transit back from the local hospital. However, now notice that a violation of the

capacity constraint on the score level two occurs when we try to send five victims with

that score level during time period three. Utilizing the procedure outlined above, we

decrease the number of victims with score level two to a single patient (bringing the

hospital to capacity for that score level). Because the current practice of evacuation of

victims emphasizes sending the worst cases first, the four remaining ambulances are

used to send four victims with score level 1 to the hospital. (As we show in Section 3.5.2,

this may not be the best alternative, but it is what is currently being used in practice.) In

this example, we do not run into the last type of infeasibility. The resulting decisions are

shown in Table 3.5, with highlighted cells indicating where we fixed infeasibilities. The

resulting objective function is reduced to a total 5.975 expected survivors.

By not taking into account decision epochs that are shorter than the round-trip

distance to the hospital, the STM model introduces ambulance infeasibilities. Also, when

the treatment resources are not considered, the STM model may result in additional

infeasibilities that are of practical significance.

Figure 3.3 shows the possible percentage of improvement in the objective

function if we utilize the enhanced model rather than the STM model. The graph on the

49

Table 3.4: The decisions after fixing the ambulance infeasibilities.

1 2 3 4 5

Total Sent

^#«^S»'ii«

0 4 0 0 1 5

• * > | p ^ 0 0 0 0 0 0

V >> W ^ V

0 5 0 0 0 5

S , i $ 4 »

0 0 0 0 0 0

•\*HsK' 0 0 1 4 0 s

: * - § ^ 0 0 0 0 0 0

Q .i-.-9:.: ..:/•':

1 4 1

15

Table 3.5: The final decisions after fixing all of the infeasibilities.

left shows the improvement as we hold the capacity at each hospital constant at six beds

per score level and vary the number of ambulances that are used for transporting victims

during the MCI. Both implementations of the STM model are shown. The other graph

shows how our enhanced model improves as the hospital capacity for each score level

varies and we only have six ambulances at our disposal.

3.3.4 Our Enhanced Model

We add three major enhancements to the STM model.

1. The time needed to treat/process a victim at a hospital varies depending on the

severity of the injury to the victim. Care times for a particular victim will increase

as his score decreases (gets worse).

2. We explicitly consider the treatment capacities of the area hospitals.

3. The availability of ambulances for transporting victims depends upon where and

when it was sent.

50

Figure 3.3: Improvement over the STM model when using the enhanced model and varying either the hospital capacity or the ambulance capacity.

80.0%

70.0% ;

60.0% ;..

I 50.0% j

S 40.0% : ex

£ 30.0% '

20.0%

10.0% i

0.0% •

4

ImprovementOverSTM Model as Ambulance Capacity Changes

— — Local

Remote

5 6 7 8

Ambulance Capacity, ft

em

m p

ro ve

m

55

90.0% !

80.0% '

50.0% |

40.0% \

30.0%

20.0% [ ,

10.0%

0.0'<s

5

ImprovementOverSTM Model as Hospital

Loc.ll

Remote

Hospital Capacity, ft beds

The first enhancement should seem logical. As a victim waits at the casualty

collection area for transport, his condition will worsen (according to some deterioration

rate). As his condition worsens, more resources and time will generally be needed to

stabilize the patient once he arrives at the hospital. While the STM model does consider

deterioration rates of patients, it uses them exclusively for calculating the appropriate

objective function coefficients.

We consider the treatment capacities for each score at each hospital explicitly in

our model. The decision methodology must be cognizant of the ability of the hospitals to

be able to accept and process victims at a score-based level. In particular, we ensure

that a hospital has the capacity to accept a patient with his updated score level before

sending the patient to that hospital. Recall that a patient's condition deteriorates over

time, meaning that he will switch score levels as time progresses. We make sure that the

hospital can accept a patient with the updated score at the decision point rather than a

patient with the initial score. The STM model does not include treatment resources.

The final enhancement we add considers how available ambulances affect our

decisions and vice versa. Because we consider more than a single hospital, an

ambulance that travels to the "remote" hospital will be unavailable longer than one sent

51

to the "local" hospital. Resources for transportation of victims are handled differently in

the STM model. There, they simply assume a fixed number of ambulances and that all of

the vehicles return by the next decision epoch. (They use 30-minute decision epochs.)

3.4. Formulation of the Resource-Constrained Triage Problem

3.4.1 Notation

Data

S Set of victim scores (e.g., RPM values)

s Score index

H Set of area hospitals

h Hospital index

T The set of time periods

t Time period index

Dh One-way trip distance (in time periods) to hospital h from the casualty collection

area

C£ The amount of care time needed to "process" a victim of score s at hospital h

A The initial number of ambulances available

Bl The initial number of "beds" available at hospital h for score s

Pt s The probability of survival of a victim with score s at time r

Vs The number of victims of score s awaiting transport from the casualty collection

to one of the hospitals

Variables

xlt Number of patients with score s to send to hospital h during time period f

52

at The number of ambulances available at time period t

bfo The number of patients with score s that can be cared for at hospital h during

time period f (e.g., the number of resources such as beds available for victim

score s at hospital h during time period r)

Note that the "real" decision variables are xht and that the variables at and bht

are the "auxiliary" variables. For example, we will know the values of ax and bhx since

these represent the initial state.

3.4.2 Formulation

m a x ZZZ P t S ^ f

hen ter ses

(3.1)

subject to

heH

*ht < &ht V/iGtf, V t G 7 \ V s G 5 (3.3)

ax=A (3.4)

V t > 1 G T (3.5) at — ° t - i — y xM-\ + y xM-2x.Dh

hSH heH

b s h l = Bs

h V h G H, V s G S (3.6)

Ht = Kt-r ~ Xht-i + <t-{Dh+cH) VhEH, V t > 1 G 7, V s G 5 (3.7)

££*/[t<r' VSG5 (3'8)

7l6H teT

Xs ht G Z + V f t £ W , V t £ T , V S 6 5 (3.9)

53

The objective function expressed in (3.1) indicates that we are maximizing the number of

expected survivals from the MCI. Constraint (3.2) enforces the restriction that we cannot

send more patients to hospitals than we have ambulances. We ensure that we do not

violate the capacity of the hospitals via constraint set (3.3). Constraints (3.4) and (3.6)

simply set the initial values of the "state". The availability of ambulances at each time

period beyond the initial period is defined by (3.5). Similarly, (3.7) defines the capacity

availability of the hospitals during each time period beyond the first one. It should be

noted that the way (3.7) is formulated gives the implicit assumption that a bed is

reserved at the hospital once the decision is made to send a victim there. This is

expressed in the final term and the second subscript on the x variable: t - {Dh + Cft).

The Dh expresses the time it takes the ambulance to get to the hospital and is added to

the care time at that hospital, forcing the bed to remain "in use" during that time.

Constraint (3.8) ensures that we do not send more victims of a particular score to the

hospitals than actually exist. Finally, integer restrictions are expressed in (3.9).

3.4.3 A Small Example

We consider a small instance of the problem as an illustrative example to aid in

understanding the enhanced model. Using the same data from example from Section

3.3.3, we are interested in determining how the decisions from the proposed model differ

from those generated by the STM model. First, we expect several changes because we

are considering two hospitals, a local and a remote. Also, we avoid capacity

infeasibilities because we explicitly model these in our model.

Solving this small problem using Excel's built in Solver functionality, the decisions

shown in Table 3.6 are obtained. The table is expanded to show how many victims of

each score are sent to either the local ('L') or the remote ('R') hospital. We see that

having a second hospital as an option allows us to send more victims of score level 3

54

Table 3.6: The results using the proposed enhanced model.

J;-/ „\"

1 .': '2"

3 ; • - . • • > • : # : • : • • • ' • " •

5 Total Sent

• • • iV ^.' V Bcosf̂ tMfel ,* mmmf, f^'

0 0 3 0 2 5

' • $ >

0 0 0 0 0 0

i-AW^M'lk<M' -«. ' Q*\

0 0 0 0 0 0

• ts« 0 0 0 0 0 0

:' (T 0 0 1 4 0 5

: ® - 0 0 0 0 0 0

i R ^ - w M ^ l i ^ "§? "4.-"

0 0 0 0 0 0

, e r 0 0 0 0 0 0

nr 0 0 1 0 0 1

m 0 0 4 0 0 4

>• ' '•

- v ; . ^ . ' , . & » : 0 0 0 0 0 0

.m ' 0 0 0 0 0 0

? € « •^en^S&ly ^%

Q

W:':' • , . - ; § • •

•.••':;4Vf

2 11

' ' ^ ^ 0 0

• : T ' 4

•M : p ;.:,-

0 4

than with the STM model. Here, we send five, the capacity, to the local hospital and then

send four to the remote hospital. The corresponding objective function value is 9.067

expected survivors. This equates to a 51.7% increase over the STM model's objective

function value.

3.4.4 Enhanced Model's Performance vs. STM Model

The small example from before provides some motivation of the potential benefits to

using the enhanced model. We were also interested in getting a better picture of how it

may react when the number of available ambulances and the capacity levels of the

hospitals change. We varied the number of available ambulances from four to eight and

let the capacity levels for each score level range from five to seven. We then compared

the enhanced model's objective function to the STM model when it sent all victims to the

local hospital and when it sent victims to only the remote hospital. Over these 15

comparisons (five levels of ambulances and three levels of capacity at the hospitals), the

enhanced model provides an expected increase in the expected number of survivors.

The overall average improvement of the enhanced model over the STM model that uses

only the local hospital was 36.5% and the increase over the STM remote model was

69.0%.

55

Figure 3.4: Model reactions to varying ambulances and hospital capacity.

Objective Function as Ambulance Capacity 12 Increases

;- 10

S 6 i. •

Ambulance Capacity, H

10.00

9.50

9.00

8.50

3.00

7.S0

7.00

6.50

6.00

5.50

5.00

Objective Function as Hospital Capacity Increases

• • •

— • - Local

•••*••• Remote

• Enhanced

— — — — — — ——

Hospital Capacity, tf beds

Figure 3.4 provides a look at how the models react when holding one variable

constant and varying the level of the second. The graph on the left fixes the capacity of

the hospitals to six and varies the number of ambulances. The rightmost graph holds the

number of available ambulances constant at five and varies the hospitals' capacities.

3.5. Experiments

We are interested in determining how well our enhanced model does in comparison to

several heuristics that are currently used in real life. Three of the most common "rule of

thumb" approaches used are:

1. Closest first - send as many victims as possible to the closest hospital first.

When that hospital reaches capacity, send victims to the next closest hospital.

2. Furthest first - send as many victims as possible to the furthest hospital first.

When that hospital reaches capacity, send victims to the next furthest hospital.

3. Cyclical - send victims in a round-robin fashion to the area hospitals.

The logic behind using the closest first heuristic is to transport those victims

which we have decided to treat to a hospital as quickly as possible. The furthest first

heuristic is also used at times. It is known that victims deteriorate over time. If a large

56

number of serious casualties is expected, then the first ones found are sent to the

furthest hospital because the expectation is that they can survive the transport. After

some time passes the next batch of casualties will have deteriorated and the "reserved"

capacity at the closer hospital is a benefit because those victims could not survive the

wait and an extended travel time. Another part of the thinking is that many of the

"walking wounded" will make their own way to the closest hospital. By sending victims to

hospitals further away, we avoid inundating the closest hospital with both the transported

victims and those who find their own way there. The final common heuristic considers

sending victims in an alternating fashion to the different area hospitals. The benefit of

this heuristic is that we provide some "breathing room" for each hospital. That is, it

essentially assumes that no victims will get effective care if a hospital is blitzed beyond

its ability to respond.

Each of these heuristics can be implemented several ways. The most common

implementation in use is to transport the most critical, non-expectant victims first.

Intuitively, this seems like the best approach. However, as we show below, this may not

result in the best outcome if we measure the total expected number of survivors. In

direct contrast, another implementation possibility is to transport the least critical victims

first, leaving the most severely injured patients for transportation later. Obviously, these

are two extremes and various other schemes could be thought of.

The cyclical heuristic has even more flexibility in terms of implementation

possibilities. When alternating between different area hospitals, how many victims are

sent to each hospital before switching the destination to a different hospital is one way to

vary how the heuristic in implemented. For example, we could send one victim to the first

hospital, the next victim to the second hospital, etc. Alternatively, we could send the first

two victims to the first hospital, the next two victims to the second hospital, and so on.

Again, there exist many different ways to implement the cyclical heuristic.

57

3.5.1 Experimental Setup

For all of our experiments we use the following setup. We follow Sacco et al. (2005) in

many regards. For victim scores, we utilize the "RPM" score, which includes the

respiratory rate, pulse rate, and motor response (Sacco and Champion, 1983). The

RPM score is the sum of coded values as shown in Table 3.7. Additionally, we assume

we are dealing with blunt trauma victims and utilize both the survival probabilities and

the victim deterioration rates derived by Sacco et al. (2005). The survival probabilities

are shown in Table 3.8 and the deterioration rates based on 30-minute intervals are

shown in Table 3.9.

One of the enhancements of our model over the STM is that care time is directly

related to the current status (i.e., RPM score) of the victim. The care times we use in our

experiments is shown in Table 8. The care time is measured in minutes. It represents an

estimate of how long it would take to stabilize a patient with the associated score, thus

freeing the resources to be used on another victim. For simplicity and exposition

purposes, we refer to these resources simply as "beds" even though there are many

other intertwined resources involved.

We consider a fixed time horizon of six hours after the onset of a MCI. We

assume that victims are gathered at a common collection area from which emergency

medical vehicles depart to transport the victims to a hospital. We also assume that both

the number of victims and their associated distribution is known at the beginning of the

Table 3.7: RPM Coded Values.

<p(^l}^fal0\ ',,2 Respiratory Rate

Pulse Rate

Motor Response

4 ' 0

0

None

" 4 [ ' ,11- "> ' -< 1 - 9

1 -40 Extends/flexes from pain

r •-§. ' 36+

4 1 - 6 0 Withdraws from pain

• < ; « § ' • *

2 5 - 3 5

121 + Localizes pain

• a"* ; : ' 1 0 - 2 4

61 -120 Obeys commands

Source: Sacco et al. (2005).

58

Table 3.8: Probability survival estimates for RPM values and the associated care time.

BIB'̂ IiS1"" : . - ^ t r " • • • - • •

:...:''•: : - ' : 1 - ; - - s : > - v ; - •

• - • ' • • • • 2 '• ••

' 3 : A

5 6

. . / ; ' 7 • • . . :

•"':. 8

v • * • • - & t : : •••••-

; TO :

11 12

,vv^^^Q^®^fflS^''" 0.052 0.089 0.15 0.23 0.35 0.49 0.63 0.75 0.84 0.90 0.94 0.97 0.98

:mmmmmmmm]

180 170 160 150 140 130 120 110 90 60 50 40 30

Sacco et al. (2005) is the source of first two columns

time horizon. We consider three different distribution patterns of victims that are awaiting

transport: a uniform distribution, a left-skewed distribution, and a right-skewed

distribution. See Figure 3.5 for a pictorial representation of the victim distributions. Each

distribution considers the same number of overall victims: 130.

We also consider transit time between each hospital. We assume that the transit

time is deterministic, known, and the same for both the outbound and inbound directions.

Four different levels for the number of ambulances are examined. We consider two

hospitals: a "local" hospital and a "remote" one. Both hospitals have the same

technology resources and are able to treat all of the victims. Additionally, both hospitals

treat the victims in the same amount of time. We vary each of the hospitals capacity and

the distance of the remote hospital from the casualty collection area. Table 3.10 shows

the different levels we consider. Altogether we look at 384 different design settings. The

models were solved using the Gurobi solver.

59

Table 3.9: Deterioration of RPM scores in 30-minute time intervals.

0 1 2 3 4 5 6 7 8 9 10 11 12

: WT

• fe 0 0 0 1 2 3 4 6 8 9 10 11 12

*U ' '" *»

0 0 0 0 1 2 3 5 7 8 9 11 12

w ' A >

0 0 0 0 0 1 2 4 6 8 9 10 11

>

, * . v S

0 0 0 0 0 0 1 3 5 7 8 10 11

"* r 1i?Kaia'tote#a 'A. W ,NJL © I*

0 0 0 0 0 0 0 2 4 6 8 9 10

0 0 0 0 0 0 0 1 3 5 7 8 10

>•,#,.* 0 0 0 0 0 0 0 0 2 4 6 8 10

m... 0 0 0 0 0 0 0 0 1 3 6 7 10

'I1 i " - j !

k »«L.'. 0 0 0 0 0 0 0 0 0 2 5 7 9

, y -v i * •

m., 0 0 0 0 0 0 0 0 0 1 5 6 9

'

'm 0 0 0 0 0 0 0 0 0 0 4 6 8

y • , % «

, t ^ : 0 0 0 0 0 0 0 0 0 0 4 5 8

Adapted from Sacco, et al. (2005)

Figure 3.5: Distribution used during experiments.

Uniform Distribution

3 10 11 12

Left-Skewed Distribution

Right-Skewed Distribution

5 5 7 8 9 10 11 12

RPM Score

60

Table 3.10: Parameter settings for the experiments.

^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ H "• ' . ' • •;&; ." ' Initial available ambulances Distribution of victims Local hospital capacity for each RPM score Remote hospital capacity for each RPM score One-way distance to local hospital (minutes) One-way distance to remote hospital (minutes)

5, 10, 15,20 Uniform, left-skewed, right-skewed

2, 4, 6, 8 2, 4, 6, 8

5 8,20

3.5.2 Low Score First Not Always Best Approach

The logic behind using the closest first heuristic is to get critically injured victims

qualifying for transport needed care as quickly as possible. This approach seems

intuitive: see if we can provide the most badly injured victims help in time for them

survive. However, when we quantify the probability of survival and want to maximize the

overall number of expected survivors, the low-score-first heuristic may fall short of its

intended outcome. Instead, this objective may indicate that we should use a high-score-

first approach when making transportation decisions. Using our experimental setting

described above, we explore which circumstances lead to use of the low-score-first

heuristic versus the high-score-first approach.

In our experimental design victims with RPM scores of 0 and 1 would most likely

be declared "expectant" and tagged with a black tag (in START language). Therefore, in

reality we would not consider these victims when making transportation decisions. This

allows us to consider victims with RPM scores of 2 and higher. We look at the closest-

first heuristic, comparing a low-score-first implementation to a high-score-first one. We

vary the lowest score considered from 2 to 6. The results showing the percentage of the

time the low-score-first is at least as good as the high-score-first, in terms of the

objective function, are displayed in Table 10. Percentages for each of the individual

61

Figure 3.11: Results of testing a low-score-first implementation versus a high-score-first implementation.

3 'est Score nsidered

2 3 4 5 6

> ' - • ! < * • . » :

58.6% 79.7% 83.6% 93.8% 98.4%

52.3% 59.4% 72.7% 74.2% 93.0%

^^^ff i '»V fg»: 11.7% 36.7% 76.6% 88.3% 99.2%

wJJSKT 40.9% 58.6% 77.6% 85.4% 96.9%

distributions are based off of the 128 runs for that particular distribution. The overall

percentage (last column) is over all 384 runs.

From Table 3.11, we see that whether a low-score-first implementation is better

depends on both the underlying distribution of victims and which RPM score values we

consider. For example, when considering an RPM score of 2 as the lowest score with a

right-skewed distribution, only 11.7% of the runs showed a better objective function

using a low-score-first approach instead of a high-score-first one.

3.5.3 Comparison of Enhanced Model to Heuristics

We wish to see how well our enhanced model does against the commonly used

heuristics. As discussed earlier, each heuristic can be implemented in numerous ways.

Following the discussion above, we utilize a high-score-first implementation of each

heuristic, giving a conservative estimate of the improvement of our model over each

heuristic. For the cyclical heuristic, we provide three specific implementations: switch

hospitals after each patient, switch to a different hospital after two patients have been

sent to the current hospital in row, and go in round-robin fashion in groups of three. We

also implement a simplistic random heuristic for comparison purposes. For each time

period, if an ambulance is available, the random heuristic first picks the hospital with a

simple flip of a fair coin. Based on destination hospital, it then randomly picks one of the

62

Figure 3.6: Average improvement over the commonly used heuristics when using the enhanced model.

^ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 60.00% •••: _. ,

Average Improvement 50.00%

40.00%

30.00%

20.00%

10.00%

0.00% .1111 Closest First Furthest Cyelical-1 Cyelical-2 Cyclical-3 Random

First

awaiting scores for which the destination hospital has capacity. The results showing

average improvement of our model over each heuristic are shown in Figure 3.6.

We are also interested in how the enhanced model compares to the various

heuristics as other attributes about the problem vary. First, we hold the capacity at each

hospital constant at six beds and vary the number of available ambulances. The results

are depicted in Figure 3.7 on the left. Next, we fix the number of available ambulances to

15, the capacity at the remote hospital to six beds, and vary the capacity at the local

hospital. The results are shown on the right in Figure 3.7. Only the closest-first and

cyclical-3 heuristics are shown in order to avoid cluttered graphs. The cyclical-3 heuristic

is very representative of the other two cyclical heuristics.

63

Figure 3.7: Improvement over heuristics when using the enhanced model and varying

50.00% ;

45.00%

40.00%

35.00% ;

s I 25.00* a- 2o.oo«

15.00*

10.00%

5.001,

0.00%

0

Average Improvement as Ambulances Vary

\ \ \ Closest First

\ Cvdical-3

\

\ ^ _ _ ^ ~ ~

5 10 15 20 25

Number of Ambulances

.... ,̂

i \

\.

35.00%

30.00%

25.00%

20.00%

15.00%

10.00%

5.00%

0.00%

t

Average Improvement as Local Capacity Varies \

\ \ \ \

Closest First

_^--~ . ^ _ <*<*«'-*

2 4 6 8 10

Local Capacity

3.6. Managerial Implications and Conclusion

The current triage methods of START and SALT do not provide specifics on how to

"provide the most good to the most people." The STM model filled this gap by

quantifying survival probabilities of victims and suggesting that use of overall number of

expected survivors as a quantifiable objective to use in order to capture the idea of "most

good to most people." However, the STM model is a macro model. We have suggested

a micro model that provides three major enhancements to the STM model. We have

shown that our enhanced model makes more sense, especially when we consider

multiple hospitals to which victims may be sent during an MCI. We have provided an

experimental design setting in which we are able to test various in-use heuristics. In

particular, we showed that the effectiveness of using a low-score-first implementation of

the closest-first heuristic depends upon the underlying victim distribution and which RPM

score value we start with. We also investigated how well our enhanced model performs

in comparison to several heuristics that are currently used in real life. We tested our

model against the heuristics that used a high-score-first implementation. A high-score-

first approach is not currently the way most evacuation heuristics are implemented in

64

practice, but comparing the enhanced model to the heuristics implemented in this

manner provides more conservative estimates of the improvement measures. The

enhanced model outperforms all of the heuristics on the order of 10% - 30% on average.

As the number of available ambulances varies but the hospital capacities remain

constant, we saw that the improvement of our model over the closest-first heuristic

stayed relatively stable at approximately 10%. Not surprisingly, as more ambulances are

added, the round-robin heuristics perform better. However, our model performs better

and it appears that the improvement gap between models starts to level off. We also

investigated keeping the number of available ambulances and the remote hospital's

capacity constant, while increasing the capacity at the local hospital. This experiment

shows the performance gap between our enhanced model and the heuristics as

remaining steady after having reached a capacity of four beds at the local hospital.

65

CHAPTER 4 - Investigating the Distribution of Victims to

Multiple Hospitals in a Dynamic and Stochastic Setting

4.1. Introduction

The previous chapter dealt with determining where, and in which order, to send victims

of a mass casualty incident (MCI) to various area hospitals in a deterministic setting.

While the model developed therein is useful, it is worthwhile to also investigate this

problem in a dynamic and stochastic setting. The situation surrounding an MCI will

unfold over time and inherently involves elements for which much of the data is

uncertain. For example, the travel times of the vehicles used for transport will likely vary

based on several exogenous influencing factors.

One of the issues with utilizing a deterministic model is that once a data

parameter deviates from its assumed value, "the script" no longer applies and

improvisation must be used in the decision making process. For example, consider the

possibility where the round-trip transportation time takes 30 minutes longer (e.g., due to

damaged or heavily congested roads) than the value used when solving a deterministic

model. How does this situation affect the well-thought out plans? What other alternative

courses of action are feasible to help compensate for this unplanned event?

One way to combat this issue is to model the problem as dynamic and

stochastic. That is, routing and allocation decisions are made sequentially, explicitly

taking into account the variability of the various input data elements. One modeling

technique to accomplish this task is stochastic dynamic programming (SDP). SDP

66

provides a formalized and eloquent way of describing sequential decision problems.

However, solving sequential decision problems can be another matter entirely.

One of the issues to be cognizant of is how the problem is modeled and defined.

Generally, when a sequential decision problem is formally described, the "state" of the

system must have an explicit form. The state definition must fully capture all the

information needed to make a decision, as well as the information needed to describe

how the system evolves over time. This may mean that a large number of attributes,

each of which may be multi-dimensional, are needed to describe the state. The more

descriptive and detailed the state variable, the more difficult the problem is to solve. In

fact, the explosion of the state space is the oft-cited "curse of dimensionality" that often

makes SDPs intractable for real-sized problems.

A major contribution of this work is that a novel solution methodology and

framework that combines SDP with Monte Carlo simulation as a way to overcome some

of these limitations in a practical setting is proposed. There are two end goals. First, an

easy-to-implement policy table that provides recommended decisions, even in the face

of uncertain data elements for the distribution of victims of an MCI to the surrounding,

appropriate healthcare providers, should be provided. In other words, this is a real

problem that requires providing practical recommendations to the decision maker that

will result in saving more lives. Second, the methodology proposed is applicable to other

settings where solving a stochastic sequential decision problem using the classical

approach is impractical.

The rest of this chapter is organized as follows. A brief review of the relevant

literature is provided first. Then, in Section 4.3, the problem of allocation and

transportation of victims from an MCI is modeled as a stochastic sequential decision

problem. The data necessary in order to capture an accurate depiction of the overall

problem is described. In Section 4.4, a new methodology, termed "scenario iteration," is

67

introduced. Its aim is overcoming some of the limitations of a typical SDP model. The

methodological framework and basic algorithm are detailed and the advantages of the

methodology are discussed. Next, in Section 4.5 the performance of the proposed

heuristic is examined through experimental analysis. In particular, a focus on the

performance of the model and on the performance of heuristic as a solution

methodology is considered. Finally, in Section 4.6, the chapter concludes and

recapitulates the main managerial insights.

4.2. Review of the Relevant Literature

For a review of the general literature on the broad field of disaster management and in

particular the literature stemming from the decision support and operations management

fields, please refer to the Section 3.2 of the dissertation. For this chapter, there are two

foci for the literature review. First, an exploration of some of the more relevant literature

from the operations management field is conducted and then the literature that is related

to the scenario iteration heuristic that is formally propose in Section 4.4 is reviewed.

One way to model the evacuation of victims of disasters that has been prevalent

in the extant literature is by using some variant of a multi-commodity flow formulation.

For example, Carolan et al. (1990) introduce the Patient Distribution System which

models the evacuation of patients from a European military conflict to U.S. hospitals over

a specified time horizon. They utilize a constant 11 commodities in a deterministic multi-

commodity network flow model, but the problem becomes quite large and more difficult

as the length of the time horizon, measured in days, extends. The study by Barbarosoglu

and Arda (2004) proposes a two-stage stochastic programming model to help plan the

transportation of medicine, food, clothing, etc. to disaster-affected areas. They, too, use

a multi-commodity formulation to describe the flow of these first-aid commodities over an

68

urban transportation network, and incorporate randomness by considering a finite set of

realizations of the random variables.

Much of the transportation literature focuses on evacuation of people for short-

notice disasters (e.g., an ensuing hurricane in the Gulf of Mexico). Examples of such

papers include Han et al. (2006) who explore the best way to assign destinations for

evacuees and Chiu and Zhen (2007) who explore a similar concept but consider

emergency response evacuation and logistic decisions within a multi-dimensional

simultaneous environment. Liu et al. (2006) propose a two-level integrated optimization

system for large-scale network evacuation planning. The top level optimization has the

goal of maximize the throughput for a mass evacuation scenario, while the second level

optimization seeks to minimize the travel and waiting time for the entire operation. For a

comprehensive review of the OR/MS research, the reader is urged to consult the article

by Altay and Green (2006).

The proposed methodology in Section 4.4 most closely resembles what is

commonly becoming known as approximate dynamic programming (ADP). ADP is a

blend of operations research methods such as dynamic programming and mathematical

programming, computer science fields such as artificial intelligence, simulation, and

statistics. This field is also sometimes known as neuro-dynamic programming or

reinforcement learning. ADP has received a large amount of interest in the recent past,

spawning, among many other things, the publication a book dedicated to the subject

(Powell 2007). The main difference between ADP and the proposed methodology is that

often the main goal an ADP is to approximate the optimal cost (objective) function. This

is done by generating one or more simulated system trajectories and associated costs

and using some form of "least squares fit" to see how good the approximated objective

function is. The approach described in this chapter is different. The interest here is in

finding the good approximations for the initially unknown transition probabilities. As

69

discussed in Section 4.4.3 once the process has converged, there will be two weapons

at our disposal: the "optimal" policy from the SDP and the ability to use the scaled-up

Monte Carlo simulation step of the algorithm to help make decisions for larger sized

problems in the field.

Simulation optimization (e.g., April 2003) is the other broad field that is

tangentially related to the scenario iteration heuristic. With simulation optimization the

goal is to find the best input parameters for a simulation. This is not the goal of the

proposed approach. Instead, good estimates of the unknown transition probability

distributions that are associated with our sequential decision problem are the desire. As

detailed in Section 4.4, the use of simulation helps accomplish this goal, rather than the

other way around.

4.3. Problem Description

The focus of this work is on the immediate time period following an MCI. In many

disaster scenarios, a casualty collection area is established outside of the immediate

vicinity of the disaster scene in order to avoid potential dangers to both the victims and

the disaster responders. Patients sent to the casualty collection area are triaged, treated

for immediately life threatening injuries, and transported to area hospitals. Figure 4.1

illustrates an example of the distribution of triaged victims to hospitals during a disaster.

Ideally, the distribution of victims to the various area hospitals should take place

in an orderly and systematic fashion. However, managing the patient load during a

disaster presents a major challenge to hospitals and disaster response managers.

Patient allocation decisions made by the so-called "Incident Command" should

incorporate information on resource availability, situation details (mechanism of injury,

etc.), and triage data. This information, however, is not static; it changes as the disaster

situation unfolds over time.

70

Figure 4.1: An example of victim distribution to a group of area hospitals during a disaster response.

Hospital

^ v ^^^w Hospital

^ ^ ^ / ^ Casualty " ^ y ^ - " - " " " " ' ^ I Hncnitai L— —'— V Collection Area J

The decisions include where a victim should be sent, which victim "class" to

send, and when to send the victim. As in the previous chapter, the overall objective is to

maximize the expected number of survivors from the MCI. The major resources on

which we have restrictions include availability of transport and the capacity level for

casualties at each of the hospitals. Hospital capacity is referred to simply as "beds" even

though there are numerous inter-related resources that go into treating and stabilizing a

victim of an MCI. Additionally, the term "ambulance" is used to mean any form of

transportation (e.g., a helicopter) that is available to help evacuate victims of the MCI to

the appropriate healthcare providers.

4.3.1 Data Elements

In the setting for this chapter, some of the data are stochastic. In particular, the transport

times and the care times are each independent random variables, and may depend on

the distance to be traveled and the severity of the injuries respectively. The care time

could also depend on which hospital a particular victim is sent to. In addition to the

decision of where to send a victim, the variability in the travel times directly influence the

availability of transport vehicles at each decision point. The variability in the care times,

which can differ with respect to both the hospital and the score, determines the capacity

71

availability at the healthcare provider's facility to take in more victims. The capacity also

depends on the number of beds, doctors and other resources the hospital may have on

a routine basis, and the surge capacity they may have during emergencies.

In order to accurately capture the system dynamics of the response to the MCI,

additional data needs to be kept track of. Instead of simply keeping track of the number

of ambulances, information on each ambulance must be retained and updated

accordingly. Relevant data on each ambulance includes

• when it became available (i.e., arrived at the casualty collection area),

• when the transport was dispatched,

• to where was the transport dispatched,

• which type of victim is it carrying, and

• the expected duration of the round-trip for this vehicle.

Also a more accurate accounting of each hospital is necessary. In particular, data

on each specific victim should be kept to allow easy calculation of the provider's

capacity availability and its current "load" or utilization at any point in time. Three of the

important data elements needed to help with these calculations include

• when did a patient arrive at the facility,

• the expected care time for the patient with that survival score in that

hospital, and

• how many resources (beds, surgeons, emergency room hours, surge

capacity, etc.) are available in each hospital.

These metrics directly influence the decisions on whether or not to send more

victims to a particular provider. For example, if a hospital has very little remaining

capacity and a victim is currently en route to the facility, then intuitively, the chance that

72

the facility will be able to effectively handle another victim in the near future is quite

small.

4.3.2 The Stochastic and Dynamic Problem

The objective we wish to maximize is the expected total number of survivors from the

MCI. A fixed time horizon, consisting of T time periods, is considered. A fixed time

horizon is consistent with this setting because the recommended decisions involve

sending injured victims immediately after the onset of an MCI and at some point in the

future resources (such as ambulances) are reallocated to causes other than the disaster

response effort.

The state of the process for a particular time period is described by the following

attributes:

• the number of victims for each score awaiting transport from the casualty

collection area to one of the surrounding hospitals,

• the set of hospitals that can handle at least some of the victims from the

MCI, and

• the set of ambulances or transport resources available.

On the surface this compact state representation might appear to be easily

solvable. However, some important historical information involving both the hospitals

and the ambulances must be kept track of in order to know when these valuable

resources become available for reuse. To know when capacity becomes free at a

provider's facility, the minimal attributes that must be monitored include when a victim

arrived, what score level he was, and his associated care time. Recall that in this setting

the care time will be a random variable, making it more difficult to accurately capture the

"release" time of the associated hospital resources. With the arrival time and an estimate

of the care time for each patient, the expected "processing" time can be calculated. This

73

allows an estimation of when the provider's resources will be freed to concentrate on

another victim. Similarly, to keep track of when a transport vehicle will return to the

casualty collection area and be able to evacuate another victim the minimal information

to keep track of includes when and to where each particular ambulance was dispatched.

The travel time of each ambulance is a random variable whose value can be estimated

by using the destination hospital and perhaps exogenous information regarding the

current traffic patterns, etc. This estimated return time then indicates when the transport

becomes reusable.

As indicated earlier, accurately capturing the problem using a SDP approach

may provide a useful description of the problem, but the size of the state space can

quickly make solving it intractable. This fact should be readily apparent from the

discussion above regarding the historical data that must be kept on each ambulance,

each victim that is transported, and each bed or resource that is used in the hospital.

Retaining this information makes the state space explode quickly. Exploring such a large

number of states makes solving even small instances of the problem prohibitive.

4.4. Scenario Iteration

The size of the fully-descriptive SDP model grows quickly making it intractable for real-

sized instances of the problem. This "curse of dimensionality" is one of the motivating

factors to find an alternative solution approach. Another, more subtle, reason to look for

an alternative approach is the fact that in this problem setting the outcome space can be

very large. That is, the random variables of travel time and care time can result in an

inordinately large number of possible realizations. Suppose, however, that it was

possible to aggregate the outcome space of the random variables in some meaningful

fashion (as is often done, perhaps implicitly instead of explicitly, in many applications).

74

The issue then changes from keeping track of history in a compact way to knowing the

transition probabilities between the various states. One way to accomplish this is to keep

track of state transitions from one period to the next. But doing this also requires keeping

track of history. Further, some compact way to determine when patients are discharged

is needed. Utilizing the concepts of waiting line theory, the proposed scenario iteration

approach keeps track of the utilization of resources at any time, rather than each

individual admit and discharge time. Then it is possible to track how the utilizations

transition from one time period to the other. But even doing this can be a challenge. For

instance, what is the best way to properly capture the transition of a hospital from one

utilization level to another level? The inability to measure the transition probabilities is a

concern in the application of distributing victims of an MCI to area hospitals. The

methodology proposed attempts to overcome both the explosion of the state space and

the inability to directly measure the transition probabilities.

The methodology developed combines the use of a SDP policy and Monte Carlo

simulation to arrive at a final decision policy. The simulation helps predict the probability

of occurrence of status of several state variables so they do not have to be explicitly

modeled in the SDP, thus reducing the size of the state space dramatically. The SDP

optimal policy table is then used in the simulation whenever decisions need to be made.

The simulation then generates updated probability transition matrices to be used in the

SDP. During the simulation, scaling is used and enables this iterative solving of the SDP

and simulation to converge to a stable policy table fairly quickly. The term "scenario

iteration" is coined for the proposed methodology and its purpose is twofold. First, a

tractable way to arrive at the transition probabilities associated with the problem is

desired. Second, the end goal is to provide the decision makers both a condensed,

easy-to-use decision-rule table and a framework to explore when different decision

policies might be necessary.

75

4.4.1 General Heuristic Algorithm

The basic steps of the scenario iteration methodology are:

Step 1. Generate the initial transition probability matrices.

Step 2. Solve the SDP, generating a decision policy.

Step 3. Run a scaled-up Monte Carlo simulation over an extended time horizon,

using the decision policy found in Step 2 to make decisions during the

simulation.

Step 4. Calculate the transition probability matrices from the Monte Carlo

simulation.

Step 5. Check to see if convergence has occurred. If so, generate the final

"optimal" SDP policy using the final transition probabilities. Otherwise, use

these new transition matrices and go to Step 2.

These steps are also depicted in the flowchart shown in Figure 4.2. Step 1 is

accomplished by running a small inline simulation over an extended time horizon. The

allocation decisions are based on an equal chance of sending a particular victim class to

each of the hospitals. These initial probability vectors are then used as input for Step 2.

The details of Steps 2 and 3, the SDP and the Monte Carlo simulation, are explained in

turn below. Step 4 uses the results from Monte Carlo simulation to calculate new

transition probabilities. A check is then performed to determine whether the process has

converged. If it has not, then the algorithm returns to Step 2 and begins again.

76

Figure 4.2: Flowchart showing the basic steps of the scenario iteration methodology.

Start

Generate Initial Transition

Probabilities using Simulation

Run the SDP

No

End W

Calculate New Transition Probabilities

using Simulation

Yes

_L Run the SDP to Generate Final

Policy Table

4.4.2 The Stochastic Dynamic Programming Model

The following notation is used to help explain the SDP.

Data

S Set of victim scores (e.g., RPM values)

5 Score index

H Set of area hospitals

h Hospital index

T The final time period

t Time period index

Sh One-way trip distance (in time periods) to hospital h from the casualty collection

area

c* The amount of care time needed to "process" / care for a victim of score s at

hospital h

A The index used to indicate we are talking about the transport / ambulance

77

vs The number of victims of score s awaiting transport from the casualty collection

to one of the healthcare provider facilities

i,j Indices for the different load groups

Ys The probability of survival of a victim with score s

ah The indicator bit of whether or not hospital h has available capacity

aA The indicator bit of whether or not an ambulance is available

ph The current load group for hospital h

pA The current load group for the ambulances

pi The probability of moving from load group /'to load groupy'for hospital h

pf- The probability of moving from load group /'to load groupy'for the ambulances

qf The probability that there will be availability at hospital h during the next time

period given that we are in load group /'this time period

qf The probability that an ambulance will be available during the next time period

given that we are in load group /' now

Variables

d% The decision to send a patient with score s to hospital h during the current time

period

Time is discretized into equal length intervals in a manner that only allows for the

decision of the routing and allocation of at most a single victim within each time period.

The decision to send a victim needs to be made only when we have the available

resources to both transport the victim and treat the victim at the healthcare provider's

78

facility. At the end of the time horizon it is assumed the only the option is to "do nothing".

At all other time periods, assuming the appropriate resources are available, the available

actions are denoted by d% for all h e H,s e S. Each decision variable is binary, with 0

meaning "do nothing," and, for example, d[ = 1 would mean "send the patient of score 1

to the local hospital."

The following attributes are used to capture the essence of the problem and

specify the state of the system. For each score class, the number of victims awaiting

transport are kept track of and denoted with vs. The next piece of information retained

concerns the availability of the main resources. Does each hospital have capacity? That

is, a single bit that tells indicates yes or no is recorded instead of using the specific

number of available beds. The notation utilized here is ah. The availability of transport

vehicles is treated similarly and aA represents whether or not an ambulance is available.

The final information incorporated into the definition of the state are the proxies for how

busy each of the resources are. The general idea is that some way to measure the

current capability (or utilization) of the hospitals and the ambulances is needed.

To accomplish the task of estimating the current capabilities of the hospitals and

the ambulances, the capability of a resource is broken into groups based on their

"loads." The load of a resource could be defined multiple ways. The definition used in

this chapter identifies the total number minutes remaining to finish processing the current

victims if no new victims were sent. For example, suppose the care time for a victim of

score 1 is 130 minutes. Suppose the current time period is the third in the time horizon

and in each of the previous two time periods, a victim of score 1 arrived at the local

hospital. If each time period is 15 minutes long, the load at the local hospital is 215

minutes. The first victim arrived at time period 1 giving a load of 130, but the current time

period is time period 3, indicating that the victim has been under care for 30 minutes.

79

This results in the first victim contributing 100 minutes to the load at the third time period.

The second victim was also a score 1 victim, indicating that his initial load is 130, but he

has been at the hospital for 15 minutes. Therefore the second victim contributes 115

minutes to the local hospital's load. Combined, the local hospital has a load of 215.

Using this definition for a load group, five load groups are used for both the

hospitals and the ambulances. Then the interest is in examining how the system

transitions between the different load groups. The load groups for the hospitals are

denoted by ph, and the load groups for the ambulances are denoted by pA.

The state of the system in time period t can be specified by (vs;a h,aA;ph,pA).

Let ft(ys',a h,aA;ph,pA') be the maximal expected number survivors from being in state

(ys; a h,aA;ph,pA) when optimal actions are taken in each time period from t to T.

The following recursive functional equation specifies the model:

ft T(vs;a

h,ccA;p\pA) = max Ys + E[ft T

+1(vs;d h,aA;ph

lp h)] (4.1)

where ys is the immediate "reward" of making the decision to send a victim of score s to

hospital h. This is simply the victim's survival probability. This immediate reward is

added to the expectation term E[ft'+1(vs;a h,aA;ph,ph)]. The expectation is calculated

over the set of reachable states, indicated by the tilde on each state variable, from the

current state. The boundary condition is given by:

ff(vs;a\aA;p\pA) = 0 (4.2)

4.4.3 The Monte Carlo Simulation

The immediate purpose of the Monte Carlo (MC) simulation portion of the algorithm is

meant to help predict the probability of occurrence of the status of several state variables

80

so they do not have to be explicitly modeled in the SDP. This allows a dramatic

reduction in the size of the state space of the SDP. For example, instead of explicitly

keeping track of each individual patient at the hospitals, the simple yes/no information

concerning the availability of beds at the hospitals is kept.

The beauty of using the MC simulation is that it is somewhat easy to keep track

of an "expanded state space." Another advantage is the ability to run the simulation for

an extended length of time to help find some resemblance of steady-state equilibrium.

By using the simulation scaling the condensed and aggregated SDP into a more

complex picture of the setting is possible. The Monte Carlo simulation enables the

scaling of the smaller SDP results to a problem of virtually any size that may be

encountered during a disaster. In Section 4.5, it is shown that as the results from the

Monte Carlo simulation are scaled back down, they closely approximate the results from

the SDP.

Within the simulation, the generated decision policy from the SDP is used to

make decisions at each decision epoch. Each of the various SDP-equivalent states

visited during the simulation is recorded. From this information the transition probabilities

between the different load groups and the probability of each resource being available in

the next time period are calculated. These probability distributions are then fed back to

another iteration of the heuristic that solves the SDP again. The pseudo-code for the MC

is shown in Figure 4.3, but to fully understand it, some preliminary information is

necessary.

Recall that in the SDP, the state is defined as (vs;a h,aA;ph,pA). In the MC

simulation, an "extended state", referred to as s ta tes im, is utilized instead. It is

composed of a time period, an array of awaiting victims, a set of hospitals, and a set of

ambulances. Each hospital keeps track of individual victims/patients that arrive to it from

81

the MCI. Each patient has an arrival time, a score level, a care time, and a "release"

time. From the set of patients, the hospital's associated load can be calculated. Each

ambulance in the set keeps track of the type of victim it is transporting, when it left the

casualty collection area, which hospital is it traveling to, and when it will return to the

casualty collection area. The load on the ambulances can be calculated from this

information. The MC must translate the s ta tes im to the state that is understood by the

SDP in order to look up the corresponding decision from the policy table. The MC scaled

up by a user-prescribed factor. In the experiments of Section 4.5, a factor of 60 is used,

meaning that the total number of time periods considered is 480 (since there are only

eight "real" decision points in the SDP).

When scaling is employed, translating from the s t a t e s i m to an SDP state is a

little tricky because the time periods and the number of awaiting victims will not match

because there are, for example, 60 times more time periods and initial awaiting victims

in the s ta tes im when compared to the SDP state. To reconcile these differences, the

"ceiling" of the result from taking the s ta tes im time period and dividing by the scaling

factor, e.g., 60, is used as the corresponding time period for the SDP state. Similarly,

each of the victim score levels is translated in the same manner.

During the simulation a warm-up period is used because the starting state

includes "empty" hospitals and all of the ambulances available for transporting victims.

After the warm-up period, each of the corresponding SDP states is captured for the

remaining portion of the time horizon. After the time horizon has been exhausted, the

new transition probabilities can be gleaned from the recorded SDP states.

82

Figure 4.3: The pseudo-code describing the Monte Carlo simulation step of the scenario iteration methodology.

t = 1; visitedSDPStates := empty; Initialize expandedState := (t; [vs]; [H]; [A]);

Do SDP state = convert(expandedState) If (t > warm-up time)

visitedSDPStates.add(SDP state);

dj = lookupDecision(SDP state); expandedState.[A] .nextAvailable().update(); expandedState.[Hn] .update(); expandedState. [vs] .update () ; expandedState.t++ ;

expandedState.[Hh] .dischargePatients(); Until t = T;

Iterate over visitedSDPStates calculate the new transition probabilities

Return new transition probabilities (to be tested against previous ones for convergence)

4.4.3 Convergence

There are numerous ways to test for convergence of the probability matrices between

SDP and MC iterations. As an example, descriptions of three of the ideas considered are

discussed. Two of the measures were ultimately rejected in favor of the third. Initially, the

idea was to test each element of each transition matrix against the corresponding

element of the previous matrices to see if each element was within a pre-specified

distance, e. This check turns out to be too strict because it means that every element of

every matrix must be within a certain percentage after the same iteration. The second

idea for testing for convergence considered was to check how close the objective

83

function value for the SDP was to the one from the previous iteration. This check is not

strict enough because it is too aggregated. That is, many different fluctuations could

occur within different transition matrices and still end up with an objective function that is

in roughly the same vicinity of another.

The convergence metric used in this chapter strikes a balance between being too

restrictive and too lenient. For each probability matrix, a single metric is calculated and

used it to determine if this probability matrix is within the pre-specified epsilon of the

corresponding previous one. In particular, each entry of the probability matrix is weighted

by its associated load group and weighted values are added up. The logic behind

weighting the probabilities by their load groups is as follows. For true convergence to

occur, the load group probabilities will be "close" to the previous ones. Weighting each

probability by its associated load group will provide a decent proxy for this "closeness"

measure. Intuitively, using this measure helps ensure that the system will not jump too

far away from load group that it was in during the previous iteration.

Consider the following simplified example to understand the process of checking

for convergence. Suppose the following probability matrix represents the availability of

an ambulance in the next time period when considering three load groups: qf =

[0.9 0.6 0.1]. Note that the elements do not have to sum to one. The first element

means that there is a 90% chance that an ambulance will be available next time period if

the system is currently in the first load group. (Obviously, this corresponds to a 10%

chance that no ambulance will be available during the next time period.) Similarly, the

second element indicates a 60% chance of an available ambulance in the next time

period if the system is currently in the second load group. After a single iteration of the

scenario iteration process, the Monte Carlo simulation gives a new probability matrix of

qf = [o.7 0.4 0.0]. If e is 0.05, has convergence occurred yet? First, calculate the

84

metric for the two probability matrices, the previous one and the current one. The

corresponding metric for the previous matrix is (0.9 x 1 + 0.6 x 2 + 0.1 x 3) = 2.4. The

current matrix has a corresponding result of 1.5. Now compare the two metrics by

looking at the absolute value of (current metric - previous metric) / previous metric and

see if it is within the given epsilon. In this small illustrative example, convergence has not

occurred yet because the result is 0.375. This result indicates that the new probability

matrices should be fed into the next iteration and used in the solving of the SDP.

This check is performed for each probability matrix found in the SDP. In other

words, 2 x (\H\ + 1) different matrices need to be checked. If any one of these fall

outside of the pre-specified value of epsilon, the conclusion is that convergence has not

occurred yet and another iteration of the process begins.

A time or iteration cutoff can also be used during the determination of whether

the process has converged. So, for example, in the experiments in Section 4.5, a cutoff

of six iterations is utilized. On the machine running these experiments, this equated to

approximately one hour of CPU time. More details can be found Section 4.5.

4.5. Experimental Analysis

Two main avenues are explored. First, an examination of how the model performs under

various configurations is desired. Second, the performance of the scenario iteration

heuristic process itself is of interest. For both investigations, a common experimental

setup is used and it is described next.

4.5.1 Experimental Setup

The SDP contains a total of nine time periods with the last one having only the option of

"do nothing." So, in effect, there are eight decision points. Two score levels, referred to

as Score 1 and Score 2 for simplicity, are considered. For the SDP, each score level can

85

have at most four victims awaiting transport at the casualty collection area. Two

hospitals are investigated, where one is "local" and one is "remote". Each hospital can

be in one of a total of five load groups. Similarly, five load groups are defined for the set

of ambulances. The size of the state space for the SDP that is used during Step 2 of the

scenario iteration process is therefore ( 9 x 5 x 5 x 2 x 2 x 2 x 5 x 5 x 5 ) = 225,000

states.

Five different probability distribution patterns for travel time are considered.

Specifically, we look at constant travel times, uniformly distributed, normally distributed,

a left-skewed distribution, and a right-skewed distribution. Similarly, we examine five

different distributions for the care times. The care time distributions also follow either a

constant time, uniformly distributed, normally distributed, skewed left, or a skewed right

distribution. Care times are score-dependent. To give a general idea of the shape of the

distributions, Figure 4.4 provides a pictorial representation of the local travel time

distributions used in the experiments. The remote travel times follow the distributions

from the same families, but the mean of each distribution is higher because the "remote"

hospital is further away.

Three different levels of capacity at the hospitals are used in the experiments:

five, six, and seven "beds." The final parameter in the experiments involves how close

the survival probabilities of the scores are to one another. The specific settings for the

survival probabilities are shown in Table 4.1 and are termed "base", "squeezed", and

"spread".

86

Figure 4.4: The local travel time distributions used in the experiments.

XI «a XI o

0.45

0.40

0.35 -

0.30

0.25

0.20

0.15

0.10 j

0.05 |

0.00 4-

0

Local Travel Time Distributions \

• Uniform

Normal

LeftSkewed

RightSkewecl

\

\ %. /

/ X ' • •

10 20 30 40

Minutes

50

V„..,

For all of the design points, in the SDP the start point for t = 1 in state is

(vs;a h,aA;ph,pA) = (v1,v2;a

L,aR,aA;pL,pR, pA) = (4,4; 1,1,1; 1,1,1).

That is, for both Score 1 and Score 2, there are four victims each waiting to transported

to a healthcare provider's facility; both hospitals have available beds and an ambulance

is available; and each of the load groups is at its minimum value. We set T = 9 for all

design points. During the scenario iteration process, a cutoff of six iterations is utilized if

the convergence has not yet been met. The heuristic is implemented in this fashion for

these experiments in order to be able to test a large number of different design points.

On the 2.0 GHz, dual core processor with 4.0 GB memory machine we were running the

experiments, a single iteration took approximately 15 minutes to complete. If the iteration

Table 4.1: The different survival probabilities used in the experiments.

Score 1 Score 2

& & * : • . '

Es|i=v>*# Illf'-̂ :

0.49 0.60 0.30

0.63 0.63 0.63

87

cutoff point (6 iterations) was reached before converging, it means approximately 1.5

hours was spent on each design point. With a large number of design points of interest,

this cutoff point seemed reasonable and the results in general do not appear to be

adversely affected (see below).

During the simulation portion of the experiments, a "warm-up" period (a user-

inputted parameter) is used before the start of recording data on the progress of the

simulation. The general idea is that it is better to let the system get to some steady-state

before data collected that will be used in the calculation of the new probability matrices is

started. Using this setup, the first exploration involves how well the model is doing. Then

the attention is turned to investigating the heuristic methodology itself.

4.5.2 Model Performance

To determine how well the model is doing, several different perspectives of the overall

outcome are provided. First, the care time for victims is held constant and the survival

probability is fixed at the base level. For each these 15 design points, the expected

number of survivors over the time horizon is captured. These results are shown in Table

4.2 (a). As the initial capacity of the hospitals increases, the results are monotonically

increasing with respect to each particular travel time distribution. This is the result

expected a priori: the greater the initial capacity at the hospitals, the higher the number

of survivors from the MCI. Basically, with additional capacity at each of the hospitals,

more victims are able to be "placed." The other noticeable result is that when the left-

skewed distribution is used for the travel times (hence they tend to be longer) the

objective function value is smaller than, say, the uniformly distributed travel times.

Similarly, the right-skewed travel time distribution provides a better outcome (larger

number of expected survivors) over the time horizon than, say, if the travel times are

normally distributed. Again, these results are expected a priori. Finally, these results

88

appear to confirm that the scenario iteration heuristic works well for a diverse set of

distributions for travel time and care times.

Table 4.2 (b) provides another example of the results indicating the model reacts

in the expected way. Here, the left-skewed care time distribution is used while holding

the survival probability at the base level. Again, the results are monotonic in each

particular row as expected. Table 4.4 (a) shows similar results when using normal travel

times and normal care times, but varying the survival probabilities.

Figure 4.5 illustrates one specific design point and tracks each iteration as it

converges to the ending survival score of the SDP. We chose the design point

corresponding to hospital capacity of five, uniformly distributed travel times, constant

care time, and the base survival probabilities. We allow this design point run longer in

order to try to capture the possible oscillations in the survival score.

Table 4.2: Two examples showing the results of the model.

Uniform 2.70 2.79 2.89 Normal 2.66 2.74 2.87 feftSkewed 2.64 2.68 2.81 Right Skewed 2.77 2.85 2.95

(a) Constant care times and a base (b) Left skewed care times and a survival probability. base survival probability.

Uniform 3.41 3.47 3.49 Normal 3.18 3.28 3.29 Left Skewed 2.86 2.93 3.02 RightSkewed 4.08 4.16 4.31

89

Figure 4.5: Convergence of the scenario iteration heuristic's survival score for the hospital capacities of five patients, uniformly distributed travel times, constant care times, and the base survival probabilities.

3.70

3.50

<u 3.30 o u (A

« 3.10

<̂ 2.90

2.70

2.50

Convergence of Scenario Iteration Heuristic

10 12

Iterations

4.5.3 Performance of the Heuristic Methodology

It is also important to investigate how the proposed scenario iteration methodology itself

performs. To understand how the heuristic reacts, the number of iterations each design

point required to converge was recorded. A few representative results are shown in

Table 4.3. The term "6*" indicates that the process stopped due to the imposed iteration

count limit. Additionally, see Table 4.4 (b) for how the heuristic methodology reacts when

using normal travel times and normal care times, but the survival probabilities vary.

4.5.4 Ability to Scale

As discussed earlier, one of the advantages to using the scenario iteration heuristic is

the ability to scale up the smaller-sized SDP results to a larger, and perhaps more

realistic, sized problem. In the experiments, an initial SDP state with four victims of each

patient class was used and then progressed through the time horizon of nine time

periods. For the simulation step of the process, a scaling factor of 60 was used. This

90

Table 4.3: Two examples showing the results of the how the scenario iteration process converges.

Constant Uniform Normal Left Skewed Right Skewed

(a) Constant care times and a base survival probability.

Constant Uniform Normal LeftSkewed Right Skewed

(b) Left skewed care times and a base survival probability.

scaling factor was chosen because it gave us approximately 500 time periods and a

large number of victims. At the end of the Monte Carlo simulation the survival score can

be calculated for the simulation and compared to the survival score generated by the

SDP. As Figure 4.6 shows, once the Monte Carlo results have been scaled back down

they are quite close to the SDP results. Here we use the design point corresponding to

hospital capacity of five, uniformly distributed travel times, constant care times, and the

base survival probabilities. We interpret the closeness of the two results to mean that the

scaling up during the simulation works well.

91

Figure 4.6: An evaluation of the SDP results to the results of the Monte Carlo simulation (after they have been scaled back down).

3.70

3.50

(U s_ o u </i

1 1

Su r

3.30 •-

3.10

2.90

Evaluation of the SDP and Monte Carlo Results

2.70

2.50

— SDP

— Monte Carlo

6 8

Iteration #

10 12

Table 4.4: Examples of how the different survival probabilities affect the survival scores and the number of iterations until convergence.

(a) Normal travel times and normal

care times.

(b) Normal travel times and normal

care times.

92

4.6. Conclusion

The problem of evacuating victims of an MCI in a dynamic setting was investigated in

this chapter. To accurately describe the problem in this setting, the state space grows

large quickly making it a challenge to explore the enormous state space for the optimal

answer. The novel methodology proposed, scenario iteration, significantly reduces the

state space helping overcome the "curse of dimensionality." The heuristic framework

combines stochastic dynamic programming with Monte Carlo simulation, resulting in a

final decision policy that can be easily understood and scaled up to meet the specific

attributes of a given MCI. Using the proposed scenario iteration methodology, the

dynamic setting is explored through the use of experiments. The model reacts in many

of the anticipated manners and appears to be providing policies that are better than the

heuristics currently used in practice. Additionally, the methodology itself seems robust

and flexible, making it applicable to other settings where the solving the stochastic

sequential decision problem using the classical approach is prohibitive. It should be

noted, however, that there is some art to implementing the scenario iteration

methodology including deciding which state variables should be estimated with the

equivalent "load groups" (using the language of this research) instead of explicitly

modeled. These decisions also directly affect the Monte Carlo implementation which

estimates the corresponding transition probabilities.

An avenue that should be explored in future research is the design of a proper

experiment to compare the dynamic model to the deterministic model described in

Chapter 3. Another interesting avenue for future research includes exploring different

ways to dynamically discover and modify the load groups as the process progresses.

93

Chapter 5 - Conclusion and Future Directions

In the preceding sections, several challenges that occur in the healthcare operations

field were examined and solutions were presented to provide improvements within the

decision making environments surrounding these operations. The three problems

examined in the paper, while distinct, demonstrate both the need and the opportunities

to improve decision making in diverse healthcare settings. Many of the current

procedures utilized by the decision makers in the different settings examined are not

providing the optimal outcomes. This dissertation attempts to address this fact by

providing better dynamic, real-time allocation decision aids for each of the settings.

We first examine the decision making process involved with allocating testing

time slots for a Cardiac Diagnostic Testing Center (CDTC). The challenge comes with

allocating the time slots among multiple competing patient classes. Real-time scheduling

decisions made in the CDTC are highly complex and have direct revenues and costs

associated with the patients undergoing cardiac diagnosis and the costs associated with

other hospital resources, such as bed availability. Thus, the decisions made in the CDTC

have a significant impact on the entire hospital.

We provide a superior solution for the CDTC's scheduling problem through the

implementation of a dynamic model that incorporates a multi-commodity network flow

formulation. We used the term MFSM to refer to the proposed model. The formulation

introduced allows for the possibility of "bumping" different patient classes in order to

accommodate requests from other patient classes. The solution is shown, through

extensive simulation studies using a partner hospital's data, to provide significant

improvement in the quality of service provided to inpatients while ensuring little to no

94

degradation in the quality of service provided to outpatients. Using the online decision

support tool and framework provided, hospital administrators are also able to make

better informed and more profitable decisions when it comes to long-term capacity

planning.

We then investigate the decision making involved in determining which of several

area hospitals to send victims of a Mass Casualty Incident (MCI). The heuristics

currently used after an MCI are not able to take into account complexities such as

resource availability and the survival probabilities of patients and seldom accomplish the

oft-cited goal of doing the most good for the most people. We first look at the problem in

a deterministic setting and, focusing on the critical time period immediately following the

onset of an MCI, develop a model that enhances the Sacco Treatment Method (STM)

model. The STM model is currently the only model that quantifies the term "greatest

good to the greatest number." The enhancements proposed in this dissertation help

overcome the difficulty in realistically implementing the STM model in real-time settings.

Our enhanced model, formulated as a mixed-integer program, has an average

improvement of 36.5% and 69.0% in the expected number of survivors when compared

to results from the STM model. Additionally, significant improvements are shown when

our proposed model is used in place of three commonly used policies for allocating

victims to hospitals during an MCI.

Finally, we look at the problem of evacuating the victims of an MCI in a dynamic

setting. The challenge in this setting lies in exploring the large state space created by the

state variables that need to be modeled. The novel methodology proposed significantly

reduces the state space helping overcome the "curse of dimensionality." The heuristic

framework combines stochastic dynamic programming with Monte Carlo simulation,

resulting in a final decision policy that can be easily understood and scaled up to meet

the specific attributes of a given MCI. Using our proposed "scenario iteration"

95

methodology, we compare the dynamic model results with the results from the static

model, discussed earlier, and provide managerial insights that can be applied in the

event of a MCI.

While the three essays contained in this dissertation make solid progress

towards helping use healthcare resources more efficiently, there is still much to be done

in the general healthcare setting. The three problems presented here are rich enough

that some future fruitful work can be explored. From the CDTC chapter, while we allude

and suggest that strategic level capacity decisions can benefit from our work, it would be

interesting to fully develop a decision support system whose main goal is to focus on

these types of decisions. Partnering with a supportive hospital with buy-in to the idea

would be vital to ensuring the success and use of such a tool.

Within the MCI setting, perhaps the most potential for fruitful future research is

within the stochastic setting. For example, can "optimal" scaling parameters be found for

use within Monte Carlo simulation step of the scenario iteration process? Is there a

minimum threshold that should be used for the scaling? Another interesting avenue

might be to see if there is an inventive way to be able to include many more score levels

within the heuristic. Incorporating them directly into the SDP intuitively seems somewhat

prohibitive. Can the addition of score levels somehow be accomplished in the Monte

Carlo portion?

One of the differences between the MIP enhanced model of Chapter 3 and the

scenario iteration heuristic of Chapter 4 is that the MIP can somewhat easily look at

hospital resources on a score-level basis whereas the scenario iteration methodology

has more difficulty implementing this effectively. The straight-forward approach to

incorporating this option in the scenario iteration process is to expand the SDP state

definition. However, a large state space for the SDP is exactly what we wish to avoid. Is

96

there an imaginative way to reconcile this difference between the two methods without

increasing the computational costs of the scenario iteration process too much?

Additionally, the methodology proposed in this dissertation has the strong

potential to be useful in other practical problem settings. Are there settings that are

"better" than others? If this is case, then why does it occur? Are some seemingly well-

matched problem settings that have the end result that scenario iteration is not that

effective? What are the common characteristics of the problem settings that indicate a

favorable outcome, and conversely an unfavorable one, when scenario iteration is used

in practice?

97

References

Nezih Altay and Walter G. Green, III. OR/MS research in disaster operations

management. European Journal of Operational Research, 175(3):475-493, 2006.

Jay April, Fred Glover, James P. Kelly, and Manuel Laguna. Practical Introduction to

Simulation Optimization. Proceedings of the 2003 Winter Simulation Conference.

71 - 7 8 .

Gulay Barbarosoglu and Y Arda. A two-stage stochastic programming framework for

transportation planning in disaster response. Journal of the Operational

Research Society, 55(1): 122 43-53, 2004.

Gulay Barbarosoglu, Linet Ozdamar, and Ahmet Cevik. An interactive approach for

hierarchical analysis of helicopter logisitics in disaster relief operations. European

Journal of Operational Research, 140(1): 118-133, 2002.

P. P. Belobaba. Application of a Probabilistic Decision Model to Airline Seat Inventory

Control. Operations Research, 37(2): 183-197, 1989.

G. R. Bitran and S. M. Gilbert. Managing Hotel Reservations with Uncertain Arrivals.

Operations Research, 44(1):35^9, 1996.

98

Margaret L. Brandeau, Jessica H. McCoy, Nathaniel Hupert, Jon-Erik Holty, and Dena

M. Bravata. Recommendations for modeling disaster responses in public health

and medicine: A position paper of the society for medical decision making.

Medical Decision Making, 29(4):438-460, 2009.

M. L. Brandeau, F. Sainfort, and W. P. Pierskalla, editors. Operations Research and

Health Care: A Handbook of Methods and Applications. International Series in

Operations Research and Management Science. Springer, 2004.

GG Brown and AL Vassiliou. Optmizing disaster relief-real-time operational and tactical

decision support. Naval Research Logistics, 40(1): 1-23, 1993.

William J. Carolan, James E. Hill, Jeffrey L. Kennington, Sandra Niemi, and Stephen J.

Wichmann. An Empirical Evaluation of the KORBX® Algorithms for Military Airlift

Applications. Operations Research, 38(2): 240-248, 1990.

W. J. Carrol and R. C. Grimes. Evolutionary Change in Product Management:

Experiences in the Car Rental Industry. Interfaces, 25(5):84-104, 1995.

Yi-Chang Chiu and Hong Zheng. Real-time mobilization decisions for multi-priority

emergency response resources and evacuation groups: Model formulation and

solution. Transportation Research Part E, 43(6): 710 - 736, 2007.

P. Maria Joseph Christie and Reuven R. Levary. The use of simulation in planning the

transportation of patients to hospitals following a disaster. Journal of Medical

Systems, 22(5):289-300, 1998.

99

David C. Cone, John Sera, Kevin Burnes, Donald S. MacMillan, Lias Kurland, and Carin

Van Gelder. Pilot test of the SALT mass casualty triage system. Prehospital

Emergency Care, 13 (4):536-540, 2009.

FN de Silva and RW Eglese. Integrating simulation modelling and GIS: Spatial decision

support for evacuation planning. Journal of the Operational Research Society,

51(4):423-430, 2000.

Martin Durbin and Karla Hoffman. The Dance of the Thirty-ton Trucks: Dispatching and

Scheduling in a Dynamic Environment. Operations Research, 56(1):3-19, 2008.

F. Fiedrich, F. Gehbauer, and U. Rickers. Optimized resource allocation for emergency

response after earthquake disasters. Safety Science, 35(1-3):41-57, 2000.

C. D. Flagle. Some Origins of Operations Research in the Health Services. Operations

Research, 50(1):52-60, 2002.

Simon French. Multi-attribute decision support in the event of a nuclear accident. Journal

of Multi-Criteria Decision Analysis, 5(1):39-57, 1996.

Eric R. Frykberg. Medical management of disasters and mass casualties from terrorist

bombings: How can we cope? The Journal of Trauma Injury, Infection, and

Critical Care, 53(2):201-212, 2002.

100

Eric R. Frykberg. Triage: Principles and practice. Scandinavian Journal of Surgery,

94:272-278, 2005.

M. Geraghty and E. Johnson. Revenue Management Saves National Car Rental.

Interfaces, 27(1): 107-127, 1997.

L. V. Green, S. V. Savin, and B. Wang. Managing Patient Service in a Diagnostic

Medical Facility. Operations Research, 54(1):11-25, 2006.

Lee D. Han, Fang Yuan, Shih-Miao Chin, and Holing Hwang. Global Optimization of

Emergency Evacuation Assignments. Interfaces, 36(6): 502 - 513, 2006.

Antoine G. Hobeika, Sigon Kim, and Robert E. Beckwith. A decision support system for

developing evacuation plans around nuclear power stations. Interfaces, 24(5):22-

35, 1994.

Lenworth M. Jacobs, Jr., Michelle M. Goody, and Anna Sinclair. The role of the trauma

center in disaster management. The Journal of Trauma Injury, Infection, and

Critical Care, 23(8):697-701, 1983.

Dai Jianshe, Wang Shuning, and Yang Xiaoyin. Computerized support systems for

emergency decision making. Annals of Operations Research, 51 (7):315-325,

1994.

101

Arun Jotshi, Qiang Gong, and Rajan Batta. Dispatching and routing of emergency

vehicles in disaster mitigation using data fusion. Soci-Economic Planning

Sciences, 43:1-24, 2009.

Christopher A. Kahn, Carl H. Schultz, Ken T. Miller, and Craig L. Anderson. Does

START triage work? An outcomes assessment after a disaster. Annals of

Emergency Medicine, 54(3): 424-430, 2009.

S.P. Kourniotis, C.T. Kiranoudis, and N.C. Markatos. A systematic approach to effective

chemical emergency management. Safety Science, 38(1):49-61, 2001.

Laura Landro. The informed patient: Hospitals step up disaster-preparedness. The Wall

Street Journal, pg. D.4, September 6, 2006.

Eva K. Lee, Siddhartha Maheshwary, Jacquelyn Mason, and William 210 Glisson.

Large-scale dispensing for emergency response to bioterrorism and infectious-

disease outbreak. Interfaces, 36(6):591-607, 2006a.

Eva K. Lee, Siddhartha Maheshwary, Jacuelyn Mason, and William Glisson. Decision

support system for mass dispensing of medications for infectious disease

outbreaks and bioterrorist attacks. Annals of Operations Research, 148:25-53,

2006b.

102

E. Brooke Lerner, Richard B. Schwartz, Phillip L. Coule, Eric S. Weinstein, David C.

Cone, Richard C. Hunt, Scott M. Sasswer, J. Marc Liu, Nikiah G. Nudell, Ian S.

Wedmore, Jeffrey Hammond, Eileen M. Bulger, Jeffrey P. Salomone, Teri L.

Sanddal, Graydon C. Lord, David Markenson, and Robert E. O'Connor. Mass

casualty triage: An evaluation of the data and development of a proposed

national guideline. Disaster Medicine and Public Health Preparedness,

2(Supplement 1):S25-S34, 2008.

Gl Mould. Assessing systems for offshore emergency evacuation. Journal of the

Operational Research Society, 52(4):401-408, 2001.

V. Lieberman and U. Yechiali. On the Hotel Overbooking Problem: An Inventory System

with Stochastic Cancellations. Management Science, 24(11):1117-1126, 1978.

Ying Liu, Xiaorong Lai, and Gang-Len Chang. Two-level Integrated Optimization System

for Planning of Emergency Evacuation. Journal of Transportation Engineering.

132(10): 800-807, 2006.

Linet Ozdamar, Ediz Ekinci, and Beste Kuciikyazici. Emergency logisitics planning in

natural disasters. Annals of Operations Research, 129(1-4):217-245, 2004.

E. Litvak, M. C. Long, A. B. Cooper, and M. L. McManus. Emergency Department

Diversion: Causes and Solutions. Academic Emergency Medicine, 8(11): 1108-

1110,2001.

103

KN Papamichail and Simon French. Generating feasible strategies in nuclear

emergencies-a constraint satisfaction problem. Journal of the Operational

Research Society, 50(6):617-626, 1999.

P. Parker. Move Care to a Higher Level with Emergency Systems. Nursing

Management, 35(9):82-84, 2004.

J. Patrick, M. L. Puterman, and M. Queyranne. Dynamic Multipriority Patient Scheduling

for a Diagnostic Resource. Operations Research, 56(6): 1507-1525, 2008.

Warren B. Powell. Approximate Dynamic Programming: Solving the Curses of

Dimensionality. 2007. Wiley Interscience.

Vladimir P. Reshetin and James L. Regens. Simulation modeling of antrhax spore

dispersion in a bioterrorism incident. Risk Analysis, 23(6): 1135-1145, 2003.

William J. Sacco and H. Champion. Field Testing of the Trauma Score. Honolulu, HI:

Proceedings of the Sixteenth Hawaii International Conference on System

Sciences, 1983.

William J. Sacco, D. Michael Navin, Katherine E. Fiedler, Robert K. Waddell II, William

B. Long, and Robert F. Buckman, Jr. Precise Formulation and Evidence-based

Application of Resource-constrained Triage, Academic Emergency Medicine,

12(8):759-770, 2005.

104

Kyu-Cheoul Shim, Darrell G. Fontane, and JohnW. Labadie. Spatial decision support

system for integrated river basin flood control. Journal of Water Resources

Planning and Management, 128(3): 190-201, 2002.

S. Thompson, M. Nunez, R. Garfinkel, and M. D. Dean. Efficient Short-Term Allocation

and Reallocation of Patients to Floors of a Hospital During Demand Surges.

Operations Research, 57(2):261-273, 2009.

World Health Organization (WHO). Mass Casualty Systems: Strategies and guidelines

for building health sector capacity. April 2007.

http://www.who.int/hac/techguidance/tools/mcm_guidelines_en.pdf. Last

accessed on March 30, 2010.

P. Daniel Wright, Matthew J. Liberatore, and Robert L. Nydick. A survey of operations

research models and applications in homeland security. Interfaces, 36(6):514-

529, 2006.

105