OM470ChenWSJTheScienceBehindYourLongWaitinLine_Oct2016WAb.pdf

OM470. Chen (Waiting line analysis. Little’s Law (B)) 1

The Science Behind Your Long Wait in Line by: Jo Craven McGinty

Oct 08, 2016

Click here to view the full article on WSJ.com

You’ve probably participated in this familiar dance: Given a choice of checkout lines, you’ve

somehow picked the slowest.

You could wait it out. You could chassé to another queue. Or you could bail out altogether. After

all, no one likes to wait. But are the other lines really faster?

When parallel lines feed multiple cashiers, you may not be in the slowest one, but chances are,

you also are not in the fastest.

Bill Hammack, a professor at the University of Illinois at Urbana-Champaign and YouTube’s

“Engineer Guy,” explained it like this:

Imagine three lines feeding three cash registers. Some shoppers will have more items than others,

or there may be a delay for something like a price check. The rate of service in the different lines

will tend to vary. If the delays are random, there are six ways three lines could be ordered from

fastest to slowest—1-2-3, 1-3-2, 2-1-3, 2-3-1, 3-1-2 or 3-2-1. Any one of the three (including the

one you picked) is quickest in only two of the permutations, or one-third of the time.

There are two sources of variability, according to Linda V. Green, a professor of health-care

management at the Columbia University Business School who specializes in mathematical

models of service systems.

“When the demands for service come in and how long it takes the servers to process them,” she

said. “That inevitably causes temporary mismatches between supply and demand and hence

backups, delays and congestion.”

OM470. Chen (Waiting line analysis. Little’s Law (B)) 2

Luckily, most service providers take steps to manage the wait. A supermarket with parallel lines

reserves some registers for customers with fewer items. Airline security uses a single serpentine

line to feed multiple agents to mitigate bottlenecks at individual checkpoints. Emergency rooms

and 911 dispatchers give priority to those whose needs are most urgent.

Each approach is based on queuing theory, or the mathematical study of lines.

Queues can be trivial, like a line at an ATM, or they can be serious, like a list of people waiting

for an organ transplant, said Richard Larson, director of Massachusetts Institute of Technology’s

Center for Engineering Systems and an expert on queuing, but the fundamentals are the same: A

basic queue funnels clients demanding service to one or more servers who respond. If the servers

are busy, other demands must wait.

The clients may include a line of people, a series of 911 calls, or a string of commands issued

over a computer network (think of a printer queue). The servers are the cashiers, the dispatchers

or the devices that respond.

Queuing theory helps untangle the mess of requests, or at least smooth it out, by estimating the

number of servers needed to meet demand over a given period and designing rules for advancing

the queue.

The best system depends on the situation. “First come, first served” is most familiar, and people

often prefer it because it seems fair. But most also accept that a heart attack should take

precedence over a sprained ankle or someone with five items shouldn’t have to wait behind a

procession of brimming shopping carts.

Queuing theory originated in 1908 when a Danish scientist named Agner Krarup Erlang went to

work for the Copenhagen Telephone Company and set about trying to determine how many

telephone trunks, or lines, were needed to serve callers.

The company could have provided a trunk for each telephone, Dr. Hammack said, but that would

have been wasteful because not everyone calls at the same time. It could have provided enough

trunks to handle the average number of calls, but too many would be blocked when the average

was exceeded.

“Think of it like a bike wheel,” said Dr. Larson. “The switchboard is the hub. Each spoke is a

wire coming in from a home to a human operator who can connect your wire to some other

spoke in the wheel. What should the capacity be? If it’s too big, you’ll spend too much money on

capital investments. If it’s too small customers won’t be able to be connected reliably.”

OM470. Chen (Waiting line analysis. Little’s Law (B)) 3

Customers waited in line to check out at an Ikea store in New York City in September. PHOTO: MICHAEL NAGLE/BLOOMBERG NEWS

Erlang’s basic formula included three parameters: the number of trunks, the number of calls per

hour and their average length. Adjusting the number of trunks in the formula raised or lowered

the probability that a call would be blocked during the busiest hour.

Since then, retail stores, banks, call centers, emergency rooms, manufacturing plants, computer

networks and all variety of queuing environments have used Erlang’s formulas or related models

to figure out how to manage their lines.

Meanwhile, clients—at least the human variety—can’t help but wonder if there isn’t a way to

game the system.

Raj Jain, a computer-science and engineering professor at Washington University in St. Louis

takes an analytical approach: When he’s in line, he times the service. “Then I count the number

of people ahead of me, and I know how much time I am to wait.”

Dr. Larson takes a different approach. “I just strike up a conversation with an adjacent queue

dweller,” he said, “and wait.”

OM470. Chen (Waiting line analysis. Little’s Law (B)) 4

Appendix:

Erlang B & Erlang B Formula

- details & tutorial describing the Erlang B unit, and Erlang B formula developed from the basic

Erlang unit to handle blocked calls into the calculations.

ERLANG TUTORIAL INCLUDES

 Erlang telecommunications tutorial

 Erlang B

 Erlang C

The Erlang B is used to work out how many lines are required from a knowledge of the traffic figure during the busiest hour. The Erlang B figure assumes that any blocked calls are cleared immediately. This is the most commonly used figure to be used in any telecommunications capacity calculations.

Erlang B

It is particularly important to understand the traffic volumes at peak times of the day. Telecommunications traffic, like many other commodities, varies over the course of the day, and also the week. It is therefore necessary to understand the telecommunications traffic at the peak times of the day and to be able to determine the acceptable level of service required. The Erlang B figure is designed to handle the peak or busy periods and to determine the level of service required in these periods.

Essentially, the Erlang B traffic model is used by telephone system designers to estimate the number of lines required for PSTN connections or private wire connections. The three variables involved are Busy Hour Traffic (BHT), Blocking and Lines.

Busy Hour Traffic (in Erlangs) is the number of hours of call traffic there are during the busiest hour of operation of a telephone system.

Blocking is the failure of calls due to an insufficient number of lines being available. E.g. 0.03 mean 3 calls blocked per 100 calls attempted.

Lines is the number of lines in a trunk group.

The Extended Erlang B is similar to Erlang B, but it can be used to factor in the number of calls that are blocked and immediately tried again.

Erlang B formula

Where: B=Erlang B loss probability N=Number of trunks in full availability group A=Traffic offered to group in Erlangs