Assignment

profilenickyr
chapter1-converted.pdf

Part I Introduction to Analytics and AI

Chapter 1Overview of Business Intelligence, Analytics, Data Science, and Artificial Intelligence: Systems for Decision Support

Learning Objectives • Understand the need for computerized support of managerial decision making • Understand the development of systems for providing decision-making support • Recognize the evolution of such computerized support to the current state of

analytics/data science and artificial intelligence • Describe the business intelligence (BI) methodology and concepts • Understand the different types of analytics and review selected applications • Understand the basic concepts of artificial intelligence (AI) and see selected

applications • Understand the analytics ecosystem to identify various key players and career

opportunities

The business environment (climate) is constantly changing, and it is becoming more and more

complex. Organizations, both private and public, are under pressures that force them to respond

quickly to changing conditions and to be innovative in the way they operate. Such activities

require organizations to be agile and to make frequent and quick strategic, tactical, and

operational decisions, some of which are very complex. Making such decisions may require

considerable amounts of relevant data, information, and knowledge. Processing these in the

framework of the needed decisions must be done quickly, frequently in real time, and usually

requires some computerized support. As technologies are evolving, many decisions are being

automated, leading to a major impact on knowledge work and workers in many ways.

This book is about using business analytics and artificial intelligence (AI) as a computerized

support portfolio for managerial decision making. It concentrates on the theoretical and

conceptual foundations of decision support as well as on the commercial tools and techniques

that are available. The book presents the fundamentals of the techniques and the manner in which

these systems are constructed and used. We follow an EEE (exposure, experience, and exploration) approach to introducing these topics. The book primarily provides exposure to various analytics/AI techniques and their applications. The idea is that students will be inspired

to learn from how various organizations have employed these technologies to make decisions or

to gain a competitive edge. We believe that such exposure to what is being accomplished with

analytics and that how it can be achieved is the key component of learning about analytics. In

describing the techniques, we also give examples of specific software tools that can be used for

developing such applications. However, the book is not limited to any one software tool, so

students can experience these techniques using any number of available software tools. We hope

that this exposure and experience enable and motivate readers to explore the potential of these

techniques in their own domain. To facilitate such exploration, we include exercises that direct

the reader to Teradata University Network (TUN) and other sites that include team-oriented

exercises where appropriate. In our own teaching experience, projects undertaken in the class

facilitate such exploration after students have been exposed to the myriad of applications and

concepts in the book and they have experienced specific software introduced by the professor.

This introductory chapter provides an introduction to analytics and artificial intelligence as well

as an overview of the book. The chapter has the following sections:

1. 1.1 Opening Vignette: How Intelligent Systems Work for KONE Elevators and Escalators Company 3

2. 1.2 Changing Business Environments and Evolving Needs for Decision Support and Analytics 5

3. 1.3 Decision-Making Processes and Computer Decision Support Framework 9 4. 1.4 Evolution of Computerized Decision Support to Business Intelligence/Analytics/Data

Science 22 5. 1.5 Analytics Overview 30 6. 1.6 Analytics Examples in Selected Domains 38 7. 1.7 Artificial Intelligence Overview 52 8. 1.8 Convergence of Analytics and AI 59 9. 1.9 Overview of the Analytics Ecosystem 63 10. 1.10 Plan of the Book 65 11. 1.11 Resources, Links, and the Teradata University Network Connection 66

1.1 Opening Vignette: How Intelligent Systems Work for KONE

Elevators and Escalators Company

KONE is a global industrial company (based in Finland) that manufactures mostly elevators and

escalators and also services over 1.1 million elevators, escalators, and related equipment in

several countries. The company employs over 50,000 people.

The Problem

Over 1 billion people use the elevators and escalators manufactured and serviced by KONE every day. If equipment does not work properly, people may be late to work, cannot get home in time, and may miss important meetings and events. So, KONE’s objective is to minimize the downtime and users’ suffering.

The company has over 20,000 technicians who are dispatched to deal with the elevators anytime a problem occurs. As buildings are getting higher (the trend in many places), more people are using elevators, and there is more pressure on elevators to handle the growing amount of traffic. KONE faced the responsibility to serve users smoothly and safely.

The Solution

KONE decided to use IBM Watson IoT Cloud platform. As we will see in Chapter 6, IBM installed cognitive abilities in buildings that make it possible to recognize situations and behavior of both people and equipment. The Internet of Things (IoT), as we will see in Chapter 13, is a platform that can connect millions of “things” together and to a central command that can manipulate the connected things. Also, the IoT connects sensors that are attached to KONE’s elevators and escalators. The sensors collect information and data about the elevators (such as noise level) and other equipment in real time. Then, the IoT transfers to information centers via the collected data “cloud.” There, analytic systems (IBM Advanced Analytic Engine) and AI process the collected data and predict things such as potential failures. The systems also identify the likely causes of problems and suggest potential remedies. Note the predictive power of IBM Watson Analytics (using machine learning, an AI technology described in Chapters 4–6) for finding problems before they occur. The KONE system collects a significant amount of data that are analyzed for other purposes so that future design of equipment can be improved. This is because Watson Analytics offers a convenient environment for communication of and collaboration around the data. In addition, the analysis suggests how to optimize buildings and equipment operations. Finally, KONE and its customers can get insights regarding the financial aspects of managing the elevators.

KONE also integrates the Watson capabilities with Salesforce’s service tools (Service Cloud Lightning and Field Service Lightning). This combination helps KONE to immediately respond to emergencies or soon-to-occur failures as quickly as possible, dispatching some of its 20,000 technicians to the problems’ sites. Salesforce also provides superb customer relationship management (CRM). The people–machine communication, query, and collaboration in the system are in a natural language (an AI capability of Watson Analytics; see Chapter 6). Note that IBM Watson analytics includes two types of analytics: predictive, which predicts when failures may occur, and prescriptive, which recommends actions (e.g., preventive maintenance).

The Results

KONE has minimized downtime and shortened the repair time. Obviously, elevators/- escalators users are much happier if they do not have problems because of equipment downtime, so they enjoy trouble-free rides. The prediction of “soon-to-happen” can save many problems for the equipment owners. The owners can also optimize the schedule of their own employees (e.g., cleaners and maintenance workers). All in all, the decision makers at both KONE and the buildings can make informed and better decisions. Some day in the future, robots may perform maintenance and repairs of elevators and escalators.

NOTE: This case is a sample of IBM Watson’s success using its cognitive buildings capability. To learn more, we suggest you view the following YouTube videos: (1) youtube.com/watch?v=6UPJHyiJft0 (1:31 min.) (2017); (2) youtube.com/watch?v=EVbd3ejEXus (2:49 min.) (2017).

Sources: Compiled from J. Fernandez. (2017, April). “A Billion People a Day. Millions of Elevators. No Room for Downtime.” IBM

developer Works Blog. developer.ibm.com/dwblog/2017/kone-watson-video/ (accessed September 2018); H. Srikanthan. “KONE

Improves ‘People Flow’ in 1.1 Million Elevators with IBM Watson IoT.” Generis. https://generisgp.com/2018/01/08/ibm-case-study-

kone-corp/ (accessed September 2018); L. Slowey. (2017, February 16). “Look Who’s Talking: KONE Makes Elevator Services Truly

Intelligent with Watson IoT.” IBM Internet of Things Blog. ibm.com/blogs/internet-of-things/kone/ (accessed September 2018).

Questions for the Opening Vignette

1. It is said that KONE is embedding intelligence across its supply chain and enables smarter buildings. Explain.

2. Describe the role of IoT in this case. 3. What makes IBM Watson a necessity in this case? 4. Check IBM Advanced Analytics. What tools were included that relate to this case? 5. Check IBM cognitive buildings. How do they relate to this case?

What Can We Learn from This Vignette?

Today, intelligent technologies can embark on large-scale complex projects when they include

AI combined with IoT. The capabilities of integrated intelligent platforms, such as IBM Watson,

make it possible to solve problems that were economically and technologically unsolvable just a

few years ago. The case introduces the reader to several of the technologies, including advanced

analytics, sensors, IoT, and AI that are covered in this book. The case also points to the use of

“cloud.” The cloud is used to centrally process large amounts of information using analytics and

AI algorithms, involving “things” in different locations. This vignette also introduces us to two

major types of analytics: predictive analytics (Chapters 4–6) and prescriptive analytics

(Chapter 8).

Several AI technologies are discussed: machine learning, natural language processing, computer

vision, and prescriptive analysis.

The case is an example of augmented intelligence in which people and machines work together. The case illustrates the benefits to the vendor, the implementing companies, and their employees

and to the users of the elevators and escalators.

1.2 Changing Business Environments and Evolving Needs for Decision Support and Analytics

Decision making is one of the most important activities in organizations of all kind—

probably the most important one. Decision making leads to the success or failure of

organizations and how well they perform. Making decisions is getting difficult due to

internal and external factors. The rewards of making appropriate decisions can be very

high and so can the loss of inappropriate ones.

Unfortunately, it is not simple to make decisions. To begin with, there are several types

of decisions, each of which requires a different decision-making approach. For

example, De Smet et al. (2017) of McKinsey & Company management consultants

classify organizational decision into the following four groups:

• Big-bet, high-risk decisions.

• Cross-cutting decisions, which are repetitive but high risk that require group work

(Chapter 11).

• Ad hoc decisions that arise episodically.

• Delegated decisions to individuals or small groups.

Therefore, it is necessary first to understand the nature of decision making. For a

comprehensive discussion, see (De Smet et al. 2017).

Modern business is full of uncertainties and rapid changes. To deal with these,

organizational decision makers need to deal with ever-increasing and changing data.

This book is about the technologies that can assist decision makers in their jobs.

Decision-Making Process

For years, managers considered decision making purely an art—a talent acquired over a long

period through experience (i.e., learning by trial and error) and by using intuition. Management

was considered an art because a variety of individual styles could be used in approaching and

successfully solving the same types of managerial problems. These styles were often based on

creativity, judgment, intuition, and experience rather than on systematic quantitative methods

grounded in a scientific approach. However, recent research suggests that companies with top

managers who are more focused on persistent work tend to outperform those with leaders whose

main strengths are interpersonal communication skills. It is more important to emphasize

methodical, thoughtful, analytical decision making rather than flashiness and interpersonal

communication skills.

Managers usually make decisions by following a four-step process (we learn more about these in

the next section):

1. Define the problem (i.e., a decision situation that may deal with some difficulty or with an opportunity).

2. Construct a model that describes the real-world problem. 3. Identify possible solutions to the modeled problem and evaluate the solutions. 4. Compare, choose, and recommend a potential solution to the problem.

A more detailed process is offered by Quain (2018), who suggests the following steps:

1. Understand the decision you have to make.

2. Collect all the information. 3. Identify the alternatives. 4. Evaluate the pros and cons. 5. Select the best alternative. 6. Make the decision. 7. Evaluate the impact of your decision.

We will return to this process in Section 1.3.

The Influence of the External and Internal Environments on the Process

To follow these decision-making processes, one must make sure that sufficient alternative

solutions, including good ones, are being considered, that the consequences of using these

alternatives can be reasonably predicted, and that comparisons are done properly. However,

rapid changes in internal and external environments make such an evaluation process difficult for

the following reasons:

• Technology, information systems, advanced search engines, and globalization result in

more and more alternatives from which to choose.

• Government regulations and the need for compliance, political instability and terrorism,

competition, and changing consumer demands produce more uncertainty, making it more

difficult to predict consequences and the future.

▪ POLITICAL FACTORS. Major decisions may be influenced by both external and

internal politics. An example is the 2018 trade war on tariffs.

▪ ECONOMIC FACTORS. These range from competition to the genera and state of

the economy. These factors, both in the short and long run, need to be considered.

▪ SOCIOLOGICAL AND PSYCHOLOGICAL FACTORS REGARDING

EMPLOYEES AND CUSTOMERS. These need to be considered when changes are

being made.

▪ ENVIRONMENT FACTORS. The impact on the physical environment must be

assessed in many decision-making situations.

Other factors include the need to make rapid decisions, the frequent and unpredictable changes

that make trial-and-error learning difficult, and the potential costs of making mistakes that may

be large.

These environments are growing more complex every day. Therefore, making decisions today is

indeed a complex task. For further discussion, see Charles (2018). For how to make effective

decisions under uncertainty and pressure, see Zane (2016).

Because of these trends and changes, it is nearly impossible to rely on a trial-and-error approach

to management. Managers must be more sophisticated; they must use the new tools and

techniques of their fields. Most of those tools and techniques are discussed in this book. Using

them to support decision making can be extremely rewarding in making effective decisions.

Further, many tools that are evolving impact even the very existence of several decision-making

tasks that are being automated. This impacts future demand for knowledge workers and begs

many legal and societal impact questions.

Data and Its Analysis in Decision Making

We will see several times in this book how an entire industry can employ analytics to develop

reports on what is happening, predict what is likely to happen, and then make decisions to make

the best use of the situation at hand. These steps require an organization to collect and analyze

vast stores of data. In general, the amount of data doubles every two years. From traditional uses

in payroll and bookkeeping functions, computerized systems are now used for complex

managerial areas ranging from the design and management of automated factories to the

application of analytical methods for the evaluation of proposed mergers and acquisitions.

Nearly all executives know that information technology is vital to their business and extensively

use these technologies.

Computer applications have moved from transaction-processing and monitoring activities to

problem analysis and solution applications, and much of the activity is done with cloud-based

technologies, in many cases accessed through mobile devices. Analytics and BI tools such as

data warehousing, data mining, online analytical processing (OLAP), dashboards, and the use of

cloud-based systems for decision support are the cornerstones of today’s modern management.

Managers must have high-speed, networked information systems (wired or wireless) to assist

them with their most important task: making decisions. In many cases, such decisions are

routinely being fully automated (see Chapter 2), eliminating the need for any managerial

intervention.

Technologies for Data Analysis and Decision Support

Besides the obvious growth in hardware, software, and network capacities, some developments

have clearly contributed to facilitating the growth of decision support and analytics technologies

in a number of ways:

• GROUP COMMUNICATION AND COLLABORATION. Many decisions are made

today by groups whose members may be in different locations. Groups can collaborate and

communicate readily by using collaboration tools as well as the ubiquitous smartphones.

Collaboration is especially important along the supply chain, where partners—all the way

from vendors to customers—must share information. Assembling a group of decision

makers, especially experts, in one place can be costly. Information systems can improve the

collaboration process of a group and enable its members to be at different locations (saving

travel costs). More critically, such supply chain collaboration permits manufacturers to

know about the changing patterns of demand in near real time and thus react to marketplace

changes faster. For a comprehensive coverage and the impact of AI, see Chapters 2, 10,

and 14.

• IMPROVED DATA MANAGEMENT. Many decisions involve complex computations.

Data for these can be stored in different databases anywhere in the organization and even

possibly outside the organization. The data may include text, sound, graphics, and video,

and these can be in different languages. Many times it is necessary to transmit data quickly

from distant locations. Systems today can search, store, and transmit needed data quickly,

economically, securely, and transparently. See Chapters 3 and 9 and the online chapter for

details.

• MANAGING GIANT DATA WAREHOUSES AND BIG DATA. Large data

warehouses (DWs), like the ones operated by Walmart, contain huge amounts of data.

Special methods, including parallel computing and Hadoop/Spark, are available to organize,

search, and mine the data. The costs related to data storage and mining are declining

rapidly. Technologies that fall under the broad category of Big Data have enabled massive

data coming from a variety of sources and in many different forms, which allows a very

different view of organizational performance that was not possible in the past.

See Chapter 9 for details.

• ANALYTICAL SUPPORT. With more data and analysis technologies, more alternatives

can be evaluated, forecasts can be improved, risk analysis can be performed quickly, and

the views of experts (some of whom may be in remote locations) can be collected quickly

and at a reduced cost. Expertise can even be derived directly from analytical systems. With

such tools, decision makers can perform complex simulations, check many possible

scenarios, and assess diverse impacts quickly and economically.This, of course, is the focus

of several chapters in the book. See Chapters 4–7.

• OVERCOMING COGNITIVE LIMITS IN PROCESSING AND STORING

INFORMATION. The human mind has only a limited ability to process and store

information. People sometimes find it difficult to recall and use information in an error-free

fashion due to their cognitive limits. The term cognitive limits indicates that an individual’s problem-solving capability is limited when a wide range of diverse information and

knowledge is required. Computerized systems enable people to overcome their cognitive

limits by quickly accessing and processing vast amounts of stored information. One way to

overcome humans’ cognitive limitations is to use AI support. For coverage of cognitive

aspects, see Chapter 6.

• KNOWLEDGE MANAGEMENT. Organizations have gathered vast stores of

information about their own operations, customers, internal procedures, employee

interactions, and so forth through the unstructured and structured communications taking

place among various stakeholders. Knowledge management systems (KMS) have become

sources of formal and informal support for decision making to managers, although

sometimes they may not even be called KMS. Technologies such as text analytics and IBM Watson are making it possible to generate value from such knowledge stores.

(See Chapters 6 and 12 for details.

• ANYWHERE, ANYTIME SUPPORT. Using wireless technology, managers can access

information anytime and from any place, analyze and interpret it, and communicate with

those using it. This perhaps is the biggest change that has occurred in the last few years. The

speed at which information needs to be processed and converted into decisions has truly

changed expectations for both consumers and businesses. These and other capabilities have

been driving the use of computerized decision support since the late 1960s, especially since

the mid-1990s. The growth of mobile technologies, social media platforms, and analytical

tools has enabled a different level of information systems (IS) to support managers. This

growth in providing data-driven support for any decision extends not just to managers but

also to consumers. We will first study an overview of technologies that have been broadly

referred to as BI. From there we will broaden our horizons to introduce various types of

analytics.

• INNOVATION AND ARTIFICIAL INTELLIGENCE. Because of the complexities in

the decision-making process discussed earlier and the environment surrounding the process,

a more innovative approach is frequently need. A major facilitation of innovation is

provided by AI. Almost every step in the decision-making process can be influenced by AI.

AI is also integrated with analytics, creating synergy in making decisions (Section 1.8).

Section 1.2 Review Questions

1. Why is it difficult to make organizational decisions? 2. Describe the major steps in the decision-making process. 3. Describe the major external environments that can impact decision making. 4. What are some of the key system-oriented trends that have fostered IS-supported

decision making to a new level? 5. List some capabilities of information technologies that can facilitate managerial

decision making.

1.3 Decision-Making Processes and Computerized Decision Support Framework

In this section, we focus on some classical decision-making fundamentals and in more

detail on the decision-making process. These two concepts will help us ground much of

what we will learn in terms of analytics, data science, and artificial intelligence.

Decision making is a process of choosing among two or more alternative courses of

action for the purpose of attaining one or more goals. According to Simon (1977),

managerial decision making is synonymous with the entire management process.

Consider the important managerial function of planning. Planning involves a series of

decisions: What should be done? When? Where? Why? How? By whom? Managers set

goals, or plan; hence, planning implies decision making. Other managerial functions,

such as organizing and controlling, also involve decision making.

Simon’s Process: Intelligence, Design, and Choice

It is advisable to follow a systematic decision-making process. Simon (1977) said that this

involves three major phases: intelligence, design, and choice. He later added a fourth phase:

implementation. Monitoring can be considered a fifth phase—a form of feedback. However, we

view monitoring as the intelligence phase applied to the implementation phase. Simon’s model is the most concise and yet complete characterization of rational decision making. A conceptual

picture of the decision-making process is shown in Figure 1.1. It is also illustrated as a decision

support approach using modeling.

Figure 1.1 The Decision-Making/Modeling Process.

There is a continuous flow of activity from intelligence to design to choice (see the solid lines

in Figure 1.1), but at any phase, there may be a return to a previous phase (feedback). Modeling

is an essential part of this process. The seemingly chaotic nature of following a haphazard path

from problem discovery to solution via decision making can be explained by these feedback

loops.

The decision-making process starts with the intelligence phase; in this phase, the decision maker examines reality and identifies and defines the problem. Problem ownership is established as well. In the design phase, a model that represents the system is constructed. This is done by making assumptions that simplify reality and by writing down the relationships

among all the variables. The model is then validated, and criteria are determined in a principle of

choice for evaluation of the alternative courses of action that are identified. Often, the process of

model development identifies alternative solutions and vice versa.

The choice phase includes the selection of a proposed solution to the model (not necessarily to the problem it represents). This solution is tested to determine its viability. When the proposed

solution seems reasonable, we are ready for the last phase: implementation of the decision (not

necessarily of a system). Successful implementation results in solving the real problem. Failure

leads to a return to an earlier phase of the process. In fact, we can return to an earlier phase

during any of the latter three phases. The decision-making situations described in the opening

vignette follow Simon’s four-phase model, as do almost all other decision-making situations.

The Intelligence Phase: Problem (or Opportunity) Identification

The intelligence phase begins with the identification of organizational goals and objectives

related to an issue of concern (e.g., inventory management, job selection, lack of or incorrect

Web presence) and determination of whether they are being met. Problems occur because of

dissatisfaction with the status quo. Dissatisfaction is the result of a difference between what

people desire (or expect) and what is occurring. In this first phase, a decision maker attempts to

determine whether a problem exists, identify its symptoms, determine its magnitude,

and explicitly define it. Often, what is described as a problem (e.g., excessive costs) may be only

a symptom (i.e., measure) of a problem (e.g., improper inventory levels). Because real-world

problems are usually complicated by many interrelated factors, it is sometimes difficult to

distinguish between the symptoms and the real problem. New opportunities and problems

certainly may be uncovered while investigating the causes of symptoms.

The existence of a problem can be determined by monitoring and analyzing the organization’s

productivity level. The measurement of productivity and the construction of a model are based

on real data. The collection of data and the estimation of future data are among the most difficult

steps in the analysis.

Issues in Data Collection

The following are some issues that may arise during data collection and estimation and thus

plague decision makers:

• Data are not available. As a result, the model is made with and relies on potentially

inaccurate estimates.

• Obtaining data may be expensive.

• Data may not be accurate or precise enough.

• Data estimation is often subjective.

• Data may be insecure.

• Important data that influence the results may be qualitative (soft).

• There may be too many data (i.e., information overload).

• Outcomes (or results) may occur over an extended period. As a result, revenues, expenses,

and profits will be recorded at different points in time. To overcome this difficulty, a

present-value approach can be used if the results are quantifiable.

• It is assumed that future data will be similar to historical data. If this is not the case, the

nature of the change has to be predicted and included in the analysis.

When the preliminary investigation is completed, it is possible to determine whether a problem

really exists, where it is located, and how significant it is. A key issue is whether an information

system is reporting a problem or only the symptoms of a problem. For example, if reports

indicate that sales are down, there is a problem, but the situation, no doubt, is symptomatic of the

problem. It is critical to know the real problem. Sometimes it may be a problem of perception,

incentive mismatch, or organizational processes rather than a poor decision model.

To illustrate why it is important to identify the problem correctly, we provide a classical example

in Application Case 1.1.

Application Case 1.1 Making Elevators Go Faster!

This story has been reported in numerous places and has almost become a classic example to

explain the need for problem identification. Ackoff (as cited in Larson, 1987) described the

problem of managing complaints about slow elevators in a tall hotel tower. After trying many

solutions for reducing the complaint—staggering elevators to go to different floors, adding

operators, and so on—the management determined that the real problem was not about

the actual waiting time but rather the perceived waiting time. So the solution was to install full- length mirrors on elevator doors on each floor. As Hesse and Woolsey (1975) put it, “The

women would look at themselves in the mirrors and make adjustments, while the men would

look at the women, and before they knew it, the elevator was there.” By reducing the perceived

waiting time, the problem went away. Baker and Cameron (1996) give several other examples of

distractions, including lighting and displays, that organizations use to reduce perceived waiting

time. If the real problem is identified as perceived waiting time, it can make a big difference in the proposed solutions and their costs. For example, full-length mirrors probably cost a whole lot

less than adding an elevator! Sources: Based on J. Baker and M. Cameron. (1996, September). “The Effects of the Service Environment on Affect and Consumer Perception of Waiting Time: An Integrative Review and Research Propositions,” Journal of the Academy of Marketing Science, 24, pp. 338–349; R. Hesse and G. Woolsey (1975). Applied Management Science: A Quick and Dirty Approach. Chicago, IL: SRA Inc; R. C. Larson. (1987, November/December). “Perspectives on Queues: Social Justice and the Psychology of Queuing.” Operations Research, 35(6), pp. 895–905.

Questions for Case 1.1

1. Why this is an example relevant to decision making? 2. Relate this situation to the intelligence phase of decision making.

Problem Classification

Problem classification is the conceptualization of a problem in an attempt to place it in a

definable category, possibly leading to a standard solution approach. An important approach

classifies problems according to the degree of structuredness evident in them. This ranges from

totally structured (i.e., programmed) to totally unstructured (i.e., unprogrammed).

Problem Decomposition

Many complex problems can be divided into subproblems. Solving the simpler subproblems may

help in solving a complex problem. Also, seemingly poorly structured problems sometimes have

highly structured subproblems. Just as a semistructured problem results when some phases of

decision making are structured whereas other phases are unstructured, and when some

subproblems of a decision-making problem are structured with others unstructured, the problem

itself is semistructured. As a decision support system is developed and the decision maker and

development staff learn more about the problem, it gains structure.

Problem Ownership

In the intelligence phase, it is important to establish problem ownership. A problem exists in an

organization only if someone or some group takes the responsibility for attacking it and if the

organization has the ability to solve it. The assignment of authority to solve the problem is

called problem ownership. For example, a manager may feel that he or she has a problem because interest rates are too high. Because interest rate levels are determined at the national and

international levels and most managers can do nothing about them, high interest rates are the

problem of the government, not a problem for a specific company to solve. The problem that

companies actually face is how to operate in a high interest-rate environment. For an individual

company, the interest rate level should be handled as an uncontrollable (environmental) factor to

be predicted.

When problem ownership is not established, either someone is not doing his or her job or the

problem at hand has yet to be identified as belonging to anyone. It is then important for someone

to either volunteer to own it or assign it to someone.

The intelligence phase ends with a formal problem statement.

The Design Phase

The design phase involves finding or developing and analyzing possible courses of action. These

include understanding the problem and testing solutions for feasibility. A model of the decision-

making problem is constructed, tested, and validated. Let us first define a model.

Models

A major characteristic of computerized decision support and many BI tools (notably those of

business analytics) is the inclusion of at least one model. The basic idea is to perform the

analysis on a model of reality rather than on the real system. A model is a simplified representation or abstraction of reality. It is usually simplified because reality is too complex to

describe exactly and because much of the complexity is actually irrelevant in solving a specific

problem.

Modeling involves conceptualizing a problem and abstracting it to quantitative and/or qualitative

form. For a mathematical model, the variables are identified and their mutual relationships are

established. Simplifications are made, whenever necessary, through assumptions. For example, a

relationship between two variables may be assumed to be linear even though in reality there may

be some nonlinear effects. A proper balance between the level of model simplification and the

representation of reality must be obtained because of the cost–benefit trade-off. A simpler model

leads to lower development costs, easier manipulation, and a faster solution but is less

representative of the real problem and can produce inaccurate results. However, a simpler model

generally requires fewer data, or the data are aggregated and easier to obtain.

The Choice Phase

Choice is the critical act of decision making. The choice phase is the one in which the actual

decision and the commitment to follow a certain course of action are made. The boundary

between the design and choice phases is often unclear because certain activities can be

performed during both of them and because the decision maker can return frequently from choice

activities to design activities (e.g., generate new alternatives while performing an evaluation of

existing ones). The choice phase includes the search for, evaluation of, and recommendation of

an appropriate solution to a model. A solution to a model is a specific set of values for the

decision variables in a selected alternative. Choices can be evaluated as to their viability and

profitability.

Each alternative must be evaluated. If an alternative has multiple goals, they must all be

examined and balanced against each other. Sensitivity analysis is used to determine the

robustness of any given alternative; slight changes in the parameters should ideally lead to slight

or no changes in the alternative chosen. What-if analysis is used to explore major changes in the

parameters. Goal seeking helps a manager determine values of the decision variables to meet a

specific objective. These topics are addressed in Chapter 8.

The Implementation Phase

In The Prince, Machiavelli astutely noted some 500 years ago that there was “nothing more difficult to carry out, nor more doubtful of success, nor more dangerous to handle, than to initiate

a new order of things.” The implementation of a proposed solution to a problem is, in effect, the

initiation of a new order of things or the introduction of change. And change must be managed.

User expectations must be managed as part of change management.

The definition of implementation is somewhat complicated because implementation is a long, involved process with vague boundaries. Simplistically, the implementation phase involves putting a recommended solution to work, not necessarily implementing a computer system.

Many generic implementation issues, such as resistance to change, degree of support of top

management, and user training, are important in dealing with information system–supported

decision making. Indeed, many previous technology-related waves (e.g., business process

reengineering [BPR] and knowledge management) have faced mixed results mainly because of

change management challenges and issues. Management of change is almost an entire discipline

in itself, so we recognize its importance and encourage readers to focus on it independently.

Implementation also includes a thorough understanding of project management. The importance

of project management goes far beyond analytics, so the last few years have witnessed a major

growth in certification programs for project managers. A very popular certification now is the

Project Management Professional (PMP). See pmi.org for more details. Implementation must also involve collecting and analyzing data to learn from the previous

decisions and improve the next decision. Although analysis of data is usually conducted to

identify the problem and/or the solution, analytics should also be employed in the feedback

process. This is especially true for any public policy decisions. We need to be sure that the data

being used for problem identification is valid. Sometimes people find this out only after the

implementation phase.

The decision-making process, though conducted by people, can be improved with computer

support, which is introduced next.

The Classical Decision Support System Framework

The early definitions of decision support system (DSS) identified it as a system intended to

support managerial decision makers in semistructured and unstructured decision situations. DSS

was meant to be an adjunct to decision makers, extending their capabilities but not replacing

their judgment. DSS was aimed at decisions that required judgment or at decisions that could not

be completely supported by algorithms. Not specifically stated but implied in the early

definitions was the notion that the system would be computer based, would operate interactively

online, and preferably would have graphical output capabilities, now simplified via browsers and

mobile devices.

An early framework for computerized decision support includes several major concepts that are

used in forthcoming sections and chapters of this book. Gorry and Scott-Morton created and used

this framework in the early 1970s, and the framework then evolved into a new technology

called DSS. Gorry and Scott-Morton (1971) proposed a framework that is a 3-by-3 matrix, as shown

in Figure 1.2. The two dimensions are the degree of structuredness and the types of control.

Figure 1.2 Decision Support Frameworks.

Degree of Structuredness

The left side of Figure 1.2 is based on Simon’s (1977) idea that decision-making

processes fall along a continuum that ranges from highly structured (sometimes

called programmed) to highly unstructured (i.e., non-programmed) decisions. Structured

processes are routine and typically repetitive problems for which standard solution

methods exist.

Unstructured processes are fuzzy, complex problems for which there are no cut-and-

dried solution methods.

An unstructured problem is one where the articulation of the problem or the solution

approach may be unstructured in itself. In a structured problem, the procedures for

obtaining the best (or at least a good enough) solution are known. Whether the problem

involves finding an appropriate inventory level or choosing an optimal investment

strategy, the objectives are clearly defined. Common objectives are cost minimization

and profit maximization.

Semistructured problems fall between structured and unstructured problems, having

some structured elements and some unstructured elements. Keen and Scott-Morton

(1978) mentioned trading bonds, setting marketing budgets for consumer products, and

performing capital acquisition analysis as semistructured problems.

Types of Control

The second half of the Gorry and Scott-Morton (1971) framework (refer to Figure 1.2) is based

on Anthony’s (1965) taxonomy, which defines three broad categories that encompass all

managerial activities: strategic planning, which involves defining long-range goals and policies for resource allocation; management control, the acquisition and efficient use of resources in the accomplishment of organizational goals; and operational control, the efficient and effective execution of specific tasks.

The Decision Support Matrix

Anthony’s (1965) and Simon’s (1977) taxonomies are combined in the nine-cell decision support

matrix shown in Figure 1.2. The initial purpose of this matrix was to suggest different types of

computerized support to different cells in the matrix. Gorry and Scott-Morton (1971) suggested,

for example, that for making semistructured decisions and unstructured decisions, conventional management information systems (MIS) and management science (MS) tools are

insufficient. Human intellect and a different approach to computer technologies are necessary.

They proposed the use of a supportive information system, which they called a DSS. Note that the more structured and operational control-oriented tasks (such as those in cells 1, 2,

and 4 of Figure 1.2) are usually performed by lower-level managers, whereas the tasks in cells 6,

8, and 9 are the responsibility of top executives or highly trained specialists.

Computer Support for Structured Decisions

Since the 1960s, computers have historically supported structured and some semistructured

decisions, especially those that involve operational and managerial control. Operational and

managerial control decisions are made in all functional areas, especially in finance and

production (i.e., operations) management.

Structured problems, which are encountered repeatedly, have a high level of structure, as their

name suggests. It is therefore possible to abstract, analyze, and classify them into specific

categories. For example, a make-or-buy decision is one category. Other examples of categories

are capital budgeting, allocation of resources, distribution, procurement, planning, and inventory

control decisions. For each category of decision, an easy-to-apply prescribed model and solution

approach have been developed, generally as quantitative formulas. Therefore, it is possible to use

a scientific approach for automating portions of managerial decision making. Solutions to many structured problems can be fully automated (see Chapters 2 and 12).

Computer Support for Unstructured Decisions

Unstructured problems can be only partially supported by standard computerized quantitative

methods. It is usually necessary to develop customized solutions. However, such solutions may

benefit from data and information generated from corporate or external data sources. Intuition

and judgment may play a large role in these types of decisions, as may computerized

communication and collaboration technologies, as well as cognitive computing (Chapter 6) and

deep learning (Chapter 5).

Computer Support for Semistructured Problems

Solving semistructured problems may involve a combination of standard solution procedures and

human judgment. Management science can provide models for the portion of a decision-making

problem that is structured. For the unstructured portion, a DSS can improve the quality of the

information on which the decision is based by providing, for example, not only a single solution,

but also a range of alternative solutions along with their potential impacts. These capabilities

help managers to better understand the nature of problems and, thus, to make better decisions.

Decision Support System: Capabilities

The early definitions of DSS identified it as a system intended to support managerial decision

makers in semistructured and unstructured decision situations. DSS was meant to be an adjunct

to decision makers, extending their capabilities but not replacing their judgment. It was aimed at

decisions that required judgment or at decisions that could not be completely supported by

algorithms. Not specifically stated but implied in the early definitions was the notion that the

system would be computer based, would operate interactively online, and preferably would have

graphical output capabilities, now simplified via browsers and mobile devices.

A DSS Application

A DSS is typically built to support the solution of a certain problem or to evaluate an

opportunity. This is a key difference between DSS and BI applications. In a very strict

sense, business intelligence (BI) systems monitor situations and identify problems and/or opportunities using analytic methods. Reporting plays a major role in BI; the user generally must

identify whether a particular situation warrants attention and then can apply analytical methods.

Again, although models and data access (generally through a data warehouse) are included in BI,

a DSS may have its own databases and is developed to solve a specific problem or set of

problems and are therefore called DSS applications.

Formally, a DSS is an approach (or methodology) for supporting decision making. It uses an

interactive, flexible, adaptable computer-based information system (CBIS) especially developed

for supporting the solution to a specific unstructured management problem. It uses data, provides

an easy user interface, and can incorporate the decision maker’s own insights. In addition, a DSS

includes models and is developed (possibly by end users) through an interactive and iterative

process. It can support all phases of decision making and may include a knowledge component.

Finally, a DSS can be used by a single user or can be Web based for use by many people at

several locations.

The Characteristics and Capabilities of DSS

Because there is no consensus on exactly what a DSS is, there is obviously no agreement

on the standard characteristics and capabilities of DSS. The capabilities

in Figure 1.3 constitute an ideal set, some members of which are described in the

definitions of DSS and illustrated in the application cases. Figure 1.3 Key Characteristics and Capabilities of DSS.

The key characteristics and capabilities of DSS (as shown in Figure 1.3) are as follows: 1. Supports decision makers, mainly in semistructured and unstructured situations,

by bringing together human judgment and computerized information. Such problems cannot be solved (or cannot be solved conveniently) by other

computerized systems or through use of standard quantitative methods or tools. Generally, these problems gain structure as the DSS is developed. Even some structured problems have been solved by DSS.

2. Supports all managerial levels, ranging from top executives to line managers. 3. Supports individuals as well as groups. Less-structured problems often require

the involvement of individuals from different departments and organizational levels or even from different organizations. DSS supports virtual teams through collaborative Web tools. DSS has been developed to support individual and group work as well as to support individual decision making and groups of decision makers working somewhat independently.

4. Supports interdependent and/or sequential decisions. The decisions may be made once, several times, or repeatedly.

5. Supports all phases of the decision-making process: intelligence, design, choice, and implementation.

6. Supports a variety of decision-making processes and styles. 7. Is flexible, so users can add, delete, combine, change, or rearrange basic elements.

The decision maker should be reactive, able to confront changing conditions quickly, and able to adapt the DSS to meet these changes. It is also flexible in that it can be readily modified to solve other, similar problems.

8. Is user-friendly, has strong graphical capabilities, and a natural language interactive human-machine interface can greatly increase the effectiveness of DSS. Most new DSS applications use Web-based interfaces or mobile platform interfaces.

9. Improves the effectiveness of decision making (e.g., accuracy, timeliness, quality) rather than its efficiency (e.g., the cost of making decisions). When DSS is deployed, decision making often takes longer, but the decisions are better.

10. Provides complete control by the decision maker over all steps of the decision-- making process in solving a problem. A DSS specifically aims to support, not to replace, the decision maker.

11. Enables end users to develop and modify simple systems by themselves. Larger systems can be built with assistance from IS specialists. Spreadsheet packages have been utilized in developing simpler systems. OLAP and data mining software in conjunction with data warehouses enable users to build fairly large, complex DSS.

12. Provides models that are generally utilized to analyze decision-making situations. The modeling capability enables experimentation with different strategies under different configurations.

13. Provides access to a variety of data sources, formats, and types, including GIS, multimedia, and object-oriented data.

14. Can be employed as a stand-alone tool used by an individual decision maker in one location or distributed throughout an organization and in several organizations along the supply chain. It can be integrated with other DSS and/or applications, and it can be distributed internally and externally, using networking and Web technologies.

These key DSS characteristics and capabilities allow decision makers to make better,

more consistent decisions in a timely manner, and they are provided by major DSS

components,

Components of a Decision Support System

A DSS application can be composed of a data management subsystem, a model

management subsystem, a user interface subsystem, and a knowledge-based

management subsystem. We show these in Figure 1.4. Figure 1.4 Schematic View of DSS.

The Data Management Subsystem

The data management subsystem includes a database that contains relevant data for the situation

and is managed by software called the database management system (DBMS). DBMS is used as both singular and plural (system and systems) terms, as are many other acronyms in this text. The data management subsystem can be interconnected with the corporate data warehouse, a

repository for corporate relevant decision-making data. Usually, the data are stored or accessed

via a database Web server. The data management subsystem is composed of the following

elements:

• DSS database

• Database management system

• Data directory

• Query facility

Many of the BI or descriptive analytics applications derive their strength from the data

management side of the subsystems.

The Model Management Subsystem

The model management subsystem is the component that includes financial, statistical,

management science, or other quantitative models that provide the system’s analytical

capabilities and appropriate software management. Modeling languages for building custom

models are also included. This software is often called a model base management system

(MBMS). This component can be connected to corporate or external storage of models. Model

solution methods and management systems are implemented in Web development systems (such

as Java) to run on application servers. The model management subsystem of a DSS is composed

of the following elements:

• Model base

• MBMS

• Modeling language

• Model directory

• Model execution, integration, and command processor

Because DSS deals with semistructured or unstructured problems, it is often necessary to

customize models, using programming tools and languages. Some examples of these are .NET

Framework languages, C++, and Java. OLAP software may also be used to work with models in

data analysis. Even languages for simulations such as Arena and statistical packages such as

those of SPSS offer modeling tools developed through the use of a proprietary programming

language. For small- and medium-sized DSS or for less complex ones, a spreadsheet (e.g., Excel)

is usually used. We use Excel for several examples in this book. Application Case 1.2 describes a

spreadsheet-based DSS.

Application Case 1.2 SNAP DSS Helps OneNet Make Telecommunications Rate Decisions

Telecommunications network services to educational institutions and government entities are

typically provided by a mix of private and public organizations. Many states in the United States

have one or more state agencies that are responsible for providing network services to schools,

colleges, and other state agencies. One example of such an agency is OneNet in Oklahoma.

OneNet is a division of the Oklahoma State Regents for Higher Education and operated in

cooperation with the Office of State Finance.

Usually agencies such as OneNet operate as an enterprise-type fund. They must recover their

costs through billing their clients and/or by justifying appropriations directly from the state

legislatures. This cost recovery should occur through a pricing mechanism that is efficient,

simple to implement, and equitable. This pricing model typically needs to recognize many

factors: convergence of voice, data, and video traffic on the same infrastructure; diversity of user

base in terms of educational institutions and state agencies; diversity of applications in use by

state clients from e-mail to videoconferences, IP telephoning, and distance learning; recovery of

current costs as well as planning for upgrades and future developments; and leverage of the

shared infrastructure to enable further economic development and collaborative work across the

state that leads to innovative uses of OneNet.

These considerations led to the development of a spreadsheet-based model. The system, SNAP-

DSS, or Service Network Application and Pricing (SNAP)-based DSS, was developed in

Microsoft Excel 2007 and used the VBA programming language.

The SNAP-DSS offers OneNet the ability to select the rate card options that best fit the preferred

pricing strategies by providing a real-time, user-friendly, graphical user interface (GUI). In

addition, the SNAP-DSS not only illustrates the influence of the changes in the pricing factors on

each rate card option but also allows the user to analyze various rate card options in different

scenarios using different parameters. This model has been used by OneNet financial planners to

gain insights into their customers and analyze many what-if scenarios of different rate plan

options.

Source: Based on J. Chongwatpol and R. Sharda. (2010, December). “SNAP: A DSS to Analyze Network Service Pricing for State Networks.” Decision Support Systems, 50(1), pp. 347–359.

The User Interface Subsystem

The user communicates with and commands the DSS through the user interface subsystem. The

user is considered part of the system. Researchers assert that some of the unique contributions of

DSS are derived from the intensive interaction between the computer and the decision maker. A

difficult user interface is one of the major reasons that managers do not use computers and

quantitative analyses as much as they could, given the availability of these technologies. The

Web browser provided a familiar, consistent GUI structure for many DSS in the 2000s. For

locally used DSS, a spreadsheet also provides a familiar user interface. The Web browser has

been recognized as an effective DSS GUI because it is flexible, user-friendly, and a gateway to

almost all sources of necessary information and data. Essentially, Web browsers have led to the

development of portals and dashboards, which front end many DSS.

Explosive growth in portable devices, including smartphones and tablets, has changed the DSS

user interfaces as well. These devices allow either handwritten input or typed input from internal

or external keyboards. Some DSS user interfaces utilize natural language input (i.e., text in a

human language) so that the users can easily express themselves in a meaningful way. Cell

phone inputs through short message service (SMS) or chatbots are becoming more common for

at least some consumer DSS-type applications. For example, one can send an SMS request for

search on any topic to GOOGL (46645). Such capabilities are most useful in locating nearby

businesses, addresses, or phone numbers, but it can also be used for many other decision support

tasks. For example, users can find definitions of words by entering the word “define” followed

by a word, such as “define extenuate.” Some of the other capabilities include

• Price lookups: “Price 64GB iPhone X.”

• Currency conversions: “10 US dollars in euros.”

• Sports scores and game times: Just enter the name of a team (“NYC Giants”), and Google

SMS will send the most recent game’s score and the date and time of the next match.

This type of SMS-based search capability is also available for other search engines such as

Microsoft’s search engine Bing.

With the emergence of smartphones such as Apple’s iPhone and Android smartphones from

many vendors, many companies are developing apps to provide purchasing-decision support. For example, Amazon’s app allows a user to take a picture of any item in a store (or wherever)

and send it to Amazon.com. Amazon.com’s graphics-understanding algorithm tries to match the

image to a real product in its databases and sends the user a page similar to Amazon.com’s

product info pages, allowing users to perform price comparisons in real time. Millions of other

apps have been developed that provide consumers support for decision making on finding and

selecting stores/restaurants/service providers on the basis of location, recommendations from

others, and especially from your own social circles. Search activities noted in the previous

paragraph are also largely accomplished now through apps provided by each search provider.

Voice input for these devices and the new smart speakers such as Amazon Echo (Alexa) and

Google Home is common and fairly accurate (but not perfect). When voice input with

accompanying speech-recognition software (and readily available text-to-speech software) is

used, verbal instructions with accompanied actions and outputs can be invoked. These are readily

available for DSS and are incorporated into the portable devices described earlier. An example of

voice inputs that can be used for a general-purpose DSS is Apple’s Siri application and Google’s

Google Now service. For example, a user can give her or his zip code and say “pizza delivery.”

These devices provide the search results and can even place a call to a business.

The Knowledge-Based Management Subsystem

Many of the user interface developments are closely tied to the major new advances in their

knowledge-based systems. The knowledge-based management subsystem can support any of the

other subsystems or act as an independent component. It provides intelligence to augment the

decision maker’s own or to help understand a user’s query so as to provide a consistent answer.

It can be interconnected with the organization’s knowledge repository (part of a KMS), which is

sometimes called the organizational knowledge base, or connect to thousands of external knowledge sources. Many artificial intelligence methods have been implemented in the current

generation of learning systems and are easy to integrate into the other DSS components. One of

the most widely publicized knowledge-based DSS is IBM’s Watson, which was introduced in the

opening vignette and will be described in more detail later.

This section has covered the history and progression of Decision Support Systems in brief. In the

next section we discuss evolution of this support to business intelligence, analytics, and data

science.

Section 1.3 Review Questions

1. List and briefly describe Simon’s four phases of decision making. 2. What is the difference between a problem and its symptoms? 3. Why is it important to classify a problem? 4. Define implementation. 5. What are structured, unstructured, and semistructured decisions? Provide two

examples of each. 6. Define operational control, managerial control, and strategic planning. Provide

two examples of each. 7. What are the nine cells of the decision framework? Explain what each is for. 8. How can computers provide support for making structured decisions? 9. How can computers provide support for making semistructured and unstructured

decisions?

1.4 Evolution of Computerized Decision Support to Business Intelligence/Analytics/Data Science

The timeline in Figure 1.5 shows the terminology used to describe analytics since the

1970s. During the 1970s, the primary focus of information systems support for decision

making focused on providing structured, periodic reports that a manager could use for

decision making (or ignore them). Businesses began to create routine reports to inform

decision makers (managers) about what had happened in the previous period (e.g., day,

week, month, quarter). Although it was useful to know what had happened in the past,

managers needed more than this: They needed a variety of reports at different levels of

granularity to better understand and address changing needs and challenges of the

business. These were usually called management information systems (MIS). In the early

1970s, Scott-Morton first articulated the major concepts of DSS. He defined DSS as

“interactive computer-based systems, which help decision makers

utilize data and models to solve unstructured problems” (Gorry and Scott-Morton,

1971). The following is another classic DSS definition provided by Keen and Scott-

Morton (1978): Figure 1.5 Evolution of Decision Support, Business Intelligence, Analytics, and AI.

Decision support systems couple the intellectual resources of individuals with the

capabilities of the computer to improve the quality of decisions. It is a computer-based

support system for management decision makers who deal with semistructured

problems.

Note that the term decision support system, like management information system and

several other terms in the field of IT, is a content-free expression (i.e., it means different

things to different people). Therefore, there is no universally accepted definition of DSS.

During the early days of analytics, data were often obtained from the domain experts

using manual processes (i.e., interviews and surveys) to build mathematical or

knowledge-based models to solve constrained optimization problems. The idea was to

do the best with limited resources. Such decision support models were typically called

operations research (OR). The problems that were too complex to solve optimally (using

linear or nonlinear mathematical programming techniques) were tackled using heuristic

methods such as simulation models. (We will introduce these as prescriptive analytics

later in this chapter).

In the late 1970s and early 1980s, in addition to the mature OR models that were being

used in many industries and government systems, a new and exciting line of models

had emerged: rule-based expert systems (ESs). These systems promised to capture

experts’ knowledge in a format that computers could process (via a collection of if-then-

else rules or heuristics) so that these could be used for consultation much the same way

that one would use domain experts to identify a structured problem and to prescribe

the most probable solution. ESs allowed scarce expertise to be made available where

and when needed, using an “intelligent” DSS.

The 1980s saw a significant change in the way organizations captured business-related

data. The old practice had been to have multiple disjointed information systems tailored

to capture transactional data of different organizational units or functions (e.g.,

accounting, marketing and sales, finance, manufacturing). In the 1980s, these systems

were integrated as enterprise-level information systems that we now commonly

call enterprise resource planning (ERP) systems. The old mostly sequential and

nonstandardized data representation schemas were replaced by relational database

management (RDBM) systems. These systems made it possible to improve the capture

and storage of data as well as the relationships between organizational data fields while

significantly reducing the replication of information. The need for RDBM and ERP

systems emerged when data integrity and consistency became an issue, significantly

hindering the effectiveness of business practices. With ERP, all the data from every

corner of the enterprise is collected and integrated into a consistent schema so that

every part of the organization has access to the single version of the truth when and

where needed. In addition to the emergence of ERP systems, or perhaps because of

these systems, business reporting became an on-demand, as-needed business practice.

Decision makers could decide when they needed to or wanted to create specialized

reports to investigate organizational problems and opportunities.

In the 1990s, the need for more versatile reporting led to the development of executive

information systems (EISs; DSS designed and developed specifically for executives and

their decision-making needs). These systems were designed as graphical dashboards

and scorecards so that they could serve as visually appealing displays while focusing

on the most important factors for decision makers to keep track of the key performance

indicators. To make this highly versatile reporting possible while keeping the

transactional integrity of the business information systems intact, it was necessary to

create a middle data tier known as a DW as a repository to specifically support business

reporting and decision making. In a very short time, most large- to medium-sized

businesses adopted data warehousing as their platform for enterprise-wide decision

making. The dashboards and scorecards got their data from a DW, and by doing so,

they were not hindering the efficiency of the business transaction systems mostly

referred to as ERP systems.

In the 2000s, the DW-driven DSS began to be called BI systems. As the amount of

longitudinal data accumulated in the DWs increased, so did the capabilities of

hardware and software to keep up with the rapidly changing and evolving needs of the

decision makers. Because of the globalized competitive marketplace, decision makers

needed current information in a very digestible format to address business problems

and to take advantage of market opportunities in a timely manner. Because the data in a

DW are updated periodically, they do not reflect the latest information. To elevate this

information latency problem, DW vendors developed a system to update the data more

frequently, which led to the terms real-time data warehousing and, more

realistically, right-time data warehousing, which differs from the former by adopting a

data-refreshing policy based on the needed freshness of the data items (i.e., not all data

items need to be refreshed in real time). DWs are very large and feature rich, and it

became necessary to “mine” the corporate data to “discover” new and useful

knowledge nuggets to improve business processes and practices, hence, the terms data

mining and text mining. With the increasing volumes and varieties of data, the needs for

more storage and more processing power emerged. Although large corporations had

the means to tackle this problem, small- to medium-sized companies needed more

financially manageable business models. This need led to service-oriented architecture

and software and infrastructure-as-a-service analytics business models. Smaller

companies, therefore, gained access to analytics capabilities on an as-needed basis and

paid only for what they used, as opposed to investing in financially prohibitive

hardware and software resources.

In the 2010s, we are seeing yet another paradigm shift in the way that data are captured

and used. Largely because of the widespread use of the Internet, new data generation

mediums have emerged. Of all the new data sources (e.g., radio-frequency

identification [RFID] tags, digital energy meters, clickstream Web logs, smart home

devices, wearable health monitoring equipment), perhaps the most interesting and

challenging is social networking/social media. These unstructured data are rich in

information content, but analysis of such data sources poses significant challenges to

computational systems from both software and hardware perspectives. Recently, the

term Big Data has been coined to highlight the challenges that these new data streams

have brought on us. Many advancements in both hardware (e.g., massively parallel

processing with very large computational memory and highly parallel multiprocessor

computing systems) and software/algorithms (e.g., Hadoop with MapReduce and

NoSQL, Spark) have been developed to address the challenges of Big Data.

The last few years and the upcoming decade are bringing massive growth in many

exciting dimensions. For example, streaming analytics and the sensor technologies have

enabled the IoT. Artificial Intelligence is changing the shape of BI by enabling new ways

of analyzing images through deep learning, not just traditional visualization of data.

Deep learning and AI are also helping grow voice recognition and speech synthesis,

leading to new interfaces in interacting with technologies. Almost half of U.S.

households already have a smart speaker such as Amazon Echo or Google Home and

have begun to interact with data and systems using voice interfaces. Growth in video

interfaces will eventually enable gesture-based interaction with systems. All of these are

being enabled due to massive cloud-based data storage and amazingly fast processing

capabilities. And more is yet to come.

It is hard to predict what the next decade will bring and what the new analytics-related

terms will be. The time between new paradigm shifts in information systems and

particularly in analytics has been shrinking, and this trend will continue for the

foreseeable future. Even though analytics is not new, the explosion in its popularity is

very new. Thanks to the recent explosion in Big Data, ways to collect and store these

data and intuitive software tools, data-driven insights are more accessible to business

professionals than ever before. Therefore, in the midst of global competition, there is a

huge opportunity to make better managerial decisions by using data and analytics to

increase revenue while decreasing costs by building better products, improving

customer experience, and catching fraud before it happens, improving customer

engagement through targeting and customization, and developing entirely new lines of

business, all with the power of analytics and data. More and more companies are now

preparing their employees with the know-how of business analytics to drive

effectiveness and efficiency in their day-to-day decision-making processes.

The next section focuses on a framework for BI. Although most people would agree that

BI has evolved into analytics and data science, many vendors and researchers still use

that term. So the next few paragraphs pay homage to that history by specifically

focusing on what has been called BI. Following the next section, we introduce analytics

and use that as the label for classifying all related concepts.

A Framework for Business Intelligence

The decision support concepts presented in Sections 1.2 and 1.3 have been implemented

incrementally, under different names, by many vendors that have created tools and

methodologies for decision support. As noted in Section 1.2, as the enterprise-wide systems

grew, managers were able to access user-friendly reports that enabled them to make decisions

quickly. These systems, which were generally called EISs, then began to offer additional

visualization, alerts, and performance measurement capabilities. By 2006, the

major commercial products and services appeared under the term business intelligence (BI).

Definitions of BI

Business intelligence (BI) is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies. It is, like DSS, a content-free expression, so it

means different things to different people. Part of the confusion about BI lies in the flurry of

acronyms and buzzwords that are associated with it (e.g., business performance management

[BPM]). BI’s major objective is to enable interactive access (sometimes in real time) to data, to

enable manipulation of data, and to give business managers and analysts the ability to conduct

appropriate analyses. By analyzing historical and current data, situations, and performances,

decision makers get valuable insights that enable them to make more informed and better

decisions. The process of BI is based on the transformation of data to information, then to decisions, and finally to actions.

A Brief History of BI

The term BI was coined by the Gartner Group in the mid-1990s. However, as the history in the previous section points out, the concept is much older; it has its roots in the MIS reporting

systems of the 1970s. During that period, reporting systems were static, were two dimensional,

and had no analytical capabilities. In the early 1980s, the concept of EISs emerged. This concept

expanded the computerized support to top-level managers and executives. Some of the

capabilities introduced were dynamic multidimensional (ad hoc or on-demand) reporting,

forecasting and prediction, trend analysis, drill-down to details, status access, and critical success

factors. These features appeared in dozens of commercial products until the mid-1990s. Then the

same capabilities and some new ones appeared under the name BI. Today, a good BI-based

enterprise information system contains all the information that executives need. So, the original

concept of EIS was transformed into BI. By 2005, BI systems started to include artificial intelligence capabilities as well as powerful analytical capabilities. Figure 1.6 illustrates the various tools and techniques that may be included in a BI system. It illustrates the evolution of

BI as well. The tools shown in Figure 1.6 provide the capabilities of BI. The most sophisticated

BI products include most of these capabilities; others specialize in only some of them.

Figure 1.6 Evolution of Business Intelligence (BI).

The Architecture of BI

A BI system has four major components: a DW, with its source data; business analytics, a

collection of tools for manipulating, mining, and analyzing the data in the DW; BPM for

monitoring and analyzing performance; and a user interface (e.g., a dashboard). The

relationship among these components is illustrated in Figure 1.7. Figure 1.7 A High-Level Architecture of BI.

The Origins and Drivers of BI

Where did modern approaches to DW and BI come from? What are their roots, and how do those

roots affect the way organizations are managing these initiatives today? Today’s investments in

information technology are under increased scrutiny in terms of their bottom-line impact and

potential. The same is true of DW and the BI applications that make these initiatives possible.

Organizations are being compelled to capture, understand, and harness their data to support

decision making to improve business operations. Legislation and regulation (e.g., the Sarbanes-

Oxley Act of 2002) now require business leaders to document their business processes and to

sign off on the legitimacy of the information they rely on and report to stakeholders. Moreover,

business cycle times are now extremely compressed; faster, more informed, and better decision

making is, therefore, a competitive imperative. Managers need the right information at the right time and in the right place. This is the mantra for modern approaches to BI. Organizations have to work smart. Paying careful attention to the management of BI initiatives is

a necessary aspect of doing business. It is no surprise, then, that organizations are increasingly

championing BI and under its new incarnation as analytics.

Data Warehouse as a Foundation for Business Intelligence

BI systems rely on a DW as the information source for creating insight and supporting

managerial decisions. A multitude of organizational and external data is captured, transformed,

and stored in a DW to support timely and accurate decisions through enriched business insight.

In simple terms, a DW is a pool of data produced to support decision making; it is also a repository of current and historical data of potential interest to managers throughout the

organization. Data are usually structured to be available in a form ready for analytical processing

activities (i.e., OLAP, data mining, querying, reporting, and other decision support applications).

A DW is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of

management’s decision-making process.

Whereas a DW is a repository of data, data warehousing is literally the entire process. Data

warehousing is a discipline that results in applications that provide decision support capability,

allows ready access to business information, and creates business insight. The three main types

of data warehouses are data marts (DMs), operational data stores (ODS), and enterprise data

warehouses (EDW). Whereas a DW combines databases across an entire enterprise, a DM is

usually smaller and focuses on a particular subject or department. A DM is a subset of a data

warehouse, typically consisting of a single subject area (e.g., marketing, operations). An

operational data store (ODS) provides a fairly recent form of customer information file. This

type of database is often used as an interim staging area for a DW. Unlike the static contents of a

DW, the contents of an ODS are updated throughout the course of business operations. An EDW

is a large-scale data warehouse that is used across the enterprise for decision support. The large-

scale nature of an EDW provides integration of data from many sources into a standard format

for effective BI and decision support applications. EDWs are used to provide data for many types

of DSS, including CRM, supply chain management (SCM), BPM, business activity monitoring,

product life-cycle management, revenue management, and sometimes even KMS.

In Figure 1.8, we show the DW concept. Data from many different sources can be extracted,

transformed, and loaded into a DW for further access and analytics for decision support. Further

details of DW are available in an online chapter on the book’s Web site.

Figure 1.8 Data Warehouse Framework and Views.

Transaction Processing versus Analytic Processing

To illustrate the major characteristics of BI, first we will show what BI is not—namely,

transaction processing. We are all familiar with the information systems that support our

transactions, like ATM withdrawals, bank deposits, and cash register scans at the grocery store.

These transaction processing systems are constantly involved in handling updates to what we might call operational databases. For example, in an ATM withdrawal transaction, we need to reduce our bank balance accordingly; a bank deposit adds to an account; and a grocery store

purchase is likely reflected in the store’s calculation of total sales for the day, and it should

reflect an appropriate reduction in the store’s inventory for the items we bought, and so on.

These online transaction processing (OLTP) systems handle a company’s routine ongoing business. In contrast, a DW is typically a distinct system that provides storage for data that will

be used for analysis. The intent of that analysis is to give management the ability to scour data for information about the business, and it can be used to provide tactical or operational decision

support whereby, for example, line personnel can make quicker and/or more informed decisions.

DWs are intended to work with informational data used for online analytical processing (OLAP) systems. Most operational data in ERP systems—and in their complementary siblings like SCM or CRM— are stored in an OLTP system, which is a type of computer processing where the computer

responds immediately to user requests. Each request is considered to be a transaction, which is a computerized record of a discrete event, such as the receipt of inventory or a customer order. In

other words, a transaction requires a set of two or more database updates that must be completed

in an all-or-nothing fashion.

The very design that makes an OLTP system efficient for transaction processing makes it

inefficient for end-user ad hoc reports, queries, and analysis. In the 1980s, many business users

referred to their mainframes as “black holes” because all the information went into them, but

none ever came back. All requests for reports had to be programmed by the IT staff, whereas

only “precanned” reports could be generated on a scheduled basis, and ad hoc real-time querying

was virtually impossible. Although the client/server-based ERP systems of the 1990s were

somewhat more report friendly, they have still been a far cry from a desired usability by regular,

nontechnical end users for things such as operational reporting and interactive analysis. To

resolve these issues, the notions of DW and BI were created.

DWs contain a wide variety of data that present a coherent picture of business conditions at a

single point in time. The idea was to create a database infrastructure that was always online and

contained all the information from the OLTP systems, including historical data, but reorganized

and structured in such a way that it was fast and efficient for querying, analysis, and decision

support. Separating the OLTP from analysis and decision support enables the benefits of BI that

were described earlier.

A Multimedia Exercise in Business Intelligence

TUN includes videos (similar to the television show CSI) to illustrate concepts of analytics in different industries. These are called “BSI Videos (Business Scenario Investigations).” Not only

are these entertaining, but they also provide the class with some questions for discussion. For

starters, please go to https://www.teradatauniversitynetwork.com/Library/Items/BSI-The-Case-

of-the-Misconnecting-Passengers/ or www.youtube.com/watch?v=NXEL5F4_aKA. Watch the

video that appears on YouTube. Essentially, you have to assume the role of a customer service

center professional. An incoming flight is running late, and several passengers are likely to miss

their connecting flights. There are seats on one outgoing flight that can accommodate two of the

four passengers. Which two passengers should be given priority? You are given information

about customers’ profiles and relationships with the airline. Your decisions might change as you

learn more about those customers’ profiles.

Watch the video, pause it as appropriate, and answer the questions on which passengers should

be given priority. Then resume the video to get more information. After the video is complete,

you can see the slides related to this video and how the analysis was prepared on a slide set

at www.slideshare.net/teradata/bsi-how-we-did-it-the-case-of-the-misconnecting-passengers.

This multimedia excursion provides an example of how additional available information through

an enterprise DW can assist in decision making.

Although some people equate DSS with BI, these systems are not, at present, the same. It is

interesting to note that some people believe that DSS is a part of BI—one of its analytical tools.

Others think that BI is a special case of DSS that deals mostly with reporting, communication,

and collaboration (a form of data-oriented DSS). Another explanation (Watson, 2005) is that BI

is a result of a continuous revolution, and as such, DSS is one of BI’s original elements. Further,

as noted in the next section onward, in many circles, BI has been subsumed by the new

terms analytics or data science.

Appropriate Planning and Alignment with the Business Strategy

First and foremost, the fundamental reasons for investing in BI must be aligned with the

company’s business strategy. BI cannot simply be a technical exercise for the information

systems department. It has to serve as a way to change the manner in which the company

conducts business by improving its business processes and transforming decision-making

processes to be more data driven. Many BI consultants and practitioners involved in successful

BI initiatives advise that a framework for planning is a necessary precondition. One framework,

proposed by Gartner, Inc. (2004), decomposed planning and execution into business, organization, functionality, and infrastructure components. At the business and organizational levels, strategic and operational objectives must be defined while considering the available

organizational skills to achieve those objectives. Issues of organizational culture surrounding BI

initiatives and building enthusiasm for those initiatives and procedures for the intra-

organizational sharing of BI best practices must be considered by upper management—with

plans in place to prepare the organization for change. One of the first steps in that process is to

assess the IS organization, the skill sets of the potential classes of users, and whether the culture

is amenable to change. From this assessment, and assuming there are justification and the need to

move ahead, a company can prepare a detailed action plan. Another critical issue for BI

implementation success is the integration of several BI projects (most enterprises use several BI

projects) among themselves and with the other IT systems in the organization and its business

partners.

Gartner and many other analytics consulting organizations promoted the concept of a BI

competence center that would serve the following functions:

• A center can demonstrate how BI is clearly linked to strategy and execution of strategy.

• A center can serve to encourage interaction between the potential business user

communities and the IS organization.

• A center can serve as a repository and disseminator of best BI practices between and among

the different lines of business.

• Standards of excellence in BI practices can be advocated and encouraged throughout the

company.

• The IS organization can learn a great deal through interaction with the user communities,

such as knowledge about the variety of types of analytical tools that are needed.

• The business user community and IS organization can better understand why the DW

platform must be flexible enough to provide for changing business requirements.

• The center can help important stakeholders like high-level executives see how BI can play

an important role.

Over the last 10 years, the idea of a BI competence center has been abandoned because many

advanced technologies covered in this book have reduced the need for a central group to

organize many of these functions. Basic BI has now evolved to a point where much of it can be

done in “self-service” mode by the end users. For example, many data visualizations are easily

accomplished by end users using the latest visualization packages (Chapter 3 will introduce some

of these). As noted by Duncan (2016), the BI team would now be more focused on producing

curated data sets to enable self-service BI. Because analytics is now permeating across the whole

organization, the BI competency center could evolve into an analytics community of excellence

to promote best practices and ensure overall alignment of analytics initiatives with organizational

strategy.

BI tools sometimes needed to be integrated among themselves, creating synergy. The need for

integration pushed software vendors to continuously add capabilities to their products.

Customers who buy an all-in-one software package deal with only one vendor and do not have to

deal with system connectivity. But they may lose the advantage of creating systems composed

from the “best-of-breed” components. This led to major chaos in the BI market space. Many of

the software tools that rode the BI wave (e.g., Savvion, Vitria, Tibco, MicroStrategy, Hyperion)

have either been acquired by other companies or have expanded their offerings to take advantage

of six key trends that have emerged since the initial wave of surge in business intelligence:

• Big Data.

• Focus on customer experience as opposed to just operational efficiency.

• Mobile and even newer user interfaces—visual, voice, mobile.

• Predictive and prescriptive analytics, machine learning, artificial intelligence.

• Migration to cloud.

• Much greater focus on security and privacy protection.

This book covers many of these topics in significant detail by giving examples of how the

technologies are evolving and being applied, and the managerial implications.

Section 1.4 Review Questions

1. List three of the terms that have been predecessors of analytics.

2. What was the primary difference between the systems called MIS, DSS, and Executive Information Systems?

3. Did DSS evolve into BI or vice versa? 4. Define BI. 5. List and describe the major components of BI. 6. Define OLTP. 7. Define OLAP. 8. List some of the implementation topics addressed by Gartner’s report. 9. List some other success factors of BI.

1.5 Analytics Overview

The word analytics has largely replaced the previous individual components of

computerized decision support technologies that have been available under various

labels in the past. Indeed, many practitioners and academics now use the

word analytics in place of BI. Although many authors and consultants have defined it

slightly differently, one can view analytics as the process of developing actionable

decisions or recommendations for actions based on insights generated from historical

data. According to the Institute for Operations Research and Management Science

(INFORMS), analytics represents the combination of computer technology,

management science techniques, and statistics to solve real problems. Of course, many

other organizations have proposed their own interpretations and motivations for

analytics. For example, SAS Institute Inc. proposed eight levels of analytics that begin

with standardized reports from a computer system. These reports essentially provide a

sense of what is happening with an organization. Additional technologies have enabled

us to create more customized reports that can be generated on an ad hoc basis. The next

extension of reporting takes us to OLAP-type queries that allow a user to dig deeper

and determine specific sources of concern or opportunities. Technologies available

today can also automatically issue alerts for a decision maker when performance

warrants such alerts. At a consumer level, we see such alerts for weather or other issues.

But similar alerts can also be generated in specific settings when sales fall above or

below a certain level within a certain time period or when the inventory for a specific

product is running low. All of these applications are made possible through analysis

and queries of data being collected by an organization. The next level of analysis might

entail statistical analysis to better understand patterns. These can then be taken a step

further to develop forecasts or models for predicting how customers might respond to a

specific marketing campaign or ongoing service/product offerings. When an

organization has a good view of what is happening and what is likely to happen, it can

also employ other techniques to make the best decisions under the circumstances.

This idea of looking at all the data to understand what is happening, what will happen,

and how to make the best of it has also been encapsulated by INFORMS in proposing

three levels of analytics. These three levels are identified as descriptive, predictive, and

prescriptive. Figure 1.9 presents a graphical view of these three levels of analytics. It

suggests that these three are somewhat independent steps and one type of

analytics applications leads to another. It also suggests that there is actually some

overlap across these three types of analytics. In either case, the interconnected nature of

different types of analytics applications is evident. We next introduce these three levels

of analytics. Figure 1.9 Three Types of Analytics.

Descriptive Analytics

Descriptive (or reporting) analytics refers to knowing what is happening in the organization and understanding some underlying trends and causes of such occurrences. First, this involves

the consolidation of data sources and availability of all relevant data in a form that enables

appropriate reporting and analysis. Usually, the development of this data infrastructure is part of

DWs. From this data infrastructure, we can develop appropriate reports, queries, alerts, and

trends using various reporting tools and techniques.

A significant technology that has become a key player in this area is visualization. Using the

latest visualization tools in the marketplace, we can now develop powerful insights in the

operations of our organization. Application Cases 1.3 and 1.4 highlight some such applications.

Application Case 1.3 Silvaris Increases Business with Visual Analysis and Real- Time Reporting Capabilities

Silvaris Corporation was founded in 2000 by a team of forest industry professionals to provide

technological advancement in the lumber and building material sector. Silvaris is the first e--

commerce platform in the United States specifically for forest products and is headquartered in

Seattle, Washington. It is a leading wholesale provider of industrial wood products and surplus

building materials.

Silvaris sells its products and provides international logistics services to more than 3,500

customers. To manage various processes that are involved in a transaction, the company created

a proprietary online trading platform to track information flow related to transactions between

traders, accounting, credit, and logistics. This allowed Silvaris to share its real-time information

with its customers and partners. But due to the rapidly changing prices of materials, it became

necessary for Silvaris to get a real-time view of data without moving them into a separate

reporting format.

Silvaris started using Tableau because of its ability to connect with and visualize live data. With

dashboards created by Tableau that are easy to understand and explain, Silvaris started using it

for reporting purposes. This helped Silvaris in pulling out information quickly from the data and

identifying issues that impact its business. Silvaris succeeded in managing online versus offline

orders with the help of reports generated by Tableau. Now, Silvaris keeps track of online orders

placed by customers and knows when to send renew pushes to which customers to keep them

purchasing online. Also, analysts of Silvaris can save time by generating dashboards instead of

writing hundreds of pages of reports by using Tableau.

Sources: Tableau.com. “Silvaris Augments Proprietary Technology Platform with Tableau’s Real-Time Reporting Capabilities.” http://www.tableau.com/sites/default/files/case-studies/silvaris-business-dashboards_0.pdf (accessed September 2018); Silvaris.com. http://www.silvaris.com (accessed September 2018).

Questions for Case 1.3

1. What was the challenge faced by Silvaris? 2. How did Silvaris solve its problem using data visualization with Tableau?

What We Can Learn from This Application Case

Many industries need to analyze data in real time. Real-time analysis enables the analysts to

identify issues that impact their business. Visualization is sometimes the best way to begin

analyzing the live data streams. Tableau is one such data visualization tool that has the capability

to analyze live data without bringing live data into a separate reporting format.

Application Case 1.4 Siemens Reduces Cost with the Use of Data Visualization

Siemens is a German company headquartered in Berlin, Germany. It is one of the world’s largest

companies focusing on the areas of electrification, automation, and digitalization. It has an

annual revenue of 76 billion euros.

The visual analytics group of Siemens is tasked with end-to-end reporting solutions and

consulting for all of Siemens internal BI needs. This group was facing the challenge of providing

reporting solutions to the entire Siemens organization across different departments while

maintaining a balance between governance and self-service capabilities. Siemens needed a

platform that could analyze its multiple cases of customer satisfaction surveys, logistic processes,

and financial reporting. This platform should be easy to use for their employees so that they

could use these data for analysis and decision making. In addition, the platform should be easily

integrated with existing Siemens systems and give employees a seamless user experience.

Siemens started using Dundas BI, a leading global provider of BI and data visualization

solutions. It allowed Siemens to create highly interactive dashboards that enabled it to detect

issues early and thus save a significant amount of money. The dashboards developed by Dundas

BI helped Siemens global logistics organization answer questions like how different supply rates

at different locations affect the operation, thus helping the company reduce cycle time by 12

percent and scrap cost by 25 percent.

Questions for Case 1.4

1. What challenges were faced by Siemens visual analytics group? 2. How did the data visualization tool Dundas BI help Siemens in reducing cost?

What We Can Learn from This Application Case

Many organizations want tools that can be used to analyze data from multiple divisions. These

tools can help them improve performance and make data discovery transparent to their users so

that they can identify issues within the business easily.

Sources: Dundas.com. “How Siemens Drastically Reduced Cost with Managed BI Applications.” https://www.dundas.com/Content/pdf/siemens-case-study.pdf (accessed September 2018); Wikipedia.org. “SIEMENS.” https://en.wikipedia.org/wiki/Siemens (accessed September 2018); Siemens.com. “About Siemens.” http://www.siemens.com/about/en/ (accessed September 2018).

Predictive Analytics

Predictive analytics aims to determine what is likely to happen in the future. This analysis is based on statistical techniques as well as other more recently developed techniques that fall

under the general category of data mining. The goal of these techniques is to be able to predict whether the customer is likely to switch to a competitor (“churn”), what and how much the

customer would likely buy next, what promotions the customer would respond to, whether the

customer is a creditworthy risk, and so forth. A number of techniques are used in developing

predictive analytical applications, including various classification algorithms. For example, as

described in Chapters 4 and 5, we can use classification techniques such as logistic regression,

decision tree models, and neural networks to predict how well a motion picture will do at the box

office. We can also use clustering algorithms for segmenting customers into different clusters to

be able to target specific promotions to them. Finally, we can use association mining techniques

(Chapters 4 and 5) to estimate relationships between different purchasing behaviors. That is, if a

customer buys one product, what else is the customer likely to purchase? Such analysis can assist

a retailer in recommending or promoting related products. For example, any product search

on Amazon.com results in the retailer also suggesting similar products that a customer may be

interested in. We will study these techniques and their applications

in Chapters 3 through 6. Application Case 1.5 illustrates one such application in sports.

Application Case 1.5 Analyzing Athletic Injuries

Any athletic activity is prone to injuries. If the injuries are not handled properly, then the team

suffers. Using analytics to understand injuries can help in deriving valuable insights that would

enable coaches and team doctors to manage the team composition, understand player profiles,

and ultimately aid in better decision making concerning which players might be available to play

at any given time.

In an exploratory study, Oklahoma State University analyzed U.S. football-related sports injuries

by using reporting and predictive analytics. The project followed the Cross-Industry Standard

Process for Data Mining (CRISP-DM) methodology (to be described in Chapter 4) to understand

the problem of making recommendations on managing injuries, understanding the various data

elements collected about injuries, cleaning the data, developing visualizations to draw various

inferences, building predictive models to analyze the injury healing time period, and drawing

sequence rules to predict the relationships among the injuries and the various body part parts

afflicted with injuries.

The injury data set consisted of more than 560 football injury records, which were categorized

into injury-specific variables—body part/site/laterality, action taken, severity, injury type, injury

start and healing dates—and player/sport-specific variables—player ID, position played, activity,

onset, and game location. Healing time was calculated for each record, which was classified into

different sets of time periods: 0–1 month, 1–2 months, 2–4 months, 4–6 months, and 6–24

months.

Various visualizations were built to draw inferences from injury–data set information depicting

the healing time period associated with players’ positions, severity of injuries and the healing

time period, treatment offered and the associated healing time period, major injuries afflicting

body parts, and so forth.

Neural network models were built to predict each of the healing categories using IBM SPSS

Modeler. Some of the predictor variables were current status of injury, severity, body part, body

site, type of injury, activity, event location, action taken, and position played. The success of

classifying the healing category was quite good: Accuracy was 79.6 percent. Based on the

analysis, many recommendations were suggested, including employing more specialists’ input

from injury onset instead of letting the training room staff screen the injured players; training

players at defensive positions to avoid being injured; and holding practice to thoroughly safety-

check mechanisms.

Sources: “Sharda, R., Asamoah, D., & Ponna, N. (2013). “Research and Pedagogy in Business Analytics: Opportunities and Illustrative Examples.” Journal of Computing and Information Technology, 21(3), pp. 171–182.

Questions for Case 1.5

1. What types of analytics are applied in the injury analysis? 2. How do visualizations aid in understanding the data and delivering insights into

the data? 3. What is a classification problem? 4. What can be derived by performing sequence analysis?

What We Can Learn from This Application Case

For any analytics project, it is always important to understand the business domain and the

current state of the business problem through extensive analysis of the only resource—historical

data. Visualizations often provide a great tool for gaining the initial insights into data, which can

be further refined based on expert opinions to identify the relative importance of the data

elements related to the problem. Visualizations also aid in generating ideas for obscure problems,

which can be pursued in building PMs that could help organizations in decision making.

Prescriptive Analytics

The third category of analytics is termed prescriptive analytics. The goal of prescriptive analytics is to recognize what is going on as well as the likely forecast and make decisions to

achieve the best performance possible. This group of techniques has historically been studied

under the umbrella of OR or management sciences and is generally aimed at optimizing the

performance of a system. The goal here is to provide a decision or a recommendation for a

specific action. These recommendations can be in the form of a specific yes/no decision for a

problem, a specific amount (say, price for a specific item or airfare to charge), or a complete set

of production plans. The decisions may be presented to a decision maker in a report or may be

used directly in an automated decision rules system (e.g., in airline pricing systems). Thus, these

types of analytics can also be termed decision or normative analytics. Application Case 1.6 gives an example of such prescriptive analytic applications. We will learn about some

aspects of prescriptive analytics in Chapter 8.

Application Case 1.6A Specialty Steel Bar Company Uses Analytics to Determine Available-to-Promise Dates

This application case is based on a project that involved one of the coauthors A company that

does not wish to disclose its name (or even its precise industry) was facing a major problem of

making decisions on which inventory of raw materials to use to satisfy which customers. This

company supplies custom configured steel bars to its customers. These bars may be cut into

specific shapes or sizes and may have unique material and finishing requirements. The company

procures raw materials from around the world and stores them in its warehouse. When a

prospective customer calls the company to request a quote for the specialty bars meeting specific

material requirements (composition, origin of the metal, quality, shapes, sizes, etc.), the

salesperson usually has just a little bit of time to submit such a quote including the date when the

product can be delivered and, of course, prices, and so on. It must make available-to-promise

(ATP) decisions, which determine in real time the dates when the salesperson can promise

delivery of products that customers requested during the quotation stage. Previously, a

salesperson had to make such decisions by analyzing reports on available inventory of raw

materials. Some of the available raw material may have already been committed to another

customer’s order. Thus, the inventory in stock might not really be inventory available. On the

other hand, there may be raw material that is expected to be delivered in the near future that

could also be used for satisfying the order from this prospective customer. Finally, there might

even be an opportunity to charge a premium for a new order by repurposing previously

committed inventory to satisfy this new order while delaying an already committed order. Of

course, such decisions should be based on the cost–benefit analyses of delaying a previous order.

The system should thus be able to pull real-time data about inventory, committed orders,

incoming raw material, production constraints, and so on.

To support these ATP decisions, a real-time DSS was developed to find an optimal assignment

of the available inventory and to support additional what-if analysis. The DSS uses a suite of

mixed-integer programming models that are solved using commercial software. The company

has incorporated the DSS into its enterprise resource planning system to seamlessly facilitate its

use of business analytics.

Questions for Case 1.6

1. Why would reallocation of inventory from one customer to another be a major issue for discussion?

2. How could a DSS help make these decisions?

Source: M. Pajouh Foad, D. Xing, S. Hariharan, Y. Zhou, B. Balasundaram, T. Liu, & R. Sharda, R. (2013). “Available-to-Promise in Practice: An Application of Analytics in the Specialty Steel Bar Products Industry.” Interfaces, 43(6), pp. 503– 517. http://dx.doi.org/10.1287/inte.2013.0693 (accessed September 2018).

Analytics Applied to Different Domains

Applications of analytics in various industry sectors have spawned many related areas or at least

buzzwords. It is almost fashionable to attach the word analytics to any specific industry or type of data. Besides the general category of text analytics—aimed at getting value out of text (to be

studied in Chapter 7)—or Web analytics—analyzing Web data streams (also in Chapter 7)—

many industry- or problem-specific analytics professions/streams have been developed.

Examples of such areas are marketing analytics, retail analytics, fraud analytics, transportation

analytics, health analytics, sports analytics, talent analytics, behavioral analytics, and so forth.

For example, we will soon see several applications in sports analytics. Application Case 1.5 could also be termed a case study in health analytics. The next section will introduce

health analytics and market analytics broadly. Literally, any systematic analysis of data in a

specific sector is being labeled as “(fill-in-blanks)” analytics. Although this may result in

overselling the concept of analytics, the benefit is that more people in specific industries are

aware of the power and potential of analytics. It also provides a focus to professionals

developing and applying the concepts of analytics in a vertical sector. Although many of the

techniques to develop analytics applications may be common, there are unique issues within each

vertical segment that influence how the data may be collected, processed, analyzed, and the

applications implemented. Thus, the differentiation of analytics based on a vertical focus is good

for the overall growth of the discipline.

Analytics or Data Science?

Even as the concept of analytics is receiving more attention in industry and academic circles,

another term has already been introduced and is becoming popular. The new term is data science. Thus, the practitioners of data science are data scientists. D. J. Patil of LinkedIn is sometimes credited with creating the term data science. There have been some attempts to describe the differences between data analysts and data scientists (e.g., see “Data Science

Revealed,” 2018) (emc.com/collateral/about/news/emc-data-science-study-wp.pdf). One view is

that data analyst is just another term for professionals who were doing BI in the form of data compilation, cleaning, reporting, and perhaps some visualization. Their skill sets included Excel

use, some SQL knowledge, and reporting. You would recognize those capabilities as descriptive

or reporting analytics. In contrast, data scientists are responsible for predictive analysis,

statistical analysis, and use of more advanced analytical tools and algorithms. They may have a

deeper knowledge of algorithms and may recognize them under various labels—data mining,

knowledge discovery, or machine learning. Some of these professionals may also need deeper

programming knowledge to be able to write code for data cleaning/analysis in current Web-

oriented languages such as Java or Python and statistical languages such as R. Many analytics

professionals also need to build significant expertise in statistical modeling, experimentation, and

analysis. Again, our readers should recognize that these fall under the predictive and prescriptive

analytics umbrella. However, prescriptive analytics also includes more significant expertise in

OR including optimization, simulation, and decision analysis. Those who cover these fields are

more likely to be called data scientists than analytics professionals. Our view is that the distinction between analytics professional and data scientist is more of a

degree of technical knowledge and skill sets than functions. It may also be more of a distinction

across disciplines. Computer science, statistics, and applied mathematics programs appear to

prefer the data science label, reserving the analytics label for more business-oriented

professionals. As another example of this, applied physics professionals have proposed

using network science as the term for describing analytics that relate to groups of people—social networks, supply chain networks, and so forth. See http://barabasi.com/networksciencebook/ for

an evolving textbook on this topic.

Aside from a clear difference in the skill sets of professionals who only have to do

descriptive/reporting analytics versus those who engage in all three types of analytics, the

distinction between the two labels is fuzzy at best. We observe that graduates of our analytics

programs tend to be responsible for tasks that are more in line with data science professionals (as

defined by some circles) than just reporting analytics. This book is clearly aimed at introducing

the capabilities and functionality of all analytics (which include data science), not just reporting

analytics. From now on, we will use these terms interchangeably.

What is Big Data?

Any book on analytics and data science has to include significant coverage of what is called Big Data analytics. We cover it in Chapter 9 but here is a very brief introduction. Our brains work extremely quickly and efficiently and are versatile in processing large amounts of all kinds of

data: images, text, sounds, smells, and video. We process all different forms of data relatively

easily. Computers, on the other hand, are still finding it hard to keep up with the pace at which

data are generated, let alone analyze them quickly. This is why we have the problem of Big Data.

So, what is Big Data? Simply put, Big Data refers to data that cannot be stored in a single storage

unit. Big Data typically refers to data that come in many different forms: structured,

unstructured, in a stream, and so forth. Major sources of such data are clickstreams from Web

sites, postings on social media sites such as Facebook, and data from traffic, sensors, or weather.

A Web search engine such as Google needs to search and index billions of Web pages to give

you relevant search results in a fraction of a second. Although this is not done in real time,

generating an index of all the Web pages on the Internet is not an easy task. Luckily for Google,

it was able to solve this problem. Among other tools, it has employed Big Data analytical

techniques.

There are two aspects to managing data on this scale: storing and processing. If we could

purchase an extremely expensive storage solution to store all this at one place on one unit,

making this unit fault tolerant would involve a major expense. An ingenious solution was

proposed that involved storing these data in chunks on different machines connected by a

network—putting a copy or two of this chunk in different locations on the network, both

logically and physically. It was originally used at Google (then called the Google File System)

and later developed and released by an Apache project as the Hadoop Distributed File System

(HDFS).

However, storing these data is only half of the problem. Data are worthless if they do not provide

business value, and for them to provide business value, they must be analyzed. How can such

vast amounts of data be analyzed? Passing all computation to one powerful computer does not

work; this scale would create a huge overhead on such a powerful computer. Another ingenious

solution was proposed: Push computation to the data instead of pushing data to a computing

node. This was a new paradigm and gave rise to a whole new way of processing data. This is

what we know today as the MapReduce programming paradigm, which made processing Big

Data a reality. MapReduce was originally developed at Google, and a subsequent version was

released by the Apache project called Hadoop MapReduce. Today, when we talk about storing, processing, or analyzing Big Data, HDFS and MapReduce

are involved at some level. Other relevant standards and software solutions have been proposed.

Although the major toolkit is available as an open source, several companies have been launched

to provide training or specialized analytical hardware or software services in this space. Some

examples are HortonWorks, Cloudera, and Teradata Aster.

Over the past few years, what was called Big Data changed more and more as Big Data

applications appeared. The need to process data coming in at a rapid rate added velocity to the

equation. An example of fast data processing is algorithmic trading. This uses electronic

platforms based on algorithms for trading shares on the financial market, which operates in

microseconds. The need to process different kinds of data added variety to the equation. Another

example of a wide variety of data is sentiment analysis, which uses various forms of data from

social media platforms and customer responses to gauge sentiments. Today, Big Data is

associated with almost any kind of large data that have the characteristics of volume, velocity,

and variety. As noted before, these are evolving quickly to encompass stream analytics, IoT,

cloud computing, and deep learning–enabled AI. We will study these in various chapters in the

book.

Section 1.5 Review Questions

1. Define analytics. 2. What is descriptive analytics? What are the various tools that are employed in

descriptive analytics? 3. How is descriptive analytics different from traditional reporting? 4. What is a DW? How can DW technology help enable analytics? 5. What is predictive analytics? How can organizations employ predictive analytics? 6. What is prescriptive analytics? What kinds of problems can be solved by

prescriptive analytics?

7. Define modeling from the analytics perspective. 8. Is it a good idea to follow a hierarchy of descriptive and predictive analytics

before applying prescriptive analytics? 9. How can analytics aid in objective decision making? 10. What is Big Data analytics? 11. What are the sources of Big Data? 12. What are the characteristics of Big Data? 13. What processing technique is applied to process Big Data?

1.6 Analytics Examples in Selected Domains

You will see examples of analytics applications throughout various chapters. That is

one of the primary approaches (exposure) of this book. In this section, we highlight

three application areas—sports, healthcare, and retail—where there have been the most

reported applications and successes.

Sports Analytics—An Exciting Frontier for Learning and Understanding Applications of Analytics

The application of analytics to business problems is a key skill, one that you will learn in this

book. Many of these techniques are now being applied to improve decision making in all aspects

of sports, a very hot area called sports analytics. It is the art and science of gathering data about athletes and teams to create insights that improve sports decisions, such as deciding which

players to recruit, how much to pay them, who to play, how to train them, how to keep them

healthy, and when they should be traded or retired. For teams, it involves business decisions such

as ticket pricing as well as roster decisions, analysis of each competitor’s strengths and

weaknesses, and many game-day decisions.

Indeed, sports analytics is becoming a specialty within analytics. It is an important area because

sport is a big business, generating about $145$145 billion in revenues each year plus an additional $100$100 billion in legal and $300$300 billion in illegal gambling, according to Price Waterhouse (“Changing the Game: Outlook for the Global Sports Market to 2015” (2015)).

In 2014, only $125$125 million was spent on analytics (less than 0.1 percent of revenues). This is expected to grow at a healthy rate to $4.7$4.7 billion by 2021 (“Sports Analytics Market Worth $4.7B$4.7B by 2021” (2015)). The use of analytics for sports was popularized by the Moneyball book by Michael Lewis in 2003 and the movie starring Brad Pitt in 2011. It showcased Oakland A’s general manager Billy

Beane and his use of data and analytics to turn a losing team into a winner. In particular, he hired

an analyst who used analytics to draft players who were able to get on base as opposed to players

who excelled at traditional measures like runs batted in or stolen bases. These insights allowed

the team to draft prospects overlooked by other teams at reasonable starting salaries. It worked—

the team made it to the playoffs in 2002 and 2003.

Now analytics are being used in all parts of sports. The analytics can be divided between the

front office and back office. A good description with 30 examples appears in Tom Davenport’s

survey article (). Front-office business analytics include analyzing fan behavior ranging from

predictive models for season ticket renewals and regular ticket sales to scoring tweets by fans

regarding the team, athletes, coaches, and owners. This is very similar to traditional CRM.

Financial analysis is also a key area such as when salary cap (for pros) or scholarship (for

colleges) limits are part of the equation.

Back-office uses include analysis of both individual athletes and team play. For individual

players, there is a focus on recruitment models and scouting analytics, analytics for strength and

fitness as well as development, and PMs for avoiding overtraining and injuries. Concussion

research is a hot field. Team analytics include strategies and tactics, competitive assessments,

and optimal roster choices under various on-field or on-court situations.

The following representative examples illustrate how two sports organizations use data and

analytics to improve sports operations in the same way that analytics have improved traditional

industry decision making.

Example 1: The Business Office Dave Ward works as a business analyst for a major pro baseball team, focusing on revenue. He

analyzes ticket sales, both from season ticket holders and single-ticket buyers. Sample questions

in his area of responsibility include why season ticket holders renew (or do not renew) their

tickets as well as what factors drive last-minute individual seat ticket purchases. Another

question is how to price the tickets.

Some of the analytical techniques Dave uses include simple statistics on fan behavior such as

overall attendance and answers to survey questions about likelihood to purchase again. However,

what fans say versus what they do can be different. Dave runs a survey of fans by ticket seat

location (“tier”) and asks about their likelihood of renewing their season tickets. But when he

compares what they say versus what they do, he discovers big differences. (See Figure 1.10.) He

found that 69 percent of fans in Tier 1 seats who said on the survey that they would “probably

not renew” actually did. This is useful insight that leads to action—customers in the green cells

are the most likely to renew tickets and so require fewer marketing touches and dollars to convert

compared to customers in the blue cells.

Figure 1.10 Season Ticket Renewals—Survey Scores.

However, many factors influence fan ticket purchase behavior, especially price, which

drives more sophisticated statistics and data analysis. For both areas, but especially

single-game tickets, Dave is driving the use of dynamic pricing—moving the business

from simple static pricing by seat location tier to day-by-day up-and-down pricing of

individual seats. This is a rich research area for many sports teams and has huge upside

potential for revenue enhancement. For example, his pricing takes into account the

team’s record, who they are playing, game dates and times, which star athletes play for

each team, each fan’s history of renewing season tickets or buying single tickets, and

factors such as seat location, number of seats, and real-time information like traffic

congestion historically at game time and even the weather. See Figure 1.11. Figure 1.11 Dynamic Pricing Previous Work—Major League Baseball.

Source: Based on C. Kemper and C. Breuer, “How Efficient is Dynamic Pricing for Sports Events? Designing a Dynamic Pricing

Model for Bayern Munich”, Intl. Journal of Sports Finance, 11, pp. 4–25, 2016.

Figure 1.11 Full Alternative Text

Which of these factors are important and by how much? Given his extensive statistics

background, Dave builds regression models to pick out key factors driving these historic

behaviors and create PMs to identify how to spend marketing resources to drive revenues. He

builds churn models for season ticket holders to create segments of customers who will renew,

will not renew, or are fence-sitters, which then drives more refined marketing campaigns.

In addition, Dave does sentiment scoring on fan comments such as tweets that help him segment

fans into different loyalty segments. Other studies about single-game attendance drivers help the

marketing department understand the impact of giveaways like bobble-heads or T-shirts or

suggestions on where to make spot TV ad buys.

Beyond revenues, there are many other analytical areas that Dave’s team works on, including

merchandising, TV and radio broadcast revenues, inputs to the general manager on salary

negotiations, draft analytics especially given salary caps, promotion effectiveness including

advertising channels, and brand awareness, as well as partner analytics. He’s a very busy guy!

Example 2: The Coach Bob Breedlove is the football coach for a major college team. For him, everything is about

winning games. His areas of focus include recruiting the best high school players, developing

them to fit his offense and defense systems, and getting maximum effort from them on game

days. Sample questions in his area of responsibility include: Whom do we recruit? What drills

help develop their skills? How hard do I push our athletes? Where are opponents strong or weak,

and how do we figure out their play tendencies?

Fortunately, his team has hired a new team operations expert, Dar Beranek, who specializes in

helping the coaches make tactical decisions. She is working with a team of student interns who

are creating opponent analytics. They used the coach’s annotated game film to build a cascaded

decision tree model (Figure 1.12) to predict whether the next play will be a running play or

passing play. For the defensive coordinator, they have built heat maps (Figure 1.13) of each

opponent’s passing offense, illustrating their tendencies to throw left or right and into which

defensive coverage zones. Finally, they built some time-series analytics (Figure 1.14) on

explosive plays (defined as a gain of more than 16 yards for a passing play or more than 12 yards

for a run play). For each play, they compare the outcome with their own defensive formations

and the other team’s offensive formations, which help Coach Breedlove react more quickly to

formation shifts during a game. We explain the analytical techniques that generated these figures

in much more depth in Chapters 3–6 and Chapter 9.

Figure 1.12 Cascaded Decision Tree for Run or Pass Plays.

Source: Contributed by Dr. Dave Schrader, who retired after 24 years in advanced development and marketing at Teradata. He

has remained on the Board of Advisors of the Teradata University Network, where he spends his retirement helping students and

faculty learn more about sports analytics. Graphics by Peter Liang and Jacob Pearson, graduate students at Oklahoma State

University, as part of a student project in the spring of 2016 in Prof. Ramesh Sharda’s class under Dr. Dave Schrader’s coaching.

Figure 1.13 Heat Map Zone Analysis for Passing Plays.

Source: Contributed by Dr. Dave Schrader, who retired after 24 years in advanced development and marketing at Teradata. He

has remained on the Board of Advisors of the Teradata University Network, where he spends his retirement helping students and

faculty learn more about sports analytics. Graphics by Peter Liang and Jacob Pearson, graduate students at Oklahoma State

University, as part of a student project in the spring of 2016 in Prof. Ramesh Sharda’s class under Dr. Dave Schrader’s coaching.

Figure 1.14 Time-Series Analysis of Explosive Plays.

New work that Dar is fostering involves building better high school athlete recruiting models.

For example, each year the team gives scholarships to three students who are wide receiver

recruits. For Dar, picking out the best players goes beyond simple measures like how fast

athletes run, how high they jump, or how long their arms are to newer criteria like how quickly

they can rotate their heads to catch a pass, what kinds of reaction times they exhibit to multiple

stimuli, and how accurately they run pass routes. Some of her ideas illustrating these concepts

appear on the TUN Web site; look for the Business Scenario Investigation (2015) “The Case of

Precision Football.”

What Can We Learn from These Examples?

Beyond the front-office business analysts, the coaches, trainers, and performance experts, there

are many other people in sports who use data, ranging from golf groundskeepers who measure

soil and turf conditions for PGA tournaments to baseball and basketball referees who are rated

on the correct and incorrect calls they make. In fact, it is hard to find an area of sports that

is not being impacted by the availability of more data, especially from sensors. Skills you will learn in this book for business analytics will apply to sports. If you want to dig

deeper into this area, we encourage you to look at the Sports Analytics section of the TUN, a free

resource for students and faculty. On its Web site, you will find descriptions of what to read to

find out more about sports analytics, compilations of places where you can find publically

available data sets for analysis, as well as examples of student projects in sports analytics and

interviews of sports professionals who use data and analytics to do their jobs. Good luck learning

analytics!

Analytics Applications in Healthcare—Humana Examples

Although healthcare analytics span a wide variety of applications from prevention to diagnosis to

efficient operations and fraud prevention, we focus on some applications that have been

developed at a major health insurance company in the United States, Humana. According to its

Web site, “The company’s strategy integrates care delivery, the member experience, and clinical

and consumer insights to encourage engagement, behavior change, proactive clinical outreach

and wellness. . . .” Achieving these strategic goals includes significant investments in information

technology in general and analytics in particular. Brian LeClaire is senior vice president and CIO

of Humana. He has a PhD in MIS from Oklahoma State University. He has championed

analytics as a competitive differentiator at Humana—including cosponsoring the creation of a

center for excellence in analytics. He described the following projects as examples of Humana’s

analytics initiatives, led by Humana’s chief clinical analytics officer, Vipin Gopal.

Humana Example 1: Preventing Falls in a Senior Population— An Analytic Approach Accidental falls are a major health risk for adults age 65 years and older with one-third

experiencing a fall every year.1 The costs of falls pose a significant strain on the U.S. healthcare

system; the direct costs of falls were estimated at $34$34 billion in 2013 alone.1 With the percent of seniors in the U.S. population on the rise, falls and associated costs are anticipated to

increase. According to the Centers for Disease Control and Prevention (CDC), “Falls are a public

health problem that is largely preventable”

(www.cdc.gov/homeandrecreationalsafety/falls/adultfalls.html).1 Falls are also the leading factor for both fatal and nonfatal injuries in older adults with injurious falls increasing

the risk of disability by up to 50 percent (Gill et al., 2013).2 Humana is the nation’s second-

largest provider of Medicare Advantage benefits with approximately 3.2 million members, most

of whom are seniors. Keeping its senior members well and helping them live safely at their

homes is a key business objective of which prevention of falls is an important component.

However, no rigorous methodology was available to identify individuals most likely to fall, for

whom falls prevention efforts would be beneficial. Unlike chronic medical conditions such as

diabetes and cancer, a fall is not a well-defined medical condition. In addition, falls are usually

underreported in claims data as physicians typically tend to code the consequence of a fall such

as fractures and dislocations. Although many clinically administered assessments to identify

fallers exist, they have limited reach and lack sufficient predictive power (Gates et al.,

2008).3 As such, there is a need for a prospective and accurate method to identify individuals at

greatest risk of falling so that they can be proactively managed for fall prevention. The Humana

analytics team undertook the development of a Falls Predictive Model in this context. It is the

first comprehensive PM reported that utilizes administrative medical and pharmacy claims,

clinical data, temporal clinical patterns, consumer information, and other data to identify

individuals at high risk of falling over a time horizon.

Today, the Falls PM is central to Humana’s ability to identify seniors who could benefit from fall

mitigation interventions. An initial proof-of-concept with Humana consumers, representing the

top 2 percent of those at the highest risk of falling, demonstrated that the consumers had

increased utilization of physical therapy services, indicating consumers are taking active steps to

reduce their risk for falls. A second initiative utilizes the Falls PM to identify high-risk

individuals for remote monitoring programs. Using the PM, Humana was able to identify 20,000

consumers at a high risk of falling who benefited from this program. Identified consumers wear a

device that detects falls and alerts a 24/724/7 service for immediate assistance. This work was recognized by the Analytics Leadership Award by Indiana University Kelly

School of Business in 2015, for innovative adoption of analytics in a business environment.

Contributors: Harpreet Singh, PhD; Vipin Gopal, PhD; Philip Painter, MD.

Humana Example 2: Humana’s Bold Goal—Application of Analytics to Define the

Right Metrics In 2014, Humana, Inc. announced its organization’s Bold Goal to improve the health of the

communities it serves by 20 percent by 2020 by making it easy for people to achieve their best

health. The communities that Humana serves can be defined in many ways, including

geographically (state, city, neighborhood), by product (Medicare Advantage, employer-based

plans, individually purchased), or by clinical profile (priority conditions including diabetes,

hypertension, congestive heart failure [CHF], coronary artery disease [CAD], chronic obstructive

pulmonary disease [COPD], or depression). Understanding the health of these communities and

how they track over time is critical not only for the evaluation of the goal, but also in crafting

strategies to improve the health of the whole membership in its entirety.

A challenge before the analytics organization was to identify a metric that captures the essence

of the Bold Goal. Objectively measured traditional health insurance metrics such as hospital

admissions or emergency room visits per 1,000 persons would not capture the spirit of this new

mission. The goal was to identify a metric that captures health and its improvement in a

community and was relevant to Humana as a business. Through rigorous analytic evaluations,

Humana eventually selected “Healthy Days,” a four-question, quality-of-life questionnaire

originally developed by the CDC to track and measure Humana’s overall progress toward the

Bold Goal.

It was critical to make sure that the selected metric was highly correlated to health and business

metrics so that any improvement in Healthy Days resulted in improved health and better business

results. Some examples of how “Healthy Days” is correlated to metrics of interest include the

following:

• Individuals with more unhealthy days (UHDs) exhibit higher utilization and cost patterns.

For a five-day increase in UHDs, there are (1) an $82$82 increase in average monthly medical and pharmacy costs, (2) an increase of 52 inpatient admits per 1,000 patients, and

(3) a 0.28-day increase in average length of stay (Havens, Peña, Slabaugh, Cordier, Renda,

& Gopal, 2015).1

• Individuals who exhibit healthy behaviors and have their chronic conditions well managed

have fewer UHDs. For example, when we look at individuals with diabetes, UHDs are

lower if they obtained an LDL screening (−4.3−4.3 UHDs) or a diabetic eye exam (−2.3−2.3 UHDs). Likewise, if they have controlled blood sugar levels measured by HbA1C (−1.8−1.8 UHDs) or LDL levels (−1.3−1.3 UHDs) (Havens, Slabaugh, Peña, Haugh, & Gopal 2015).2

• Individuals with chronic conditions have more UHDs than those who do not have (1) CHF

(16.9 UHDs), (2) CAD (14.4 UHDs), (3) hypertension (13.3 UHDs), (4) diabetes (14.7

UHDs), (5) COPD (17.4 UHDs), or (6) depression (22.4 UHDs) (Havens, Peña, Slabaugh et

al., 2015; Chiguluri, Guthikonda, Slabaugh, Havens, Peña, & Cordier, 2015; Cordier et al.,

2015).1,3,4

Humana has since adopted Healthy Days as their metric for the measurement of progress toward

Bold Goal (Humana, http://populationhealth.humana.com/wp-

content/uploads/2016/05/BoldGoal2016ProgressReport_1.pdf).5 Contributors: Tristan Cordier, MPH; Gil Haugh, MS; Jonathan Peña, MS; Eriv Havens, MS; Vipin Gopal, PhD.

Humana Example 3: Predictive Models to Identify the Highest Risk Membership in a

Health Insurer

The 80/2080/20 rule generally applies in healthcare; that is, roughly 20 percent of consumers account for 80 percent of healthcare resources due to their deteriorating health and chronic

conditions. Health insurers like Humana have typically enrolled the highest-risk enrollees in

clinical and disease management programs to help manage the chronic conditions the members

have.

Identification of the correct members is critical for this exercise, and in the recent years, PMs

have been developed to identify enrollees with high future risk. Many of these PMs were

developed with heavy reliance on medical claims data, which results from the medical services

that the enrollees use. Because of the lag that exists in submitting and processing claims data,

there is a corresponding lag in identification of high-risk members for clinical program

enrollment. This issue is especially relevant when new members join a health insurer as they

would not have a claims history with an insurer. A claims-based PM could take on average of 9–

12 months after enrollment of new members to identify them for referral to clinical programs.

In the early part of this decade, Humana attracted large numbers of new members in its Medicare

Advantage products and needed a better way to clinically manage this membership. As such, it

became extremely important that a different analytic approach be developed to rapidly and

accurately identify high-risk new members for clinical management, to keep this group healthy

and costs down.

Humana’s Clinical Analytics team developed the New Member Predictive Model (NMPM) that

would quickly identify at-risk individuals soon after their new plan enrollments with Humana

rather than waiting for sufficient claim history to become available for compiling clinical profiles

and predicting future health risk. Designed to address the unique challenges associated with new

members, NMPM developed a novel approach that leveraged and integrated broader data sets

beyond medical claims data such as self-reported health risk assessment data and early indicators

from pharmacy data, employed advanced data mining techniques for pattern discovery, and

scored every Medicare Advantage (MA, a specific insurance plan) consumer daily based on the

most recent data Humana has to date. The model was deployed with a cross-functional team of

analytics, IT, and operations to ensure seamless operational and business integration.

Since NMPM was implemented in January 2013, it has been rapidly identifying high-risk new

members for enrollment in Humana’s clinical programs. The positive outcomes achieved through

this model have been highlighted in multiple senior leader communications from Humana. In the

first quarter 2013 earnings release presentation to investors, Bruce Broussard, CEO of Humana,

stated the significance of “improvement in new member PMs and clinical assessment processes,”

which resulted in 31,000 new members enrolled in clinical programs, compared to 4,000 in the

same period a year earlier, a 675 percent increase. In addition to the increased volume of clinical

program enrollments, outcome studies showed that the newly enrolled consumers identified by

NMPM were also referred to clinical programs sooner with over 50 percent of the referrals

identified within the first three months after new MA plan enrollments. The consumers identified

also participated at a higher rate and had longer tenure in the programs.

Contributors: Sandy Chiu, MS; Vipin Gopal, PhD.

These examples illustrate how an organization explores and implements analytics applications to

meet its strategic goals. You will see several other examples of healthcare applications

throughout various chapters in the book.

Analytics in the Retail Value Chain

The retail sector is where you would perhaps see the most applications of analytics.

This is the domain where the volumes are large but the margins are usually thin.

Customers’ tastes and preferences change frequently. Physical and online stores face

many challenges to succeed. And market dominance at one time does not guarantee

continued success. So investing in learning about your suppliers, customers, employees,

and all the stakeholders that enable a retail value chain to succeed and using that

information to make better decisions has been a goal of the analytics industry for a long

time. Even casual readers of analytics probably know about Amazon’s enormous

investments in analytics to power their value chain. Similarly, Walmart, Target, and

other major retailers have invested millions of dollars in analytics for their supply

chains. Most of the analytics technology and service providers have a major presence in

retail analytics. Coverage of even a small portion of those applications to achieve our

exposure goal could fill a whole book. So this section highlights just a few potential

applications. Most of these have been fielded by many retailers and are available

through many technology providers, so in this section, we will take a more general

view rather than point to specific cases. This general view has been proposed by

Abhishek Rathi, CEO of vCreaTek.com. vCreaTek, LLC is a boutique analytics software

and service company that has offices in India, the United States, the United Arab

Emirates (UAE), and Belgium. The company develops applications in multiple

domains, but retail analytics is one of its key focus areas.

Figure 1.15 highlights selected components of a retail value chain. It starts with

suppliers and concludes with customers but illustrates many intermediate strategic and

operational planning decision points where analytics—descriptive, predictive, or

prescriptive—can play a role in making better data-driven decisions. Table 1.1 also

illustrates some of the important areas of analytics applications, examples of key

questions that can be answered through analytics, and of course, the potential business

value derived from fielding such analytics. Some examples are discussed next.

Figure 1.15 Example of Analytics Applications in a Retail Value Chain.

Table 1.1 Examples of Analytics Applications in the Retail Value Chain

Analytic

Application

Business Question Business Value

Inventory

Optimization

1. Which products have high demand?

2. Which products are slow moving or

becoming obsolete?

1. Forecast the consumption of fast-moving

products and order them with sufficient

inventory to avoid a stock out scenario.

2. Perform fast inventory turnover of slow-

moving products by combining them with

one in high demand.

Price

Elasticity

1. How much net margin do I have on

the product?

2. How much discount can I give on

this product?

1. Markdown prices for each product can be

optimized to reduce the margin dollar

loss.

2. Optimized price for the bundle of

products is identified to save the margin

dollar.

Market-Basket

Analysis

1. What products should I combine to

create a bundle offer?

2. Should I combine products based on

slow-moving and fast-moving

characteristics?

1. The affinity analysis identifies the hidden

correlations between the products, which

can help in following values:

a. Strategize the product bundle offering

based on focus on inventory or margin.

Analytic

Application

Business Question Business Value

3. Should I create a bundle from the

same category or a different

category line?

b. Increase cross-selling or up-selling by

creating bundle from different

categories or the same categories,

respectively.

Shopper

Insight

1. Which customer is buying what

product at what location?

1. By customer segmentation, the business

owner can create personalized offers

resulting in better customer experience

and retention of the customer.

Customer

Churn

Analysis

1. Who are the customers who will not

return?

2. How much business will I lose?

3. How can I retain the customers?

4. What demography of customer is

my loyal customer?

1. Businesses can identify the customer and

product relationships that are not

working and show high churn. Thus, they

can have better focus on product quality

and the reason for that churn.

2. Based on the customer lifetime value

(LTV), the business can do targeted

marketing resulting in retention of the

customer.

Analytic

Application

Business Question Business Value

Channel

Analysis

1. Which channel has lower customer

acquisition cost?

2. Which channel has better customer

retention?

3. Which channel is more profitable?

1. Marketing budget can be optimized based

on insight for better return on

investment.

New Store

Analysis

1. What location should I open?

2. What and how much opening

inventory should I keep?

1. Best practices of other locations and

channels can be used to get a jump-start.

2. Comparison with competitor data can

help to create a differentiator to attract

the new customers.

Store Layout 1. How should I do store layout for

better topline?

2. How can I increase my in-store

customer experience?

1. Understand the association of products to

decide store layout and better alignment

with customer needs.

2. Workforce deployment can be planned

for better customer interactivity and thus

satisfying customer experience.

Analytic

Application

Business Question Business Value

Video

Analytics

1. What demography is entering the

store during the peak period of

sales?

2. How can I identify a customer with

high LTV at the store entrance so

that a better personalized

experience can be provided to this

customer?

1. In-store promotions and events can be

planned based on the demography of

incoming traffic.

2. Targeted customer engagement and

instant discount enhances the customer

experience resulting in higher retention.

An online retail site usually knows its customer as soon as the customer signs in, and

thus they can offer customized pages/offerings to enhance the experience. For any retail

store, knowing its customer at the store entrance is still a huge challenge. By combining

the video analytics and information/badge issued through its loyalty program, the store

may be able to identify the customer at the entrance itself and thus enable an extra

opportunity for a cross-selling or up-selling. Moreover, a personalized shopping

experience can be provided with more customized engagement during the customer’s

time in the store.

Store retailers invest lots of money in attractive window displays, promotional events,

customized graphics, store decorations, printed ads, and banners. To discern the

effectiveness of these marketing methods, the team can use shopper analytics by

observing closed-circuit television (CCTV) images to figure out the demographic details

of the in-store foot traffic. The CCTV images can be analyzed using advanced

algorithms to derive demographic details such as age, gender, and mood of the person

browsing through the store.

Further, the customer’s in-store movement data when combined with shelf layout and

planogram can give more insight to the store manager to identify the hot-

selling/profitable areas within the store. Moreover, the store manager also can use this

information to plan the workforce allocation for those areas for peak periods.

Market-basket analysis has commonly been used by the category managers to push the

sale of slowly moving stock keeping units (SKUs). By using advanced analytics of data

available, the product affinity can be identified at the lowest level of SKU to drive better

returns on investments (ROIs) on the bundle offers. Moreover, by using price elasticity

techniques, the markdown or optimum price of the bundle offer can also be deduced,

thus reducing any loss in the profit margin.

Thus, by using data analytics, a retailer can not only get information on its current

operations but can also get further insight to increase the revenue and decrease the

operational cost for higher profit. A fairly comprehensive list of current and potential

retail analytics applications that a major retailer such as Amazon could use is proposed

by a blogger at Data Science Central. That list is available

at www.datasciencecentral.com/profiles/blogs/20-data-science-systems-used-by-

amazon-to-operate-its-business. As noted earlier, there are too many examples of these

opportunities to list here, but you will see many examples of such applications

throughout the book.

Image Analytics

As seen in this section, analytics techniques are being applied to many diverse industries and

data. An area of particular growth has been analysis of visual images. Advances in image

capturing through high-resolution cameras, storage capabilities, and deep learning algorithms

have enabled very interesting analyses. Satellite data have often proven their utility in many

different fields. The benefits of satellite data at high resolution and in different forms of imagery

including multi-spectral are significant to scientists who need to regularly monitor global change,

land usage, and weather. In fact, by combining the satellite imagery and other data including

information on social media, government filings, and so on, one can surmise business planning

activities, traffic patterns, changes in parking lots or open spaces. Companies, government

agencies, and non-governmental organizations (NGOs) have invested in satellites to try to image

the whole globe every day so that daily changes can be tracked at any location and the

information can be used for forecasting. In the last few months, many interesting examples of

more reliable and advanced forecasts have been reported. Indeed, this activity is being led by

different industries across the globe, and has added a term to Big Data called Alternative Data. Here are a few examples from Tartar et al. (2018). We will see more in Chapter 9 when we study

Big Data.

• World Bank researchers used satellite data to propose strategic recommendations for urban

planners and officials from developing nations. This analysis arose due to the recent natural

disaster where at least 400 people died in Freetown, Sierra Leone. Researchers clearly

demonstrated that Freetown and some other developing cities lacked systematic planning of

their infrastructure that resulted in the loss of life. The bank researchers are using satellite

imagery now to make critical decisions regarding risk-prone urban areas.

• EarthCast provides accurate weather updates for a large commercial U.S. airline based on

the data it pulls from a constellation of 60 government-operated satellites combined with

ground and aircraft-based sensors, tracking almost anything from lightning to turbulence. It

has even developed the capability to map out conditions along a flight path and provides

customized forecasts for everything from hot air balloons to drones.

• Imazon started using satellite data to develop a picture of close real-time information on

Amazon deforestation. It uses advanced optical and infrared imagery that has led to

identifying illegal sawmills. Imazon is now focused more on getting data to local

governments through its “green municipalities” program that trains officials to identify and

curb deforestation.

• The Indonesian government teamed up with international nonprofit Global Fishing Watch,

which processes satellite extracted information on ship movement to spot where and when

vessels are fishing illegally (Emmert, 2018). This initiative delivered instant results:

Government revenue from fishing went up by 129 percent in 2017 compared to 2014. It is

expected that by next decade, the organization would track vessels that are responsible for

75 percent of the world’s catch.

These examples illustrate just a sample of ways that satellite data can be combined with analytics

to generate new insights. In anticipation of the coming era of abundant earth observations from

satellites, scientists and communities must put some thought into recognizing key applications

and key scientific issues for the betterment of society. Although such concerns will eventually be

resolved by policymakers, what is clear is that new and interesting ways of combining satellite

data and many other data sources is spawning a new crop of analytics companies.

Such image analysis is not limited to satellite images. Cameras mounted on drones and traffic

lights on every conveyable pole in buildings and streets provide the ability to capture images

from just a few feet high. Analysis of these images coupled with facial recognition technologies

is enabling all kinds of new applications from customer recognition to governments’ ability to

track all subjects of interest. See Yue (2017) as an example. Applications of this type are leading

to much discussion on privacy issues. In Application Case 1.7, we learn about a more benevolent

application of image analytics where the images are captured by a phone and a mobile

application provides immediate value to the user of the app.

Application Case 1.7 Image Analysis Helps Estimate Plant Cover

Estimating how much ground is covered by green vegetation is important in analysis of a forest

or even a farm. In case of a forest, such analysis helps users understand how the forest is

evolving, its impact on surrounding areas, and even climate. For a farm, similar analysis can help

understand likely plant growth and help estimate future crop yields. It is obviously impossible to

measure all forest cover manually and is challenging for a farm. The common method is to

record images of a forest/farm and then analyze these images to estimate the ground cover. Such

analysis is expensive to perform visually and is also error prone. Different experts looking at the

ground cover might estimate the percentage of ground covering differently. Thus, automated

methods to analyze these images and estimate the percentage of ground covered by vegetation

are being developed. One such method and an app to make it practical through a mobile phone

has been developed at Oklahoma State University by researchers in the Department of Plant and

Soil Sciences in partnership with the university’s App Center and the Information Technology

group within the Division of Agricultural Sciences and Natural Resources.

Canopeo is a free desktop or mobile app that estimates green canopy cover in near real-time from

images taken with a smartphone or digital camera. In experiments in corn, wheat, canola, and

other crops, Canopeo calculated the percentage of canopy covering dozens to thousands of times

faster than existing software without sacrificing accuracy. And unlike other programs, the app

can acquire and analyze video images, says Oklahoma State University (OSU) soil physicist,

Tyson Ochsner—a feature that should reduce the sampling error associated with canopy cover

estimates. “We know that plant cover, plant canopies, can be quite variable in space,” says

Ochsner, who led the app’s development with former doctoral student Andres Patrignani, now a

faculty member at Kansas State University. “With Canopeo, you can just turn on your [video]

device, start walking across a portion of a field, and get results for every frame of video that

you’re recording.” By using a smartphone or tablet’s digital camera, Canopeo users in the field

can take photos or videos of green plants, including crops, forages, and turf, and import them to

the app, which analyzes each image pixel, classifying them based on its red-green-blue

(RGB) color values. Canopeo analyzes pixels based on a ratio of red to green and blue to green

pixels as well as an excess green index. The result is an image where color pixels are converted

into black and white with white pixels corresponding to green canopy and black pixels

representing the background. Comparison tests showed that Canopeo analyzes images more

quickly and just as accurately as two other available software packages.

Developers of Canopeo expect the app to help producers judge when to remove grazing cattle

from winter wheat in “dual-purpose” systems where wheat is also harvested for grain. Research

by others at OSU found that taking cattle off fields when at least 60 percent green canopy cover

remained ensured a good grain yield. “So, Canopeo would be useful for that decision,”

Patrignani says. He and Ochsner also think the app could find use in turfgrass management; in

assessments of crop damage from weather or herbicide drift; as a surrogate for the Normalized

Difference Vegetation Index (NDVI) in fertilizer recommendations; and even in UAV-based

photos of forests or aquatic systems.

Analysis of images is a growing application area for deep learning as well as many other AI

techniques. Chapter 9 includes several examples of image analysis that have spawned another

term—alternative data. Applications of alternative data are emerging in many

fields. Chapter 6 also highlights some applications. Imagining innovative applications by being

exposed to others’ ideas is one of the main goals of this book!

Questions for Discussion 1. What is the purpose of knowing how much ground is covered by green foliage on

a farm? In a forest? 2. Why would image analysis of foliage through an app be better than a visual

check?

3. Explore research papers to understand the underlying algorithmic logic of image analysis. What did you learn?

4. What other applications of image analysis can you think of?

Source: Compiled from A. Patrignani and T. E. Ochsner. (2015). “Canopeo: A Powerful New Tool for Measuring Fractional Green Canopy Cover.” Agronomy Journal, 107(6), pp. 2312–2320; R. Lollato, A. Patrignani, T. E. Ochsner, A. Rocatelli, P. Tomlinson, & J. T. Edwards. (2015). Improving Grazing Management Using a Smartphone App. www.bookstore.ksre.ksu.edu/pubs/MF3304.pdf (accessed October 2018); http://canopeoapp.com/ (accessed October 2018); Oklahoma State University press releases.

Analytics/data science initiatives are quickly embracing and even merging with new

developments in artificial intelligence. The next section provides an overview of artificial

intelligence followed by a brief discussion of convergence of the two.

Section 1.6 Review Questions

1. What are three factors that might be part of a PM for season ticket renewals? 2. What are two techniques that football teams can use to do opponent analysis? 3. What other analytics uses can you envision in sports? 4. Why would a health insurance company invest in analytics beyond fraud

detection? Why is it in its best interest to predict the likelihood of falls by patients?

5. What other applications similar to prediction of falls can you envision? 6. How would you convince a new health insurance customer to adopt healthier

lifestyles (Humana Example 3)? 7. Identify at least three other opportunities for applying analytics in the retail value

chain beyond those covered in this section. 8. Which retail stores that you know of employ some of the analytics applications

identified in this section? 9. What is a common thread in the examples discussed in image analytics? 10. Can you think of other applications using satellite data along the lines presented

in this section?

1.7 Artificial Intelligence Overview

On September 1, 2017, the first day of the school year in Russia, Vladimir Putin, the

Russian President, lectured to over 1,000,000 school children in what is called in Russia

the National Open Lesson Day. The televised speech was titled “Russia Focused on the

Future.” In this presentation, the viewers saw what Russian scientists are achieving in

several fields. But, what everyone remembers from this presentation is one sentence:

“The country that takes the lead in the sphere of computer-based artificial intelligence

will become the ruler of the world.”

Putin is not the only one who knows the value of AI. Governments and corporations are

spending billions of dollars in a race to become a leader in AI. For example, in July 2017,

China unveiled a plan to create an AI industry worth $150$150 billion to the Chinese

economy by 2030 (Metz, 2018). China’s Badu Company today employs over 5,000 AI

engineers. The Chinese government facilitates research and applications as a national

top priority. The accounting firm PricewaterhouseCoopers calculated that AI will

add $15.7$15.7 trillion to the global economy by 2030 (about 14 percent; see Liberto, 2017). Thus, there is no wonder that AI is clearly the most talked about technology topic

in 2018.

What is Artificial Intelligence?

There are several definitions of what is AI (Chapter 2). The reason is that AI is based on theories

from several scientific fields, and it encompasses a wide collection of technologies and

applications. So, it may be beneficial to look at some of the characteristics of AI in order to

understand what it is. The major goal of AI is to create intelligent machines that can do tasks

currently done by people. Ideally, these tasks include reasoning, thinking, learning, and problem

solving. AI studies the human thought processes’ ability to understand what intelligence is so AI

scientists can duplicate the human processes in machines. eMarketer (2017) provides a

comprehensive report, describing AI as

• Technology that can learn to do things better over time.

• Technology that can understand human language.

• Technology that can answer questions.

The Major Benefits of AI

Since AI appears in many shapes, it has many benefits. They are listed in Chapter 2. The major

benefits are as follows:

• Significant reduction in the cost of performing work. This reduction continues over time

while the cost of doing the same work manually increases with time.

• Work can be performed much faster.

• Work is consistent in general, more consistent than human work.

• Increased productivity and profitability as well as a competitive advantage are the major

drivers of AI.

The Landscape of AI

There are many parts in the landscape (or ecosystem) of AI. We decided to organize them into

five groups as illustrated in Figure 1.16. Four of the groups constitute the basis for the fifth one,

which is the AI applications. The groups are as follows:

Major Technologies

Here we elected to include machine learning (Chapter 5), deep learning (Chapter 6), and

intelligent agents (Chapter 2).

Knowledge-Based Technologies

(all covered in Chapter 12) Topics covered are expert systems, recommendation engines,

chatbots, virtual personal assistants, and robo-advisors.

Biometric-Related Technologies

This includes natural language processing (understanding and generation, machine vision and

scene and image recognition and voice and other biometric recognition (Chapter 6).

Support Theories, Tools, and Platforms

Academic disciplines include computer science, cognitive science, control theory, linguistics,

mathematics, neuroscience, philosophy, psychology, and statistics.

Devices and methods include sensors, augmented reality, context awareness, logic, gestural

computing collaborative filtering, content recognition, neural networks, data mining, humanoid

theories, case-based reasoning, predictive application programming interfaces (APIs), knowledge

management, fuzzy logic, genetic algorithm, bin data, and much more.

Tools and Platforms

These are available from IBM, Microsoft, Nvidia, and several hundred vendors specializing in

the various aspects of AI.

AI Applications

There are several hundred or may be thousands of them. We provide here only a sample:

Smart cities, smart homes, autonomous vehicles (Chapter 13), automatic decisions (Chapter 2),

language translation, robotics (Chapter 10), fraud detection, security protection, content

screening, prediction, personalized services, and more. Applications are in all business areas

(Chapter 2), and in almost any other area ranging from medicine and healthcare to transportation

and education.

NOTE: Lists of all these are available at Faggela (2018) and Jacquet (2017). Also see

Wikipedia, “Outline of artificial intelligence,” and a list of “AI projects” (several hundred items.)

Figure 1.16 The Landscape (Ecosystem) of AI.

In Application Case 1.8, we describe how several of these technologies are combined in

improving security and in expediting the processing of passengers in airports.

Application Case 1.8 AI Increases Passengers’ Comfort and Security in Airports and Borders

We may not like the security lines at airports or the idea that terrorists may board our plane or

enter our country. Several AI devices are designed to minimize these possibilities.

1. FACIAL RECOGNITION AT AIRPORTS. Jet Blue is experimenting with facial- recognition technology (a kind of machine vision to match travelers’ faces against prestored photos, such as passport, driver’s license). This will eliminate the need for boarding passes and increase security. The match is of high quality. The technology pioneered by British Airways is used by Delta, KLM, and other airlines using similar technologies for self-checking of bags. Similar technology is used by the U.S. Immigration and Customs Enforcement agency where people’s photos taken at arrivals are matched against the database of photos and other documents.

2. CHINA’S SYSTEM. The major airports in China are using a system similar to that of Jet Blue, using facial recognition for verifying the identity of passengers. The idea is to eliminate boarding passes and expedite the flow of boarding. The system is also used to recognize airport employees entering restricted areas.

3. USING BOTS. Several airports (e.g., New York, Beijing) offer conversational bots (Chapter 12) to provide travelers with airport guidance. Bots provide also information about customs and immigration services.

4. SPOTTING LIARS AT AIRPORT. This application is emerging to help immigration services to vet passengers at airports and land entry borders. With increased security, both immigration and airline personal may need to query passengers. Here is the solution that can be economically used to query all passengers at high

speed, so there will be short waiting lines. This emerging system is called Automated Virtual Agent for Truth Assessments in Real Time (AVATAR). The essentials of the system are as follows:

a. AVATAR is a bot in which you first scan your passport. b. AVATAR asks you a few questions. Several AI technologies are used in

this project, such as AI, Big Data analytics, the “Cloud,” robotics, machine learning, machine vision, and bots.

c. You answer the questions. d. AVATAR’s sensors and other AI technologies collect data from your

body, such as voice variability, facial expression (e.g., muscle engagement), eyes’ position and movements, mouth movements, and body posture. Researchers feel that it takes less effort to tell the truth than to die, so researchers compared the answers to routine questions.

The machine then will flag suspects for further investigation. The machine is already in use by

immigration agents in several countries.

Sources: Condensed from Thibodeaux, W. (2017, June 29). “This Artificial Intelligence Kiosk Is Designed to Spot Liars at Airports.” Inc.com.; Silk, R. (2017, November). “Biometrics: Facial Recognition Tech Coming to an Airport Near You.” Travel Weekly, 21.

Questions for Case 1.8 1. List the benefits of AI devices to travelers. 2. List the benefits to governments and airline companies. 3. Relate this case to machine vision and other AI tools that deal with people’s

biometrics.

Narrow (Weak) versus General (Strong) AI

The AI field can be divided into two major categories of applications: narrow (or weak) and

general (or strong).

A Narrow AI Field Focuses on One Narrow Field (Domain)

Well-known examples of this are SIRI and Alexa (Chapter 12) that, at least in their early years of

life, operated in limited, predefined areas. As time has passed, they have become more general,

acquiring additional knowledge. Most expert systems (Chapter 12) were operating in fairly

narrow domains. If you notice, when you converse with an automated call center, the

computer (which is usually based on some AI technology) is not too intelligent. But, it is getting

“smarter” with time. Speech recognition allows computers to convert sound to text with great

accuracy. Similarly, computer vision is improving, recognizes objects, classifies them, and even

understands their movements. In sum, there are millions of narrow AI applications, and the

technology is improving every day. However, AI is not strong enough yet because it does not

exhibit the true capabilities of human intelligence (Chapter 2).

General (Strong) AI

To exhibit real intelligence, machines need to perform the full range of human cognitive

capabilities. Computers can have some cognitive capabilities (e.g., some reasoning and problem

solving) as will be shown in Chapter 6 on cognitive computing.

The difference between the two classes of AI is getting smaller as AI is getting smarter. Ideally,

strong AI will be able to replicate humans. But true intelligence is happening only in narrow

domains, such as game playing, medical diagnosis, and equipment failure diagnosis.

Some feel that we never will be able to build a truly strong AI machine. Others think differently;

see the debate in Section 14.9. The following is an example of a strong AI bot in a narrow

domain.

Example 3: AI Makes Coke Vending Machine Smarter If you live in Australia or New Zealand and you are near a Coca-Cola vending machine, you can

order a can or a bottle of the soft drink using your smartphone. The machines are cloud

connected, which means you can order the Coke from any place in the world, not only for

yourself but also for any friend who is near a vending machine in Australia or New Zealand.

See Olshansky (2017).

In addition, the company can adjust pricing remotely, offer promotions, and collect inventory

data so that restocking can be made. Converting existing machines to AI-enabled takes about 1

hour each.

Wait a minute, what if something goes wrong? No problem, you can chat with Coca-Cola’s bot

via Facebook Messenger (Chapter 12).

The Three Flavors of AI Decisions

Staff (2017) divided the capabilities of AI systems into three levels: assisted, autonomous, and

augmented.

Assisted Intelligence

This is equivalent mostly to the weak AI, which works only in narrow domains. It requires

clearly defined inputs and outputs. Examples are some monitoring systems and low-level virtual

personal assistants (Chapter 12). Our cars are full of such monitoring systems that give us alerts.

Similarly, there are many healthcare applications (monitoring, diagnosis).

Autonomous AI

These systems are in the realm of the strong AI but in very narrow domain. Eventually, the

computer will take over. Machines will act as experts and have absolute decision-making power.

Pure robo-advisors (Chapter 12) are examples of such machines. Autonomous vehicles and

robots that can fix themselves are also good examples.

Augmented Intelligence

Most of the existing AI applications are between assisted and autonomous and/are referred to

as augmented intelligence (or intelligence augmentation). The technology focuses on augmenting computer abilities to extend human cognitive abilities (see Chapter 6 on cognitive

computing), resulting in high performance as described in Technology Insights 1.1.

Technology Insights 1.1 Augmented Intelligence The idea of combining the performance of people and machines is not new. Here we combine

(augmenting) human capabilities with powerful machine intelligence. That is, not replacing

people which is done by autonomous AI, but extending human cognitive abilities. The result is

the ability to solve complex human problems as in the opening vignette to this chapter. The

computers enabled people to solve problems that were unsolved before. Padmanabhan

(2018) distinguishes the following differences between traditional and augmented AI:

1. Augmented machines extend rather than replace human decision making, and they facilitate creativity.

2. Augmentation excels in solving complex human and industry problems in specific domains in contrast with strong, general AI.

3. In contrast with a “black box” model of some AI and analytics, augmented intelligence provides insights and recommendations, including explanations.

4. In addition, augmentation technology can offer new solutions by combining existing and discovered information in contrast with assisted AI, which identifies problems or symptoms and suggests predetermined solutions.

Padmanabhan (2018) and many others believe that at the moment, augmented AI is the best

option to move toward the transformation of the AI world.

In contrast with autonomous AI, which describes machines with a wide range of cognitive

abilities (e.g., driverless vehicles), augmented intelligence has only a few cognitive abilities.

Examples of Augmented Intelligence Staff (2017) provides the following examples:

• CYBERCRIME FIGHTING. AI can identify forthcoming attacks and suggest

solutions.

• E-COMMERCE DECISIONS. Marketing tools make testing 100 times faster and adapt

the layout and response functions of a Web site to users. The machines make

recommendations and the marketers can accept or reject them.

• HIGH-FREQUENCY STOCK MARKET TRADING. This is done either completely

autonomously or in some cases with control and calibration by humans.

Discussion Questions 1. What is the basic premise of augmented intelligence? 2. List the major differences between augmented intelligence and traditional AI. 3. What are some benefits of augmented intelligence? 4. How does technology relate to cognitive computing?

Societal Impacts

Much talk is on the topics of AI and productivity, speed, and cost reduction. In a national

conference hosted by Gartner, the famous IT consulting company, nearly half of 3,000

participating U.S. CIOs reported plans to deploy AI-now (Weldon, 2018). Industry cannot ignore

the potential benefits of AI, especially its increased productivity gains, cost reduction and

quality, and speed. Conference participants there talked about strategy and implementation

(Chapter 14). It seems that every company is at least involved in piloting and experimentation

AI. However, in all this excitement, we should not neglect the societal impacts. Many of these

are positive, some are negative, and most are unknown. A comprehensive discussion is provided

in Chapter 14. Here we provide three examples of AI impacts.

Impact on Agriculture

A major impact of AI will be on agriculture. One major anticipated result is to provide more

food, especially in third world countries. Here are a few examples:

• According to Lopez (2017), using AI and robots can help farmers produce 70 percent more

food by 2050. This increase is a result of higher productivity of farm equipment boosted by

IoT (see opening vignette to Chapter 13) and a reduced cost of producing food. (Today only

10 percent of a family’s budget is spent on food versus. 17.5 percent in 1960).

• Machine vision helps in improved planting and harvesting. Also, AI helps to pick good

kernels of grain.

• AI will help to improve the nutrition of food.

• AI will reduce the cost of food processing.

• Driverless tractors are already being experimented with.

• Robots know how to pick fruits and to plant vegetables can solve the shortage of farm

workers.

• Crop yields are continuously increasing in India and other countries.

• Pest control improves. For example, AI can predict pest attacks, facilitating planning.

• Weather conditions are monitored by satellites. AI algorithms tell farmers when to plant

and/or harvest.

The list can go on and on. For countries such as India and Bangladesh, these activities will

critically improve the life of farmers. All in all, AI will help farmers make better decisions. For a

Bangladesh case, see PE Report (2017). See alsonews.microsoft.com/en-in/features/ai-

agriculture-icrisat-upl-india/.

NOTE: AI can help hungry pets too. A food and water dispenser, called Catspad, is available in

the United Kingdom for about US $470$470 You need to put an ID tag on your pet (only cats and small dogs). The dispenser knows which pet comes to the food and dispenses the type and

amount of appropriate food. In addition, sensors (Chapter 13) can tell you how much food each

pet ate. You will also be notified if water needs to be added. Interested? See Deahl (2018) for

details.

Intelligent Systems Contribution to Health and Medical Care

Intelligent systems provide a major contribution to our health and medical care. New innovations

arrive almost any day in some place in the world (governments, research institutions, and

corporation-sponsored active medical AI research). Here are some interesting innovations.

• AI excels in disease prediction (e.g., predicting the occurrence of infective diseases one

week in advance).

• AI can detect brain bleeds.

• AI can track medication intake, send medical alerts, order medicine refills, and improve

prescription compliance.

• Mobile telepresence robots remotely connect physicians and patients.

• NVIDIA’s medical imaging supercomputer helps diagnosticians and facilitates cures of

diseases.

• Robotics and AI can redesign pharmaceutical supply chains.

• AI predicts cardiovascular risks from retinal images.

• Cancer predictions are made with deep learning, and machine learning performs melanoma

diagnosis.

• A virtual personal assistant can assess a patient’s mood and feeling by cues provided (e.g.,

speech gesture or inflection).

• Many portals provide medical information to patients and even surgeons. Adoptive spine IT

is an example.

• Aging-based AI center for research on people who are elderly operates in the United States.

Similar activities exist in Japan.

• The use of bionic hands and legs is improving with AI.

• Healthcare IT News (2017) describes how AI is solving healthcare problems by using

virtual assistants (Chapter 12).

The list can go on and on. Norman (2018) describes the scenario of replacing doctors with

intelligent systems.

NOTE: AI in medicine is recognized as a scientific field with national and international annual

conferences. For a comprehensive book on the subject, see Agah (2017).

Other Societal Applications

There are many AI applications in transportation, utilities, education, social services, and other

fields. Some are covered under the topic of smart cities (Chapter 13). AI is used by social media

and others to control content including fake news. Finally, how about using technology to

eradicate child slavery in the Middle East? See Application Case 1.9.

Application Case 1.9 Robots Took the Job of Camel-Racing Jockeys for Societal Benefits

In several Middle Eastern countries, notably Jordan, Abu Dhabi, and other Gulf nations, racing

camels has been a popular activity for generations. The owners of the winning camels can make

a huge bonus (up to $1,000,000$1,000,000 for first place). Also, the events are considered cultural and social.

The Problem For a long time, the racing camels were guided by human jockeys. The lighter the weight of the

rider, the better is the chance to win. So the owners of the camels trained children (as young as

seven) to be jockeys. Young male children were bought (or kidnapped) from poor families in

Sudan, India, Bangladesh, and other poor countries and were trained as child jockeys. In fact,

this practice was using child slave labor to race the camels. This practice was used for

generations until it was banned in all Middle Eastern countries during 2005–2010. A major

factor that resulted in the banning was the utilization of robots.

The Robots’ Solution Racing camels was a tradition for many generations and become a lucrative sport. So, no one

wanted to discontinue it. According to Opfer (2016), there was a humanistic reason for using

robots to race camels—to save the children. Today, all camel race tracks in the Middle East

employ only robots. The robots are tied to the hump of the camels, looking like small jockeys

and are remote controlled from cars that drive parallel to the racing camels. The owners can

command the camels by voice, and they can also operate a mechanical whip to beat the animals

so they will run faster, much like human jockeys do. Note that camels would not run unless they

hear the voice of a human or see something that looks like a human on their humps.

The Technology There is a video camera that shows the people that are in cars driving alongside of the camels,

what is going on in real time. The owner can provide voice commands to the camel from the car.

A mechanical whip attached to the hump of the camel can be remotely operated to induce the

animal.

The Results The results are astonishing. Not only was the child slavery practice eliminated, but also the speed

obtained by the camels increased. After all, the robots used weigh only 6 pounds and do not get

tired. To see how this works watch the video at youtube.com/watch?v=GVeVhWXB7sk (2:47

min.). To view a complete race,

see youtube.com/watch?v=xFCRhk4GYds (9:08 min.)(9:08 min.). You may have a chance to see the royal family when you go to the track. Finally, you can see more details

in youtube.com/watch?v=C1uYAXJIbYg (8:08 min.)(8:08 min.). Sources: Compiled from C. Chung. (2016, April 4). “Dubai Camel Race Ride- Along.” YouTube.com. youtube.com/watch?v=xFCRhk4GYds (accessed September 2018); P. Boddington. (2017, January 3). “Case Study: Robot Camel Jockeys. Yes, really.” Ethics for Artificial Intelligence; and L. Slade. (2017, December 21). “Meet the Jordanian Camel Races Using Robot Jockeys.” Sbs.com.au.

Discussion Questions 1. It is said that the robots eradicated the child slavery. Explain. 2. Why do the owners need to drive by their camels while they are racing? 3. Why not duplicate the technology for horse racing? 4. Summarize ethical aspects of this case (Read Boddington, 2017). Do this exercise

after you have read about ethics in Chapter 14.

Section 1.7 Review Questions

1. What are the major characteristics of AI? 2. List the major benefits of AI. 3. What are the major groups in the ecosystem of AI? List the major contents of each.

4. Why is machine learning so important? 5. Differentiate between narrow and general AI. 6. Some say that no AI application is strong. Why? 7. Define assisted intelligence, augmented intelligence, and autonomous intelligence. 8. What is the difference between traditional AI and augmented intelligence? 9. Relate types of AI to cognitive computing. 10. List five major AI applications for increasing the food supply. 11. List five contributions of AI in medical care.

1.8 Convergence of Analytics and AI

Until now we have presented analytics and AI as two independent entities. But, as

illustrated in the opening vignette, these technologies can be combined in solving

complex problems. In this section, we discuss the convergence of these techniques and

how they complement each other. We also describe the possible addition of other

technologies, especially IoT, that enable the solutions to very complex problems.

Major Differences between Analytics and AI

As you recall from Section 1.4, analytics process historical data using statistical, management science and other computational tools to describe situations (descriptive analytics), to predict results including forecasting (predictive analytics), and to propose recommendations for

solutions to problems (predictive analytics). The emphasis is on the statistical, management

science, and other computational tools that help analyze historical data.

AI, on the other hand, also uses different tools, but its major objective is to mimic the manner in

which people think, learn, reason, make decisions, and solve problems. The emphasis here is

on knowledge and intelligence as major tools for solving problems rather than relying on computation, which we do in analysis. Furthermore, AI also is dealing with cognitive computing.

In reality, this difference is not so clear because in advanced analytic applications, there are

situations of using machine learning (an AI technology), supporting analytics in both prediction

and prescription. In this section, we describe the convergence of intelligent technologies.

Why Combine Intelligent Systems?

Both analytics and AI and their different technologies are making useful contributions to many

organizations when each is applied by itself. But each does have limitations According to a

Gartner study, the chance that business analytics initiatives will not meet the enterprise

objectives is 70–80 percent. Namely, at least 70 percent of corporate needs are not fulfilled. In

other words, there is only a small chance that business intelligence initiatives will result in

organizational excellence. There are several reasons for this situation including:

• Predictive models have unintended effects (see Chapter 14).

• Models must be used ethically, responsibly, and mindfully (Chapter 14). They may not be

used this way.

• The results of analytics may be very good for some applications but not for others.

• Models are as good as their input data and assumptions (garbage-in, garbage-out).

• Data could be incomplete. Changing environments can make data obsolete very quickly.

Models may be unable to adapt.

• Data that come from people may not be accurate.

• Data collected from different sources can vary in format and quality.

Additional reasons for combining intelligent systems are generic to IT projects, and they are

discussed in Section 14.2.

The failure rate of AI initiatives is also high. Some of the reasons are similar to the rate of

analytics. However, a major reason is that some AI technologies need a large amount of data,

sometimes Big Data. For example, many millions of data items are fed to Alexa every day to

increase its knowledge. Without continuous flow of data, there would not be good learning in AI.

The question is whether AI and analytics (and other intelligent systems) can be combined in such

a way that there will be synergy for better results.

How Convergence Can Help?

According to Nadav (2017), business intelligence and its analytics answer most of

the why and what questions regarding the sufficiency of problem solving. Adding prescriptive analytics will add more cost but not necessarily better performance. Therefore, the next

generation of business intelligence platforms will use AI to automatically locate, visualize, and

narrate important things. This can also be used to create automatic alerts and notifications. In

addition, machine learning and deep learning can support analytics by conducting pattern

recognition and more accurate predictions. AI will help to compare actual performance with the

predicted one (see Section 14.6). Machine learning and other AI technologies also provide for

constant improvement strategy. Nadav also suggested adding expert opinions via collective

intelligence, as presented in Chapter 11.

In the remaining part of this section, we present detailed aspects of convergence of some

intelligent systems.

Big Data is Empowering AI Technologies

Big Data is characterized by its volume, variety, and velocity that exceed the reach of commonly

used hardware environments and/or the capabilities of software tools to process data. However,

today there are technologies and methods that enable capturing, cleaning, and analyzing Big

Data. These technologies and methods enable companies to make real-time decisions. The

convergence with AI and machine learning is a major force in this direction. The availability of

new Big Data analytics enables new capabilities in AI technologies that were not possible until

recently. According to Bean (2017), Big Data can empower AI due to:

• The new capabilities of processing Big Data at a much reduced cost.

• The availability of large data sets online.

• The scale up of algorithms, including deep learning, is enabling powerful AI capabilities.

MetLife Example: Convergence of AI and Big Data MetLife is a Canadian-based global insurance company that is known for its use of IT to smooth

its operation and increase customer satisfaction. To get the most from technology, the company

uses AI that has been enabled by Big Data analysis as follows:

• Tracking incidents and their outcomes has been improved by speech recognition.

• Machine learning indicates pending failures. In addition, handwritten reports made by

doctors about people injured or were sick and claims paid by the insurance company are

analyzed in seconds by the system.

• Expediting the execution of underwriting policies in property and casualty insurance is done

by using both AI and analytics.

• The back-office side of claim processing includes many unstructured data that are

incorporated in claims. Part of the analysis includes patients’ health data. Machine learning

is used to recognize anomalies in reports very quickly.

For more about AI and the insurance business, see Chapter 2. For more on the convergence of

Big Data and AI in general and at MetLife, see Bean (2017).

The Convergence of AI and the IoT

The opening vignette illustrated to us how AI technologies when combined with IoT can provide

solutions to complex problems. IoT collects a large amount of data from sensors and other

“things.” These data need to be processed for decision support. Later we will see how

Microsoft’s Cortana does this. Butner (2018) describes how combining AI and IoT can lead to

the “next-level solutions and experiences.” The emphasis in such combination is on learning

more about customers and their needs. This integration also can facilitate competitive analysis

and business operation (see the opening vignette). The combined pair of AI and IoT, especially

when combined with Big Data, can help facilitate the discovery of new products, business

processes, and opportunities. The full potential of IoT can be leveraged with AI technologies. In

addition, the only way to make sense of the data streamed from the “things” via IoT and to

obtain the insight from them is to subject them to AI analysis. Faggela (2017) provides the

following three examples of combining AI and IoT:

1. The smart thermostat of Nest Labs (see smart homes in Chapter 13). 2. Automated vacuum cleaners, like iRobot Roomba (see Chapter 2, intelligent

vacuums). 3. Self-driving vehicles (see Chapter 13).

The IoT can become very intelligent when combined with IBM Watson Analytics that includes

machine learning. Examples are presented in the opening vignette and the opening vignette

to Chapter 13.

The Convergence with Blockchain and Other Technologies

Several experts raise the possibility of the convergence of AI, analytics, and blockchain

(e.g., Corea, 2017; Kranz, 2017). The idea is that such convergence may contribute to design or

redesign of paradigms and technologies. The blockchain technology can add security to data

shared by all parties in a distributed network, where transaction data can be recorded. Kranz

believes that the conversion with blockchain will power new solutions to complex problems.

Such a convergence should include the IoT. Kranz also see a role for fog computing (Chapter 9).

Such a combination can be very useful in complex applications such as autonomous vehicles and

in Amazon’s Go (Application Case 1.10).

Application Case 1.10 Amazon Go Is Open for Business

In early 2018, Amazon.com opened its first fully automated convenience store in downtown

Seattle. The company had had success with this type of store during 2017, experimenting with

only the company’s employees.

Shoppers enter the store, pick up products, and go home. Their accounts are charged later on.

Sounds great! No more waiting in line for the packing of your goods and paying for them – no

cashiers, no hassle.

In some sense, shoppers are going through a process similar to what they do online—find desired

products/services, buy them, and wait for the monthly electronic charge.

The Shopping Process

To participate, you need a special free app on your smartphone. You need to connect it to your

regular Amazon.com account. Here is what you do next:

1. Open your app. 2. Wave your smartphone at a gate to the store. It will work with a QR code there. 3. Enter the store. 4. Start shopping. All products are prepacked. You put them in a shopping bag

(yours or one borrowed at the store). The minute you pick an item from the shelf, it is recorded in a virtual shopping cart. This activity is done by sensors/cameras. Your account is debited. If you change your mind, and return an item, the system will credit your account instantly. The sensors also track your movements in the store. (This is an issue of digital privacy; see Chapter 14, Section 14.3). The sensors are of RFID type (Chapter 13).

5. Finished shopping? Just leave the store (make sure your app is open for the gate to let you leave). The system knows that you have left and what products you took, and your shopping trip is finished. The system will total your cost, which you can check anytime on your smartphone.

6. Amazon.com records your shopping habits (again, a privacy issue), which will help your future shopping experience and will help Amazon to build recommendations for you (Chapter 2). The objective of Go is to guide you to healthy food! (Amazon sells its meal kits of healthy food there.)

NOTE: Today, only few people work in the store! Employees stock shelves and assist you

otherwise. The company plans to open several additional stores in 2018.

The Technology Used

Amazon disclosed some of the technologies used. These are deep learning algorithms, computer

vision, and sensor fusion. Other technologies were not disclosed. See

the videoyoutube.com/watch?v=NrmMk1Myrxc (1:50 min.)(1:50 min.). Sources: Condensed for C. Jarrett. (2018). “Amazon Set to Open Doors on AI-Powered Grocery Store.” Venturebeat.com. venturebeat.com/2018/01/21/amazon-set-to-open-doors-on-ai-powered-grocery-store/ (accessed September 2018); D. Reisinger. (2018, February 22). “Here Are the Next Cities to Get Amazon Go Cashier-Less Stores.” Fortune.

Questions for Case 1.9

1. Watch the video. What did you like in it, and what did you dislike? 2. Compare the process described here to a self-check available today in many

supermarkets and “big box” stores (Home Depot, etc.). 3. The store was opened in downtown Seattle. Why was the downtown location

selected? 4. What are the benefits to customers? To Amazon? 5. Will customers be ready to trade privacy for convenience? Discuss.

For a comprehensive report regarding convergence of intelligent technologies,

see reportbuyer.com/product/5023639/.

In addition to blockchain, one can include IoT and Big Data, as suggested earlier, as well as

more intelligent technologies (e.g., machine vision, voice technologies). These may have

enrichment effects. In general, the more technologies are used (presumably properly), the more

complex problems may be solved, and the more efficient the performance of the convergence

systems (e.g., speed, accuracy) will be. For a discussion, see i-scoop.eu/convergence-ai-iot-big-

data-analytics/.

IBM and Microsoft Support for Intelligent Systems Convergence

Many companies provide tools or platforms for supporting intelligent systems convergence. Two

examples follow.

IBM

IBM is combining two of its platforms to support the convergence of AI and analytics. Power AI

is a distribution platform for AI and machine learning. This is a way to support the IBM analytics

platform called Data Science Experience (cloud enabled). The combination of the two enables

improvements in data analytics process. It also enables data scientists to facilitate the training of

complex AI models and neural networks. Researchers can use the combined system for deep

learning projects. All in all, this combination provides better insight to problem solving. For

details, see FinTech Futures (2017).

As you may recall from the opening vignette, IBM Watson is also combining analytics, AI, and

IoT in cognitive buildings projects.

Microsoft’s Cortana Intelligence Suite

Microsoft offers from its AZURE cloud (Chapter 13) a combination of advanced analytics,

traditional BI, and Big Data analytics. The suite enables users to transform data into intelligent

actions.

Using Cortana, one can transform data from several sources, including from IoT sensors, and

apply both advanced analytics (e.g., data mining) and AI (e.g., machine learning) and extract

insights and actionable recommendations, which are delivered to decision makers, to apps, or to

fully automated systems. For the details of the system and the architecture of Cortana,

see mssqltips.com/sqlservertip/4360/introduction-to-microsoft-cortana-intelligence-suite/.

Section 1.8 Review Questions

1. What are the major benefits of intelligent systems convergences? 2. Why did analytics initiatives fail at such a high rate in the past? 3. What synergy can be created by combining AI and analytics? 4. Why is Big Data preparation essential for AI initiatives? 5. What are the benefits of adding IoT to intelligent technology applications? 6. Why it is recommended to use blockchain in support of intelligent applications?

7. nalytics Ecosystem 8. So you are excited about the potential of analytics, data science, and AI and want

to join this growing industry. Who are the current players, and what to do they

do? Where might you fit in? The objective of this section is to identify various

sectors of the analytics industry, provide a classification of different types of

industry participants, and illustrate the types of opportunities that exist for

analytics professionals. Eleven different types of players are identified in

an analytics ecosystem. An understanding of the ecosystem also gives the

reader a broader view of how the various players come together. A

secondary purpose of understanding the analytics ecosystem for a professional is

also to be aware of organizations and new offerings and opportunities in sectors

allied with analytics.

9. Although some researchers have distinguished business analytics professionals

from data scientists (Davenport and Patil, 2012), as pointed out previously, for

the purpose of understanding the overall analytics ecosystem, we treat them as

one broad profession. Clearly, skill needs can vary for a strong mathematician to

a programmer to a modeler to a communicator, and we believe this issue is

resolved at a more micro/individual level rather than at a macro level of

understanding the opportunity pool. We also take the widest definition of

analytics to include all three types as defined by INFORMS—

descriptive/reporting/visualization, predictive, and prescriptive as described

earlier. We also include AI within this same pool.

10. Figure 1.17 illustrates one view of the analytics ecosystem. The components of

the ecosystem are represented by the petals of an analytics flower. Eleven key

sectors or clusters in the analytics space are identified. The components of the

analytics ecosystem are grouped into three categories represented by the inner

petals, outer petals, and the seed (middle part) of the flower. The outer six petals

can be broadly termed technology providers. Their primary revenue comes from

providing technology, solutions, and training to analytics user organizations so

they can employ these technologies in the most effective and efficient manner.

The inner petals can be generally defined as the analytics accelerators. The

accelerators work with both technology providers and users. Finally, the core of

the ecosystem comprises the analytics user organizations. This is the most

important component as every analytics industry cluster is driven by the user

organizations. 11. Figure 1.17 Analytics Ecosystem.

14. The metaphor of a flower is well suited for the analytics ecosystem as multiple

components overlap each other. Similar to a living organism like a flower, all

these petals grow and wither together. Many companies play in multiple sectors

within the analytics industry and thus offer opportunities for movement within

the field both horizontally and vertically.

15. More details for the analytics ecosystem are included in our shorter book

(Sharda, Delen, and Turban, 2017) as well as in Sharda and Kalgotra (2018). Matt

Turck, a venture capitalist with FirstMark, has also developed and updates an

analytics ecosystem focused on Big Data. His goal is to keep track of new and

established players in various segments of the Big Data industry. A very nice

visual image of his interpretation of the ecosystem and a comprehensive listing

of companies is available through his Web

site: http://mattturck.com/2016/02/01/big-data-landscape/ (accessed September

2018).

1.10 Plan of the Book

The previous sections have given you an understanding of the need for information

technology in decision making, the evolution of BI, analytics, data science, and artificial

intelligence. In the last several sections, we have seen an overview of various types of

analytics and their applications. Now we are ready for a more detailed managerial

excursion into these topics along with some deep hands-on experience in some of the

technical topics. Figure 1.18 presents a plan on the rest of the book. Figure 1.18 Plan of the Book

In this chapter, we have provided an introduction, definitions, and overview of DSS, BI,

and analytics, including Big Data analytics and data science. We also gave you an

overview of the analytics ecosystem to have you appreciate the breadth and depth of

the industry. Chapters 2 and 3 cover descriptive analytics and data issues. Data clearly

form the foundation for any analytics application. Thus, we cover an introduction to

data warehousing issues, applications, and technologies. This chapter also covers

business reporting and visualization technologies and applications.

We follow the current chapter with a deeper introduction to artificial intelligence

in Chapter 2. Because data are fundamental to any analysis, Chapter 3 introduces data

issues as well as descriptive analytics, including statistical concepts and visualization.

An online chapter covers data warehousing processes and fundamentals for those who

like to dig more deeply into these issues. The next section of the book covers predictive

analytics and machine learning. Chapter 4 provides an introduction to data mining

applications and the data mining process. Chapter 5 introduces many of the common

data mining techniques: classification, clustering, association mining, and so

forth. Chapter 6 includes coverage of deep learning and cognitive

computing. Chapter 7 focuses on text mining applications as well as Web analytics,

including social media analytics, sentiment analysis, and other related topics. The

following section brings the “data science” angle into further depth. Chapter 8 covers

prescriptive analytics. Chapter 9 includes more details of Big Data analytics. It also

includes an introduction to cloud-based analytics as well as location analytics. The next

section covers robotics, social networks, AI, and IoT. Chapter 10 introduces robots in

business and consumer applications and discusses the future impact of such devices on

society. Chapter 11 focuses on collaboration systems, crowdsourcing, and social

networks. Chapter 12 reviews personal assistants, chatbots, and the exciting

developments in this space. Chapter 13 studies IoT and its potential in decision support

and a smarter society. The ubiquity of wireless and GPS devices and other sensors is

resulting in the creation of massive new databases and unique applications. A new

breed of analytics companies is emerging to analyze these new databases and create a

much better and deeper understanding of customers’ behaviors and movements. It is

leading to the automation of analytics and has spanned a new area called the “Internet

of Things.” Finally, Chapter 14 concludes with a brief discussion of security, privacy,

and societal dimensions of analytics/AI.

1.11 Resources, Links, and the Teradata University Network Connection

The use of this chapter and most other chapters in this book can be enhanced by the

tools described in the following sections.

Resources and Links

We recommend the following major organizational resources and links:

• The Data Warehousing Institute (tdwi.org).

• Data Science Central (datasciencecentral.com).

• DSS Resources (dssresources.com).

• Microsoft Enterprise Consortium (enterprise.waltoncollege.uark.edu/mec.asp).

Vendors, Products, and Demos

Most vendors provide software demos of their products and applications. Information about

products, architecture, and software is available at dssresources.com.

Periodicals

We recommend the following periodicals:

• Decision Support Systems (www.journals.elsevier.com/decision-support-systems). • CIO Insight (www.cioinsight.com).

The Teradata University Network Connection

This book is tightly connected with the free resources provided by TUN

(see www.teradatauniversitynetwork.com). The TUN portal is divided into two major parts: one

for students and one for faculty. This book is connected to the TUN portal via a special section at

the end of each chapter. That section includes appropriate links for the specific chapter, pointing

to relevant resources. In addition, we provide hands-on exercises using software and other

materials (e.g., cases) available at TUN.

The Book’s Web Site

This book’s Web site, pearsonhighered.com/sharda, contains supplemental textual material

organized as Web chapters that correspond to the printed book’s chapters. The topics of these

chapters are listed in the online chapter table of contents.

As this book went to press, we verified that all cited Web sites were active and valid. However,

URLs are dynamic. Web sites to which we refer in the text sometimes change or are

discontinued because companies change names, are bought or sold, merge, or fail. Sometimes

Web sites are down for maintenance, repair, or redesign. Many organizations have dropped the

initial “www” designation for their sites, but some still use it. If you have a problem connecting

to a Web site that we mention, please be patient and simply run a Web search to try to identify a

possible new site. Most times, you can quickly find the new site through one of the popular

search engines. We apologize in advance for this inconvenience.

Chapter Highlights

• The business environment is becoming more complex and is rapidly changing,

making decision making more difficult.

• Businesses must respond and adapt to the changing environment rapidly by

making faster and better decisions.

• A model is a simplified representation or abstraction of reality.

• Decision making involves four major phases: intelligence, design, choice, and

implementation.

• In the intelligence phase, the problem (opportunity) is identified, classified, and

decomposed (if needed), and problem ownership is established.

• In the design phase, a model of the system is built, criteria for selection are agreed

on, alternatives are generated, results are predicted, and a decision methodology is

created.

• In the choice phase, alternatives are compared, and a search for the best (or a good-

enough) solution is launched. Many search techniques are available.

• In implementing alternatives, a decision maker should consider multiple goals and

sensitivity-analysis issues.

• The time frame for making decisions is shrinking, whereas the global nature of

decision making is expanding, necessitating the development and use of

computerized DSS.

• An early decision support framework divides decision situations into nine

categories, depending on the degree of structuredness and managerial activities.

Each category is supported differently.

• Structured repetitive decisions are supported by standard quantitative analysis

methods, such as MS, MIS, and rule-based automated decision support.

• DSS use data, models, and sometimes knowledge management to find solutions for

semistructured and some unstructured problems.

• The major components of a DSS are a database and its management, a model base

and its management, and a user-friendly interface. An intelligent (knowledge-

based) component can also be included. The user is also considered to be a

component of a DSS.

• BI methods utilize a central repository called a DW that enables efficient data

mining, OLAP, BPM, and data visualization.

• BI architecture includes a DW, business analytics tools used by end users, and a

user interface (such as a dashboard).

• Many organizations employ descriptive analytics to replace their traditional flat

reporting with interactive reporting that provides insights, trends, and patterns in

the transactional data.

• Predictive analytics enables organizations to establish predictive rules that drive

the business outcomes through historical data analysis of the existing behavior of

the customers.

• Prescriptive analytics helps in building models that involve forecasting and

optimization techniques based on the principles of OR and management science to

help organizations to make better decisions.

• Big Data analytics focuses on unstructured, large data sets that may also include

vastly different types of data for analysis.

• Analytics as a field is also known by industry-specific application names, such as

sports analytics. It is also known by other related names such as data science or

network science.

• Healthcare and retail chains are two areas where analytics applications abound,

with much more to come.

• Image analytics is a rapidly evolving field leading to many applications of deep

learning.

• The analytics ecosystem can be first viewed as a collection of providers, users, and

facilitators. It can be broken into 11 clusters.

Key Terms • analytics • analytics ecosystem • artificial intelligence • augmented intelligence • Big Data analytics • business intelligence (BI) • choice phase • dashboard • data mining • decision or normative analytics • descriptive (or reporting) analytics • design phase • implementation phase • intelligence phase

• intelligent agents

• Internet of Things (IoT)

• narrow (weak) AI • online analytical processing (OLAP)

• online transaction processing (OLTP) • predictive analytics • prescriptive analytics

• strong (general) AI

Questions for Discussion 1. Survey the literature from the past six months to find one application each for

DSS, BI, and analytics. Summarize the applications on one page, and submit it with the exact sources.

2. Your company is considering opening a branch in China. List typical activities in each phase of the decision (intelligence, design, choice, and implementation) regarding whether to open a branch.

3. You are about to buy a car. Using Simon’s (1977) four-phase model, describe your activities at each step in making the decision.

4. Explain, through an example, the support given to decision makers by computers in each phase of the decision process.

5. Comment on Simon’s (1977) philosophy that managerial decision making is synonymous with the whole process of management. Does this make sense? Explain. Use a real-world example in your explanation.

6. Review the major characteristics and capabilities of DSS. How does each of them relate to the major components of DSS?

7. List some internal data and external data that could be found in a DSS for a university’s admissions office.

8. Distinguish BI from DSS. 9. Compare and contrast predictive analytics with prescriptive and descriptive

analytics. Use examples. 10. Discuss the major issues in implementing BI.

Exercises

Teradata University Network and Other Hands-On Exercises

1. Go to the TUN site teradatauniversitynetwork.com. Using the site password your instructor provides, register for the site if you have not already previously registered. Log on and learn the content of the site. You will receive assignments related to this site. Prepare a list of 20 items on the site that you think could be beneficial to you.

2. Go to. Explore the Sports Analytics page, and summarize at least two applications of analytics in any sport of your choice.

3. Go to. The TUN site, and select “Cases, Projects, and Assignments.” Then select the case study “Harrah’s High Payoff from Customer Information.” Answer the following questions about this case:

a. What information does the data mining generate?

b. How is this information helpful to management in decision making? (Be specific.)

c. List the types of data that are mined. d. Is this a DSS or BI application? Why?

4. Go to teradatauniversitynetwork.com and find the paper titled “Data Warehousing Supports Corporate Strategy at First American Corporation” (by Watson, Wixom, and Goodhue). Read the paper, and answer the following questions:

a. What were the drivers for the DW/BI project in the company? b. What strategic advantages were realized? c. What operational and tactical advantages were achieved? d. What were the critical success factors for the implementation?

5. Go to http://analytics-magazine.org/issues/digital-editions and find the January/February 2012 edition titled “Special Issue: The Future of Healthcare.” Read the article “Predictive Analytics—Saving Lives and Lowering Medical Bills.” Answer the following questions:

a. What problem is being addressed by applying predictive analytics? b. What is the FICO Medication Adherence Score? c. How is a prediction model trained to predict the FICO Medication

Adherence Score HoH? Did the prediction model classify the FICO Medication Adherence Score?

d. Zoom in on Figure 4, and explain what technique is applied to the generated results.

e. List some of the actionable decisions that were based on the prediction results.

6. Go to http://analytics-magazine.org/issues/digital-editions, and find the January/February 2013 edition titled “Work Social.” Read the article “Big Data, Analytics and Elections,” and answer the following questions:

a. What kinds of Big Data were analyzed in the article’s Coo? Comment on some of the sources of Big Data.

b. Explain the term integrated system. What is the other technical term that suits an integrated system?

c. What data analysis techniques are employed in the project? Comment on some initiatives that resulted from data analysis.

d. What are the different prediction problems answered by the models? e. List some of the actionable decisions taken that were based on the

prediction results. f. Identify two applications of Big Data analytics that are not listed in the

article. 7. Search the Internet for material regarding the work of managers and the role

analytics plays in it. What kinds of references to consulting firms, academic departments, and programs do you find? What major areas are represented? Select five sites that cover one area, and report your findings.

8. Explore the public areas of dssresources.com. Prepare a list of its major available resources. You might want to refer to this site as you work through the book.

9. Go to microstrategy.com. Find information on the five styles of BI. Prepare a summary table for each style.

10. Go to oracle.com, and click the Hyperion link under Applications. Determine what the company’s major products are. Relate these to the support technologies cited in this chapter.

11. Go to the TUN questions site. Look for BSI videos. Review the video of “Case of Retail Tweeters.” Prepare a one-page summary of the problem, proposed solution, and the reported results. You can also find associated slides on slideshare.net.

12. Review the Analytics Ecosystem section. Identify at least two additional companies in at least five of the industry clusters noted in the discussion.

13. The discussion for the analytics ecosystem also included several typical job titles for graduates of analytics and data science programs. Research Web sites such as datasciencecentral.com and tdwi.org to locate at least three similar job titles that you may find interesting for your career.

14. Go to Brainspace at MIT lab brainspace.com. View the video about “Augmented Human Intelligence.” Find the activities that deal with the enabling of meaningful combination of people and machines. Write a report.

15. Find information about IBM Watson’s activities in the healthcare field. Write a report.

16. Examine Daniel Power’s DSS Resources site at dssresources.com. Take the Decision Support Systems Web Tour (dssresources.com/tour/index.html). Explore other areas of the Web site. List at least three recent resources related to analytics. What topics do these cover?

ferences • http://canopeoapp.com/ (accessed October 2018).

• http://imazon.org.br/en/imprensa/mapping-change-in-the-amazon-how-satellite-images- are-halting-deforestation/ (accessed October 2018).

• http://www.earthcastdemo.com/2018/07/bloomberg-earthcast-customizing- weather/ (accessed October 2018)

• https://www.worldbank.org/en/news/press-release/2018/02/22/world-bank-supports- sierra-leones-efforts-in-landslide-recovery (accessed October 2018)

• Siemens.com. About Siemens. siemens.com/about/en/ (accessed September 2018). • Silvaris.com. Silvaris overview. silvaris.com (accessed September 2018). • Agah, A. (2017). Medical Applications of Artificial Intelligence. Boca Raton, FL: CRC Press. • Anthony, R. N. (1965). Planning and Control Systems: A Framework for Analysis.

Cambridge, MA: Harvard University Graduate School of Business.

• Baker, J., and M. Cameron. (1996, September). “The Effects of the Service Environment on Affect and Consumer Perception of Waiting Time: An Integrative Review and Research Propositions.” Journal of the Academy of Marketing Science, 24, pp. 338–349.

• Bean, R. (2017, May 8). “How Big Data Is Empowering AI and Machine Learning at Scale.” MIT Sloan Management Review.

• Boddington, P. (2017, January 3). “Case Study: Robot Camel Jockeys. Yes, really.” Ethics for Artificial Intelligence.

• Brainspace. (2016, June 13). “Augmenting Human Intelligence.” MIT Technology Review Insights.

• Butner, K. (2018, January 8). “Combining Artificial Intelligence with the Internet of Things Could Make Your Business Smarter.” IBM Consulting Blog.

• CDC.gov. (2015, September 21). “Important Facts about Falls.” cdc.gov/homeandrecreationalsafety/falls/adultfalls.html (accessed September 2018).

• Charles, T. (2018, May 21). “Influence of the External Environment on Strategic Decision.” Azcentral. yourbusiness.azcentral.com/influence-external-environment- strategic-decisions-17628.html/ (accessed October 2018).

• Chiguluri, V., Guthikonda, K., Slabaugh, S., Havens, E., Peña, J., & Cordier, T. (2015, June). Relationship Between Diabetes Complications and Health Related Quality of Life Among an Elderly Population in the United States. Poster presentation at the American Diabetes Association Seventy-Fifth Annual Scientific Sessions. Boston, MA.

• Chongwatpol, J., & R. Sharda. (2010, December). “SNAP: A DSS to Analyze Network Service Pricing for State Networks.” Decision Support Systems, 50(1), pp. 347–359.

• Chung, C. (2016). “Dubai Camel Race Ride- Along.” YouTube.com. youtube.com/watch?v=xFCRhk4GYds (accessed September 2018).

• Cordier, T., Slabaugh, L., Haugh, G., Gopal, V., Cusano, D., Andrews, G., & Renda, A. (2015, September). Quality of Life Changes with Progressing Congestive Heart Failure. Poster presentation at the Nineteenth Annual Scientific Meeting of the Heart Failure Society of America, Washington, DC.

• Corea, F. (2017, December 1). “The Convergence of AI and Blockchain: What’s the Deal?” Medium.com. medium.com/@Francesco_AI/the-convergence-of-ai-and-blockchain- whats-the-deal-60c618e3accc (accessed September 2018).

• Davenport, T., & SAS Institute Inc. (2014, February). “Analytics in Sports: The New Science of Winning.” sas.com/content/dam/SAS/en_us/doc/whitepaper2/iia-analytics-in-sports- 106993.pdf (accessed September 2018).

• Davenport, T. H., & Patil, D. J. (2012). “Data Scientist.” Harvard Business Review, 90, 70–76. • Deahl, D. (2018, January 7). “This Automatic Feeder Can Tell the Difference Between Your

Pets.” The Verge. • De Smet, A., et al. (2017, June). “Untangling Your Organization’s Decision Making.” McKinsey

Quarterly. • Duncan, A. (2016). “The BICC Is Dead.” https://blogs.gartner.com/alan-

duncan/2016/03/11/the-bicc-is-dead/ (accessed October 2018).

• Dundas.com. “How Siemens Drastically Reduced Cost with Managed BI Applications.” www.dundas.com/Content/pdf/siemens-case-study.pdf (accessed September 2018).

• Eckerson, W. (2003). Smart Companies in the 21st Century: The Secrets of Creating Successful Business Intelligent Solutions. Seattle, WA: The Data Warehousing Institute.

• eMarketer. (2017, May). “Artificial Intelligence: What’s Now, What’s New and What’s Next?” EMarketer Inc.

• Emc.com. (n.d.). “Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field.” emc.com/collateral/about/news/emc-data-science-study-wp.pdf (accessed September 2018)

• Emmert, Samantha. (2018, March 19). “Fighting Illegal Fishing.” Global Fishing Watch. globalfishingwatch.org/research/fighting-illegal-fishing/ (accessed October 2018).

• Faggela, D. (2017, August 24). “Artificial Intelligence Plus the Internet of Things (IoT): 3 Examples Worth Learning From.” TechEmergence.

• Faggela, D. (2018, March 29). “Artificial Intelligence Industry: An Overview by Segment.” TechEmergence.

• Fernandez, J. (2017, April). “A Billion People a Day. Millions of Elevators. No Room for Downtime.” IBM developer Works Blog. developer.ibm.com/dwblog/2017/kone-watson- video/ (accessed September 2018).

• FinTech Futures. (2017, October 11). “IBM Combining Data Science and AI for Analytics Advance.” BankingTech.com.

• Gartner, Inc. (2004). Using Business Intelligence to Gain a Competitive Edge. A special report.

• Gates, S., Smith, L. A., Fisher, J. D., et al. (2008). Systematic Review of Accuracy of Screening Instruments for Predicting Fall Risk Among Independently Living Older Adults. Journal of Rehabilitation Research and Development, 45(8), pp. 1105–1116.

• Gill, T. M., Murphy, T. E., Gahbauer, E. A., et al. (2013). “Association of Injurious Falls with Disability Outcomes and Nursing Home Admissions in Community Living Older Persons.” American Journal of Epidemiology, 178(3), pp. 418–425.

• Gorry, G. A., & Scott-Morton, M. S. (1971). “A Framework for Management Information Systems.” Sloan Management Review, 13(1), pp. 55–70.

• Havens, E., Peña, J., Slabaugh, S., Cordier, T., Renda, A., & Gopal, V. (2015, October). Exploring the Relationship Between Health-Related Quality of Life and Health Conditions, Costs, Resource Utilization, and Quality Measures. Podium presentation at the ISOQOL Twenty-Seventh Annual Conference, Vancouver, Canada.

• Havens, E., Slabaugh, L., Peña, J., Haugh, G., & Gopal, V. (2015, February). Are There Differences in Healthy Days Based on Compliance to Preventive Health Screening Measures? Poster presentation at Preventive Medicine 2015, Atlanta, GA.

• Healthcare IT News. (2017, November 9). “How AI Is Transforming Healthcare and Solving Problems in 2017.” Slideshow. healthcareitnews.com/slideshow/how-ai-transforming- healthcare-and-solving-problems-2017?page=4/ (accessed September 2018).

• Hesse, R., & G. Woolsey. (1975). Applied Management Science: A Quick and Dirty Approach. Chicago, IL: SRA Inc.

• Humana. 2016 Progress Report. populationhealth.humana.com/wp- content/uploads/2016/05/BoldGoal2016ProgressReport_1.pdf (accessed September 2018).

• INFORMS. Analytics Section Overview. informs.org/Community/Analytics (accessed September 2018).

• Jacquet, F. (2017, July 4). “Exploring the Artificial Intelligence Ecosystem: AI, Machine Learning, and Deep Learning.” DZone.

• Jarrett, C. (2018, January 21). “Amazon Set to Open Doors on AI-Powered Grocery Store.” Venturebeat.com. venturebeat.com/2018/01/21/amazon-set-to-open-doors-on-ai- powered-grocery-store/ (accessed September 2018).

• Keen, P. G. W., & M. S. Scott-Morton. (1978). Decision Support Systems: An Organizational Perspective. Reading, MA: Addison-Wesley.

• Kemper, C., and C. Breuer. (2016). “How Efficient Is Dynamic Pricing for Sports Events? Designing a Dynamic Pricing Model for Bayern Munich.” International Journal of Sports Finance, 11, pp. 4–25.

• Kranz, M. (2017, December 27). “In 2018, Get Ready for the Convergence of IoT, AI, Fog, and Blockchain.” Insights.

• Liberto, D. (2017, June 29). “Artificial Intelligence Will Add Trillion to the Global Economy: PwC.” Investopedia.

• Lollato, R., Patrignani, A., Ochsner, T. E., Rocatelli, A., Tomlinson, P. & Edwards, J. T. (2015). “Improving Grazing Management Using a Smartphone App.” www.bookstore.ksre.ksu.edu/pubs/MF3304.pdf (accessed October 2018).

• Lopez, J. (2017, August 11). “Smart Farm Equipment Helps Feed the World.” IQintel.com.

• Metz, C. (2018, February 12). “As China Marches Forward on A.I., the White House Is Silent.” The New York Times.

• Nadav, S. (2017, August 9). “Business Intelligence Is Failing; Here Is What Is Coming Next.” Huffington Post.

• Norman, A. (2018, January 31). “Your Future Doctor May Not Be Human. This Is the Rise of AI in Medicine.” Futurism.com.

• Olshansky, C. (2017, August 24). “Coca-Cola Is Bringing Artificial Intelligence to Vending Machines.” Food & Wine.

• Opfer, C. (2016, June 22). “There’s One Terrific Reason to Race Camels Using Robot Jockeys.” Howstuffworks.com.

• Padmanabhan, G. (2018, January 4). “Industry-Specific Augmented Intelligence: A Catalyst for AI in the Enterprise.” Forbes.

• Pajouh Foad, M., Xing, D., Hariharan, S., Zhou, Y., Balasundaram, B., Liu, T., & Sharda, R. (2013). Available-to-Promise in Practice: An Application of Analytics in the Specialty Steel

Bar Products Industry. Interfaces, 43(6), pp. 503– 517. dx.doi.org/10.1287/inte.2013.0693 (accessed September 2018).

• Patrignani, A., & Ochsner, T. E., (2015). Canopeo: A Powerful New Tool for Measuring Fractional Green Canopy Cover. Agronomy Journal, 107(6), pp. 2312–2320;

• PE Report. (2017, July 29). “Satellite-Based Advance Can Help Raise Farm Output by 20 Percent Experts.” Financial Express.

• PricewaterhouseCoopers Report. (2011, December). “Changing the Game: Outlook for the Global Sports Market to 2015.” pwc.com/gx/en/hospitality-leisure/pdf/changing-the- game-outlook-for-the-global-sports-market-to-2015.pdf (accessed September 2018).

• Quain, S. (2018, June 29). “The Decision-Making Process in an Organization.” Small Business Chron.

• Reisinger, D. (2018, February 22). “Here Are the Next Cities to Get Amazon Go Cashier-Less Stores.” Fortune.

• Sharda, R., Asamoah, D., & Ponna, N. (2013). “Research and Pedagogy in Business Analytics: Opportunities and Illustrative Examples.” Journal of Computing and Information Technology, 21(3), pp. 171–182.

• Sharda, R., Delen, D., & Turban, E. (2016). Business Intelligence, Analytics, and Data Science: A Managerial Perspective on Analytics. 4th ed. NJ: Pearson.

• Sharda, R., & P. Kalgotra. (2018). “The Blossoming Analytics Talent Pool: An Overview of the Analytics Ecosystem.” In James J. Cochran (ed.). INFORMS Analytics Body of Knowledge. John Wiley, Hoboken, NJ

• Silk, R. (2017, November). “Biometrics: Facial Recognition Tech Coming to an Airport Near You.” Travel Weekly, 21.

• Simon, H. (1977). The New Science of Management Decision. Englewood Cliffs, NJ: Prentice Hall.

• Slade, L. (2017, December 21). “Meet the Jordanian Camel Races Using Robot Jockeys.” Sbs.com.au.

• Slowey, L. (2017, February 16). “Look Who’s Talking: KONE Makes Elevator Services Truly Intelligent with Watson IoT.” IBM Internet of Things Blog. ibm.com/blogs/internet-of- things/kone/ (accessed September 2018).

• “Sports Analytics Market Worth by 2021.” (2015, June 25). Wintergreen Research Press Release. Covered by PR Newswire at http://www.prnewswire.com/news-releases/sports- analytics-market-worth-47-billion-by-2021-509869871.html.

• Srikanthan, H. . (2018, January 8). “KONE Improves ‘People Flow’ in 1.1 Million Elevators with IBM Watson IoT.” Generis. https://generisgp.com/2018/01/08/ibm-case-study-kone- corp/ (accessed September 2018).

• Staff. “Assisted, Augmented and Autonomous: The 3Flavours of AI Decisions.” (2017, June 28). Software and Technology.

• Tableau.com. Silvaris Augments Proprietary Technology Platform with Tableau’s Real-Time Reporting Capabilities. tableau.com/sites/default/files/case-studies/silvaris-business- dashboards_0.pdf (accessed September 2018).

• Tartar, Andre, et al. (2018, 26 July). “All the Things Satellites Can Now See from Space.” Bloomberg.com. www.bloomberg.com/news/features/2018-07-26/all-the-things- satellites-can-now-see-from-space (accessed October 2018).

• TeradataUniversityNetwork.com. (2015, Fall). “BSI: Sports Analytics—Precision Football” (video). teradatauniversity network.com/About-Us/Whats-New/BSI–Sports-Analytics- Precision-Football/ (accessed September 2018).

• Thibodeaux, W. (2017, June 29). “This Artificial Intelligence Kiosk Is Designed to Spot Liars at Airports.” Inc.com.

• Turck, Matt. “Is Big Data Still a Thing? (The 2016 Big Data Landscape).” http://mattturck.com/2016/02/01/big-data-landscape/ (accessed September 2018).

• Watson, H. (2005, Winter). Sorting Out What’s New in Decision Support. Business Intelligence Journal.

• Weldon, D. (2018, March 6). “Nearly Half of CIOs Now Plan to Deploy Artificial Intelligence.” Information Management.

• Wikipedia.org. On-base Percentage. wikipedia.org/wiki/On_base_percentage. (accessed September 2018).

• Wikipedia.org. Sabermetrics. wikipedia.org/wiki/Sabermetrics (accessed September 2018).

• Wikipedia.org. SIEMENS. wikipedia.org/wiki/Siemens (accessed September 2018). • YouTube.com. (2013, December 17). CenterPoint Energy Talks Real Time Big Data

Analytics. youtube.com/watch?v=s7CzeSlIEfI (accessed September 2018).

• Yue, P. (2017, August 24). “Baidu, Beijing Airport Launch Facial Recognition for Passenger Check-In.” China Money Network. https://www.chinamoneynetwork.com/2017/08/24/baidu-capital-airport- launch-facial-recognition-system-airport (accessed October 2018).

• Zane, E. B. (2016). Effective Decision-Making: How to Make Better Decisions Under Uncertainty And Pressure. Kindle ed. Seattle, WA: Amazon Digital Services.