luxury hotels in paris maicro

stude1207
HWA3.pdf

DEC215 Business Statistics

Dr. A. Ampountolas Fall22

Assignment sheet 3

Due date: 11.12.2022 @ 11:59pm

Exercise 1.

The National Insurance Crime Bureau says that Miami- Dade, Broward, and palm

Beach counties account for a substantial number of questionable insurance claims

referred to investigators. Assume that the number of questionable insurance claims

referred to investigators by Miami-Dade, Broward, and Palm Beach counties is

distributed as a Poisson random variable with a mean of 10 per day.

Questions:

1. What assumptions need to be made so that the number of questionable insurance

claims referred to investigators by Miami- Dade, Broward, and Palm Beach

counties is distributed as a Poisson random variable?

Making the assumptions given in (1), what is the probability that:

2. 5 questionable insurance claims will be referred to investigators by Miami-Dade,

Broward, and Palm Beach counties in a day?

3. 10 or fewer questionable insurance claims will be referred to investigators by

Miami-Dade, Broward, and Palm Beach counties in a day?

4. 11 or more questionable insurance claims will be referred to investigators by

Miami-Dade, Broward, and Palm Beach counties in a day?

Exercise 2.

Spurious correlation refers to the apparent relationship between variables that either

has no true relationship or are re-related to other variables that have not been

measured. One widely publicized stock market indicator in the United States that is an

example of spurious correlation is the relationship between the National Football

League Super Bowl winner and the per- performance of the Dow Jones Industrial

Average in that year. The indicator states that when a team that existed before the

National Football League merged with the American Football League wins the Super

Bowl, the Dow Jones Industrial Average will increase in that year. Since the first Super

1

Bowl was held in 1967 through 2012, the indicator has been correct 37 out of 46 times.

Assuming that this indicator is a random event with no predictive value, you would

expect that the indicator would be correct 50% of the time.

Questions:

1. What is the probability that the indicator would be correct 37 or more times in 46

years?

2. What does this tell you about the usefulness of this indicator?

Exercise 3.

In the Florida lottery Lotto game, you select six numbers from a pool of numbers from

1 to 53. each wager costs $1. You win the jackpot if you match all six numbers that you

have selected.

Questions:

Find the probability of:

1. winning the jackpot.

2. matching five numbers.

3. matching four numbers.

4. matching three numbers.

5. matching two numbers.

6. matching one number.

7. matching none of the numbers.

8. If you match zero, one, or two numbers, you do not win anything. What is the

probability that you will not win anything?

9. The Lotto ticket gives complete game rules and probabilities of matching zero

through six numbers. The lottery ticket has the saying ”A Win for education” on

the back of the ticket. Do you think Florida’s slogan and the printed complete

game rules and probabilities of matching zero through six numbers is an ethical

approach to running the lottery game?

Exercise 4.

The Consumer Financial Protection Bureaus consumer response team hears directly

from consumers about the challenges they face in the marketplace, brings their concerns

2

to the attention of financial institutions, and assists in addressing their complaints.

Consumer response accepts complaints related to mortgages, bank accounts and

services, private student loans, other consumer loans, and credit reporting. Of the

consumers who registered a bank account and service complaint, 41% cited ”account

management” as the type of complaint; these complaints are related to opening, closing,

or managing the account and addressing issues, such as confusing marketing, denial,

fees, statements, and joint accounts. Consider a sample of 20 consumers who registered

bank accounts and service complaints. Use the binomial model to answer the following

questions:

Questions:

1. What is the expected value, or mean, of the binomial distribution?

2. What is the standard deviation of the binomial distribution?

3. What is the probability that 10 of the 20 consumers cited ”account management”

as the type of complaint?

4. What is the probability that no more than 5 of the consumers cited ”account

management” as the type of complaint?

5. What is the probability that 5 or more of the consumers cited ”account

management” as the type of complaint?

Exercise 5.

Zagat’s publishes restaurant ratings for various locations in the United States. the file

restaurants contain the Zagat rating for food, decor, service, and the cost per person for

a sample of 100 restaurants located in New York City and in a suburb of New York

City. Develop a regression model to predict the cost per person based on a variable that

represents the sum of the ratings for food, decor, and service (use the enclosed excel file:

NYC Restaurants.xlsx).

Questions:

1. Construct a scatter plot.

2. Assuming a linear relationship, use the simple linear regression method to

compute the regression coefficients b0 and b1.

3. Interpret the meaning of the Y-intercept, b0, and the slope, b1, in this problem.

4. Predict the mean cost per person for a restaurant with a summated rating of 50.

3

5. What should you tell the owner of a group of restaurants in this geographical area

about the relationship between the summated rating and the cost of a meal?

Exercise 6.

Measuring the height of a California redwood tree is very difficult because these trees

grow to heights over 300 feet. People familiar with these trees understand that the

height of a California redwood tree is related to other characteristics of the tree,

including the diameter of the tree at the breast height of a person (in inches) and the

thickness of the bark of the tree (in inches). The excel file Redwood.xlsx contains the

height, diameter at breast height of a person, and bark thickness for a sample of 21

California redwood trees.

Questions:

1. Use the multiple regression method to compute the regression coefficients.

2. State the multiple regression equation that predicts the height of a tree based on

the tree’s diameter at breast height and the thickness of the bark.

3. Interpret the meaning of the slopes in this equation.

4. Predict the mean height for a tree that has a breast height diameter of 25 inches

and a bark thickness of 2 inches.

5. Interpret the meaning of the coefficient of multiple determination in this problem.

———

Thank you!

4