Help with Statistics 200

HelloThere
STAT_Final.pdf

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 1 of 10

STAT 200

OL4/US2 Sections

Final Exam

Fall 2018

The final exam will be posted at 12:01 am on December 14, and it is

due at 11:59 pm on December 16, 2018. Eastern Time is our

reference time.

This is an open-book exam. You may refer to your text and other course materials

for the current course as you work on the exam, and you may use a calculator. You

must complete the exam individually. Neither collaboration nor consultation with

others is allowed. It is a violation of the UMUC Academic Dishonesty and

Plagiarism policy to use unauthorized materials or work from others.

Answer all 20 questions. Make sure your answers are as complete as possible.

Show all of your supporting work and reasoning. Answers that come straight from

calculators, programs or software packages without any explanation will not be

accepted. If you need to use technology (for example, Excel, online or hand-held

calculators, statistical packages) to aid in your calculation, you must cite the sources

and explain how you get the results.

Record your answers and work on the separate answer sheet provided.

This exam has 20 questions; 5% for each question.

You must include the Honor Pledge on the title page of your submitted final exam.

Exams submitted without the Honor Pledge will not be accepted.

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 2 of 10

1. The U.S. Census Bureau needs to estimate the median income of females in the U.S. They collect

incomes from 3500 females. Choose the best answer. Justify for full credit.

(a) Which of the followings is the parameter?

(i) Set of income responses from 3500 females in the US

(ii) Median Income of set of 3500 females in the US

(iii) Set of income responses from all females in the US

(iv) Median income of set of all females in the US

(b) Which of the followings is the sample?

(i) Set of income responses from 3500 females in the US

(ii) Median income of set of 3500 females in the US

(iii) Set of income responses from all females in the US

(iv) Median income of set of all females in the US

2. Choose the best answer. Justify for full credit.

(a) What type of data are student ID numbers considered?

(i) interval

(ii) nominal

(iii) ordinal

(iv) ratio

(b) The quality control department of a semiconductor manufacturing company tests every 100th

product from the assembly line. This type of sampling is called:

(i) cluster

(ii) convenience

(iii) systematic

(iv) stratified

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 3 of 10

3. The midterm exam scores in a statistics class are shown in the following table:

67 92 76 45 85 70 87 84 67 72

84 85 55 76 84 98 59 93 87 83

(a) Complete the following frequency distribution table using 6 classes: 40-49, 50-59, 60-69,

70-79, 80-89, and 90-99. Express the cumulative relative frequency to two decimal places.

(Show all work. Just the answer, without supporting work, will receive no credit.)

Scores Frequency Relative

Frequency

Cumulative

Relative

Frequency

40 - 49

50 - 59

60 - 69

70 - 79

80 - 89

90 - 99

(b) What percentage of the midterm exam scores was at least 80?

4. Answer the following questions based on the midterm exam score data given in Question # 3:

(Show all work. Just the answer, without supporting work, will receive no credit.)

(a) What is the range of the midterm exam scores?

(b) What is the median of the midterm exam scores?

(c) What is the mode of the midterm exam scores?

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 4 of 10

5. A STAT 200 professor took a sample of 10 midterm exam scores from a class of 30 students. The

10 scores are shown in the table below:

95 68 76 51 85 70 89 84 67 72

(a) What is the sample mean?

(b) What is the sample standard deviation? (Round your answer to two decimal places)

(c) If you leveraged technology to get the answers for part (a) and/or part (b), what technology

did you use? If an online applet was used, please list the URL, and describe the steps. If a

calculator or Excel was used, please write out the function.

6. There are 4 suits (heart, diamond, clover, and spade) in a 52-card deck, and each suit has 13

cards. Suppose your experiment is to draw one card from a deck and observe what suit it is.

Express the probability in fraction format. (Show all work. Just the answer, without supporting

work, will receive no credit.)

(a) Find the probability of drawing a diamond or clover.

(b) Find the probability that the card is not a spade.

7. There are 6 white balls and 4 red balls in an urn. Consider selecting one ball at a time from the

urn. What is the probability that the first ball is white and the second ball is also white? Express

the probability in fraction format. (Show all work. Just the answer, without supporting work, will

receive no credit.)

(a) Assuming the ball selection is with replacement.

(b) Assuming the ball selection is without replacement.

8. There are twenty stores for a grocery chain in the Mid-Atlantic region. The regional executive

wants to visit five of the twenty stores. She asks her assistant to choose five stores and arrange the

visit schedule. (Show all work. Just the answer, without supporting work, will receive no credit).

(a) Does the order matter in the scheduling?

(b) Based on your answer to part (a), should you use permutation or combination to find the

different schedules that the assistant may arrange?

(c) How many different schedules can the assistant recommend?

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 5 of 10

9. Mimi has eight books from the Statistics is Fun series. She plans on bringing two of the eight

books with her in a road trip. (Show all work. Just the answer, without supporting work, will

receive no credit).

(a) Does the order matter in the book selection?

(b) Based on your answer to part (a), should you use permutation or combination to find the

number of the different ways the two books can be selected?

(c) How many different ways can the two books be selected?

10. Let random variable x represent the number of heads when a fair coin is tossed three times.

(a) Construct a table describing the probability distribution.

x P(x)

0

1

2

3

(b) Determine the mean and standard deviation of x. Show all work. Just the answer, without supporting

work, will receive no credit.

11. Mimi joined UMUC basketball team since summer 2018. On average, she is able to score 30% of the

field goals. Assume she tries 15 field goals in a game.

(a) Let X be the number of field goals that Mimi scores in the game. As we know, the distribution of X

is a binomial probability distribution. What is the number of trials (n), probability of successes (p)

and probability of failures (q), respectively?

(b) Find the probability that Mimi scores at least 5 of the 15 field goals. (Round the answer to 3 decimal

places).

(c) To get the answers for part (b), what technology did you use? If an online applet was used,

list the URL and describe the steps. If a calculator or Excel was used, write out the function.

12. The heights of pecan trees are normally distributed with a mean of 10 feet and a standard deviation

of 2 feet. Show all work. Just the answer, without supporting work, will receive no credit.

(a) What is the probability that a randomly selected pecan tree is between 8.5 and 12.5 feet tall?

(round the answer to 4 decimal places)

(b) Find the 90th percentile of the pecan tree height distribution. (round the answer to 2 decimal places)

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 6 of 10

(c) To get the answers for part (a) and part (b), what technology did you use? If an online applet was

used, list the URL and describe the steps. If a calculator or Excel was used, write out the function

13. Based on the performance of all individuals who tested between July 1, 2014 and June 30, 2017,

the GRE Quantitative Reasoning scores are normally distributed with a mean of 152.8 and a

standard deviation of 9.13. (https://www.ets.org/s/gre/pdf/gre_guide_table1a.pdf). Show all work.

Just the answer, without supporting work, will receive no credit.

(a) Consider all random samples of 36 test scores. What is the standard deviation of the sample

means? (Round your answer to three decimal places)

(b) What is the probability that 36 randomly selected test scores will have a mean test score that is

between 150 and 155? (Round your answer to four decimal places)

(c) To get the answer for part (b), what technology did you use? If an online applet was used, list the

URL and describe the steps. If a calculator or Excel was used, write out the function

14. A survey showed that 850 of the 1600 adult respondents who live in a household without landline

phones.

(a) Construct a 95% confidence interval estimate of the proportion of adults living in a household

without landline phones. Show all work. Just the answer, without supporting work, will receive no

credit. Include description of how confidence interval was constructed.

(b) Describe the confidence interval in everyday language.

15. A random sample of 900 SAT scores has a sample mean of 1100. Assume that SAT scores have a

population standard deviation of 300.

(a) Construct a 95% confidence interval estimate of the mean SAT scores. Show all work. Just the

answer, without supporting work, will receive no credit. Include description of how confidence

interval was constructed.

(b) Describe the confidence interval in everyday language.

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 7 of 10

16. A researcher claims the proportion of adults who live in a household without landline phones is

greater than 50%. A survey showed that 850 of the 1600 adult respondents who live in a

household without landline phones.

Assume you want to use a 0.05 significance level to test the researcher’s claim.

(a) What is the appropriate hypothesis test to use for this analysis: one-sample z-test for the

population proportion, one-sample t-test for population proportion, one-sample z-test for

population mean, or one-sample t- test for population mean? Please identify and explain why it is

appropriate.

(b) Identify the null hypothesis and the alternative hypothesis.

(c) Determine the test statistic. Round your answer to two decimal places. Show all work; writing the

correct test statistic, without supporting work, will receive no credit.

(d) Determine the P-value for this test. Round your answer to three decimal places. Show all work;

writing the correct P-value, without supporting work, will receive no credit.

(e) Compare p-value and significance level α. What decision should be made regarding the null hypothesis

(e.g., reject or fail to reject) and why?

(f) Is there sufficient evidence to support the researcher’s claim that the proportion of adults living in a

household without landline phones is greater than 50%? Explain.

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 8 of 10

17. In a study of memory recall, 5 people were given 10 minutes to memorize a list of 20 words.

Each was asked to list as many of the words as he or she could remember both 1 hour and 24

hours later. The result is shown in the following table.

Number of Words Recalled

Subject 1 hour later 24 hours later

1 14 12

2 18 15

3 11 9

4 13 12

5 12 12

Is there evidence to suggest that the mean number of words recalled after 1 hour exceeds the

mean recall after 24 hours?

Assume we want to use a 0.05 significance level to test the claim.

(a) What is the appropriate hypothesis test to use for this analysis: z-test for two proportions, t-test for

two proportions, t-test for two dependent samples (matched pairs), or t-test for two independent

samples? Please identify and explain why it is appropriate.

(b) Let μ1 = mean number of words recalled 1 hour later. Let μ2 = mean number of words recalled 24

hours later. Which of the following statements correctly defines the null hypothesis?

(i) μ1 - μ2 > 0 (μd > 0)

(ii) μ1 - μ2 = 0 (μd = 0)

(iii) μ1 - μ2 < 0 (μd < 0)

(c) Let μ1 = mean number of words recalled 1 hour later. Let μ2 = mean number of words recalled 24

hours later. Which of the following statements correctly defines the alternative hypothesis?

(i) μ1 - μ2 > 0 (μd > 0)

(ii) μ1 - μ2 = 0 (μd = 0)

(iii) μ1 - μ2 < 0 (μd < 0)

(d) Determine the test statistic. Round your answer to three decimal places. Show all work; writing the

correct test statistic, without supporting work, will receive no credit.

(e) Determine the p-value. Round your answer to three decimal places. Show all work; writing the

correct critical value, without supporting work, will receive no credit.

(f) Compare p-value and significance level α. What decision should be made regarding the null

hypothesis (e.g., reject or fail to reject) and why?

(g) Is there sufficient evidence to support the claim that the mean number of words recalled after 1

hour exceeds the mean recall after 24 hours? Justify your conclusion.

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 9 of 10

18. The UMUC Daily News reported that the color distribution for plain M&M’s was: 40% brown,

20% yellow, 20% orange, 10% green, and 10% tan. Each piece of candy in a random sample of

100 plain M&M’s was classified according to color, and the results are listed below. Use a 0.05

significance level to test the claim that the published color distribution is correct. Show all work

and justify your answer.

Color Brown Yellow Orange Green Tan

Number 38 22 13 9 18

(a) What is the appropriate hypothesis test: z-test for sample proportion, t-test for sample mean, chi-

square goodness of fit test, F-test for ANOVA? Please identify and explain why it is appropriate

for analyzing this data.

(b) Identify the null hypothesis and the alternative hypothesis.

(c) Determine the test statistic. Round your answer to two decimal places. Show all work; writing the

correct test statistic, without supporting work, will receive no credit.

(d) Determine the P-value. Round your answer to two decimal places. Show all work; writing the

correct P-value, without supporting work, will receive no credit.

(e) Compare p-value and significance level α. What decision should be made regarding the null

hypothesis (e.g., reject or fail to reject) and why?

(f) Is there sufficient evidence to support the claim that the published color distribution is correct?

Justify your answer.

19. A STAT 200 instructor believes that the average quiz score is a good predictor of final exam score.

A random sample of 10 students produced the following data where x is the average quiz score and

y is the final exam score.

x 86 95 50 65 98 55 85 70 75 85

y 85 96 60 63 96 60 83 60 77 87

(a) Find an equation of the least squares regression line. Round the slope and y-intercept value to

two decimal places. Describe method for obtaining results. Show all work; writing the correct

equation, without supporting work, will receive no credit.

(b) Based on the equation from part (a), what is the predicted final exam score if the average quiz

score is 80? Show all work and justify your answer.

(c) Based on the equation from part (a), what is the predicted final exam score if the average quiz

score is 40? Show all work and justify your answer.

(d) Which predicted final exam score that you calculated for (b) and (c) do you think is closer to the

true final exam score and why?

STAT 200: Introduction to Statistics Final Examination, Fall 2018 OL4_US2 Page 10 of 10

20. What is the appropriate statistical analysis to use: t-test for two independent samples, t-test for two

dependent samples, ANOVA, or chi-square test of independence? Please identify and explain why

it is appropriate.

(a) A study was conducted to see whether monetary incentives to use less water during times of

drought had an effect on water usage. Sixty single family homeowners were randomly

assigned to one of two groups: 1) monetary incentives and 2) no monetary incentives. At the

end of three months, the total amount of water usage for each household, in gallons, was

measured.

(b) A study was conducted to see whether the mean weight loss is the same for 10 different weight

loss programs. Each of the 10 programs had 50 subjects in it. The subjects were followed for

12 months. Weight change for each subject was recorded.