Business Statistics

profilemaharjanaaurash
Assignmentinstructionst320forbothwordandexcelfilev32.docx

“Cover page for the word file you must ensure you submit the correct file and check you have submitted the correct file. Email [email protected] immediately if you accidentally submit something that has someone else’s work. “

“Title: Semester 3, 2020 BUS105 computing assignment” “Name:” “Student number:” “Sample Number: ”

“I am using the sample that is allocated to me based on my student number”

“Instructions for the computing assignment word file worth 18% of your final grade and excel file worth 2% of your final grade

Overview

Materials that must be used in the assignment - these are provided on moodle

· Videos that show how to get correct output, note that the vast majority of marks come from commenting on the output

Use this video to check that you have the correct output for question 1a and 6a

https://youtu.be/WqeNW0rbVH8

Use this video to check that you have the correct output for question 2a and 7a and 7c

https://youtu.be/gfRqxHwnwXk

Use this video to check that you have the correct output for question 3a and 8a and 8c

https://youtu.be/5XzOvWtPam8

· An excel file with the datasets for all the students , each student must follow the instructions and get 3 datasets using their student number , each student will have different datasets

· Automatic dataset summarizer.

Submission

1 Students must submit a word file (worth 18%) to Moodle AND an excel file (worth 2%) to moodle

2 The word file needs to be submitted to the Turnitin link – instructions are given on page 6

3 The word file needs a cover page

4 The word file needs the answers to 9 questions given in full detail later in this document a vital part of answering the question is using the dataset and the dataset summarizer.

5 The excel file needs to be submitted to the assignment dropbox,

6 The excel file should have the student’s 3 datasets and summaries NOT made by the automatic dataset summarizer - summarize the dataset using PivotTables and the scatterplot. (Instructions for submitting the excel file are given on page 7)

The Computing assignment also consists of 5 preparation quizzes worth 1% each these preparation quizzes are on moodle.

“Instructions for the Major part of assignment, the word file worth 18% of your final grade you submit to Turnitin.

Overview

You need to submit a word file with the answers to 9 questions - the first 8 questions are about the datasets

The last question is a paraphrasing task (refer to page 5)

You will use your datasets and the automatic dataset summarizer to get the descriptive statistics that are used in questions 1 to 5 and the inferential statistics that are used in question 6 to 8. To check you have correctly obtained your dataset check both p-values are correct when you investigate both categorical variables (question 6 to 8). There are videos on moodle explaining to check you have properly obtained your sample

Use this video to check that you have the correct output for question 1a and 6a

https://youtu.be/WqeNW0rbVH8

Use this video to check that you have the correct output for question 2a and 7a and 7c

https://youtu.be/gfRqxHwnwXk

Use this video to check that you have the correct output for question 3a and 8a and 8c

https://youtu.be/5XzOvWtPam8

The word count can be less than 1500 words if you are giving answers that demonstrate you have understood the material.

Summary of the datasets (questions 1 to 8 are about the datasets)

Dataset 1

University XYZ records the following information for 100 students

Number of Files downloaded from moodle and final mark

Dataset 2

University XYZ records the following information for 100 students

1) Is the student learning online?

2) How many files where downloaded from moodle?

Dataset 3

University XYZ gives out a survey to 100 students in a statistics course

The survey questions were

1) Do you think the course is useful ?

2) Did you learn the course online? or in the classroom at campus ?

Dataset is the questions and answers to the survey above.

Question 1

Paste dataset 1 into the dataset summarizer

a) Paste the descriptive sample statistics and the scatterplot into the word file. The descriptive statistics let you investigate the relationship between the variables “Number of files downloaded from moodle?” and “Final mark?” using the sample

b) Use the output in part (a) to describe the relationship between the two variables, do not use any numbers in your discussion

c) Also describe the relationship by using one of the following numbers, select the correct option

· The difference between sample means -

· The difference between sample proportions -

· The correlation coefficient r

d) Write an equation that lets you predict the final mark Y given the number of files downloaded X

e) Use the information in part (d) to predict the final mark if 10 files are downloaded from moodle

Question 2 Paste dataset 2 into the dataset summarizer

a) Paste in the descriptive statistics into the word file. The descriptive sample statistics let you investigate the relationship between the variables “Is the student leaning online?” and “Number of files downloaded from Moodle?” using the sample

b) Describe the relationship between the variables without using any numbers

c) Describe the relationship between the variables using one of the following numbers, select the correct option

· The difference between sample means -

· The difference between sample proportions -

· The correlation coefficient r

Question 3

Paste dataset 3 into the dataset summarizer

a) Paste in the descriptive statistics into the word file. The descriptive sample statistics let you investigate the relationship between the variables “Did you learn the course online?” and “Do you think the course is useful ?” using the sample

b) Describe the relationship between the variables without using any numbers

c) Describe the relationship between the two variables using one of the following numbers, choose the correct option

· The difference between sample means -

· The difference between sample proportions -

· The correlation coefficient r

Question 4

Paste dataset 1 into the summarizer to obtain the information needed to answer the question below

a) Considering the variable “files downloaded” find the zscore of the sample mean if you assume the population mean is µ=10 and the population standard deviation is σ=6

b) Considering the variable “Final mark” find the zscore of the sample mean if you assume the population mean is µ=5 and the population standard deviation is σ=3

Question 5

Paste dataset 2 into the summarizer to obtain the information needed to answer the question below. You also need to use the t table given at the end of sample final exams

a) Just considering the people that are learning On campus find a 95% confidence interval for the average number of files downloaded

b) Just considering the people that are learning Online find a 95% confidence interval for the average number of files downloaded

Question 6

Paste dataset 1 into the dataset summarizer

a) Paste in computer output that measures evidence for the claim there is a relationship between the variables “Files downloaded from moodle?” and “Final mark?” if you consider the whole population

b) Make suitable comments about the output in part (a)

c) If another sample had a lower coefficient of determination R2 would you expect the p value to be lower or higher ?

Question 7

Paste dataset 2 into the dataset summarizer

a) Paste in inferential statistics that measure evidence for the claim there is a relationship between the variables “Is the student leaning online?” and “How many files were downloaded from Moodle?” if you consider the whole population

b) Make suitable comments about the output in part (a)

c) Go back to the dataset summarizer and scroll down , Paste in the output for question 7c given below the inferential statistics

d) Compare the cases given in question 2 part (a) and the output in part (c) above in which case is there more evidence of a relationship between the variables? Which case would have a lower p-value ?

Question 8

Paste dataset 3 into the dataset summarizer

a) Paste in computer output that measure evidence for the claim there is a relationship between the variables “Do you think the course is useful ?” and “Did you learn the course online?” if you consider the whole population

Hint: inferential statistics measure evidence for a claim.

b) Make suitable comments about the output in part (a)

c) Go back to the dataset summarizer and scroll down. Paste in the output for question 8c given below the inferential statistics

d) Compare the cases given in question 3 part (a) and the case given in part (c) above. In which case is there more evidence of a relationship between the variables ? Which case would have a lower p-value ?

Question 9

Briefly discuss the sample report given in the link below , in particular discuss the data , how the data was analysed and the main message of the report

https://app.box.com/s/xf1f5vazt3deb9p6c35mmhma1ltnlfq1

(you need to click download, logging in will not work) and discuss how it is communicated.

Do not cut and paste text and use a computer to randomly change the words.

Upload the word file to the Turnitin link on moodle

Instructions for the excel file ,

This is worth 2% of your final grade you have to use the excel commands discussed below and not the dataset summarizer However you should check that your summaries are the same as the output from the dataset summarizer you used in the word file. If you have different information you will get at most 1 out of 2

You need to cut and paste just your dataset into a new excel file and follow the instructions below, DO NOT use a cover page for the excel file, you must check that you have the correct sample

Note that you do not have to use excel to make summaries you can use google sheets

A) Select all of dataset 3 and use excel commands to make a graph that lets you investigate the relationship between the fields (variables) “Number of files downloaded?” and “mark?”

B) Select all of dataset 2 and use excel PivotTable commands (or google sheet pivot table

commands) to find appropriate sample statistics that let you investigate the relationship between the fields (variables) “course online?” and “Number of Files Downloaded?”

C) Select all of dataset 3 and use excel PivotTable commands (or google sheet pivot table commands) to find appropriate sample statistics that let you investigate the relationship between the fields (variables) “Is the course online?” and “Is the Course useful?”

D) Upload the excel file with the pivot tables and scatterplot to the assignment dropbox

1