Assignment
School of Computer & Information Sciences
ITS836 Data Science and Big Data Analytics
ITS 836
1
HW 03 Textbook Review Exercises
Review the R code to perform all exercises
section 3.1 Introduction to R
section 3.2 Exploratory Data Analysis
section 3.3 Statistical Methods for Evaluation
Save your output and attach to your homework submission with your name in the file name
Attempt to do at least 3 graphs with ggplot
Attach all the graphs in a powerpoint
ITS 836
2
HW 3.1
> ggplot(sales, aes(num_of_orders, sales_total)) + geom_point() + geom_smooth(method = lm, se = FALSE)
ITS 836
3
# section 3.1.3 Attribute and Data Types
ITS 836
4
Statistical Analysis
3.3.1 Hypothesis Testing
3.3.2 Difference of Means
3.3.3 Wilcoxon Rank-Sum Test
3.3.4 Type I and Type II Errors
3.3.5 Power and Sample Size
3.3.6 ANOVA (Analysis of Variance)
ITS 836
5
”R for Data Science” 5 Modules
ITS 836
6
R for Data Science, Garrett Grolemund & Hadley Wickham
https://r4ds.had.co.nz/index.html
I Explore
II Wrangle
III Program
IV Model
V Communicate
Wrangle Part II
Wrangle Part I
10 Tibble
10.5
11 Data Import
11.2.2
11.3.5
12 Tidy Data
12.2.1
12.3.3
12.4.3
12.5.1
12.6 Case Study
12.6.1
ITS 836
7
Wrangle Part II
13 Relational Data
13.2.1
13.3.1
13.4.6
13.5.1
14 Strings
14.2.5
14.3.3.1, 14.3.4.1, 14.3.5.1
14.4.1.1., 14.4.2.1, 14.4.3.1, 14.4.4.1, 14.5.5.1
14.5.1
14.7.1
https://r4ds.had.co.nz/relational-data.html
https://r4ds.had.co.nz/strings.html
https://r4ds.had.co.nz/factors.html
https://r4ds.had.co.nz/dates-and-times.html
Factors
15.3.1
15.4.1
15.5.1
Dates and Time
16.2.4
16.3.4
16.4.5
Homework Exercise Submission
Share the code (copy to powerpoint)
Plots in powerpoint format
Use existing ppt slide format
Put the exercise number in title
Put your name and id number
Upload
ITS 836
8
Questions?
ITS 836
9