week 2_discussion- D2
Week 1_ CR
Wood Wrote:
The minimum is the smallest value in a data set.
Ordering the data from least to greatest, we get:
18 27 58 62 64 67 68 70 74 78 78 79 82 82 82 83 83 85 93 93 93 96 98 99
So, the minimum is 18.
The first quartile (or lower quartile or 25th percentile) is the median of the bottom half of the numbers. So, to find the first quartile, we need to place the numbers in value order and find the bottom half.
18 27 58 62 64 67 68 70 74 78 78 79 82 82 82 83 83 85 93 93 93 96 98 99
So, the bottom half is
18 27 58 62 64 67 68 70 74 78 78 79
The median of these numbers is 67.5.
planation
The median is the middle number in a sorted list of numbers. So, to find the median, we need to place the numbers in value order and find the middle number.
Ordering the data from least to greatest, we get:
18 27 58 62 64 67 68 70 74 78 78 79 82 82 82 83 83 85 93 93 93 96 98 99
As you can see, we do not have just one middle number, but we have a pair of middle numbers, so the median is the average of these two numbers:
Median = 79 + 82 / 2 = 80.5
The third quartile (or upper quartile or 75th percentile) is the median of the upper half of the numbers. So, to find the third quartile, we need to place the numbers in value order and find the upper half.
18 27 58 62 64 67 68 70 74 78 78 79 82 82 82 83 83 85 93 93 93 96 98 99
So, the upper half is
82 82 82 83 83 85 93 93 93 96 98 99
The median of these numbers is 89.
The maximum is the greatest value in a data set.
Ordering the data from least to greatest, we get:
18 27 58 62 64 67 68 70 74 78 78 79 82 82 82 83 83 85 93 93 93 96 98 99
So, the maximum is 99.
Santos Wote:
The mean is 75.5
The standard deviation is 19.99
The 5 number summary is:
Minimum: 18
Q1: 67.75
Median: 80.5
Q3: 87
Maximum: 99
Does the data set have outliers? If so which one(s)?
Yes, the data set does have outliers and they are 18 and 27.
Would you prefer to use the mean or the medians this dataset measure of central tendency? Why?
Based on the data I would prefer to use the median over the mean as the favored central tendency because it is more representative of the data. Because the number of individuals with the higher average test scores show how serious, the test was taken or studied for. For those who studied and put in the work reflects on their scores and for those whom did not it show on the scores as well. It is also difficult to know why exactly the outliers are so low without having the knowledge of where the data came from or its origin. In the Plotbox we will see all the formulas and outliers as well.