Data science and big data

profileharsha518
Lab3-K-Means-R.docx

Advanced Analytics – Theory and Methods

Advanced Analytics – Theory and Methods-Lab#3

Narender Reddy Kudumula

University of Cumberlands

Data Science & Big Data Analysis (ITS-836)

Prof. Dr. Gasan Elkhodari

09/29/2019

Advanced Analytics – Theory and Methods-Lab#3

1) Study the learning materials “Advanced Analytics – Theory and Methods.

2) Find the included “Data Set” under this week’s content “grades_km_input.csv”

3) Using R, find the optimum number of clusters “K” by using “Elbows” algorithm.

4) Find and display the students’ Cluster Means for English, Math and Science.

5) Discuss the results.

I can be seen, the WCSS is greatly reduced when i increases from one to two. Another substantial reduction in WCSS occurs at i = 3. However, the improvement in WCSS is fairly linear for i > 3. Therefore, the k-means analysis will be conducted for i = 3. The process of identifying the appropriate value of i is referred to as finding the “elbow” of the WCSS curve.