data mining assignment
book - Read Tan, Steinbach, & Kumar - intro to data mining - https://www.pearsonhighered.com/assets/preface/0/1/3/3/0133128903.pdf
1. The following attributes are measured for members of a herd of Asian elephants: weight, height, tusk length, trunk length, and ear area. Based on these measurements, what sort of similarity measure from Section 2.4 (measure of similarity and dissimilarity) would you use to compare or group these elephants? Justify your answer and explain any special circumstances. (Chapter 2)
2. Consider the training examples shown in Table 3.5 (185 page) for a binary classification problem. (Chapter 3)
(a) Compute the Gini index for the overall collection of training examples.
(b) Compute the Gini index for the Customer ID attribute.
(c) Compute the Gini index for the Gender attribute.
(d) Compute the Gini index for the Car Type attribute using multiway split.
3. Consider the data set shown in Table 4.9 (348 page). (Chapter 4)
(a) Estimate the conditional probabilities for P(A|+), P(B|+), P(C|+), P(A|-), P(B|-), and P(C|-).
(b) Use the estimate of conditional probabilities given in the previous question to predict the class label for a test sample (A = 0, B = 1, C = 0) using the naıve Bayes approach.
(c) Estimate the conditional probabilities using the m-estimate approach, with p = 1/2 and m = 4.
6 years ago
12
- Senior Advanced Writter
- Vacuum
- LabView
- In the land of free trade, the public does not view all industries as equal. Do you believe that is ethical? Do you believe that some industries are unfairly targeted? Should it be consumers’ choice to partake in products that are not healthy for them, or
- im trying to get this essay done
- Managerial Application of Technology
- FOR LADY HAWKINS ONLY
- Very essay outline
- need help with crime scene sketch
- You have been hired as a consultant by your local mayor to look at the various market structures. Your role...