Data Mining Basic Assignment
1. The following attributes are measured for members of a herd of Asian elephants: weight, height, tusk length, trunk length, and ear area. Based on these measurements, what sort of similarity measure from Section 2.4 (measure of similarity and dissimilarity) would you use to compare or group these elephants? Justify your answer and explain any special circumstances.
2. Consider the training examples shown in Table 3.5 (please find the table from the Attached screenshot) for a binary classification problem.
(a) Compute the Gini index for the overall collection of training examples.
(b) Compute the Gini index for the Customer ID attribute.
(c) Compute the Gini index for the Gender attribute.
(d) Compute the Gini index for the Car Type attribute using multiway split.
3. Consider the data set shown in Table 4.9 (please find the table in the attached screenshot).
(a) Estimate the conditional probabilities for P(A|+), P(B|+), P(C|+), P(A|-), P(B|-), and P(C|-).
(b) Use the estimate of conditional probabilities given in the previous question to predict the class label for a test sample (A = 0, B = 1, C = 0) using the naıve Bayes approach.
(c) Estimate the conditional probabilities using the m-estimate approach, with p = 1/2 and m = 4.
6 years ago 10
Purchase the answer to view it
- DataMining.docx
- Consumer needs and wants are what drive marketers to succeed in selling their products or services through a variety of methods. Describe the process from consumer need to purchase behavior. Then, discuss how each of the following items impacts the proces
- ANT.101, WK4, DS1 Monumental Architecture
- Risk Management Help Pleaseeeeeeeeee
- Help with computer discussion homework
- i need it soon please
- Need a 2-3 pages paper by Friday
- lab report
- wizard kim
- You are assigned the task of computing the variable capital and labor costs for Cost Cutters production level. Below is...
- ACC 490 Week 3 Individual Assignment Ch. 5, 6, & 7 Textbook Exercises