hw8.pdf

Assignment 8 Due: 3/29 Note: Show all your work. Problem 1 (10 points). Consider the following transactional database for sequential pattern mining.

CID Day Items 1 1 B, D, H 14 A, C, D 24

31 B, D, F E, F, G

2 4 A, B, G, H 9 C, D, E, G 14 C, D, H

3 1 B, G, H 24 A, C, D, E 51 A, D, G, H

4 2 12

B, G A, B, C, H

25 B, C, D, E. G Determine the supports of the following sequences:

<{H}, {B}>, <{A, C}, {E}>, <{C}, {D, G}> Problem 2 (20 points). Consider the following contingency table.

C (buys coffee = Yes) C (buys coffee = No) T (buys tea = Yes) 337 582

T (buys tea = No) 729 648

(1). Compute the lift, all-confidence, cosine, Kulczynski and imbalance ratio measure, and

determine whether buying coffee and buying tea are positively correlated, negatively correlated, or not correlated.

(2). Perform the chi-square test with 5% significance level and determine whether they are correlated or not.

Submission Include all answers in a single file and name it LastName_FirstName_HW8.EXT. Here, “EXT” is an appropriate file extension (e.g., docx or pdf). If you have multiple files, then combine all files into a single archive file. Name the archive file as LastName_FirstName_HW8.EXT. Here, “EXT” is an appropriate archive file extension (e.g., zip or rar). Upload the file to Blackboard.