996sc
Assignment 8 Due: 3/29 Note: Show all your work. Problem 1 (10 points). Consider the following transactional database for sequential pattern mining.
CID Day Items 1 1 B, D, H 14 A, C, D 24
31 B, D, F E, F, G
2 4 A, B, G, H 9 C, D, E, G 14 C, D, H
3 1 B, G, H 24 A, C, D, E 51 A, D, G, H
4 2 12
B, G A, B, C, H
25 B, C, D, E. G Determine the supports of the following sequences:
<{H}, {B}>, <{A, C}, {E}>, <{C}, {D, G}> Problem 2 (20 points). Consider the following contingency table.
C (buys coffee = Yes) C (buys coffee = No) T (buys tea = Yes) 337 582
T (buys tea = No) 729 648
(1). Compute the lift, all-confidence, cosine, Kulczynski and imbalance ratio measure, and
determine whether buying coffee and buying tea are positively correlated, negatively correlated, or not correlated.
(2). Perform the chi-square test with 5% significance level and determine whether they are correlated or not.
Submission Include all answers in a single file and name it LastName_FirstName_HW8.EXT. Here, “EXT” is an appropriate file extension (e.g., docx or pdf). If you have multiple files, then combine all files into a single archive file. Name the archive file as LastName_FirstName_HW8.EXT. Here, “EXT” is an appropriate archive file extension (e.g., zip or rar). Upload the file to Blackboard.