HW
1. For a binary classification, describe the possible values of entropy. On what conditions does entropy reach its minimum and maximum values?
2. In a decision tree, how does the algorithm pick the attributes for splitting?
3. John went to see the doctor about a severe headache. The doctor selected John at random to have a blood test for swine flu, which is suspected to affect 1 in 5,000 people in this country. The test is 99% accurate, in the sense that the probability of a false positive is 1%. The probability of a false negative is zero. John's test came back positive. What is the probability that John has swine flu?
4. Which classifier is considered computationally efficient for high-dimensional problems? Why?
5. A data science team is working on a classification problem in which the dataset contains many correlated variables, and most of them are categorical variables. Which classifier should the team consider using? Why?
6. A data science team is working on a classification problem in which the dataset contains many correlated variables, and most of them are continuous. The team wants the model to output the probabilities in addition to the class labels. Which classifier should the team consider using? Why?
7. Why use autocorrelation instead of autocovariance when examining stationary time series?
8. Provide an example that if the cov(X, Y) = 0, the two random variables, X and Y, are not necessarily independent.
7 years ago
20
Purchase the answer to view it

- Homework4.docx
- tt3.pdf
- 5-7 pages milestone
- write a one (1) page summary on the Annual Review posted
- PHI
- BUSINESS LAW II 1577
- CAN YOU CHOOSE AND DISCUSS TWO METRICS 1456
- English work : 36 hours : Business Plan
- talk 4 ls
- Numerical analysis
- The Church as Forgiving Community An Initial Model
- Finance Assignment - Balance Sheet Analysis
