R studio
a year ago
20
HW5.pdf
HW5.pdf
MthStat 568/768 �Multivariate Statistical Analysis �Spring 2025
Homework 5
Due Wednesday, April 9
1. Consider the spambase data set, where emails are classi�ed as spam or not, and 57 feature variables are measured on each of them (see full description on p. 259 of the book).
(a) Split the data set into training and test sets (roughly a 70/30 split). Com- pute a logistic classi�er using the training data. (There might be perfect separation between the groups, but that should not matter as long as you don�t get NA coe¢ cients.)
(b) Find the misclassi�cation table for the test data and compute the mis- classi�cation rate.
2. Consider the pendigits data set, which are samples of handwritten digits 0; 1; : : : ; 9. The feature variables in this case are the (x; y) coordinates of the pen tip, dis- cretized at eight time points (see section 7.2.1 of the book for more details).
(a) Split the data set into training and test sets (roughly a 70/30 split). Com- pute the multinomial logistic classi�er using the training data.
(b) Construct the misclassi�cation table for the test data and compute the misclassi�cation rate. Which digit seems to be the hardest to classify correctly?
- BGMT Disc ***Professor Anthony ONLY***
- shift in supply and demand
- COM 323 Week 2 Persuasion, Manipulation, and Seduction
- HCS 438 Week 4 Individual Assignment - Analysis of Research Report Paper
- The Green Buffet has sales of 428,000, depreciation of 26,500, interest of 1,800, net income of 21,400, and a tax...
- Determine the basic components of a strategic information system (IT) plan within health care organizations. Next, specify the main roles of leadership team—including Chief Information Officer (CIO) and Chief Financial Officer (CFO)—in the process of IT a
- eco 372 week 5 International Trade Speech Assume that the team has been appointed as speech writers for the Speaker of the House. The team must write a speech which the Speaker must deliver about the current state of the U.S. macroeconomy to a number of a
- operation management
- PSY 103 Learning Experience
- laws and regulations in health care