data science
(a) Read the notes on classification.
(b) Modify the classification code to run the same analysis but on the data set titanic.csv Download titanic.csv .
Some information on the data set: - data on the Titanic disaster - The goal is to predict if a given person survived or not the Titanic disaster. - each row represents one person (passenger) on the Titanic - target: survived (0), did not survive (1). This is the first column. - features: Pclass, Sex, Age, Siblings/Spouses Aboard, Parents/Children Aboard, Fare
Use the same classification algorithm GaussianNB, and split the data set into 700 rows for training, and the rest for testing. Use a random seed equal to 12.
(c) Repeat the previous step, but use the following classification algorithms. You can learn how to use these classifiers on the scikit-learn website.
KNeighborsClassifier RandomForestClassifier ExtraTreesClassifier SVC
(d) Upload your entire code to Canvas in a Python file, i.e. the extension of the file should be py.