python data analysis project
Introduction
Use five-fold mode and separate the data into training and testing sets.
Data Dictionary description
Variable Definition Keysurvival Survival 0 = No, 1 = Yespclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rdsex Sex Age Age in years sibsp # of siblings/spousesparch # of parents/childrenfare Passenger fare embarked Port of Embarkation C = Cherbourg, Q = Queenstown, S = Southampton Variable Notes
pclass: A proxy for socio-economic status (SES)1st = Upper2nd = Middle3rd = Lowerage: Age is fractional if less than 1. sibsp: The dataset defines family relations in this way...Sibling = brother, sister, stepbrother, stepsisterSpouse = husband, wife (mistresses and fiancés were ignored)parch: The dataset defines family relations in this way...Parent = mother, fatherChild = daughter, son, stepdaughter, stepsonSome children travelled only with a nanny, therefore parch=0 for them. Analysis requirement:
1. How is the survival chance related to the gender?2. How is the survival chance related to age?3. How is the survival chance related to socio-economic status?5. What are the first three most important factors related to the survival chance?6. What is average prediction accuracy?
9 years ago 30
Answer(0)
Bids(0)
other Questions(10)
- Process safety unit viii
- PHI-413V Topic 1 DQ 1
- Lindo Optimization HW *NEED IN 10 HOURS*
- Problem Description The previous assignment assumes its input includes spaces between every token in an expression, but many programmers tend not...
- Assignment 2: Income Support Policies
- Week 2 Assignment
- Human rights assignment
- 300 words HW
- Seminar
- Project