Data Wrangling
For this portion of the project, you will examine your dataset for incorrect data. Any incorrect data should be removed, corrected, or imputed. Follow these steps:
- Remove irrelevant data. If you are unsure if it is irrelevant, then keep it.
- Remove duplicate records that are repeated.
- Make sure numbers are interpreted as numerical data types.
- Fix typos.
- Standardize.
- Investigate outliers.
- Check and manage missing values.
- Format and normalize data if needed.
- Change categorical values into numbers if needed.
Once you have completed this, you will need to provide a Word document summarizing the pre-processing steps performed on your dataset.
4 years ago 10
Answer(0)
other Questions(10)
- read Cruz’s “College Affordability: Damned If You Go, Damned If You Don’t” and Motoko’s “Literacy Debate: Online, R U Really Reading?”
- Homework Question for Pro. Miley
- The Affordable Care Act
- Quick paper
- Identify the theoretical statement and answer the following question in a 3 page essay.
- Rey Writer
- 100A Discussion: Interview Preparation and Discussion
- 1 page paper
- PCN 662B Week 1 Weekly Journal
- History Q (2)