Examining Data Formats
Part 1: Multiple Formats
Download the four zipcodeDemographics files. These files contain the same data but in different formats. Using a text editor examine the files and note differences in their structure. Perform some research to obtain brief definitions of each format, write your definitions stating your referenced source(s). Is any format easier to read compared to the others? If so, why?
Examine the file and characterize it (i.e., pick a format to analyze). Imagine that you must present your findings to an executive committee that wants to know what is important in the data. Prepare a single PowerPoint slide to succinctly characterize the data.
Part 2: Data Wrangling
Data wrangling is the process of manipulating data to change it into a format that can be used for analysis so that the data can be used in automated/computerized analysis tools. There are two major stages: data tidying and data cleaning. Data tidying is the process of transforming data so that it is in a format where it can be readily processed by automated tools. Data cleaning is the process of transforming the data to ensure to the greatest extent possible that it accurately reflects the subject that it pertains to.
Download the University returns_for_figshare_FINAL.csv file. Using a tool such as Excel examine the file. Identify as many issues as you can with the dataset. For issues that you identify develop a strategy (or strategies) as to how they can be dealt with, report the issues and strategies.
Submit an MS Word document with your findings from parts 1 & 2. Also, submit a printed copy at the beginning of the class on the date the assignment is due.
DOWN BELOW ARE THE FILES NEEDED FOR THE ASSIGNMENT!!!!!!!!!!
Attachments
- zipcodeDemographics.xml
- zipcodeDemographics.rdf
- zipcodeDemographics.json
- zipcodeDemographics.csv
- University returns_for_figshare_FINAL.csv
6 years ago
15
- Public Administration Event Analysis
- POL 443 Week 1-5 Complete Course grade A++ Wealth and Power in America.
- Interpreting Financial Results
- Behavior Change Assignment
- FIN 534 Week 1 Chapter 2 Solution 3#
- ENG 225 Week 2 -- Elements of Design Paper;
- BIS 375 Week 2 DQs
- BIS 303 Week 4 Individual Assignment - Internal and External Customers Paper
- ACC 460 Week 3 DQs
- Walgreens Case Study