Analyzing and Vizualizing Studio
ITS 530 Midterm
Name: _______________________________
Email Address: _________________________
Student ID: ____________________________
Section A
Explain if the following statements are true or false in details with scholar references.
1. All data science investigations start with an existing dataset. [10 points]
2. Data scientists do most of their work in Python and are unlikely to use other tools. [10 points]
3. Most data scientists spend the majority of their time developing new models. [10 points]
4. The use of historical data to make decisions about the future can reinforce historical biases. [10 points]
Section B
Each question in this section must be answered in detail (at least 2 paragraph) using references (APA style)
1. Explain with references the differences between Information visualization and visual analytics. (2 paragraphs) [10 points]
2. Why would you want to use data visualization for Big Data? (2 paragraphs) [10 points]
3. Data used in visual design comes in a “messy” form and as a result the data must go through a data cleansing process.
a. Explain what is a “messy” data. (2 paragraphs) [10 points]
b. Explain why a data may come in a “messy” data format. (2 paragraphs) [10 points]
c. Explain methods that maybe used to cleansed the data. (2 paragraphs) [10 points]
i. Give examples of data sets that lack structure. (1 paragraphs) [5 points]
ii. Give examples of data sets that includes structure. (1 paragraphs) [5 points]