proposal -
Project proposal
students will turn in a 500-word final project proposal detailing the question or hypothesis they wish to explore, the dataset(s) they intend to analyze, and an outline of the data methods, and visualization strategies, they intend to use in analyzing and presenting it.
Final project: use data to examine a question of social relevance
The project will be displayed either in R Markdown or in Tableau, and should present a complete analysis of a question of social relevance through data. The exact format of the presentation can be determined in consultation with the instructors; but it should display (a) ability to formulate an interesting and important research question of relevance to an identified audience; (b) competence in finding and obtaining dataset(s) appropriate to the question; (c) rigorous, accurate and appropriate analysis of the data, showing sophistication and depth; and (d) creative, appealing and informative use of data visualization to display the results.
Datasets for Video Game Sales:
https://www.kaggle.com/sidtwr/videogames-sales-dataset
https://www.kaggle.com/arslanali4343/sales-of-video-games
Explanations for variables in these datasets:
Rank - Ranking of overall sales
Name - The games name
Platform - Platform of the games release (i.e. PC,PS4, etc.)
Year - Year of the game's release
Genre - Genre of the game
Publisher - Publisher of the game
NA_Sales - Sales in North America (in millions)
EU_Sales - Sales in Europe (in millions)
JP_Sales - Sales in Japan (in millions)
Other_Sales - Sales in the rest of the world (in millions)
Global_Sales - Total worldwide sales.
Hypothesis:
H1: Sports games are more popular than shooters games in North America.
H2: Role-playing games are most popular (have highest sales) in Japan.
H3: Action games have become increasingly popular in recent years (Year after year).
Explanations for each Hypothesis:
Outline of the data methods:
Visualization strategies:
Charts/plots/gapminder/Scatterplot
Number
Length
Distance
Size: area and volume
Color
Saturation
Small multiples/ “facets”