InfoTech
Subject Name: - Data Science and Big Data Analytics
Text Book Name: - EMC Education Service (Eds). (2015) Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing, and Presenting Data, Indianapolis, IN: John Wiley & Sons, Inc.
R for Data Science, Garrett Grolemund & Hadley Wickham https://r4ds.had.co.nz/introduction.html
Review the Data Analytics Lifecycle in the text.
a. Summarize your understanding of the identification of potential data sources as part of the Discovery phase.
b. Provide your opinion regarding at what point the team should have enough information to create an analytics plan.
Note: Use your own words, describe your own preferences, and cite and reference as appropriate.
Question 2: -
Review the information on R in this week's readings, lecture, and supplemental material.
a. Provide what you believe are the pros and cons of using a tool such as R instead of Python, another popular tool.
b. Discuss how R & Python are different tools than tools that are used for data prep such as Hadoop, Alpine Miner, OpenRefine, and Data Wrangler.
Note: Use your own words, describe your own preferences, and cite and reference as appropriate.