Project final

profilemezz
BigDataProject-Template2.docx

https://www.seu.edu.sa/sites/ar/SitePages/images/logo.png

College of Computing and Informatics

2020/2021 Second Semester

Course Code

DS520

Course Name

Big Data Processing and Analytics

CRN

Assignment type

Critical Thinking Project

Module

All modules

Assignment Points

10

Student ID

Student Name

Project Template

Task 1:

1.1 Literature Review:

-----------------------------------------------------------

1.2 References:

-----------------------------------------------------------

Task 2:

2.1 Introduction

Provide a short description of your project and an overview about the data you are analysing.

2.2 Body section

2.2.1 Data

This section should include a description of the data being analyse (include number of samples in the dataset, features and their types, descriptive statistics of the data, etc).

2.2.2 Steps:

In this section, write the steps and commands you used to import the data and.

Task 3:

3.1 MapReduce Algorithm (Comment your Code)

Write the complete code you applied.

3.2 Results

Include a written description of the statistical results, and its meaning based on the dataset you have chosen.

Task 4:

4.1 Steps:

In this section, write the steps and commands you used to import the data and.

Task 5:

5.1 Applied Queries on MongoDB

Write the complete code you applied with describing the function of each query.

5.2 Results

Include a written description of the results. Discuss the meaning of the results based on the data set.

Task 6:

6.1 Applied Code on Hive/Pig

Write the complete code you applied with describing the function of each query.

6.2 Results

Include a written description of the results. Discuss the meaning of the results based on the data set.

Task 7:

6.1 Applied Code on SparkSQL

Write the complete code you applied with describing the function of each query.

6.2 Results

Include a written description of the results. Discuss the meaning of the results based on the data set (Include visualization of the results) Figures must be added.

Task 8:

8.1 Applied Code on Spark (Using MLib)

Write the complete code you applied with describing the machine learning algorithm and why you choose it.

8.2 Results

Include a written description of the results. Discuss the meaning of the results based on the data set.

Conclusion

Restate the main results of your analysis and provide any future recommendations.