Distributed Hadoop Mapreduce - Course recommendation
Title: Design and develop a Distributed Recommendation System on Hadoop
Problem statement:
Given 2 CSV data sets:
(a) A course dataset containing details of courses offered
(b) A job description dataset containing a list of job descriptions
(Note: Each field of a job description record is demarcated by " ")
You have to design and implement a distributed recommendation system using the data sets, which will recommend the best courses for up-skilling based on a given job description. You can use the data set to train the system and pick some job descriptions not in the training set to test.
It is left up to you how you pick necessary features and build the training that creates matching courses for job profiles.
Use Map Reduce and Python
NOTE: Combine all data_job_posts.csv into a single CSV file. Due to size limitation, I had to split the file
4 years ago
100
Purchase the answer to view it

- Hadoop-recomendationsystemUpdated.zip
- Assignment 2: Required Assignment 2—Genesis Energy Capital Plan
- There are 4 aces and 4 kings in a standard deck of 52 cards. You pick one card at rendom....
- Can someone do this for me?
- What external role players need info on the growth of an organisation?
- ACC 205 Week 5 Journal Most Important Ratio Journal
- The bussiness law
- do you know the answer
- ACC 290 Week 3 WileyPLUS Assignment Week Three
- Warehouse-1 Solution
- tests with a very short description Solution