data storage (project proposal)

p_patel359
ProjectProposal.docx

Project Proposal

Project Proposal and Final Project Overview

During this course, you will complete a data mining proposal and final project. Below is a list of samples you may choose from to use for your project. Instructions for the Project Proposal follow the samples. The Project Proposal is due in Module 5, and the final project is due in Module 10. An overview of the Final Project is also provided here for your review.

Sample Data Projects:

· Overall Car Evaluation Based On Various Specifications

· Personal Spam Filter using Decision Trees

· Predicting Favored Contraceptive Methods

· Negative Effect of Pruning under Low Recall

· Objective Empirical Comparison of Decision Tree Classifiers

· Who is Most Likely to Smoke

· Importance of Attribute Replacement to Misclassification Rate Reduction

· Comparative Study of C4.5, C5.0, and SAS Enterprise Miner

· Associative Analysis of Caffeine Intake and Lifestyle of Fordham University Students

· Efficient Market Theory versus Enterprise Miner

· Gender and Generations: predicting Age and Income based on Social and Political Opinions

· Mining Major League Baseball

· Predicting Breast Cancer Recurrence with Data Mining

· A Comparison of Selected Data Mining Algorithms

· Predicting Quantifiable Forest Fires using Data Mining

· Scooby-Doo: Where are You? : Examining the Dogs of New York City

· Drafting the Perfect Running Backs- Predicting Seasonal Performance Changes

· Predicting Meteorite Landings

· Predicting who Survived on the Titanic

· Micro-Loans: Predicting if Loan is Approved and Predicting Interest Rate of Loan

· Are there differences between religious and non-religious US Universities

· Predicting Shelter Animal Adoption: a look at characteristics that predict pet adoption

· Predicting AirBnb Prices

· Predicting Undergraduate Student GPA

· Classifying Myoelectric Signals into Hand Movements for Control of a 3D Printed Hand

· Predicting E-sport Team Performance at the League of Legends World Championship

· IMDB Movie Review Sentiment Analysis

Project Proposal

Your project proposal must be typed, and it should be approximately one page long, single spaced. The purpose of the proposal is to make sure that you are on the right track and to give me enough information so that I can give you useful feedback. You may choose to work alone for this project or you may choose to work with another student. You do not need my permission to work with one other student, but if you are working with two or more students, you must get my approval (which I will only grant for very ambitious projects). If you wish to work in a group, you will need to determine your group members by the time you submit your proposal in Module 5. Each member of the group will need to submit both the proposal and final project individually. If you need to request permission for a larger group, please reach out to me via your NLU email no later than Module 4 in order to ensure you have time to choose another topic. Provide the following items in your proposal:

· Preliminary title and list of students working on the project (if you are working with someone else)

· Abstract: This should be similar to the abstract that will ultimately appear in your paper. It should be one paragraph long, but for now perhaps only 5-15 lines. It should provide a high-level summary of your project and outline your main goals.

· Brief description of what you plan to do

· What problem are you trying to solve?

· How do you formulate the problem as a data mining problem? (E.g., Is it classification, association rule mining, etc.) What exactly are you trying to predict (for prediction tasks) and how will you evaluate your results? How will you know if your results are good? What can you compare them to? It is critical that your problem is well defined?

· What data sets do you plan to use? If you must do significant work to get the data or convert it into the proper format, then describe the process and approximate effort required. How many examples are in the data set? How many features?

Your proposal is due in Module 5 by 11:59 PM CT on Sunday.

Final Project

The actual write-up of your project paper should be roughly three to seven pages, single spaced. The paper need not be organized exactly as described below, but it should be quite similar, since the outline below is fairly standard.

· Abstract: This should summarize the paper and the goals of the work (required, approximately 500 words).

· Introduction: Introduce the project and what you are trying to do. This may include some background.

· Background: Depending on the project and how much background you want to include, you may want a separate background section. For example, this section may provide domain information for the domain that you are studying. If the domain is not complex, then this section may not be needed.

· Experiment Methodology: Describe the experiments and the experiment methodology. Describe the data sets, evaluation metrics, data mining algorithms used, the precise methodology related to the setup of experiments, and any other details related to the experiments. There will usually be a subsection for each of the sub-topics just mentioned.

· Results: Present the experiment results and a discussion and analysis of the results. Normally, a separate discussion section is not necessary.

· Conclusion: Provide your conclusion (perhaps summarize your main results). Normally, it will also discuss limitations and avenues for future work.

Your final project is due in Module 10 by 11:59 PM CT on Sunday.