Statistics Project - Help!
Hello,
I am seeking assistance with a statistics project.
3 years ago
125
NationalSummaryStatisticsandGraphsRealEstateData2.pdf
MAT240Module4ProjectOneVideo.htm
MAT240ProjectOneTemplate.docx
MAT240ProjectOneGuidelinesandRubric.docx
- MAT240RealEstateData1.xlsx
NationalSummaryStatisticsandGraphsRealEstateData2.pdf
Summary Statistics for MAT 240 Real Estate Data (for dataset in Modules 2, 3, and 4)
n Mean Median Std. Dev. Min Q1 Q3 Max Listing price ($)
1,000 342,365 318,000 125,914 135,300 265,250 381,600 987,600
Cost per square foot ($)
1,000 169 166 41 71 139 191 344
Square feet
1,000 2,111 1,881 921 1,101 1,626 2,215 6,516
This graph shows the frequency for listing price.
This graph shows the frequency for square feet.
- National Summary Statistics and Graphs Real Estate Data
MAT240Module4ProjectOneVideo.htm
MAT240ProjectOneTemplate.docx
Median Housing Price Model for D. M. Pan National Real Estate Company 2
[ Note: To complete this template, replace the bracketed text with your own content. Remove this note before you submit your outline.]
Report: Housing Price Prediction Model for D. M. Pan National Real Estate Company
[Your Name]
Median Housing Price Prediction Model for D. M. Pan National Real Estate Company 1
Southern New Hampshire University
Introduction
[ Describe the report: Define the question your report is trying to answer.]
[ Describe the report: Explain when using linear regression is the most appropriate.]
[ Describe the report: Explain when using linear regression what you would expect the scatterplot to look like.]
[ Describe the report: Explain the difference between predictor (x) and response (y) variables in a linear regression to justify the selection of variables.]
Data Collection
[ Sampling the data: Select a random sample of 50 houses. Describe how you obtained your sample data (provide Excel formulas as appropriate).]
[ Sampling the data: Identify your predictor and response variables.]
[ Scatterplot: Create and insert a correctly labeled scatterplot of your predictor and response variables to ensure they are appropriate for developing a linear model.]
Data Analysis
[ Histogram: Create and insert a histogram for the first variable. Be sure to include appropriate labels.]
[ Histogram: Create and insert a histogram for the second variable. Be sure to include appropriate labels.]
[ Summary statistics: Create and insert a table to show the summary statistics (mean, median, standard deviation) for both variables.]
[ Interpret the graphs and statistics: Interpret the center, spread, shape, and any unusual characteristic (outliers, gaps, etc.) for house sales and square footage.]
[ Interpret the graphs and statistics: Compare and contrast center, spread, shape, and any unusual characteristic for your sample of house sales with the national population. Also, determine whether your sample is representative of the national housing market sales. Note: In the learning management system, under Supporting Materials, see National Summary Statistics and Graphs Real Estate Data PDF.]
Develop Regression Model
[ Scatterplot: Create and insert the scatterplot of the variables with a line of best fit and the regression equation. [Based on your scatterplot, explain whether a regression model is appropriate.]
[ Discuss associations: Discuss the associations in the scatterplot, including the direction, strength, and form, in the context of your model.]
[ Discuss associations: Identify any possible outliers or influential points and discuss their effect on correlation.]
[ Discuss associations: Discuss keeping or removing outlier data points and what impact your decision would have on your model.]
[ Calculate r: Calculate the correlation coefficient and explain how the calculated r value supports what was noticed in your scatterplot.]
Determine the Line of Best Fit
[ Regression equation: Write the regression equation (i.e., line of best fit) and clearly define your variables.]
[ Interpret regression equation: Interpret the slope and intercept in context. For example, answer the questions: What does the slope represent in this situation? What does the intercept represent? Revisit the Scenario section in the learning management system.]
[ Strength of the equation: Provide and interpret R-squared. Determine the strength of the linear regression equation you developed.]
[ Use regression equation to make predictions: Use the regression equation to predict how much you should list your home for based on the assumed square footage of your home at 1500 square feet.]
Conclusions
[ Summarize findings: Summarize your findings in clear and concise plain language for the CEO to understand.]
[Summarize findings: Did you see the results you expected, or was anything different from your expectations or experiences?]
[Summarize findings: What changes could support different results, or help to solve a different problem?]
[Summarize findings: Provide at least one question that would be interesting for follow-up research.]
MAT240ProjectOneGuidelinesandRubric.docx
MAT 240 Project One Guidelines and Rubric
Competencies
In this project, you will demonstrate your mastery of the following competencies:
· Apply statistical techniques to address research problems
· Perform regression analysis to address an authentic problem
Overview
The purpose of this project is to have you complete all of the steps of a real-world linear regression research project starting with developing a research question, then completing a comprehensive statistical analysis, and ending with summarizing your research conclusions.
Scenario
You have been hired by the D. M. Pan National Real Estate Company to develop a model to predict housing prices for homes sold in 2019. The CEO of D. M. Pan wants to use this information to help their real estate agents better determine the use of square footage as a benchmark for listing prices on homes. Your task is to provide a report predicting the housing prices based square footage. To complete this task, use the provided real estate data set for all U.S. home sales as well as national descriptive statistics and graphs provided.
Directions
Using the Project One Template located in the What to Submit section, generate a report including your tables and graphs to determine if the square footage of a house is a good indicator for what the listing price should be. Reference the National Statistics and Graphs document for national comparisons and the Real Estate Data Spreadsheet spreadsheet (both found in the Supporting Materials section) for your statistical analysis.
Note: Present your data in a clearly labeled table and using clearly labeled graphs.
Specifically, include the following in your report:
Introduction
A. Describe the report: Give a brief description of the purpose of your report.
a. Define the question your report is trying to answer.
b. Explain when using linear regression is most appropriate.
i. When using linear regression, what would you expect the scatterplot to look like?
c. Explain the difference between predictor (x) and response (y) variables in a linear regression to justify the selection of variables.
Data Collection
A. Sampling the data: Select a random sample of 50 houses. Describe how you obtained your sample data (provide Excel formulas as appropriate).
a. Identify your predictor and response variables.
B. Scatterplot: Create a scatterplot of your predictor and response variables to ensure they are appropriate for developing a linear model.
Data Analysis
A. Histogram: Create a histogram for each of the two variables.
B. Summary statistics: For your two variables, create a table to show the mean, median, and standard deviation.
C. Interpret the graphs and statistics:
a. Based on your graphs and sample statistics, interpret the center, spread, shape, and any unusual characteristic (outliers, gaps, etc.) for house sales and square footage.
b. Compare and contrast the center, shape, spread, and any unusual characteristic for your sample of house sales with the national population (under Supporting Materials, see the National Summary Statistics and Graphs House Listing Price by Region PDF). Determine whether your sample is representative of national housing market sales.
Develop Your Regression Model
A. Scatterplot: Provide a scatterplot of the variables with a line of best fit and regression equation.
a. Based on your scatterplot, explain if a regression model is appropriate.
B. Discuss associations: Based on the scatterplot, discuss the association (direction, strength, form) in the context of your model.
a. Identify any possible outliers or influential points and discuss their effect on the correlation.
b. Discuss keeping or removing outlier data points and what impact your decision would have on your model.
C. Calculate r: Calculate the correlation coefficient ( r).
a. Explain how the r value you calculated supports what you noticed in your scatterplot.
Determine the Line of Best Fit. Clearly define your variables. Find and interpret the regression equation. Assess the strength of the model.
A. Regression equation: Write the regression equation (i.e., line of best fit) and clearly define your variables.
B. Interpret regression equation: Interpret the slope and intercept in context. For example, answer the questions: what does the slope represent in this situation? What does the intercept represent? Revisit the Scenario above.
C. Strength of the equation: Provide and interpret R-squared.
a. Determine the strength of the linear regression equation you developed.
D. Use regression equation to make predictions: Use your regression equation to predict how much you should list your home for based on the assumed square footage of your home at 1500 square feet.
Conclusions
A. Summarize findings: In one paragraph, summarize your findings in clear and concise plain language for the CEO to understand. Summarize your results.
a. Did you see the results you expected, or was anything different from your expectations or experiences?
b. What changes could support different results, or help to solve a different problem?
c. Provide at least one question that would be interesting for follow-up research.
- MAT 300 Assignment Bottling Company Case Study
- Project management
- IT project Management paper
- ACC 291 Week 4 WileyPLUS Week Four Assignment
- President Bill Clinton essay
- EncryptedDecrypted Solution
- Deployment Specialists pays a current (annual) dividend of $1 and is expected to grow at 24% for two years and then at 4% thereafter.
- Explain the significance of such a ritual. The essay should be at least 500 words
- BSHS 406 Week 2 Individual Assignment / Gender Class and Race
- Anthropology