assignment excel

profilealdennim
examples.zip

RealEstateSP18_..Bayan.xlsx

Description of Data

Description: The datafile contains 266 observations on 23 variables sampled from homes in Chester County in 2007-2010
Variable names in order from left to right:
MLS - Multiple Listing Service - serves as a house ID in the real estate system
DOM - Days on Market - the number of days from when a house was listed until it was sold
PRICE - actual selling price of the house
COUNTY - county the house is located in
SUBDIV / NEI - if the house is in a named neighborhood the name is given - if the house is not in a neighborhood it might be blank of have an N/A
SCHOOL DISTRICT - either the West Chester of Unionville-Chadds Ford school district
-HIGH - students living in this house would go to this high school - please note some school districts have multiple high schools and some only have one
-MIDDLE - students living in this house would go to this middle school - please note some school districts have multiple middle schools and some only have one
-ELEM - students living in this house would go to this elementary school
TYPE - whether the structure is single or attached
DESIGN - number of stories/floors
STYLE - style of the home
BED - number of bedrooms
BATH - number of full bathrooms - note a full bathroom contains a shower/tub
1/2 BATH - number of 1/2 bathrooms or powder rooms - note a 1/2 bath does not contain a shower/tub
AGE - age of home in years
SQ FOOT - interior square footage
RETAXES - real estate taxes
ASSESS - assessed value - this is used to calculate real estate taxes
ACRE - the acerage of the land
GARAGE - number of garage bays
CONDITIONG - condition of the house - condition does not have a standard definition
LDt - listing date - the day the house was put on the market

Freq 1

Row Labels Count of SCHOOL DISTRICT Count of SCHOOL DISTRICT2
Unionvil-chadds 106 40.46%
West Chester 156 59.54%
Grand Total 262 100.00%

Freq2

Values
Row Labels Count of TYPE Count of TYPE2
Row/townhouse 32 12.17%
single 124 47.15%
Single/detac 97 36.88%
Twin/Semi-De 10 3.80%
Grand Total 263 100.00%

CorrelationMatrix

DOM PRICE BED BATH 1/2 BATH AGE SQ FOOT RETAXES ASSESS ACRE GARAGE
DOM 1.00
PRICE 0.14 1.00
BED 0.13 0.62 1.00
BATH 0.12 0.56 0.59 1.00
1/2 BATH 0.12 0.41 0.31 0.21 1.00
AGE 0.08 0.13 0.08 -0.24 -0.28 1.00
SQ FOOT 0.09 0.65 0.66 0.74 0.47 -0.23 1.00
RETAXES 0.07 0.58 0.46 0.60 0.43 -0.22 0.66 1.00
ASSESS 0.08 0.70 0.56 0.66 0.51 -0.27 0.80 0.83 1.00
ACRE 0.15 0.62 0.25 0.07 0.03 0.53 0.10 0.10 0.18 1.00
GARAGE -0.07 0.56 0.41 0.54 0.42 -0.40 0.61 0.55 0.68 0.04 1.00

DiscrriptiveStatic

DOM PRICE BED BATH 1/2 BATH AGE SQ FOOT RETAXES ASSESS ACRE GARAGE
Mean 78.3854961832 556644.690839695 3.9122137405 2.465648855 0.9770992366 30.5992366412 3090.4910675629 6420.7813408421 271399.139220202 1.6522137405 1.965648855
Standard Error 5.9147943555 19049.0989223115 0.053942965 0.0531523372 0.0350197668 2.3878455396 82.7578016648 264.8239485694 8643.7151654096 0.3361023602 0.0570663738
Median 47.5 495000 4 2 1 19.5 2800 6016.0373134328 244985 0.92 2
Mode 3 675000 4 2 1 0 2723.9029850746 6016.0373134328 259104.402985075 1 2
Standard Deviation 95.7393104961 308336.602554784 0.8731431662 0.8603457375 0.5668444461 38.6506566066 1339.5520441302 4286.5500835516 139910.752651509 5.4402919673 0.9236999556
Sample Variance 9166.0155742739 95071460475.0267 0.7623789886 0.7401947881 0.3213126261 1493.87325612 1794399.67893351 18374511.6187959 19575018707.5117 29.5967766898 0.853221608
Kurtosis 12.5554308452 18.7753229323 3.9554221095 0.4927937782 1.5738737377 11.1950133676 2.6086056096 28.7255790763 3.657254878 89.2009250701 0.2162616439
Skewness 2.8798352803 3.1542622809 0.8675737115 0.6163955793 0.3765768362 3.0896852532 1.3364090407 3.9683268454 1.5252391155 8.9771890895 -0.4900831347
Range 756 2940000 7 4 3 240 8118 44442 898490 61.6 4
Minimum 1 125000 2 1 0 0 882 1386 55000 0 0
Maximum 757 3065000 9 5 3 240 9000 45828 953490 61.6 4
Sum 20537 145840909 1025 646 256 8017 809708.659701493 1682244.71130064 71106574.475693 432.88 515
Count 262 262 262 262 262 262 262 262 262 262 262

Correlation 2

WC=1..UC=0 Single=0..Others=1 s1=1 s1.5=2 s2=3,s3=4 Colonial=0..Others=1
WC=1..UC=0 1.00
Single=0..Others=1 -0.15 1.00
s1=1 s1.5=2 s2=3,s3=4 0.07 0.18 1.00
Colonial=0..Others=1 -0.16 0.01 -0.27 1.00

Sheet2

Average of AGE
Row Labels Total
2 25.25
3 35.2428571429
4 25.3383458647
5 31.9285714286
6 69.75
9 79
Grand Total 30.5992366412

Data

MLS SUBDIV / NEI SCHOOL DISTRICT TYPE DESIGN STYLE DOM PRICE BED BATH 1/2 BATH AGE SQ FOOT RETAXES ASSESS ACRE GARAGE
4813991 N/A 1 0 1 1 237 280000 3 2 0 38 1540 2249 118790 1.2 0
4964729 N/A 1 0 3 1 13 293500 4 2 1 20 2863 3908 205330 1.2 2
4808620 N/A 1 0 3 1 182 296000 4 2 0 27 1700 2587 136640 1.2 1
5103499 Norbro Mhp 1 0 1 1 3 298000 3 1 1 50 1421 2038 103630 1.7 2
4957362 N/A 1 0 4 0 23 325000 3 2 0 240 1798 2957 155330 1.2 2
4931445 N/A 1 0 2 1 11 342000 3 2 0 24 2139 3099 162800 1 1
4926813 N/A 1 0 3 0 34 352000 4 2 1 45 1932 3564 187220 0.76 2
4971204 N/A 1 0 3 0 54 371200 4 2 1 30 1934 2710 142390 1.3 1
4971016 N/A 1 0 3 0 76 399000 4 2 1 28 2056 3375 177320 1 2
5086945 Black Horse Run 1 0 3 1 4 429900 4 2 1 21 2287 4276 217440 1.2 2
4986839 Deerfield Greene 1 0 3 1 53 434000 4 3 1 28 3898 4841 254320 1.2 2
4972299 N/A 1 0 3 1 15 435000 4 2 1 26 2249 3264 171480 1 2
4963670 Sconnelltown 1 0 3 0 55 450000 4 2 1 20 2492 3867 203140 0.48 2
4891676 N/A 1 0 3 0 108 471000 4 2 1 19 2416 4594 242660 1 2
4944754 Sconnelltown 1 0 3 0 85 483000 4 2 1 21 2558 3867 203160 0.51 4
5021459 Courtfield Streams 1 0 3 1 9 495100 4 2 2 17 3602 4481 235430 1 2
4969410 Brookmead Farms 1 0 3 0 6 504000 4 2 1 20 2776 4386 230420 1 2
4929943 Squires Lea 1 0 3 1 33 520000 4 2 1 29 2693 4368 229460 1 2
5046985 Meadowcroft 1 0 3 0 17 555000 4 3 0 19 3056 5225 274480 1 2
4996341 Carroll Hill 1 0 3 1 22 568000 4 3 0 8 2723.9029850746 6408 336630 1.6 3
4947419 Royalwood Ests 1 0 3 0 27 575000 4 2 1 17 3640 6588 346100 1 3
4890768 Folkestone 1 0 3 0 80 585000 4 2 1 16 3466 5530 282140 1.7 2
5103937 N/A 1 0 2 1 34 599000 3 2 1 20 3300 5215 265210 2.2 2
4973637 Royalwood Ests 1 0 3 1 27 625000 4 3 1 17 4790 6749 354540 1 3
4944802 Blue Rock Meadows 1 0 3 0 63 645000 4 2 1 13 3276 5473 287510 1 2
4957271 Brandywine Overloo 1 0 3 1 7 653000 4 3 1 7 3692 6960 365660 0.36 2
5012961 Highland Farm 1 0 3 1 43 675000 4 2 2 14 3137 5416 284510 1.4 3
4940159 N/A 1 0 3 0 82 679000 5 3 1 20 3676 6053 318000 4.15 3
4968233 Highland 1 0 3 1 25 710000 5 4 1 12 4911 7698 404420 2.2 2
5042081 Marshallton Chase 1 0 3 1 3 710000 4 4 1 10 4402 7794 409460 1.5 3
4979669 Brandywine River E 1 0 3 0 54 725000 4 3 1 10 4098 7814 410530 0.5 3
4953113 Kenmara 1 0 3 0 66 725000 4 2 1 9 3698 6580 345690 0.73 3
4945610 Brandywine River E 1 0 3 0 126 730000 4 3 1 9 4400 7403 388910 0.74 3
4964289 Brandywine River E 1 0 3 0 46 745000 4 3 1 7 5388 7185 377480 0.53 3
4968367 Steeplechase 1 0 3 0 7 780000 4 3 1 8 4103 7635 401080 0.7 3
4812286 Brandywine River E 1 0 3 0 230 786000 4 3 1 8 4886 8402 464440 0.75 3
4946623 Bala 1 0 3 0 89 800000 5 4 1 18 4615 7616 400120 1 3
5012039 Steeplechase 1 0 4 0 3 830000 5 4 1 8 6169 9852 517580 0.96 3
4969770 Foulke Manor 1 0 3 1 101 835000 4 4 2 4 5200 10422 547500 2.03 3
5032049 Mount Bradford Far 1 0 2 0 1 850000 3 2 1 16 5175 7359 386620 1.3 3
4798340 Harmony Hills 1 0 3 0 307 930000 6 5 2 15 9000 9023 476630 4.9 3
4995595 Shenandoah 1 0 3 1 42 940000 5 3 2 4 5007 12242 643120 1.12 4
4942826 Shenandoah 1 0 3 0 3 980000 5 3 1 3 4856 13591 714010 1.14 3
4933988 Heritage At Parke Fa 1 0 3 0 35 1000000 4 3 2 10 1553 10298 544010 1.4 3
5020145 Lenape 0 0 1 1 14 249000 3 2 0 44 1334 2658 107960 0.5 0
4996075 Lenape 0 0 1 1 46 302500 3 1 1 45 1380 3082 116340 1.06 0
4995521 N/A 0 0 1 1 26 375000 3 2 0 45 1802 6395 241370 4.9 2
4815732 Brandywine Hills 0 0 4 1 193 378500 4 3 0 35 2129 3930 154800 0.69 1
4970470 Brandywine Hills 0 0 3 0 6 396000 5 3 0 50 3521.619047619 3875 146270 0.69 0
4964748 High Woods 0 0 1 1 20 425000 3 2 0 22 3144 7618 300070 2 2
4806526 Red Bridge Farm 0 0 3 0 255 450000 4 2 1 25 2723.9029850746 4673 197090 0.94 2
4955826 N/A 0 0 3 0 19 494000 4 2 1 22 2723.9029850746 7491 282750 2 2
4969740 Red Bridge Farm 0 0 3 0 41 495000 4 2 1 28 2912 6414 242080 1.1 2
4925440 Red Bridge Farm 0 0 3 0 43 506900 5 3 1 26 3184 6373 251010 1 2
5149343 Four Streams 0 0 3 1 1 600000 4 2 2 15 3727 6016.0373134328 336670 1 3
4968052 N/A 0 0 3 1 19 625000 3 2 1 20 3129 7330 276680 3.09 2
4956895 Ridgeview 0 0 1 1 98 670000 5 4 0 50 3521.619047619 6496 245170 4 2
4950867 Miles Spring 0 0 3 0 12 720000 4 3 1 12 3992 11398 430200 2 3
5015915 Overlook 0 0 3 0 8 735000 5 3 2 0 3521.619047619 11000 71450 1 3
4960072 Courts At Longwood 0 0 3 1 89 770000 4 3 1 67 3952 6535 246660 1.4 2
4958045 Olmsted 0 0 4 1 57 850000 5 4 1 2 5500 12928 487960 1.15 3
4819720 Olmsted 0 0 3 0 239 862500 5 5 2 4 7346 15693 618110 1 3
5057965 N/A 0 0 3 1 9 925000 6 3 0 200 3874 4442 161090 12.8 0
4920727 Osborne Place 0 0 3 1 2 999500 4 4 1 5 4368 45828 597420 1.21 3
4848096 Winterwood 0 0 3 1 1 1200000 5 4 2 0 5000 6016.0373134328 355245.928571429 1 3
4941695 N/A 0 0 2 1 3 269900 2 1 1 78 1445 2821 110000 0.28 1
5011635 Dilworthtown Oak E 0 0 3 0 45 475000 4 2 1 33 2505 6007 234240 1 2
4963203 Dilworthtown Oak E 0 0 3 1 43 590000 4 2 2 33 2723.9029850746 6692 260950 1.1 2
4971141 Radley Run 0 0 1 1 28 599900 4 2 1 33 3278 7402 288650 1.6 2
4944685 Knolls Of Birmingh 0 0 3 0 4 626500 4 2 1 10 3304 7022 273830 0.43 2
4969918 Birmingham 0 0 3 1 27 675000 4 4 1 19 3648 8377 326670 1.2 2
4826733 Heartsease 0 0 3 1 139 685000 4 3 1 20 3720 8637 338170 1.2 2
4954840 Radley Run 0 0 3 1 5 695000 4 3 1 12 3858 10200 397750 1.7 3
4949773 Hamilton Place 0 0 3 0 73 716000 4 2 1 12 3596 9312 363130 0.69 3
5045469 Roundelay 0 0 3 1 20 742500 6 3 1 25 4474 9219 359500 2.9 2
4967815 Radley Run 0 0 3 1 85 750000 4 3 1 36 2723.9029850746 7676 299340 1.2 2
4883251 Hamilton Place 0 0 3 1 55 765000 4 3 1 11 3759 10042 393220 0.74 3
4954612 Fieldpoint 0 0 3 1 52 769900 4 3 1 10 3677 9989 389520 0.82 3
4886417 N/A 0 0 3 0 49 830000 4 3 1 6 4008 13368 521300 3.18 3
4710933 N/A 0 0 3 0 368 880000 5 3 2 9 5711 13240 534540 2 3
4948143 Revolutionary Farm 0 0 3 1 96 1100000 5 3 2 13 7904 15042 586570 4 3
4901900 Dilworthtown Oak E 0 0 3 0 168 1185000 6 5 3 3 6978 20027 784170 3.9 3
4846255 Longview At Whlie 0 0 3 1 129 1275000 5 5 2 6 8007 22705 889040 1 3
5084777 Birmingham Ests 0 0 3 0 14 1800000 6 3 3 10 7098 18039 675000 3.5 3
4831824 Grandview Acres 1 0 4 0 220 269000 3 1 1 49 2474 3136 154040 0.46 1
4919030 Westtown, Hillside 1 0 1 1 180 295000 3 2 0 23 1632 3597 176700 0.6 2
4880750 Westview Acres 1 0 1 1 114 304500 3 2 0 38 1367 3016 148120 0.92 2
4957172 Wild Goose Farms 1 0 3 0 75 320000 3 2 1 12 1600 2869 140210 0.09 1
4802653 Hummingbird Farm 1 0 1 1 165 330000 3 2 1 44 1500 2850 140000 0.91 0
4962715 N/A 1 0 1 1 66 355500 3 2 1 50 1575 2630 128520 1.21 2
5082503 N/A 1 0 3 0 39 360000 4 1 1 67 2431 3359 159230 0.34 2
4963944 West Lynn 1 0 4 0 28 365000 4 2 1 47 1965 3313 161870 0.92 1
4921987 N/A 1 0 3 1 14 372500 5 3 1 57 2821 3188 155810 0.34 1
4935197 South Hills 1 0 1 1 6 390000 4 3 0 45 1864 3310 161750 1 2
4990661 Oadbourne 1 0 3 0 71 395000 4 2 1 44 2548 4199 206240 1 2
5027087 N/A 1 0 3 0 66 395000 4 2 1 38 2412 4467 218280 1 2
4922490 New South Hills 1 0 3 1 74 400000 4 2 1 18 3204 4844 237940 1 2
4960494 Osborne Hill 1 0 1 1 23 400000 4 2 1 34 1957 3708 181200 1.1 2
5010885 N/A 1 0 3 1 65 405000 4 2 1 37 3000 4679 228650 1.1 2
5062817 N/A 1 0 1 1 29 411900 3 2 1 35 2596 3574 169430 1 2
4976795 N/A 1 0 3 1 16 425000 3 2 1 23 2402 4531 221420 0.45 2
4953426 Pleasant Grove 1 0 3 0 106 425000 4 2 1 31 2442 4227 206560 0.42 2
4992713 Windy Hill 1 0 4 0 36 434500 5 3 0 42 2800 4074 199080 1.5 2
4934907 Hummingbird Farm 1 0 3 0 30 443000 5 3 0 40 2245 3890 190110 0.73 2
4930847 Pleasant Grove 1 0 3 0 11 445000 4 2 1 26 2386 4277 208980 0.47 2
5043235 Westmount 1 0 3 1 36 445000 3 2 1 23 1860.6285714286 3697 180670 0.41 2
4955647 Westtown Park 1 0 4 0 5 447500 4 2 1 46 2626 3940 192550 0.92 2
5075549 Pennwood 1 0 1 1 7 452000 3 2 1 48 2420 4690 222340 1.31 2
5026703 Sycamore Springs 1 0 3 0 11 460000 4 2 1 30 2352 4723 230780 1.1 2
4962358 Land Grant Farms 1 0 3 1 1 466000 4 2 1 30 3066 5342 261040 1 2
5069795 Pleasant Grove 1 0 3 0 3 479900 5 3 1 26 3047 4602 218170 0.46 2
4944988 Plumly Farms 1 0 3 1 107 505000 4 3 1 23 2466 5000 244350 1.1 2
4934557 West Glen 1 0 3 0 1 530000 4 2 1 17 3130 5393 263520 0.34 2
4943097 Land Grant Farms 1 0 2 1 7 531250 4 2 1 29 3126 6328 309220 2.1 4
4923073 West Chester 1 0 3 0 54 570000 4 2 2 12 2928 5758 281390 1.2 2
5013721 Arborview 1 0 3 1 15 625000 4 2 1 40 2723.9029850746 4611 226500 1.07 2
4932486 Arborview 1 0 3 0 1 729115 4 2 1 0 3079 6016.0373134328 259104.402985075 0.41 3
4961881 Pleasant Grove 1 0 3 0 26 754000 4 4 1 8 4489 9324 455650 0.52 3
4947023 Enclav Pleasant Wood 1 0 3 0 26 769868 5 3 1 7 3521.619047619 10431 509700 0.6 3
4877806 Arborview 1 0 3 0 137 797865 4 3 1 0 2723.9029850746 6016.0373134328 259104.402985075 0.41 3
4965442 Avonlea 1 0 3 0 18 870000 5 3 1 10 3843 8227 402040 1 3
4958236 Arborview 1 0 3 0 296 943172 4 3 2 0 2723.9029850746 6016.0373134328 259104.402985075 0.41 3
4955038 Arborview 1 0 3 0 289 1214800 4 3 2 0 2723.9029850746 6016.0373134328 259104.402985075 0.41 3
4940903 N/A 1 0 4 1 15 1273000 6 5 1 138 5875 8445 491660 3.69 4
5393482 Radley Run IV 1 1 3 1 17 505000 4 2 1 20 2723.9029850746 5577 283590 1 2
5152451 Radley Run IV 1 1 3 1 34 595000 4 3 2 19 3608 7100 361070 0.93 2
5321248 Brandywine River E 1 1 3 1 97 638500 4 3 1 10 2723.9029850746 7029 357450 0.58 3
5229179 Marshallton 1 1 3 0 142 640000 4 3 1 9 4216 7657 402240 1.3 3
5296017 Brandywine Overloo 1 1 3 0 11 675000 4 3 1 7 3920 6016.0373134328 387510 0.35 3
5245937 Brandywine Overloo 1 1 3 0 181 678000 4 4 1 8 5056 7640 388530 0.3 2
5283395 Marshallton Chase 1 1 3 1 51 685000 4 3 1 11 4235 8000 406790 2.02 3
5041743 Blue Rock Meadows 1 1 3 1 93 715000 4 2 1 17 3722 6022 316350 1.4 3
5298021 Blue Rock Meadows 1 1 3 1 4 742500 4 2 1 18 3722 6221 316350 1.4 3
5161897 Applegate 1 1 3 1 100 537500 4 2 1 8 3418 6834 323970 0.43 2
5150041 Applegate 1 1 3 0 26 582000 5 4 1 10 3382 6795 322110 0.43 2
5159665 Applegate 1 1 3 1 238 610000 5 4 1 8 3521.619047619 6639 314730 0.42 2
5356393 Applegate 1 1 3 1 59 632500 4 3 1 8 4276 8702 412520 0.47 2
5138335 Birmingham 0 1 3 0 110 333500 3 3 1 13 1860.6285714286 4944 184990 0.08 1
53420185 Birmingham hunt 0 1 3 1 113 365000 3 3 1 11 2800 5671 212200 0.08 2
5084811 Birmingham hunt 0 1 3 1 167 379000 3 3 1 8 2855 5906 220990 0.08 2
5175509 Knolls of Birmingh 0 1 3 0 112 385000 3 2 1 9 2288 6311 236160 0.21 2
5182875 Knolls of Birmingh 0 1 4 1 7 392500 4 3 1 8 2500 4772 186080 0.04 2
5300803 Knolls of Birmingh 0 1 3 0 34 392500 3 2 2 15 3184 6294 235500 0.04 2
4919785 Knolls of Birmingh 0 1 3 1 383 405000 3 2 2 9 1860.6285714286 7267 283370 0.21 2
5296537 Birmingham hunt 0 1 3 1 31 412000 3 2 1 10 2362 6085 227690 0.14 2
5311745 Birmingham hunt 0 1 3 1 8 415000 3 2 1 11 2362 6044 226160 0.16 2
5318992 Knolls of Birmingh 0 1 3 1 20 455000 4 3 1 10 2723.9029850746 6874 257210 0.22 2
4947359 Radley Run 0 1 1 1 173 500000 4 3 0 42 3028 7856 306360 1 2
5353826 Radley Run 0 1 3 0 30 530000 4 2 2 42 2723.9029850746 6505 243400 1.5 2
5155127 Radley Run 0 1 3 1 106 542000 4 2 1 23 2929 8230 307980 0.72 2
5168007 Radley Run 0 1 3 1 167 585000 4 2 1 23 2813 7020 262670 1.5 2
5117677 Radley Run 0 1 3 0 3 595000 4 2 2 36 2676 6733 251960 1 2
5217199 Radley Run 0 1 3 0 22 680000 4 3 0 31 2723.9029850746 7923 217210 1 2
5181467 Radley Run 0 1 3 1 1 692000 5 4 0 41 3209 8045 301050 1 2
5328970 Thornbury Estates 1 1 4 1 60 230000 3 1 1 50 1748 2977 145680 1 2
5198517 Brandywine Thorn 1 1 3 1 86 292000 2 2 1 6 2112 2565 165770 0.05 1
5152647 Thornbury Estates 1 1 4 0 35 292000 3 1 1 49 1824 3078 150620 1.03 2
5338866 Brandywine Thorn 1 1 3 1 90 395000 2 2 1 6 2112 3389 165850 0.06 1
5307231 Brandywine Thorn 1 1 3 1 170 299000 2 2 1 8 2124 3504 171500 0.05 1
5340293 Thornbury Estates 1 1 4 0 5 305200 3 1 1 51 1512 3104 151900 1 2
5241865 Brandywine Thorn 1 1 3 0 116 310000 3 2 1 10 2112 3920 191850 0.05 1
5268167 Brandywine Thorn 1 1 3 0 174 310000 3 2 1 10 2112 3285 160780 0.05 1
5066251 Brandywine Thorn 1 1 3 1 207 312500 3 2 1 7 2660 4235 207259 0.08 1
5325388 Thornbury Estates 1 1 1 1 20 330000 3 2 0 50 1442 2772 135660 0.98 2
5208673 Brandywine Thorn 1 1 3 1 161 351000 3 2 1 9 2180 4237 207370 0.09 1
4987251 Thornbury Estates 1 1 4 0 178 368000 4 2 1 50 2312 3477 175580 1 2
5359319 Brandywine Thorn 1 1 3 0 34 375000 3 2 1 10 2810 4905 240040 0.12 2
5108749 Brandywine Thorn 1 1 3 0 16 495000 4 2 1 9 3424 4644 234510 0.2 2
5139903 Brandywine Thorn 1 1 3 0 51 495000 4 2 1 7 3545 5629 284240 0.29 2
5283389 Brandywine Thorn 1 1 3 0 69 520000 4 2 1 9 2723.9029850746 5890 294150 0.25 2
5278663 Brandywine Thorn 1 1 3 0 5 525000 4 2 1 8 3584 5836 285610 0.23 2
5014547 Brandywine Thorn 1 1 3 0 67 530000 4 2 1 8 4019 5366 270950 0.29 2
5162131 Brandywine Thorn 1 1 3 0 18 535000 6 3 1 7 3672 6529 319510 0.26 2
5312990 Brinton Village 1 1 3 0 52 623730 4 3 1 0 3400 6016.0373134328 259104.402985075 0.25 2
5313014 Brinton Village 1 1 3 0 2 624888 3 2 1 0 3306 3984.5571428571 168895.128571429 0.25 2
5315218 Brinton Village 1 1 3 0 140 635000 4 3 1 0 3400 6016.0373134328 259104.402985075 0.25 2
5313006 Brinton Village 1 1 3 0 46 652000 3 2 1 0 3306 3984.5571428571 168895.128571429 0.25 2
5200033 Brinton Village 1 1 3 0 112 654000 4 3 1 0 3468 6016.0373134328 259104.402985075 0.25 2
5187039 Brinton Village 1 1 3 0 3 654530 4 3 1 0 3468 6016.0373134328 259104.402985075 0.25 2
5261447 Brinton Village 1 1 3 0 9 770000 4 3 1 0 3468 6016.0373134328 259104.402985075 0.25 2
5650248 N/A 0 1 2 1 186 670000 4 2 1 37 2723.9029850746 7227 255420 7.9 2
5664263 Pocopson Creek 0 1 3 0 1 760000 4 3 1 0 3720 6016.0373134328 259104.402985075 1 3
5669155 Newlin Greene 0 1 3 0 104 760000 5 3 2 6 5118 14831 524160 0.75 3
5653066 Newlin Greene 0 1 3 1 166 800000 5 4 2 4 6311 14601 496960 1.12 3
5649779 Pocopson Creek 0 1 3 0 1 1008117 5 3 1 0 4302 6016.0373134328 355245.928571429 1 3
5661644 Taggerts Crossing 0 1 3 1 133 260750 3 3 0 6 1800 3780 130460 0.12 2
5676977 n/a 0 1 2 1 87 285000 4 1 1 60 1764 4536 156520 0.98 2
5667567 The Villages 0 1 3 1 82 325000 3 3 0 6 2769 6438 222180 0.15 2
5659707 Cedarcroft 0 1 3 1 111 340000 3 2 0 48 1860.6285714286 5620 193950 1.25 2
5672904 Traditions 0 1 2 1 111 358000 3 2 0 7 1860.6285714286 7022 251430 0.16 2
5536098 Traditions 0 1 3 1 375 375000 3 2 1 7 1860.6285714286 6837 244800 0.18 2
5672335 Traditions 0 1 2 1 89 380000 3 3 0 5 2815 7600 262270 0.21 2
5697182 Beversrede 0 1 4 1 116 450000 4 3 0 26 3584 8693 300000 1.6 2
5727283 Beversrede 0 1 3 0 13 467000 4 2 1 24 2704 7586 261790 1.1 2
5717057 n/a 0 1 3 1 16 525000 2 2 0 25 2405 3913 274550 12.66 2
5717612 Merrymet Farms 0 1 3 1 16 703000 5 4 2 5 7000 10597 365700 1.18 3
5690123 Merrymet Farms 0 1 3 0 58 785000 5 4 1 7 5522 9983 344500 1.14 3
5328860 N/a 0 1 3 1 757 1390000 4 2 1 179 3276 3810 208620 21.2 0
5713936 n/a 0 1 3 1 90 1650000 5 1 1 230 2520 4338 443730 61.6 0
5676029 n/a 0 1 3 0 45 3065000 5 4 1 231 3521.619047619 11875 409780 53.8 4
5721216 Brandywine Hills 0 1 1 1 11 229000 3 1 1 56 1400 3332 112030 0.52 1
5473962 Brandywine Hills 0 1 3 0 497 270000 4 3 0 38 2129 4442 154800 0.69 1
5713275 Riverside At 0 1 3 0 25 580000 4 3 1 3 2723.9029850746 8888 398790 0.24 2
5555991 n/a 0 1 3 0 397 605000 5 3 1 0 4500 6016.0373134328 355245.928571429 2.27 2
5705066 Riverside At 0 1 3 0 25 627500 4 2 1 2 4493 9823 330240 0.3 2
5674194 Ponds Edge 0 1 3 0 65 312000 3 3 1 14 2640 5094 175450 0.03 1
5604550 Chadds Ford Knoll 0 1 3 0 272 355000 5 2 1 47 2381 5207 181460 1.05 2
5666097 Chadds Ford Knoll 0 1 3 0 70 375000 5 3 2 41 3071 6083 211990 0.64 2
5663347 Old Oak 0 1 2 1 78 555000 4 5 0 17 6150 11355 391090 1.7 3
5696370 n/a 0 1 3 1 73 600000 4 3 1 30 3798 8523 293530 10.63 3
5692915 Fair Hill 0 1 3 0 64 635000 4 2 1 25 3304 9672 333110 2.2 2
5602931 n/a 0 1 3 0 257 675000 4 3 1 9 2723.9029850746 14282 497720 1 2
5457877 n/a 0 1 4 0 515 2000000 9 4 2 79 6700 16462 598950 20 4
5661988 n/a 0 1 3 1 139 315000 4 2 1 160 2648 7005 240340 1.9 0
5708693 Knolls of Birmingh 0 1 3 1 67 317500 2 3 1 17 2194 6155 211180 0.04 2
5684256 Birmingham Hunt 0 1 3 0 108 335000 3 2 1 14 3130 7013 240640 0.17 2
5715393 Knolls of Birmingh 0 1 3 0 26 475000 4 2 1 13 3602 8162 280040 0.29 2
5736107 Spring Mdws 0 1 3 1 64 475000 4 3 1 28 3132 8417 288810 2.2 2
5698596 Radley Run 0 1 1 1 115 480000 4 3 1 39 3212 7093 243360 1 2
5689233 n/a 0 1 3 0 43 563000 5 3 0 43 2702 7597 260660 2.2 2
5703499 Silverwood 0 1 3 1 43 567000 4 2 1 23 3670 8681 297850 0.77 2
5711314 Hamilton Place 0 1 3 0 18 692500 4 3 2 15 4193 12529 429870 0.79 3
5733824 Fieldpoint 0 1 3 1 5 725000 5 3 1 18 3771 11871 407300 0.71 2
5685352 Fieldpoint 0 1 3 1 39 750000 4 3 2 15 4413 12133 416300 0.69 3
5655310 n/a 0 1 4 1 176 1130000 6 4 1 160 4711 12070 415570 3.6 2
5716765 Southpoint 0 1 4 1 16 232500 2 2 1 23 1544 2918 109580 0 1
5738039 n/a 0 1 3 1 24 370000 4 2 1 41 2300 6524 173720 2 2
5662192 n/a 0 1 4 1 170 850000 5 3 3 4 3521.619047619 25393 953490 2.35 3
5678492 Ests. At Chadds Ford 0 1 3 0 102 850000 5 4 2 2 5787 13651 512569 0.48 4
5428126 Colonial Meadows 1 1 3 0 55 125000 2 1 1 39 882 1517 57770 0.02 0
5441934 n/a 1 1 1 1 68 150000 3 1 0 56 1178 2704 102980 0.22 0
5373593 n/a 1 1 4 1 114 165000 4 1 0 108 1008 1386 55000 0.05 0
5423541 n/a 1 1 3 0 6 182000 3 1 1 108 1469 2442 93010 0.09 2
5435928 n/a 1 1 3 0 4 260000 3 1 1 49 2220 2432 96530 0.08 0
5343391 n/a 1 1 3 0 185 260000 3 1 1 24 1340 2931 116330 0.11 1
5416675 n/a 1 1 4 1 62 269000 4 1 1 158 2723.9029850746 3010 114650 0.05 0
5231997 West Chester 1 1 4 1 323 290000 4 2 1 88 2048 3300 133050 0.03 0
5435814 West Chester 1 1 4 1 52 300000 5 2 0 108 1690 2424 92330 0.05 0
5446068 Bradford Pointe 1 1 4 1 53 306000 3 2 1 7 1901 4842 184440 0.03 1
5442722 N/a 1 1 3 0 24 310000 3 2 0 58 1860.6285714286 3378 128680 0.17 0
5422023 West Chester 1 1 3 1 3 330000 3 1 0 158 1326 2451 93370 0.05 0
5435871 West Chester 1 1 4 1 6 348500 5 2 0 108 3521.619047619 3469 132140 0.05 0
5422784 West Chester 1 1 4 0 12 355000 3 1 0 150 1592 2850 108550 0.04 0
5373831 West Chester 1 1 4 0 79 360000 3 2 1 108 1860.6285714286 2064 81920 0.05 0
5370899 n/a 1 1 4 1 12 721024 4 2 1 0 2723.9029850746 6016.0373134328 259104.402985075 0.28 2
5453398 Evian 1 1 3 0 15 252000 3 2 1 11 1560 2734 128640 0.02 1
5411117 Exton Station 1 1 3 0 40 253000 3 2 1 11 1308 2324 109350 0.01 0
5370494 Sunset Grove 1 1 4 0 143 257500 3 1 1 47 1319 2656 131510 0.47 1
5365299 Fox Run 1 1 4 1 136 268300 3 2 1 20 2062 2648 131110 0.05 0
5445827 n/a 1 1 1 1 6 279900 3 2 0 27 1248 3647 171580 0.7 2
5425004 Evian 1 1 3 0 31 280000 3 2 1 12 2142 3013 141780 0.02 1
5392995 n/a 1 1 3 0 119 281000 4 2 1 45 1920 2778 130700 0.46 2
5393011 Sunset Grove 1 1 4 0 115 283000 4 1 1 52 2080 1993 93760 0.46 0
5433431 N/a 1 1 3 1 22 286000 4 1 1 52 1666 2513 118220 0.65 1
5453470 n/a 1 1 4 1 43 289000 3 2 0 54 1800 2789 131210 0.37 1
5432406 Sunset Grove 1 1 1 1 23 292000 3 2 0 48 1064 2578 121280 0.47 0
5373315 Whiteland Crest 1 1 1 1 65 298000 3 1 1 57 1392 2413 119510 0.36 1
5420966 Evian 1 1 4 0 55 312000 3 2 1 16 2182 3065 144230 0.02 1
5430202 Evian 1 1 4 1 12 324900 3 2 1 10 2100 3119 146730 0.02 1
5263531 Lynetree 1 1 3 0 127 329900 5 3 1 21 3521.619047619 2896 143390 0.03 0
5355896 Brandywine 1 1 3 0 194 330000 4 2 1 39 2133 3288 162830 0.46 2
5367827 N/a 1 1 3 0 175 335000 3 2 1 32 1896 3478 172240 0.69 2
5376217 Whitford Hills 1 1 3 0 112 340000 4 2 1 37 1704 3198 158360 0.69 3

TestData

MLS DOM PRICE COUNTY SUBDIV / NEI SCHOOL DISTRICT -HIGH -MIDDLE -ELEM TYPE DESIGN STYLE BED BATH 1/2 BATH AGE SQ FOOT RETAXES ASSESS ACRE GARAGE CONDITIONG LDt
5085775 8 240000 Chester N/A West Chester N/A N/A N/A single 1 story ranch 2 1 1 52 1340 3096 157440 1 2
4939319 56 270000 Chester N/A West Chester N/A N/A N/A single 1 story ranch 3 1 0 47 1188 2524 132620 1 1

data_mining_.._bayan (1).docx

Bayan Basham

Data Mining 1

Variables That Were Dropped: The following are the variables that were dropped from the data with the explanation of why they were dropped:

1) County: This variable was dropped, because everything was in Chester.

2) High School: This variable was dropped, because there were missing values in it. In fact, there were 76 values missing for high school, which could impact the data.

3) Middle School: This variable was deleted because there where 44 missing values.

4) Elementary School: This variable was also removed for the same reason where 63 values were missing.

Categorical Variables:

1) School District: School district is a categorical type of data, where it was also coded as binary. Unionville-Chadds was coded as 0 and West Chester was coded as 1. Below is a frequency statistics table for this variable that was done manually by looking at the total numbers of houses in each school district and then calculating their percentages:

Frequency Statistics Table:

Therefore, by looking at the table, we can see that there are more than half of the houses are in West Chester.

2) Type: There are 4 types of houses in the data. Single, Single/detac, Twin/Semi-De, Row/Townhouse. To make things easier, I chose to code this variable as a binary one, where Single and Single/detac houses were coded 0 and the other 2 houses (since they are not single) were coded 1. Below is a frequency statistics table for this variable that was done manually by looking at the total numbers of single houses in contrast to the “others”:

Frequency Statistics Table:

Type

Total

Percentage

Single and Single/detac =0

221

84.03%

Others=1

42

15.97%

Total

263

100%

Therefore, by analyzing the data we can see how close the percentages of single and Single/detac houses compared to other types are. In addition, by observing the data clients can get a clearer picture of what their houses should sell based on

Continuous Variables:

Continuous variables are variables that cannot be coded due to the fact that we can take the average of them. In this data, the continuous variables are: DOM, PRICE, BEDS, BATHS, ½ BATH, AGE, SQ FEET, RETAXES, ASSESS, ACRE, and GARAGE. Therefore, to analyze these variables independently and compare their means and standard deviations, a descriptive statistics table was created. Below is a descriptive statistics table consisting of all the continuous variables:

Using the table above, we can see that the average price of a house (highlighted above) in Chester county is $555820٫94. In addition, the standard deviation of price is 308037٫41, which is the largest out of all the data meaning that there is no consistency in the data and that it is spread also, that's mean the scores will fall in the range between 555820+308037 or 555820-308037. Furthermore, the average number of bedrooms in the houses was 3.91.

However, there are some of missing and bad variables such as the data in RETAXES and SQ FEET; I used the Pivot table to get the average based on the Beds to fill out the missing values

Descriptive Statistics Table:

Correlation Table:

Below is a correlation table that includes all variables (categorical and continuous) in order to see if there are correlations in the data

· There is a strong positive correlation between ASSESS and RETAXES (0.83).

and that’s means whenever the number of assess increases the retaxes increases either.

· There is a negative correlation between BATH and AGE (-0.24) as well as between ½ BATH and AGE (-0.28)

· As it shown in the table there are negative correlation between SQ FOOT, RETAXES, ASSESS GARAGE and AGE.

Categorical variables

· The negative correlation is between Type, Style and school district and between Type and Design.