subject

Predicting housing median prices. The file BostonHousing. xls contains information on over 500 census tracts in Boston, where for each tract 14 variable values are recorded. The last column (CAT. MEDV) was derived from MEDV, such that it obtains the value 1 if MEDV>30 and 0 otherwise. Consider the goal of predicting and classifying the median value (MEDV and CAT. MEDV) of a tract, given the information in the first 13 columns (input variables) in the column list. Partition the data into training (60%) and validation (40%) sets.
a) Perform a k-nearest neighbors prediction with all 13 predictors (the CAT. MEDV column is the outcome or decision variable), trying values of k from 1 to 10. Make sure to normalize the data (click "normalize input data"). What is the best k chosen? What does it mean?
b) Why is the validation data error overly optimistic compared to the error rate when applying this kNNpredictor to new data?

ansver
Answers: 3

Another question on Computers and Technology

question
Computers and Technology, 23.06.2019 10:30
How would you categorize the software that runs on mobile devices? break down these apps into at least three basic categories and give an example of each.
Answers: 1
question
Computers and Technology, 24.06.2019 00:00
Afashion designer wants to increase awareness about her brand. which network can she use and why she can use the blank to blank her products online. answers for the first blank: internet, extranet, or intranet answers for the second blank: market, design, and export
Answers: 1
question
Computers and Technology, 24.06.2019 16:00
Your is an example of personal information that you should keep private.
Answers: 1
question
Computers and Technology, 24.06.2019 16:30
Which program can damage your computer?
Answers: 1
You know the right answer?
Predicting housing median prices. The file BostonHousing. xls contains information on over 500 cens...
Questions
question
Biology, 15.12.2020 20:20