KNN to K-NN

trevorcampbell · trevorcampbell · commit 9daab5040ef7 · 2023-11-15T17:00:30.000-08:00
diff --git a/source/classification2.md b/source/classification2.md
@@ -1524,10 +1524,10 @@ set the number of neighbors $K$ to 1, 7, 20, and 300.
 
 ### Evaluating on the test set
 
-Now that we have tuned the KNN classifier and set $K =$ {glue:text}`best_k_unique`,
+Now that we have tuned the K-NN classifier and set $K =$ {glue:text}`best_k_unique`,
 we are done building the model and it is time to evaluate the quality of its predictions on the held out 
 test data, as we did earlier in {numref}`eval-performance-clasfcn2`.
-We first need to retrain the KNN classifier
+We first need to retrain the K-NN classifier
 on the entire training data set using the selected number of neighbors.
 Fortunately we do not have to do this ourselves manually; `scikit-learn` does it for
 us automatically. To make predictions and assess the estimated accuracy of the best model on the test data, we can use the
@@ -1654,7 +1654,7 @@ The overall workflow for performing K-nearest neighbors classification using `sc
 In these last two chapters, we focused on the K-nearest neighbors algorithm,
 but there are many other methods we could have used to predict a categorical label.
 All algorithms have their strengths and weaknesses, and we summarize these for
-the $K$-NN here.
+the K-NN here.
 
 **Strengths:** K-nearest neighbors classification
 
@@ -1927,7 +1927,7 @@ In particular, you
 2. tune each one using cross-validation, and
 3. pick the subset of predictors that gives you the highest cross-validation accuracy.
 
-Best subset selection is applicable to any classification method ($K$-NN or otherwise).
+Best subset selection is applicable to any classification method (K-NN or otherwise).
 However, it becomes very slow when you have even a moderate
 number of predictors to choose from (say, around 10). This is because the number of possible predictor subsets
 grows very quickly with the number of predictors, and you have to train the model (itself
diff --git a/source/regression1.md b/source/regression1.md
@@ -706,7 +706,7 @@ to be too small or too large, we cause the RMSPE to increase, as shown in
 
 {numref}`fig:07-howK` visualizes the effect of different settings of $K$ on the
 regression model. Each plot shows the predicted values for house sale price from
-our KNN regression model for 6 different values for $K$: 1, 3, 25, {glue:text}`best_k_sacr`, 250, and 699 (i.e., all of the training data).
+our K-NN regression model for 6 different values for $K$: 1, 3, 25, {glue:text}`best_k_sacr`, 250, and 699 (i.e., all of the training data).
 For each model, we predict prices for the range of possible home sizes we
 observed in the data set (here 500 to 5,000 square feet) and we plot the
 predicted prices as a orange line.