-
Notifications
You must be signed in to change notification settings - Fork 1
Description
I'm opening this issue because I find the wording of exercise 10.1 unclear, and i'm not sure of the intended solution. The text of the exercise states: "Compute the Euclidean distance matrix on the standardized data for the first three cities"
This phrasing seems to suggest that one should compute the standardized euclidean distances only on the first three cities. In the official solution, however, the distances are computed using the full dataset of cities, and only the first three results of the resulting distance matrix are displayed.
Since standardized Euclidean distances takes into account the context in which the variables are placed (e.g. it is different to consider two provinces in the context of northern Italy compared to all of Italy) these two procedures will generally yield different results.
Could you please clarify which interpretation is intended for this exercise?
If the goal is to standardize using all cities and compute the full distance matrix, perhaps the wording could be revised to something like:
“Compute the Euclidean distance matrix on the standardized data for all cities, and then show the distances for the first three cities in table format"
If instead the goal is to work only with the first three cities, the solution code may need to be updated accordingly.
Also, on a less important note, as this is the english version of the book, i suggest translating the variable name "nomiPrime3" to "namesFirst3"
Thank you in advance for your time.