Hi,
I have been trying your script for learning purpose. It is very useful. I would like to add and request a few points.
The original data has 20532 columns around with Cancer_type. So, you have did some pre processing and make it as your .csv file input. If possible you can please upload the same .csv input in github for easy usage for others or please provide steps or instructions to make such dataset.
The major issue is plot_confusion_matrix missing in the recent scikit-learn version 1.5.2
There is not much details about package version details in your page. It would be a good practice to provide package version details or installation details.
If possible please update with more description in ML related steps i.e. KNeighbors and so on.
Thank you