Here's a breakdown of how the code works
-
ExplorAnalysis+FeatureCreation.py is the first file for some initial exploratory analysis I also use it to create the engineered features
-
NonTextFeatureClassifiers.py is the file I used for training models on just the engineered features not the actual text itself.
-
TDFVectorClassifiers.py This is classifiers using TDF Vectorization for text, it does not have the engineered features
-
CustomFeatuers+TDF.py this file is has the custom features plus TDF vectorizations but we might need to do feature selectino with this cause it has a lot of features to make it better not sure
5.HyperParemeterTuning.py pretty self explanatory the two model hyper paremeter tuning. Probably could make this easier