Predicting user's demographic information and personality through their browsing history
Data pre-processing:
- all_user_csv_out_2.py (browsing history to web categories)
- update_all_user_v3.py (merge some web categories)
- user_daily_v4.py (output users feature with categories ratio)
- user_daily_v5.py (output users feature with time session frequency)
Demographic information prediction:
- supervise_demo_KNN.py (predicting user's demographic information in k-NN)
- supervise_demo_RF.py (predicting user's demographic information in random forests)
- supervise_demo_LR.py (predicting user's demographic information in logistic regression)
- supervise_demo_SVM.py (predicting user's demographic information in SVM)
- kms_demo_KNN.py (predicting user's demographic information in clustering with k-NN)
- kms_demo_RF.py (predicting user's demographic information in clustering with random forests)
- kms_demo_LR.py (predicting user's demographic information in clustering with logistic regression)
- kms_demo_SVM.py (predicting user's demographic information in clustering with SVM)
Big-six personality prediction:
- supervise_pr_SVM.py (predicting user's big-six personality in SVR)
- supervise_pr_Lasso.py (predicting user's big-six personality in Lasso regression)
- supervise_pr_Ridge.py (predicting user's big-six personality in Ridge regression)
- supervise_pr_EN.py (predicting user's big-six personality in Elastic net regression)
- kms_pr_pred.py (predicting user's big-six personality in clustering with some regression models)