Solving the Kaggle's challenge New York City Taxi Trip Duration
The project runs in python 3.6 using the following Packages. With the exception of XGBoost, all packages are included in Anaconda3 Distribution
- Pandas
- Scikit-Learn
- Seaborn
- Matplotlib
- Numpy
- XGBoost
The project was implemented using Jupyter Notebook (report.ipynb), which is also provided with the submission file.
To run the project, the data can be downloaded from the Challenge's website after accepting the Kaggle's Terms of Use and Conditions. After downloaded the csv files must be unzipde to the data folder
The Jupyter Notebook train a model using the training dataset and predict the values for the test dataset saving the predictions in the out folder, each file in the out folder correspond to the prediction for each model.