Machine Learning Master Class Project: Final Project

Team Members

Hugo Dutra
Zuzanna Rybok
Mateusz Janowski

Overview

This project implements advanced machine learning techniques, including Generalized Additive Models (GAMs), Decision Trees, Random Forests, Support Vector Machines (SVMs), Principal Component Analysis (PCA), and Reinforcement Learning with Q-Learning. The focus is on exploring these methods to build predictive models for vegetation classification through feature selection and dimensionality reduction.

Project Description

The project explores various machine learning approaches for vegetation classification. It implements and compares different feature selection techniques, dimensionality reduction methods, and classification algorithms to establish a comprehensive methodology for environmental data analysis.

Datasets

Working with three primary datasets: df_1, df_2, and df_4
Target variable: Vegetation_Type
Features include topographic data and soil characteristics

Installation

Required Python packages:

pip install pandas numpy scikit-learn scipy matplotlib seaborn

Usage

To run the project:

git clone [repository-url]
cd [project-directory]
jupyter notebook Project2.ipynb

Methods

Feature Selection Techniques

Reinforcement Learning Approach

Q-Learning implementation for feature selection
Two versions of Q-Learning agents:
- Baseline agent with standard Q-table
- Improved agent with feature importance integration
Epsilon-greedy exploration strategy
Custom reward function based on model performance

Traditional Feature Selection

Random Forest feature importance analysis
Correlation analysis
Feature ranking based on importance scores

Classification Models

Linear Models

Generalized Additive Models (GAMs)
Lasso Classifier
ElasticNet Classifier
Logistic Regression with CV

Tree-based Models

Decision Trees
Random Forest Classifier
- Hyperparameter tuning
- Feature importance analysis

Support Vector Machines

RBF kernel implementation
Grid search for hyperparameter optimization
Cross-validation strategy

Dimensionality Reduction

Principal Component Analysis (PCA)

Two implementations:
- 18-feature dataset
- 55-feature dataset
Variance retention analysis
Component selection methodology

Project Structure

GAM's Decision Trees Random Forest PCA Reinforcemente Learning

Development Workflow

Data preprocessing and feature engineering
Implementation of feature selection methods
Model development and training
Dimensionality reduction analysis
Model evaluation and comparison
Documentation and code organization

Contributing

To contribute to this project:

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE.txt		LICENSE.txt
Project.ipynb		Project.ipynb
README.md		README.md
df_1.csv		df_1.csv
df_2.csv		df_2.csv
df_4.csv		df_4.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Machine Learning Master Class Project: Final Project

Team Members

Overview

Table of Contents

Project Description

Datasets

Installation

Usage

Methods

Feature Selection Techniques

Reinforcement Learning Approach

Traditional Feature Selection

Classification Models

Linear Models

Tree-based Models

Support Vector Machines

Dimensionality Reduction

Principal Component Analysis (PCA)

Project Structure

Development Workflow

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

NYXMatik/Vegetation-Classification-with-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Master Class Project: Final Project

Team Members

Overview

Table of Contents

Project Description

Datasets

Installation

Usage

Methods

Feature Selection Techniques

Reinforcement Learning Approach

Traditional Feature Selection

Classification Models

Linear Models

Tree-based Models

Support Vector Machines

Dimensionality Reduction

Principal Component Analysis (PCA)

Project Structure

Development Workflow

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages