Carvana Image Segmentation with U-Net

This repository contains a PyTorch implementation of a U-Net architecture for vehicle segmentation using the Carvana Image Masking Challenge dataset.

Project Overview

The Carvana Image Masking Challenge involves automatically identifying the boundaries of cars in images. This project uses a U-Net architecture to generate high-quality segmentation masks that separate vehicles from their backgrounds.

Dataset

The dataset is from the Carvana Image Masking Challenge on Kaggle. It contains pairs of car images and their corresponding segmentation masks.

Repository Structure

├── data/
│   ├── train/              # Training images
│   ├── train_masks/        # Training masks
│   ├── processed/          # Processed and split data
│       ├── train_img_/     # Training split images
│       ├── train_mask_/    # Training split masks
│       ├── val_img_/       # Validation split images
│       ├── val_mask_/      # Validation split masks
├── outputs/                # Model predictions
├── data_preprocess.py      # Data preprocessing script
├── dataset.py              # Dataset loading utilities
├── model_unet.py           # U-Net model implementation
├── train.py                # Training script
├── utils.py                # Utility functions
└── README.md               # Project documentation

Usage

Data Preprocessing

Split the dataset into training and validation sets:

python data_preprocess.py

Training

Train the U-Net model:

python train.py

Hyperparameter can be modified in train.py: such as learning rate, epochs, batch size, image height and width.

Model Architecture

The U-Net architecture implemented in this project consists of:

Encoder path (contracting): Series of double convolution blocks followed by max pooling
Bottleneck: Double convolution at the bottom
Decoder path (expanding): Series of up-convolutions and concatenations with skip connections
Final 1x1 convolution to map to output segmentation

Evaluation

Model performance is evaluated using:

Pixel-wise accuracy
Dice coefficient (F1 score)

Results

The model outputs predicted masks in the outputs/ directory. Each prediction includes a timestamp for tracking experiments.

Dependencies

PyTorch, torchvision, albumentations, OpenCV, scikit-image, PIL, tqdm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Carvana Image Segmentation with U-Net

Project Overview

Dataset

Repository Structure

Usage

Data Preprocessing

Training

Model Architecture

Evaluation

Results

Dependencies

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
outputs		outputs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_preprocess.py		data_preprocess.py
dataset.py		dataset.py
model_unet.py		model_unet.py
train.py		train.py
utils.py		utils.py

License

Seg0fieD/segmentation_with_U-net

Folders and files

Latest commit

History

Repository files navigation

Carvana Image Segmentation with U-Net

Project Overview

Dataset

Repository Structure

Usage

Data Preprocessing

Training

Model Architecture

Evaluation

Results

Dependencies

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages