Assignment: Dataset Creation and Augmentation

1. Collect Two Suitable Videos

Capture two short videos using your camera or smartphone.
These videos will be used to create a training dataset for a binary classification model.
One video should contain positive examples, the other negative examples.
Move the camera while recording so that every frame is slightly different.
A few seconds of video are enough (aim for 100+ frames total).
Example:
- Positive samples video: Camera circling around an apple (object) on a desk (background).
- Negative samples video: Camera circling around an orange (object) on the same desk (background).

Randomly split the images into two groups:
- Training (~80%)
- Validation (~20%)
Store them in separate folders for clarity.
Verify balance between positive and negative samples by visually checking a few images.

Study possible augmentations in the Albumentations library.
Select reasonable augmentations that reflect real-world variations in your dataset.
Think carefully about transformations that make sense for your task (e.g., flipping, color changes, noise).

Your final report should include:

Dataset goal description
- Define what counts as positive and negative samples.
- Explain the purpose of your binary classification task.
Video description
- Provide details such as number of frames, resolution, objects, and background.
Preview of images
- Include thumbnails showing the diversity of both training and validation subsets.
Chosen augmentations
- List selected augmentations with before/after image examples.
- Choose at least 5 augmentations.
Conclusion
- Explain why you chose certain augmentations and why you excluded others.

Example workflow notebook: dataset_creation.ipynb
Think carefully about the goal of your binary classification.
While selecting augmentations, ask yourself questions like:
- Can the object realistically be flipped upside down or mirrored?
- Can the object naturally vary in color?
- Could there be lighting or brightness changes in real-world conditions?