Kitchen Assistant V2 is a computer vision project for kitchen cookware detection and classification, featuring YOLO-based object detection and temporal classification models.
README.md- Project documentation (English)README_CN.md- Project documentation (Chinese)FINE_TUNING_JOURNEY.md- Model fine-tuning process documentation (English)FINE_TUNING_JOURNEY_CN.md- Model fine-tuning process documentation (Chinese)
yolo_training/kitchenware_detector2/weights/best.pt- Step 1: YOLOv8n object detection model for detecting pans and pots- 497 training images, 2 classes (cooking-pot, frying-pan)
- Performance: 99.9% precision, 100% recall
pan_pot_classifier_temporal.pth- Step 2: MobileNet v2 temporal classification model for cooking state detection- 143 temporal groups (3 frames each), 4 classes (boiling, normal, on_fire, smoking)
- Performance: 93.10% validation accuracy, 92.9% test accuracy
verify_yolo.py- YOLO model verification scriptverify_temporal_on_originals.py- Temporal model verification on original images
YOLO model training results and artifacts.
kitchenware_detector/- First training run results- Training metrics and curves (F1, PR, P, R)
- Confusion matrices
- Training batch visualizations
- Model weights (best.pt, last.pt, epoch checkpoints)
- Configuration (args.yaml)
kitchenware_detector2/- Second training run results- Same structure as kitchenware_detector
- Improved model with more training data
Model verification outputs on original images.
- Marked/annotated images showing detection results
verification_results_on_originals.json- Verification metrics and results- Categories tested: boiling, on_fire, smoking
- YOLO Object Detection - Real-time kitchenware detection using YOLOv8/v11
- Temporal Classification - Enhanced classification using temporal information
- Multi-class Detection - Detects and classifies various kitchen cookware and states
- Comprehensive Training Metrics - Detailed evaluation with F1, Precision, Recall curves
- Verification Pipeline - Automated testing on diverse image sets
See training results in yolo_training/kitchenware_detector/ and yolo_training/kitchenware_detector2/:
- Training curves and metrics
- Confusion matrices
- Validation batch predictions
- Model checkpoints at various epochs
- Model:
yolo_training/kitchenware_detector2/weights/best.pt - Classes: cooking-pot, frying-pan
- Performance: 99.9% precision, 100% recall
- Model:
pan_pot_classifier_temporal.pth - Classes: boiling, normal, on_fire, smoking
- Verification results:
verification_results_on_originals/ - Performance: 93.10% validation accuracy, 92.9% test accuracy
python verify_yolo.pypython verify_temporal_on_originals.pyFor detailed information about the fine-tuning process:
- English: FINE_TUNING_JOURNEY.md
- Chinese: FINE_TUNING_JOURNEY_CN.md
This repository contains:
- ✅ Trained models (PyTorch weights)
- ✅ Training results and metrics
- ✅ Verification scripts and results
- ✅ Documentation (English & Chinese)
Note: Training data, annotation tools (X-AnyLabeling, ChatRex), and utility scripts are not included in version control.
# Core dependencies
pip install -r requirements.txt# Optional: create and activate a virtual environment
python -m venv venv
# Windows
venv\Scripts\activate
# Linux/macOS
source venv/bin/activate
pip install -r requirements.txt# Install PyTorch wheels (ARM64)
pip3 install https://github.com/KumaTea/pytorch-aarch64/releases/download/v2.3.0/torch-2.3.0a0+gitc8f7e6d-cp311-cp311-linux_aarch64.whl
pip3 install https://github.com/KumaTea/pytorch-aarch64/releases/download/v2.3.0/torchvision-0.18.0a0+gitc8f7e6d-cp311-cp311-linux_aarch64.whl
# OpenCV from apt (more reliable on Pi)
sudo apt update
sudo apt install -y python3-opencv
# Install the rest
pip3 install -r requirements.txtSee individual component licenses for details.