This project aims to reduce bias, promote fairness, and highlight underrepresentation in the maritime industry by developing gender identification models using TensorFlow. By analyzing maritime-related imagery, the project addresses the gender disparity and brings attention to the limited visibility of women in this traditionally male-dominated sector. Through the innovative use of computer vision technology, the goal is to foster a more equitable and inclusive representation of female professionals in maritime.
Dr. Padmapriya Jayaraman
Dr. T. Sasilatha
-
Dynamic Hyperparameter Tuning
- Uses Keras Tuner to identify optimal hyperparameter configurations for model training. This allows the model to achieve better accuracy by exploring a wide range of possible configurations efficiently.
-
Transfer Learning
- Leverages MobileNetV2 for efficient base feature extraction, reducing the time and computational resources needed for training while maintaining high performance.
-
Augmented Training
- Applies data augmentation techniques like rotation, zoom, brightness adjustments, and flips to enhance model generalization and ensure robustness against real-world variations.
-
Regularization Techniques
- Implements advanced techniques such as dropout and L2 regularization to prevent overfitting and maintain model generalization across diverse datasets.
-
False Image Analysis
- Stores misclassified images in designated directories for further analysis, enabling developers to identify patterns in errors and refine the model for improved accuracy.
-
Data Diversity
- Ensures datasets are representative of the diverse maritime workforce, incorporating images from various ethnic backgrounds and industrial contexts to minimize biases.
Begin by cloning the repository to your local system:
git clone <repository-url>
cd <repository-name>Create and activate a virtual environment to manage project dependencies:
python3 -m venv venv
source venv/bin/activate # For Windows: venv\Scripts\activateInstall the required libraries using the provided requirements file:
pip install -r requirements.txtOrganize your image dataset in the following structure for training and validation:
data/
female/
image1.jpg
image2.jpg
male/
image1.jpg
image2.jpg
The dataset should include a balanced mix of images across genders, with annotations reflecting the diversity of the maritime industry.
Customize the config.yaml file to set key parameters such as image size, learning rate, batch size, and dataset paths. These configurations allow for flexibility and adaptation to different datasets and training environments.
Train the model using the command:
python train.pyThe training process will utilize the specified parameters in config.yaml and include real-time feedback on accuracy and loss metrics.
Evaluate the trained model and analyze the results:
python test.pyThis step provides insights into the model's performance and highlights areas for further refinement.
-
Hyperparameter Optimization
- Dynamically determines the best configurations through Bayesian optimization using Keras Tuner, reducing the manual effort in parameter selection and improving model outcomes.
-
Misclassification Debugging
- Provides a mechanism to identify and analyze false positives and false negatives, offering actionable insights for improving model performance.
-
Extensive Data Augmentation
- Introduces robust variability to the dataset through transformations such as rotations, zooms, flips, and brightness adjustments, ensuring better model resilience.
-
Cross-Domain Applicability
- The methodologies employed in this project can be adapted to other industries facing similar challenges with underrepresentation and bias.
Track and monitor the training and validation process using TensorBoard. This tool provides real-time visualizations of key metrics:
tensorboard --logdir logs/TensorBoard graphs help identify trends in accuracy, loss, and other performance metrics, enabling data-driven model refinement.
- Python 3.8 or later
- TensorFlow 2.6 or later
- PyYAML for configuration management
- NumPy for numerical computations
- TensorBoard for visualization
- Misclassified images are saved in
false-images/female_falseandfalse-images/male_falsefor detailed analysis and debugging. - The implementation currently supports RGB images; modify the input shape to
(height, width, 1)for grayscale support if required. - Future updates may include support for additional languages and improved preprocessing pipelines to handle edge cases more effectively.
This project is released under the MIT License. Refer to the LICENSE file for more details. By adopting this open-source approach, the project encourages collaboration and innovation within the maritime and technology communities.

