Merge branch 'main' of github.com:SFI-Visual-Intelligence/Collaborative-Coding-Exam into mag-branch

Seilmast · Seilmast · commit 4dcb63f53ee7 · 2025-03-02T11:38:28.000+01:00
diff --git a/README.md b/README.md
@@ -3,19 +3,21 @@
 # Collaborative-Coding-Exam
 Repository for final evaluation in the FYS-8805 Reproducible Research and Collaborative coding course
 
-## Citation
-Several citations can be found under "cite this repository" under the about section. 
-You can also include this in your BibTex file
-```
-@software{Thrun_Collaborative_Coding_Exam_2025,
-author = {Thrun, Solveig and Salomonsen, Christian and Størdal, Magnus and Zavadil, Jan and Mylius-Kroken, Johan},
-month = feb,
-title = {{Collaborative Coding Exam}},
-url = {https://github.com/SFI-Visual-Intelligence/Collaborative-Coding-Exam},
-version = {1.1.0},
-year = {2025}
-}
-```
+## **Table of Contents**  
+1. [Project Description](#project-description)  
+2. [Installation](#installation)  
+3. [Usage](#usage)  
+4. [Results](#results)  
+5. [Citing](#citing)  
+
+## Project Description
+This project involves collaborative work on a digit classification task, where each participant works on distinct but interconnected components within a shared codebase. <br>
+The main goal is to develop and train digit classification models collaboratively, with a focus on leveraging shared resources and learning efficient experimentation practices.
+### Key Aspects of the Project:
+- **Individual and Joint Tasks:** Each participant has separate tasks, such as implementing a digit classification dataset, a neural network model, and an evaluation metric. However, all models and datasets must be compatible, as we can only train and evaluate using partners' models and datasets.
+- **Shared Environment:** Alongside working on our individual tasks, we collaborate on joint tasks like the main file, and training and evaluation loops. Additionally, we utilize a shared Weights and Biases environment for experiment management.
+- **Documentation and Package Management:** To ensure proper documentation and ease of use, we set up Sphinx documentation and made the repository pip-installable
+- **High-Performance Computing:** A key learning objective of this project is to gain experience with running experiments on high-performance computing (HPC) resources. To this end, we trained all models on a cluster
 
 ## Installation
 
@@ -39,9 +41,37 @@ python -c "import CollaborativeCoding"
 
 ## Usage
 
-TODO: Fill in
+To train a classification model using this code, follow these steps:
+
+### 1) Create a Directory for the reuslts
+Before running the training script, ensure the results directory exists:
+
+ `mkdir -p "<RESULTS_DIRECTORY>"`
+
+### 2) Run the following command for training, evaluation and testing
+
+ `python3 main.py --modelname "<MODEL_NAME>" --dataset "<DATASET_NAME>" --metric "<METRIC_1>" "<METRIC_2>" ... "<METRIC_N>" --resultfolder "<RESULTS_DIRECTORY>" --run_name "<RUN_NAME>" --device "<DEVICE>"`
+<br> Replace placeholders with your desired values:
+
+- `<MODEL_NAME>`: You can choose from different models ( `"MagnusModel", "ChristianModel", "SolveigModel", "JanModel", "JohanModel"`).
+
+
+- `<DATASET_NAME>`: The following datasets are supported (`"svhn", "usps_0-6", "usps_7-9", "mnist_0-3", "mnist_4-9"`)
+
+
+- `<METRIC_1> ... <METRIC_N>`: Specify one or more evaluation metrics (`"entropy", "f1", "recall", "precision", "accuracy"`)
+
+
+- `<RESULTS_DIRECTORY>`: Folder where all model outputs, logs, and checkpoints are saved 
+
 
-### Running on a k8s cluster
+- `<RUN_NAME>`: Name for WANDB project
+
+
+- `<DEVICE>`: `"cuda", "cpu", "mps"`
+
+
+## Running on a k8s cluster
 
 In your job manifest, include:
 
@@ -62,14 +92,31 @@ to pull the latest build, or check the [packages](https://github.com/SFI-Visual-
 > The container is build for a `linux/amd64` architecture to properly build Cuda 12. For other architectures please build the docker image locally.
 
 
-# Results 
-## JanModel & MNIST_0-3
+## Results 
+### JanModel & MNIST_0-3
 This section reports the results from using the model "JanModel" and the dataset MNIST_0-3 which contains MNIST digits from 0 to 3 (Four classes total). 
 For this experiment we use all five available metrics, and train for a total of 20 epochs.
 
 We achieve a great fit on the data. Below are the results for the described run:
+
 | Dataset Split | Loss  | Entropy | Accuracy | Precision | Recall | F1    |
 |---------------|-------|---------|----------|-----------|--------|-------|
 | Train         | 0.000 | 0.000   | 1.000    | 1.000     | 1.000  | 1.000 |
 | Validation    | 0.035 | 0.006   | 0.991    | 0.991     | 0.991  | 0.991 |
-| Test          | 0.024 | 0.004   | 0.994    | 0.994     | 0.994  | 0.994 |
+| Test          | 0.024 | 0.004   | 0.994    | 0.994     | 0.994  | 0.994 |
+
+
+### MagnusModel & SVHN 
+The MagnusModel was trained on the SVHN dataset, utilizing all five metrics.   
+Employing micro-averaging for the calculation of F1 score, accuracy, recall, and precision, the model was fine-tuned over 20 epochs.   
+A learning rate of 0.001 and a batch size of 64 were selected to optimize the training process. 
+
+The table below presents the detailed results, showcasing the model's performance across these metrics.
+
+
+| Dataset Split | Loss  | Entropy | Accuracy | Precision | Recall | F1    |
+|---------------|-------|---------|----------|-----------|--------|-------|
+| Train         | 1.007 | 0.998   | 0.686    | 0.686     | 0.686  | 0.686 |
+| Validation    | 1.019 | 0.995   | 0.680    | 0.680     | 0.680  | 0.680 |
+| Test          | 1.196 | 0.985   | 0.634    | 0.634     | 0.634  | 0.634 |
+