Merge pull request #9 from MaikBastian/readme

MaikBastian · web-flow · commit f178d09db4fe · 2025-11-14T11:43:18.000+01:00
Improve readme with clearer description and training arguments of existing models
diff --git a/README.md b/README.md
@@ -206,7 +206,7 @@ Following are our training results from a DGX-1 with 2 GPUs on the models with l
 
 ## Extended models trained for recognition of ACA and rotor ciphers
 
-The models are trained on variable length ciphertexts in between 100 and 1000 characters. This was done to improve the recognition of the models towards rotor ciphers. For resonable recognitions of rotor ciphers longer ciphertexts are needed.
+The extended models are trained with ciphertexts of variable length in between 100 and 1000 characters. The usage of variable length ciphertexts helps the recognition of the rotor ciphers, as the longer ciphertexts include more distinct features. 
 
 | Model Name                    | Accuracy in % | Iterations in Mio. |
 | :---------------------------- | :-----------: | :----------------: |
@@ -216,13 +216,35 @@ The models are trained on variable length ciphertexts in between 100 and 1000 ch
 | nb_var_10000000               |     53.50     |         10         |
 | ffnn_var_10000000             |     72.98     |         10         |
 
-These models are always part of an ensemble model with a SVM trained only on rotor ciphers. When the main models recognize rotor ciphers, the SVM is used to differentiate
-between the rotor ciphers. This helps with the results since the original models can differentiate between ACA and rotor ciphers but are bad at differentiating rotor ciphers from each other.
+These models are always part of an ensemble model augmented by an SVM that is trained only on rotor ciphers. When the main models recognizes a rotor cipher, the SVM is used to differentiate between the five types of rotor ciphers. This improves the results, as the original models are quite good in distinguishing between ACA and rotor ciphers but have issues differentiating the rotor ciphers from each other.
 
 | Model Name                    | Accuracy in % | Iterations in Mio. | Training Time |
 | :---------------------------- | :-----------: | :----------------: | :-----------: |
 | svm_rotor_only_1000_16000     |     61.50     |       0.016        |  0d 01h 01m   |
 
+## Arguments used for training
+
+The following arguments were used to train these models:
+
+```
+python train.py --architecture=FFNN
+    --download_dataset=False \
+    --plaintext_input_directory=../data/gutenberg_en \
+    --rotor_input_directory=../data/rotor_ciphertexts \
+    --train_dataset_size=976 \
+    --dataset_workers=16 \
+    --batch_size=64 \
+    --max_iter=10000000 \
+    --min_train_len=100 \
+    --max_train_len=1000 \
+    --min_test_len=100 \
+    --max_test_len=1000 \
+    --epochs=1 \
+    --ciphers=all
+```
+
+were the argument 'FFNN' can be replaced by the actual architecture to train. Attention: `max_iter` is set to `1000000` when the RF model is trained.
+
 
 # Publications