- Create
expsdirectory at project's root - install requirements using
pip install -r requirments.txt
-
The main entrypoint is
main_transformer.py
ChaningEXPvariable to run the desired architecture - the options aretransformerororg
All the visualiztions (training curves, intermediate results and the best result during the training will be underexps/YYYY_MM_DD_HH_mm/ -
Our architecture is in
new_nets.py
Where all transformer related blocks are invit_model.py
To run the code:
python main_transformer.py