The run_pipeline.sh only works if the trained embeddings use all the words present in the dataset (i.e. there is no minimum number of ocurrences per word to create its embedding).
If the threshold is higher than 1 it crashes because the script that substitutes removed words with tags has not been called.