It would be great if you could customize a dataset to use for training, for instance with CoNLL files (similar to [this](https://huggingface.co/datasets/Rodrigo1771/drugtemist-fasttext-8-ner/tree/main)).