Skip to content

Can't merge non-disjoint spans. 'πŸ‡¦' is already part of tokens to merge.Β #53

@SiirRandall

Description

@SiirRandall

I downloaded and updated all the modules that armchair requires. I imported my chat log using your text import tool. It works fine. But I get this after I start the bot.

INFO:ArmchairExpert:Training_Preprocessing_Markov(Import): 25.458330%
Traceback (most recent call last):
File "armchair_expert.py", line 344, in
ae.start(retrain_structure=args.retrain_structure, retrain_markov=args.retrain_markov)
File "armchair_expert.py", line 98, in start
self.train(retrain_structure=True, retrain_markov=retrain_markov)
File "armchair_expert.py", line 273, in train
self._train_markov(retrain_markov)
File "armchair_expert.py", line 223, in _train_markov
spacy_preprocessor = self._preprocess_markov_data(all_training_data=retrain)
File "armchair_expert.py", line 180, in _preprocess_markov_data
doc = self._nlp(MarkovFilters.filter_input(message[0].decode()))
File "/opt/rh/rh-python36/root/usr/lib64/python3.6/site-packages/spacy/language.py", line 402, in call
doc = proc(doc, **component_cfg.get(name, {}))
File "/opt/rh/rh-python36/root/usr/lib/python3.6/site-packages/spacymoji/init.py", line 87, in call
retokenizer.merge(span)
File "_retokenize.pyx", line 56, in spacy.tokens._retokenize.Retokenizer.merge
ValueError: [E102] Can't merge non-disjoint spans. 'πŸ‡¦' is already part of tokens to merge.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions