update_model #9846
-
Hello, I'm trying to train a new language model starting from data coming from other language models i.e. macedonian [nlp]
lang = "mk"
pipeline = ["tok2vec","tagger","morphologizer"]
batch_size = 1000
disabled = []
before_creation = null
after_creation = null
after_pipeline_creation = null
tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
[components]
[components.morphologizer]
source = "mk_core_news_md"
component = "morphologizer"
[components.morphologizer.model]
@architectures = "spacy.Tagger.v1"
nO = null
[components.morphologizer.model.tok2vec]
@architectures = "spacy.Tok2VecListener.v1"
width = ${components.tok2vec.model.encode.width}
upstream = "*" But during training the scores remain stable (don't update):
what could be wrong? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 8 replies
-
When sourcing components, leave out all the other settings, so just: [components.morphologizer]
source = "mk_core_news_md"
[components.tok2vec]
source = "mk_core_news_md" Since the [components.morphologizer]
source = "mk_core_news_md"
replace_listeners = ["model.tok2vec"] Our Macedonian training data doesn't contain |
Beta Was this translation helpful? Give feedback.
-
Thanks for the answer!! I followed your suggestion, but I got this error: KeyError: "[E944] Can't copy pipeline component 'tok2vec' from source 'mk_core_news_md': not found in pipeline. Available components: morphologizer, parser, senter, attribute_ruler, lemmatizer, ner" aha I see, ok even if that's so, I can still train PoS prediction based on that information; actually the idea is to get a baseline for the target language based on a similar language |
Beta Was this translation helpful? Give feedback.
Thanks for the answer!! I followed your suggestion, but I got this error:
KeyError: "[E944] Can't copy pipeline component 'tok2vec' from source 'mk_core_news_md': not found in pipeline. Available components: morphologizer, parser, senter, attribute_ruler, lemmatizer, ner"
aha I see, ok even if that's so, I can still train PoS prediction based on that information; actually the idea is to get a baseline for the target language based on a similar language