Caused by: org.apache.spark.api.python.PythonException: 'ValueError: [E109] Component 'tok2vec' could not be run. Did you forget to call initialize()
?'.
#11847
-
How to reproduce the behaviourI'm trying to apply lemmatization on several languages on azure databricks in a dataframe, however, when I use this to do the lemmatiztion:
I keep getting this error (in azure databricks notebook): What does this mean? I can't even access the lemmatized text by doing Also, I'm downloading the language on the same notebook doing this:
It seems like this is a spacy issue? Or am I not loading something on the azure databricks? Your Environment
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
Normally I would expect to see that error if you were using an uninitialized tok2vec, like one that hadn't been trained. How are you creating your Also, I'm not familiar with databricks. Is your code running in a Windows environment there, or is that your local dev environment? Probably not relevant either way, just wanted to check. |
Beta Was this translation helpful? Give feedback.
-
It worked after I installed the spacy languages using the |
Beta Was this translation helpful? Give feedback.
It worked after I installed the spacy languages using the
.whl
on the azure databricks