Skip to content
Discussion options

You must be logged in to vote

Hello everyone,

I have found a workaround for my use case, and as forecasted in the discussion thread (by both @koaning and me), there is no need to pickle a .spacy file, in order to use it as input / output of a component, inside a Vertex AI Pipeline. If the project you are currently using, is the same to access both Vertex AI and Cloud Storage (as in my case), you can access ANY GCS Bucket from Vertex AI, by adding gcs/ first in your search path, as explained here.

I will include a very simple example about how to do it, borrowing the code from this post in Stack Overflow:

import spacy
from spacy.training import Example
from spacy.tokens import DocBin

td = [["Who is Shaka Khan?", {"ent…

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
2 replies
@dave-espinosa
Comment options

@koaning
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by dave-espinosa
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / serialize Feature: Serialization, saving and loading
2 participants