Skip to content
Discussion options

You must be logged in to vote

Sorry for the delayed response!

There are many general concerns about ML models leaking sensitive information, so in general we'd suggest trying to find an alternative to training directly on sensitive data.

That said, a technical answer:

For the built-in pipeline components, I think everything should keep working even if you remove all the strings from the string store.

The general setup between a pipeline component and the string store is that a pipeline component can expect strings that it added to the string store to be there in the future, so some components might break if their labels are removed from the string store. As far as I can tell all the built-in components do re-add their…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / training Feature: Training utils, Example, Corpus and converters
2 participants