Where are the attribute_ruler patterns defined for each language? #8508
-
The different language models have an |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Right now the pretrained pipelines are the main reference. You can see the patterns using: nlp.get_pipe("attribute_ruler").patterns The You can also always source an existing attribute ruler into a new pipeline and all the patterns will be copied with it. We've considered having a separate repo where some of the pipeline-specific settings like this are stored, so it would be easy to refer to them in new configs, but this doesn't exist yet. |
Beta Was this translation helpful? Give feedback.
-
Thanks for the answer! Yeah, it'd be cool if the pipeline-specific settings are open source, in case people want to contribute to them. This came just out of curiosity, as I was traversing the code, trying to understand different parts. Related, are the commands used to create the different language models publicly available somewhere? I mean the training part, when they take parts from other pipelines, packaging, etc. |
Beta Was this translation helpful? Give feedback.
Right now the pretrained pipelines are the main reference. You can see the patterns using:
The
patterns
file is just this same data inmsgpack
format, which you can read directly withsrsly.read_msgpack
(or other msgpack libraries) if you'd prefer.You can also always source an existing attribute ruler into a new pipeline and all the patterns will be copied with it.
We've considered having a separate repo where some of the pipeline-specific settings like this are stored, so it would be easy to refer to them in new configs, but this doesn't exist yet.