StringStore : Adding strings with a "." impacts across all stringstores #5059
-
How to reproduce the behaviour
Info about spaCyRunning on Amazon Linux EC2 instance. Also replicated same issue on a Macbook Pro (anaconda)
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
There are some hard-coded symbols that are always accessible through the
The full list is here: https://github.com/explosion/spaCy/blob/4890db63399d24f088ff6978aa157a0e4672e2eb/spacy/symbols.pxd This is clearly confusing and hopefully at some point it will be possible to treat all of them the same way as any other string and avoid this hard-coded enum, but for now that's why you see The
Whether |
Beta Was this translation helpful? Give feedback.
-
Thanks for the explanation |
Beta Was this translation helpful? Give feedback.
There are some hard-coded symbols that are always accessible through the
StringStore
even though they aren't actually saved in theStringStore
:The full list is here: https://github.com/explosion/spaCy/blob/4890db63399d24f088ff6978aa157a0e4672e2eb/spacy/symbols.pxd
This is clearly confusing and hopefully at some point it will be possible to treat all of them the same way as any other string and avoid this hard-coded enum, but for now that's why you see
root
in everyStringStore
.The
StringStore
doesn't tokenize or analyzeroot.abc.com
in any way, so you should be able to see: