Skip to content
Discussion options

You must be logged in to vote

First note you can't join Spans/Docs from different languages. That doesn't seem to be an issue here, but I want to be clear.

What you can do is convert your Spans to Docs with Span.as_doc and then combine the Docs with Doc.from_docs. If you run into speed problems, be careful to check the documentation of those functions for tips on that.

However, note that I'm not sure the above has much advantage over just joining strings and running the pipeline again if your language detection pipeline is configured to run a minimum number of components. See the speed FAQ #8402 for notes about disabling components you aren't using.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@riZZZhik
Comment options

Answer selected by riZZZhik
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / doc Feature: Doc, Span and Token objects
2 participants