Skip to content
Discussion options

You must be logged in to vote

nlp.pipe() returns the documents in the same order as the input texts.

as_tuples is just to pair some external context with each returned doc, so if you only have input texts, you shouldn't need to use it. (The as_tuples option isn't really needed as of v3.2 because you can pass docs with custom attributes to the pipeline instead.)

Unless you have really limited RAM or really long texts, 100 is a pretty small batch size for en_core_web_lg. It might well be faster to have larger batch sizes and fewer processes.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@svonava
Comment options

@adrianeboyd
Comment options

@svonava
Comment options

Answer selected by adrianeboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / pipeline Feature: Processing pipeline and components
2 participants
Converted from issue

This discussion was converted from issue #10353 on February 22, 2022 16:22.