Measuring inference time in spaCy pipeline #8238

narayanacharya6 · 2021-05-30T22:23:11Z

narayanacharya6
May 30, 2021

Is there anything out-of-the-box in spaCy that helps measure performance in terms of time taken by each component in the pipeline, words processed per second, or something similar? I am looking to process a large number of documents and would like to gather some important processing time metrics for benchmarking as well as identifying potential bottleneck components.

I am aware of the benchmarking mentioned here and the associated benchmarking project here but I am looking for something that helps measure timings for each component in the pipeline, not the entire pipeline.

Answered by polm

May 31, 2021

There's nothing out-of-the-box, no. I would recommend that you try either using the Python debugger or wrapping components in some kind of timer.

Since you can get the components from the language pipeline, and since they're executed just using their __call__ function at inference time, you should be able to wrap them in a timer function. Something like this:

for name, pipe in nlp.pipeline:
    pipe.__call__ = timer(pipe.__call__)

Where timer is some kind of function that times calls while passing through arguments and return values.

View full answer

polm · 2021-05-31T05:50:00Z

polm
May 31, 2021

There's nothing out-of-the-box, no. I would recommend that you try either using the Python debugger or wrapping components in some kind of timer.

Since you can get the components from the language pipeline, and since they're executed just using their __call__ function at inference time, you should be able to wrap them in a timer function. Something like this:

for name, pipe in nlp.pipeline:
    pipe.__call__ = timer(pipe.__call__)

Where timer is some kind of function that times calls while passing through arguments and return values.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Measuring inference time in spaCy pipeline #8238

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Measuring inference time in spaCy pipeline #8238

Uh oh!

narayanacharya6 May 30, 2021

Replies: 1 comment

Uh oh!

polm May 31, 2021

narayanacharya6
May 30, 2021

polm
May 31, 2021