Batch Size for NER #8301

morankyle · 2021-06-07T14:26:52Z

morankyle
Jun 7, 2021

First off thank you! After changing architecture from following Spacy 2.0 to 3.0 saw a 20 point increase in F1-Score!

My question is about the training batch size for NER, 100 start and 1000 stop seems large, why is the default that high and have you seen success with lesser numbers?

svlandeg · 2021-06-08T07:40:57Z

svlandeg
Jun 8, 2021

Great to hear you're seeing such a performance boost when switching to spaCy 3! 20 percent point is quite an unexpectedly huge leap though, but perhaps your dataset isn't very large? Because in that case, larger variations are expected.

With respect to the batch training size, these defaults have simply kind of worked for us when benchmarking our pretrained pipelines on standard evaluation sets. It's always difficult to recommend good default settings in general, as every use-case and dataset is different. But you can try out some different runs, varying the batch size in the config, and see whether that makes any difference in your specific project. Feel free to report the results here, as I'm sure others may be interested as well!

7 replies

svlandeg Jun 8, 2021

Ah, right, so the code for that, behind the quickstart is here:

[components.tok2vec.model.encode]
@architectures = "spacy.MaxoutWindowEncoder.v2"
width = {{ 96 if optimize == "efficiency" else 256 }}
depth = {{ 4 if optimize == "efficiency" else 8 }}

so you see that it depends on the efficiency setting. You picked accuracy, so you're getting 8 ;-)

morankyle Jun 8, 2021
Author

:)

Last question (sorry :) ) , and this one is a bit silly but I could replace the MaxoutWindowEncoder with the TorchBiLSTMEncoder in the config file correct? Other than having pytorch installed there would be no other steps?

svlandeg Jun 8, 2021

Yes, that should work!

morankyle Jun 8, 2021
Author

TorchBiLSTMEncoder hurt performance but the MishWindowEncoder was amazing! ~6 point boost !

svlandeg Jun 8, 2021

Ah cool, interesting!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Batch Size for NER #8301

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 7 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Batch Size for NER #8301

Uh oh!

Uh oh!

morankyle Jun 7, 2021

Replies: 1 comment · 7 replies

Uh oh!

svlandeg Jun 8, 2021

Uh oh!

svlandeg Jun 8, 2021

Uh oh!

morankyle Jun 8, 2021 Author

Uh oh!

svlandeg Jun 8, 2021

Uh oh!

morankyle Jun 8, 2021 Author

Uh oh!

svlandeg Jun 8, 2021

morankyle
Jun 7, 2021

Replies: 1 comment 7 replies

svlandeg
Jun 8, 2021

morankyle Jun 8, 2021
Author

morankyle Jun 8, 2021
Author