Skip to content
Discussion options

You must be logged in to vote

Since it applies attention over all the hidden representations in the document, there is no theoretical limit to the input/doc size. Of course there are practical limits (e.g., you'll want to adjust the batch size to the amount of GPU memory available if you run on the GPU).

Replies: 2 comments 9 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
9 replies
@vsocrates
Comment options

@danieldk
Comment options

@vsocrates
Comment options

@danieldk
Comment options

Answer selected by vsocrates
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / textcat Feature: Text Classifier feat / transformer Feature: Transformer
4 participants