Skip to content
Discussion options

You must be logged in to vote

It's a bit tricky to match up the implementation against the algorithm description on a line-by-line basis, because of the bulk computation.

For each token in the input batch, we precompute the pre-activation hidden layer values for each feature-position it could be in, where the feature-positions are things like "first item of the stack", "first item of the buffer", etc. This precomputation happens before we start stepping through the state.

Once we're stepping through the state, we calculate the feature tokens, and resolve them to the array indices from the precomputed values. We then sum together those items, and apply whatever activation we need to for the result. This produces the hi…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@j-frei
Comment options

@honnibal
Comment options

@j-frei
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer
2 participants