Skip to content
Discussion options

You must be logged in to vote

After removing the first column of token indices, training an NER model from this data seems to work as expected:

============================= Training pipeline =============================
ℹ Pipeline: ['tok2vec', 'ner']
ℹ Initial learn rate: 0.001
E    #       LOSS TOK2VEC  LOSS NER  ENTS_F  ENTS_P  ENTS_R  SCORE 
---  ------  ------------  --------  ------  ------  ------  ------
  0       0          0.00    109.67    1.36    0.74    7.75    0.01
  0     500       2027.59   8985.10   52.63   54.92   50.53    0.53
  0    1000        513.54   6050.69   65.29   66.87   63.78    0.65
  0    1500        676.13   5320.49   68.34   72.32   64.77    0.68
  0    2000       6945.42   5176.05   …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@eschaffn
Comment options

Answer selected by adrianeboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / ru Russian language data and models feat / ner Feature: Named Entity Recognizer feat / transformer Feature: Transformer
2 participants