Skip to content
Discussion options

You must be logged in to vote

Hi, the problem is that this format doesn't support spaces in the tokens. The converter expects to be able to split the columns by whitespace and that the IOB annotation is in the same column in all lines after splitting, typically either always in the 2nd column or always in the 4th column depending on the dataset.

Replies: 1 comment 10 replies

Comment options

You must be logged in to vote
10 replies
@adrianeboyd
Comment options

@LadyHangaku
Comment options

@LadyHangaku
Comment options

@LadyHangaku
Comment options

@polm
Comment options

Answer selected by polm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / cli Feature: Command-line interface v2 spaCy v2.x
3 participants
Converted from issue

This discussion was converted from issue #8737 on July 16, 2021 11:34.