-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Description
As discussed here, comments in reported.tsv for Belarusian, which were filled in by the contributors, are not displayed correctly: all Cyrillic characters have been replaced with question marks (probably an encoding issue at some stage of the data pipeline).
Steps to reproduce:
- Download the Belarusian dataset, unpack it and open
cv-corpus-7.0-2021-07-21/be/reported.tsv. - Filter by
reason, hiding all sentences with the reasongrammar-or-spelling. - Observe that most of the remaining reasons are not displayed correctly.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels