Skip to content

Help needed in updating the eng.tagged corpus #21

@AMR-KELEG

Description

@AMR-KELEG

I have found that some tags are marked as unknown * despite getting analysed by the compiled dictionary.

Theses cases can be discovered easily but I need help in manually inspecting them.

The tagging doesn't seem to be that easy as for example:
The token bloody is located in lines 11 and 11145
https://github.com/apertium/apertium-eng/blob/master/texts/eng.tagged#L11
https://github.com/apertium/apertium-eng/blob/master/texts/eng.tagged#L11145

line analysis
11 ^bloody/*bloody
11145 ^bloody/bloody<adj><sint>$

What do you think is the better way to fix such cases?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions