Skip to content
This repository was archived by the owner on May 30, 2025. It is now read-only.

Normalize entities and relations from annotated data with Wikidata #3

@apiad

Description

@apiad

In the data/output folder, two files have been created:

  • entities.tsv contains all keyphrases annotated with their corresponding label.
  • relations.tsv contains all relation triplets.

Both files are TSV (tab-separated values), so there should no problems with ,, ", etc. Simply opening each file and splitting by \t should do.

The idea would be trying to normalize these mentions with their appearances in Wikidata. For that I would propose creating another two files (data/output/(entities|relations)-normalization.tsv for which all the matches found are logged together with their corresponding Wikidata metadata (i.e., IDs, etc.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions