Normalize entities and relations from annotated data with Wikidata

In the `data/output` folder, two files have been created:

* `entities.tsv` contains all keyphrases annotated with their corresponding label.
* `relations.tsv` contains all relation triplets.

Both files are TSV (tab-separated values), so there should no problems with `,`, `"`, etc. Simply opening each file and splitting by `\t` should do.

The idea would be trying to normalize these mentions with their appearances in Wikidata. For that I would propose creating another two files (`data/output/(entities|relations)-normalization.tsv` for which all the matches found are logged together with their corresponding Wikidata metadata (i.e., IDs, etc.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Normalize entities and relations from annotated data with Wikidata #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Normalize entities and relations from annotated data with Wikidata #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions