Releases: TIBHannover/cross-modal_entity_consistency
Releases · TIBHannover/cross-modal_entity_consistency
Datasets
This repository contains the TamperedNews (Link) and News400 (Link) dataset used in the paper. The datasets include:
dataset.jsonlcontaining:- Web links to the news texts
- Web links to the news image
- Outputs of the named entity linking and disambiguation (NERD) approach
- Untampered and tampered entities
<entity>.jsonlfile for each entity type containing the following information for each entity:- Wikidata ID
- Wikidata label
- Meta information used for tampering
- Web links to all reference images crawled from Google, Bing, and Wikidata
- splits for testing and validation