Crowdsourced ground truth dataset for 1,204 sentences and 7,778 event pairs covering 22 news topics. The corpus was created by using the CrowdTruth methodology, as described in the following paper:
- Tommaso Caselli and Oana Inel: Crowdsourcing StoryLines: Harnessing the Crowd for Causal Relation Annotation. Events and Stories in the News Workshops, COLING 2018
Crowdsourcing results and evaluation against expert data are available in folder:
|--data/results/
Expert ground truth data is available in folder:
|--data/ground_truth/
Aggregated raw crowdsourcig data is available in folder:
|--data/aggregated_input/
Raw crowdsourcig data is available in folder:
|--data/input/