You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-19Lines changed: 6 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -118,8 +118,6 @@ This results in:
118
118
119
119
Even if the mentions `Princess Liana` and `She` are not in the same chunk, hierarchical merging still resolves this case correctly.
120
120
121
-
*Note that, at the time of writing, the performance of the hierarchical merging feature has not been benchmarked*.
122
-
123
121
124
122
## Training a model
125
123
@@ -174,24 +172,13 @@ Several work make use of additional features. For now, only the distance between
174
172
175
173
# Results
176
174
177
-
The following table presents the results we obtained by training this model (for now, it has only one entry !). Note that:
178
-
179
-
- the reported results use `max_span_size=5` instead of `max_span_size=10` as in training.
180
-
- the reported results were obtained by splitting documents for performance reasons, with subdocuments having a maximum length of 11 sentences. They may not be accurate with the performance on full documents.
181
-
- the reported results can not be directly compared to the performance in [the original Litbank paper](https://arxiv.org/abs/1912.01140) since we only compute performance on one split of the datas
182
-
183
-
| Dataset | Base model | MUC | B3 | CEAF | CoNLL F1 |
The following table reports our results on the full Litbank documents (~2000 tokens each). We use `max_span_size=10`. HM stand for "Hierarchical Merging":
175
+
The following table presents the results we obtained on Litbank by training this model. We evaluate on 10% of Litbank documents, each of which consists of ~2000 tokens. The *split* column indicate whether documents were split in blocks of 512 tokens. The *HM* coumns indicates whether we use hierarchical merging.
190
176
191
-
| Dataset | Base model | HM | MUC | B3 | CEAF | BLANC | LEA |
0 commit comments