Problem to reproduce Earnings21 results #1261

naymaraq · 2023-04-20T06:52:49Z

naymaraq
Apr 20, 2023

In the paper, the authors reported the results on the Earnings21 dataset, reporting a 9.72% Word Error Rate (WER). However, upon attempting to replicate these results, I encountered some difficulties.

Could you please provide me with more information on your normalization process?
Specifically, do you normalize both the output files and the golden files?
Additionally, I'm curious to know whether you used the fstalign tool provided in the Earnings21 benchmark.

jongwook · 2023-05-05T09:08:19Z

jongwook
May 5, 2023
Maintainer

Yes, we applied the normalizer to both the ground truth (reference) and the predictions (hypotheses). This makes the WER numbers not directly comparable with the literature. We haven't used the fstalign tool.

The benchmark numbers included some nondeterministic settings such as the temperature fallback, so it might be difficult to exactly replicate the WER, but it should probably land in the ballpark.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Problem to reproduce Earnings21 results #1261

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Problem to reproduce Earnings21 results #1261

Uh oh!

naymaraq Apr 20, 2023

Replies: 1 comment

Uh oh!

jongwook May 5, 2023 Maintainer

naymaraq
Apr 20, 2023

jongwook
May 5, 2023
Maintainer