This repository was archived by the owner on Mar 8, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 20
LIST OF AUDIO+TEXT DATASETS #114
Copy link
Copy link
Open
Labels
Description
LIST OF ALL ITALIAN DATASETS FOUND
From issue #90 I'm putting here all the datasets that have been discovered.
Some of them are plug-and-play for Deepspeech others instead need to be created from scratch (splits up audio by sentences)
Feel free to pickup one that has not been done for checking it out.
NOTE
If one of this dataset needs a deeper analysis please do not start a discussion here but open a new issue and I will update this table with the issue reference.
DATASETS
| dataset | hrs | url | plug-n-play | TODOs | doing | done | note |
|---|---|---|---|---|---|---|---|
| MLS | 279.43 h | ↗ | HOT!!!! | ||||
| VoxForge #111 | 20h | ↗ | ✔ |
|
✔ | ||
| MAILABS | 127h40m | ↗ | ✔ | ✔ | |||
| Evalita2009 | 5h | ↗ | ✔ | ||||
| MSPKA | 3h | ↗ | ✔ | ||||
| SIWIS | 4.5h | ↗ | ✔ | ||||
| SUGAR | 1.5h | ↗ | sentences are not useful | ||||
| VociParlateWikipedia #34 | ? | ↗ |
|
||||
| EMOVO | ~12m | ↗ |
|
interesting for emotions (disgust, happy..) | |||
| ZIta | <1hr | ↗ | transcriptions do not follow recordings (eg: Lett_Z_Sp1_zero.wav) | ||||
| LIM_Veneti | <1hr | ↗ | no audio files? | ||||
| split-MDb | ~46m | ↗ |
|
based on CLIPS | |||
| tg60 | 1h30m | ↗ |
|
maybe among the info files there are some timings that could be useful for splitting up? | |||
| PraTiD | 1h12m | ↗ |
|
From CLIPS; maybe among the info files there are some timings that could be useful for splitting up? | |||
| ParlatoCinematografico | ? | ↗ |
|
.lab files with speakers timings | |||
| PerugiaCorpusPEC | ? | ↗ | a login is needed. License? |
Reactions are currently unavailable