asr-toolkit/README.md at main · dan-ringwald/asr-toolkit

d EDL ASR Toolkit

A toolkit to perform online deep speech recognition

Modules

This Toolkit contains:

A modular rule based formater deformater
Some phonetization tools
An audio split-on-silence tool
A phonetic alignment tool
An acoustic model
A linguistic model
An ASR engine

Setup

The environment is managed with poetry

git clone edl_asr_toolkit
cd edl_asr_toolkit
poetry install

The modules need some assets and data to work properly (tokenizers, models, etc...) You will need to provide an assets directory and a data directory with expected models, tokenizers, data etc... These are available on S3

Demos

Functionalities

jupyter lab
# Open notebooks in notebook folders

Local Server

TBD

Dev

Codebase linted with flake8
Codebase autofixed with black
Codebase tested with unittest, each submodule is tested separately

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modules

Setup

Demos

Functionalities

Local Server

Dev

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Modules

Setup

Demos

Functionalities

Local Server

Dev