Skip to content

Audio transcription

cesine edited this page May 24, 2011 · 23 revisions

Table of Contents

Google Speech Recognition

Sphinx

 CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

Language independent phonetic transcription

  • Our goal is to support a bit of bootstrapping, even for non-standard languages so that experiments on any language provide at least a bit of audio analysis.

Related Requirements

Audio Chunking based on Silence

  • The MARF project has some libraries for audio analysis. Not sure how complete and which goals have been realized yet.
 MARF is an open-source research platform and a collection of voice/sound/speech/text and natural language processing (NLP) algorithms written in Java and arranged into a modular and extensible framework facilitating addition of new algorithms.

References

Clone this wiki locally