Releases: bootphon/shennong
shennong-1.0
shennong-1.0
breaking changes
-
Deletion of
processor.rastaplp.RastaPlpProcessor. Rasta filtering is now an
option of the standardPlpProcessor, so as to use both Rasta and VTLN. -
Pitch, delta and CMVN are now deactivated by default in pipeline configuration
generation (concerns bothspeech-featuresbinary and theshennong.pipeline
module). -
Features serialization in JSON is no more supported (replaced by CSV)
-
Code reorganization (renamed
shennong.features.*toshennong.*, new
shennong.logger, import processors directly from
shennong.processor, renamedPitchProcessortoKaldiPitchProcessor,
etc...) -
When defining utterances for use with a pipeline, the format
<audio-file>
is no more supported, it must be superseeded by<utterance-id> <audio-file>. -
processor.process_all()now takes ashennong.Utterancesinstead of a
dict(name, audio).
new models
-
Vocal Tract Length Normalization (VTLN) implemented using Kaldi:
shennong.processor.vtln. -
CREPE pitch extraction:
shennong.processor.pitch_crepe.
improvments
-
new
shennong.Utterancesclass to encapsulate and manage utterances to be
feeded to a pipeline. -
shennong.Audiocan now read/write more than wav files: flac, mp3, etc...
(anything supported by pydub/ffmpeg). -
Optionnally ignore features properties when saving them.
-
New example code in
examples.
shennong-0.1.1
v0.1.1 release shennong-0.1.1
shennong-0.1
v0.1 release shennong-0.1