SpeechUnderstandingMajorProject

Feature Extraction

For feature extraction use:
(A) F0:
f0_extract.py in-dir out-dir
(B) Energy:
\t energy_extract.py in-dir out-dir
(C) Duration HuBERT:
\t encode.py --extension discrete in-dir out-dir (D) XLS-R:
\t xlsr_extract.py data-manifest-folder --path model-path
--task audio_classification --batch-size 90
--infer-manifest /fsx/data/VoxLingua107/manifest/test.tsv
--infer-xtimes 10 --infer-max-sample-size 160000 --output-path out-dir
\t Download: https://dl.fbaipublicfiles.com/fairseq/wav2vec/xlsr_300m_voxlingua107_ft.pt at model-path \

For training use:
main.py [options]\

For testing use:
Store trained model in project/baseline_LA/__pretrained as main.py --inference --model-forward-with-file-name --trained-model ${trained_model}> ${log_name}.txt 2>${log_name}_err.txt IMPORTANT: change LOSS function in config.py. Last function defines the LOSS currently as BCE change it as per requirements.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Features		Features
LCNN_model		LCNN_model
.gitignore		.gitignore
README.md		README.md
log_ASVspoof.txt		log_ASVspoof.txt
log_FOR.txt		log_FOR.txt
loss_func_compare.txt		loss_func_compare.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SpeechUnderstandingMajorProject

Feature Extraction

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

chakshu-dhannawat/SpeechUnderstandingMajorProject

Folders and files

Latest commit

History

Repository files navigation

SpeechUnderstandingMajorProject

Feature Extraction

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages