Skip to content

soumengoroi/Language-Identification-LID

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Language_Identification-LID

Environment Setup

The models are implemented in TensorFlow. To use all of the functionality of the library, you should have: tensorflow==2.4.1 tensorflow-gpu==2.4.1 tensorflow-addons==0.15.0 matplotlib==3.5.0 numpy==1.19.5 scikit-learn==1.0.1 librosa==0.8.1 SoundFile==0.10.3.post1 PyYAML==6.0

Download the codebase and open up a terminal in the root directory. Make sure Python 3.7 is installed in the current environment. Then execute

pip install -r requirements.txt

Dataset

Download the dataset from the link below. "https://huggingface.co/datasets/common_language/resolve/main/data/CommonLanguage.zip" Unzip and save it at the dataset folder.

About

Using a pre-train model we can Identify 13 language (namely-chinese, english, french, german, indonesian, italian, japanese, korean, portuguese, russian, spanish, turkish, vietnamese). and we also can perform duration wish accuracy on each language.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages