Skip to content

insight-stream/crabnlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Crabnlp Speaker Diarization Using OpenAI Whisper

GitHub stars GitHub issues GitHub license Open in Colab

Crabnlp Transcription based on OpenAI Whisper and Custom Speaker Diarization

drawing Please, star the project on github (see top-right corner) if you appreciate my contribution to the community!

What is it

Crabnlp is a transcription and speaker diarization pipeline based on OpenAI Whisper and custom speaker diarization.

Installation

Install the requirements

cd crabnlp
pip install -e crabnlp

Example using transcribe

from crabnlp.transcribe import transcribe

transcription_result = transcribe('path/to/file')

Known Limitations

  • The system may not perform well in scenarios with overlapping speech or rapid speech.
  • The system may not accurately identify speakers in scenarios with
  • There might be some errors, please raise an issue if you encounter any.

Future Improvements

  • Improvement the quality of sound recognition

Acknowledgements

This work is based on OpenAI's Whisper , Faster Whisper , Nvidia NeMo , and Facebook's Demucs

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •