Wrong Burmese/Myanmar output #2205

MattLiutt · 2024-06-07T06:27:10Z

MattLiutt
Jun 7, 2024

I was trying to transcribe some audio clip found online which are Burmese, so I follow the code

# load audio and pad/trim it to fit 30 seconds
audio = whisper.load_audio("sample.wav")
audio = whisper.pad_or_trim(audio)

print(model.device)
# make log-Mel spectrogram and move to the same device as the model
mel = whisper.log_mel_spectrogram(audio, n_mels=128).to(model.device)

# detect the spoken language
_, probs = model.detect_language(mel)
print(f"Detected language: {max(probs, key=probs.get)}")

# decode the audio
options = whisper.DecodingOptions()
result = whisper.decode(model, mel, options)

# print the recognized text
print(result.text)

It indeed gives correct detection for language which is "my". However, the text output is not burmese but like below,

Myanmar Nanyang Nguyen Niseng Chinpaare Poiro Meea Ne

May I know if anyone got same or similar issue? It would be helpful if can get some feedback! Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wrong Burmese/Myanmar output #2205

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Wrong Burmese/Myanmar output #2205

Uh oh!

MattLiutt Jun 7, 2024

Replies: 0 comments

MattLiutt
Jun 7, 2024