You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was trying to transcribe some audio clip found online which are Burmese, so I follow the code
# load audio and pad/trim it to fit 30 secondsaudio=whisper.load_audio("sample.wav")
audio=whisper.pad_or_trim(audio)
print(model.device)
# make log-Mel spectrogram and move to the same device as the modelmel=whisper.log_mel_spectrogram(audio, n_mels=128).to(model.device)
# detect the spoken language_, probs=model.detect_language(mel)
print(f"Detected language: {max(probs, key=probs.get)}")
# decode the audiooptions=whisper.DecodingOptions()
result=whisper.decode(model, mel, options)
# print the recognized textprint(result.text)
It indeed gives correct detection for language which is "my". However, the text output is not burmese but like below,
Myanmar Nanyang Nguyen Niseng Chinpaare Poiro Meea Ne
May I know if anyone got same or similar issue? It would be helpful if can get some feedback! Thanks!
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
I was trying to transcribe some audio clip found online which are Burmese, so I follow the code
It indeed gives correct detection for language which is "my". However, the text output is not burmese but like below,
Myanmar Nanyang Nguyen Niseng Chinpaare Poiro Meea Ne
May I know if anyone got same or similar issue? It would be helpful if can get some feedback! Thanks!
Beta Was this translation helpful? Give feedback.
All reactions