Replies: 3 comments 2 replies
-
Is the synthesised voice understandable at all? If so then it's probably a sampling rate problem. If it was an incorrect audio format or codec it would just sound like noise. From what I can gather the |
Beta Was this translation helpful? Give feedback.
-
Hi, thanks for the reply. I also believe it is a sampling rate problem. The words of the voice are distinguishable, even if distorted. To make you understand the problem better I tried to save the same sentence (Hello World) read by the same voice both locally (SetOutputToWaveFile) and by capturing it from the phone. I am attaching the two mp3 files. Thanks again |
Beta Was this translation helpful? Give feedback.
-
Hi, I found that by default the synth.SetOutputToWaveStream () method produces a 22KHz stream, but you can change the preferences this way: synth.SetOutputToAudioStream(streamAudio, new SpeechAudioFormatInfo(EncodingFormat.Pcm, 16000, 16, 1, 16000, 2, null)); ...and indeed the voice improves. In your opinion what are the best parameters for await voipMediaSession.AudioExtrasSource.SendAudioFromStream(streamAudio, AudioSamplingRatesEnum.Rate16KHz); thank a lot |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
first of all thanks for the work done! For me it is very precious. I wanted to ask you for help.
I am trying to implement an IVR with your library.
I wanted to use the Windows SpeechSynthesizer, using Windows built-in voices.
I am able to make the call correctly and have the text indicated by me read aloud,
however, the voice on the phone is deformed and decidedly not very pleasant.
If instead the same message I record it to a file or play it locally feels good.
I think it's a sampling problem or something, but I'm not an expert at all.
This my code:
Beta Was this translation helpful? Give feedback.
All reactions