Transcription Errors When Using Nova-3 #1502
-
|
Hi, We have a voice agent service built in Python, where we use Deepgram’s Speech-to-Text WebSocket API for real-time speech transcription. During live sessions, the user’s speech is transcribed via Flux. After the session ends, we make an HTTP request with the session’s recording to get the transcription of the whole session where we utilize the Nova-3 model. Also note that in all our recordings, we have 2 channels: the user and the AI agent. The agent channel includes a synthetic voice generated by a Text-to-Speech model, and the user channel is organic. Our question is related to the second step where we try to get the transcription of the whole recording. In the sample whose request ID is given below, the user’s audio is not transcribed at all, and when we listen to it carefully, the user seems to be saying something like ”How to create a fantasy plan of best doing.” Request ID: 3a10e1fc-c98f-4729-9d03-a6d6b8d32af5 It is a relatively inaudible speech but we were wondering if there is anything we can do to improve this. Thanks in advance. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments
-
|
Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently. |
Beta Was this translation helpful? Give feedback.
-
|
Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion. |
Beta Was this translation helpful? Give feedback.
-
|
This will be the same answer as #1501 , the user's audio needs to be increased and the user will likely need to be transferred to a human agent since the audio is so low. |
Beta Was this translation helpful? Give feedback.
This will be the same answer as #1501 , the user's audio needs to be increased and the user will likely need to be transferred to a human agent since the audio is so low.