[Nova 3] Ability to add context to an audio file to improve WER #1411
Replies: 3 comments
-
|
Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently. |
Beta Was this translation helpful? Give feedback.
-
|
Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion. |
Beta Was this translation helpful? Give feedback.
-
|
It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Originally posted here: deepgram/deepgram-js-sdk#433
Proposed changes
Context
I have a feature request for Nova 3, not sure where I should send it over:
We found great improvements to word/sentence detection by adding context to the audio file. Example, the previous conversation, or notes about the topic discussed.
We found this in Whisper and Qwen and as a language learning platform, it helps detect discussions with strong accents better (think Duolingo, Usefluently etc)
I am making a comparison on our dataset between Google, Nova, vs. Whisper/Qwen with context.
This may have more impact than fine tuning!
People with strong accents, learners of a new language, have trouble being understood by LLMs. By adding the context/topic of the conversation, it could narrow down the set of words to detect, like a human would.
We are looking to train models directly with a dataset of users from another language practicing French, but we think we can get a better return on investment by just adding it as context to the LLM.
See Qwen - https://huggingface.co/spaces/Qwen/Qwen3-ASR-Demo
Possible Implementation
Pass via the API a new param "context", see Qwen here - https://huggingface.co/spaces/Qwen/Qwen3-ASR-Demo
Other information
Beta Was this translation helpful? Give feedback.
All reactions