File tree Expand file tree Collapse file tree 1 file changed +1
-1
lines changed
Expand file tree Collapse file tree 1 file changed +1
-1
lines changed Original file line number Diff line number Diff line change @@ -76,7 +76,7 @@ Fine-tuning state-of-the-art TTS models with SynParaSpeech delivers significant
7676
7777### 🎯 Paralinguistic Event Detection
7878Prompt tuning with SynParaSpeech enhances MLLMs' ability to detect paralinguistic events:
79- - ** Optimal Context** : 5-shot prompts per catergory yield best performance (avoids overload from redundant context).
79+ - ** Optimal Context** : 5-shot prompts per category yield best performance (avoids overload from redundant context).
8080- ** Key Improvements** :
8181 - Qwen 2.5 Omni: Accuracy increases from 21.5% (no context) to 47.3% (5-shot), macro F1 from 18.9% to 47.1%.
8282 - Kimi Audio: Accuracy reaches 38.2% (5-shot), with CER (character error rate) reduced to 11.11%.
You can’t perform that action at this time.
0 commit comments