Skip to content

asyrovprog/voice_chat_demo

Repository files navigation

Voice Chat

This example demonstrates a voice chat application using Semantic Kernel and OpenAI's API for speech-to-text, chat, and text-to-speech functionalities. The application captures audio from the microphone, processes it through a pipeline, and plays back the AI-generated responses with the following flow:

Microphone → VAD → STT → Chat → TTS → Speaker

API Key

Use .NET user-secrets to securely store your API key:

dotnet user-secrets set "OpenAI:ApiKey" "your-openai-api-key"

Extending the Sample

The sample can be further extended by improving VAD, STT and other components. Some suggestions include:

  • Use local CPU ML model based Voice Activity Detector, such as Silero VAD
  • Use audio streaming, such as supported by Azure AI Speech, Deepgram and other providers.
  • Connect Semantic Kernel plugins or tools for richer, task-oriented conversations.

About

Voice Chat Demo for Semantic Kernel

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages