Voice Chat

This example demonstrates a voice chat application using Semantic Kernel and OpenAI's API for speech-to-text, chat, and text-to-speech functionalities. The application captures audio from the microphone, processes it through a pipeline, and plays back the AI-generated responses with the following flow:

Microphone → VAD → STT → Chat → TTS → Speaker

API Key

Use .NET user-secrets to securely store your API key:

dotnet user-secrets set "OpenAI:ApiKey" "your-openai-api-key"

Extending the Sample

The sample can be further extended by improving VAD, STT and other components. Some suggestions include:

Use local CPU ML model based Voice Activity Detector, such as Silero VAD
Use audio streaming, such as supported by Azure AI Speech, Deepgram and other providers.
Connect Semantic Kernel plugins or tools for richer, task-oriented conversations.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Options		Options
Pipeline		Pipeline
Services		Services
Utilities		Utilities
.gitignore		.gitignore
LICENSE		LICENSE
Program.cs		Program.cs
README.md		README.md
VoiceChat.csproj		VoiceChat.csproj
VoiceChat.sln		VoiceChat.sln
appsettings.json		appsettings.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Chat

API Key

Extending the Sample

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

asyrovprog/voice_chat_demo

Folders and files

Latest commit

History

Repository files navigation

Voice Chat

API Key

Extending the Sample

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages