Kapso + Pipecat voice agent starter

Answer WhatsApp voice calls with AI. This repo connects Kapso's WhatsApp infrastructure to Pipecat's voice pipeline using OpenAI for speech and text.

The demo agent speaks neutral Latin American Spanish and introduces Kapso's platform.

Prerequisites

Python 3.10+ with uv installed
Pipecat Cloud account
Kapso account with a WhatsApp number (calls enabled)
OpenAI API key
Docker Hub account (or another registry)

Deploy to Pipecat Cloud

Authenticate

uv run pcc auth login

Set OpenAI key

Create a Pipecat Cloud secret set:

uv run pcc secrets set kapso-voice-secrets OPENAI_API_KEY=sk-...

Store Docker credentials

uv run pcc credentials docker create my-docker-secret \
  --username YOUR_DOCKERHUB_USERNAME \
  --password YOUR_DOCKER_TOKEN

Build and push

Edit pcc-deploy.toml and update the image tag to YOUR_DOCKERHUB_USERNAME/agent-name:VERSION, then:

uv run pcc docker build-push

Deploy

uv run pcc deploy kapso-voice YOUR_DOCKERHUB_USERNAME/agent-name:VERSION --credentials my-docker-secret

Update pcc-deploy.toml with your agent name (kapso-voice) and secret set name (kapso-voice-secrets).

Connect in Kapso

Full setup guide: Kapso voice agent quickstart

Sign in to app.kapso.ai
Go to Voice agents → New voice agent
Set provider to Pipecat
Paste your Pipecat public API key and agent name (kapso-voice)
Assign a WhatsApp number and mark it Primary + Enabled
Call the number to test

Customize the agent

Edit bot.py:

System prompt: Change SYSTEM_PROMPT_FALLBACK or set SYSTEM_PROMPT env var
Voice models: Swap OpenAI services in run_voice_pipeline for other Pipecat-supported providers
Idle timeout: Adjust idle_timeout_secs in the PipelineTask constructor

Local development

uv sync

Copy .env.example to .env and add your OPENAI_API_KEY.

How it works

Kapso receives WhatsApp voice call webhook from Meta
Kapso forwards webhook to Pipecat Cloud with {kind: "whatsapp_connect", webhook, whatsapp_token, phone_number_id, context}
Pipecat launches bot.py and connects to WhatsApp via SmallWebRTC transport
Audio flows: caller speech → OpenAI STT → GPT-4 → OpenAI TTS → caller
Call ends on 30s idle timeout or disconnect

Context payload

Kapso includes a context object with each call containing:

project: Your Kapso project info (name, ID)
config: WhatsApp number details (display name, phone number ID, mode)
contact: Caller profile (name, wa_id)
call: Call metadata (direction, status, timestamps)
call_permission: Permission status and expiry
conversation: Full message history (all messages with timestamps and content)

The agent uses this context to personalize greetings and responses. Check build_context_prompt() in bot.py to see how it's formatted.

Troubleshooting

No audio back: Check OPENAI_API_KEY is set in Pipecat secrets and view Pipecat logs for TTS errors

Build fails: Verify Docker credentials with docker login and retry with --debug flag

Call doesn't connect: Confirm WhatsApp number has calls enabled in Kapso and voice agent assignment is marked Primary

License

BSD 2-Clause

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bot.py		bot.py
pcc-deploy.toml		pcc-deploy.toml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kapso + Pipecat voice agent starter

Prerequisites

Deploy to Pipecat Cloud

Authenticate

Set OpenAI key

Store Docker credentials

Build and push

Deploy

Connect in Kapso

Customize the agent

Local development

How it works

Context payload

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Kapso + Pipecat voice agent starter

Prerequisites

Deploy to Pipecat Cloud

Authenticate

Set OpenAI key

Store Docker credentials

Build and push

Deploy

Connect in Kapso

Customize the agent

Local development

How it works

Context payload

Troubleshooting

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages