AWS + Deepgram Voice AI Hackathon: Pipecat Quickstart Project

This is the quickest way to get started building a Pipecat voice AI agent for the AWS + Deepgram Voice AI Hackathon.

It was built with the Pipecat CLI, and customized to make it as fast as possible to get a working bot.

Dependencies

Python 3.10+
uv
docker (to deploy to Pipecat Cloud)

Run your bot locally

git clone git@github.com:pipecat-ai/aws-deepgram-sa-hackathon.git
cd aws-deepgram-sa-hackathon

Server

In your first terminal window:

cd server
cp env.example .env # and fill in the values
uv sync
uv run bot.py --transport daily

You should see something like:

INFO     | pipecat:<module>:14 - ᓚᘏᗢ Pipecat 0.0.102 (Python 3.12.0 (main, Oct  2 2023, 20:56:14) [Clang 16.0.3 ]) ᓚᘏᗢ

🚀 Bot ready!
   → Open http://localhost:7860 in your browser to start a session

INFO:     Started server process [91430]
INFO:     Waiting for application startup.
INFO:     Application startup complete.
INFO:     Uvicorn running on http://localhost:7860 (Press CTRL+C to quit)

Client

Then, in another terminal window:

cd ../client
cp env.example .env.local # and fill in the values
npm i
npm run dev

You should see something like:

  VITE v7.3.1  ready in 206 ms

  ➜  Local:   http://localhost:5173/
  ➜  Network: http://192.168.0.16:5173/
  ➜  Network: http://100.115.25.125:5173/
  ➜  press h + enter to show help

Now visit http://localhost:5173 in your browser and click Connect to start talking to your bot!

Architecture Diagram

graph TD
    subgraph Users
        A["👤 <b>End-user</b>"] --> B["React Web Client<br/><i>Vite + RTVI</i>"]
    end

    subgraph Transport["Daily Server - WebRTC Transport"]
        C[Audio In]
        D[Audio Out]
    end

    subgraph Pipeline["Conversational Voice Agent - Orchestrated by Pipecat"]
        E["Krisp Noise Cancellation<br/><i>Cloud only</i>"]
        F["Silero VAD<br/><i>Voice Activity Detection</i>"]
        G["Deepgram Nova or Flux<br/><b>Speech to Text (STT)</b><br/><i>Direct API or SageMaker</i>"]
        H["Amazon Bedrock<br/><b>Claude Haiku 4.5</b>"]
        I["Tool Use<br/><i>get_current_weather()</i>"]
        J["Deepgram Aura<br/><b>Text to Speech (TTS)</b><br/><i>Direct API or SageMaker</i>"]
    end

    B <-->|"Real-time Communication (WebRTC)"| Transport

    C -->|Voice input| E
    E --> F
    F --> G
    G -->|NLU query| H
    H -->|Function call| I
    I -->|Result| H
    H -->|NLG response| J
    J -->|Voice output| D

    style Users fill:#fafafa,stroke:#ddd,color:#888
    style Transport fill:#fafafa,stroke:#ddd,color:#888
    style Pipeline fill:#fafafa,stroke:#ddd,color:#888
    style A fill:#333,color:#fff
    style B fill:#fff,stroke:#ddd
    style C fill:#fff,stroke:#ddd
    style D fill:#fff,stroke:#ddd
    style E fill:#fff,stroke:#ddd
    style F fill:#fff,stroke:#ddd
    style G fill:#fff,stroke:#ddd
    style H fill:#fff,stroke:#ddd
    style I fill:#fff,stroke:#ddd
    style J fill:#fff,stroke:#ddd

Customize the bot

The server/bot.py file contains your Pipecat bot. To customize it, look for the comment #### Customize bot prompt here! Update "content". That messages variable is used by the bot's context manager, which stores the conversation between the bot and the user. Change the content property of that first message to update your bot's system prompt.

Next, you'll almost certainly want to use function calling to extend your bot's functionality. Search for the comments #### Customize function here! to see how this bot can answer questions about the weather (using fake data). Read more about function calling in the Pipecat docs page about it.

Use Deepgram Flux STT

Deepgram Flux is an advanced STT model with improved turn detection, so it handles the end-of-turn decision instead of relying solely on VAD silence detection.

To enable Flux, set USE_FLUX=true in your server/.env:

USE_FLUX=true

Flux works with both the direct Deepgram API and SageMaker — just combine USE_FLUX=true with USE_SAGEMAKER=true to use Flux via SageMaker.

Use Deepgram on AWS SageMaker

Instead of calling the Deepgram API directly, you can run Deepgram models on your own AWS SageMaker endpoints. This keeps all audio data within your AWS account.

To switch to SageMaker mode, update your server/.env:

USE_SAGEMAKER=true
SAGEMAKER_STT_ENDPOINT_NAME=my-deepgram-stt-endpoint
SAGEMAKER_TTS_ENDPOINT_NAME=my-deepgram-tts-endpoint

You'll need:

An AWS account with SageMaker access
A deployed SageMaker endpoint with a Deepgram STT model
A deployed SageMaker endpoint with a Deepgram TTS model

The existing AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and AWS_REGION variables are shared with Bedrock and will be used for SageMaker as well.

Set USE_SAGEMAKER=false (the default) to go back to using the Deepgram API directly.

Deploy to Pipecat Cloud

For the hackathon, you can perform your live demo by running the bot locally on your computer. To deliver a hosted version, use Pipecat Cloud.

To deploy your bot, first you'll want to install the Pipecat CLI if you haven't already, and authenticate with Pipecat Cloud:

uv tool install pipecat-ai-cli
pipecat cloud auth login

You can edit server/pcc-deploy.toml if you want to change any Pipecat Cloud settings, but the defaults are fine to get started.

Next, copy the secrets from your .env file to a secret set in Pipecat Cloud, and deploy your bot:

cd /server
pipecat cloud secrets set --file .env aws-deepgram-sa-hackathon-secrets # assuming you didn't change the name in pcc-deploy.toml
pipecat cloud deploy

The min_agents = 1 setting in pcc-deploy.toml ensures that there's always a bot instance ready to accept a new session. This minimizes session startup time, but also incurs a small cost. After you're done testing, you can set minimum agents to 0 in the Pipecat Cloud dashboard.

To talk to your agent, create a Pipecat Cloud public key, then start a session. The second command will return a URL you can click to talk to your agent.

# create a public API key so you can start bot sessions
pipecat cloud organizations keys create # answer "yes" to make it your default key
pipecat cloud agent start aws-deepgram-sa-hackathon --use-daily

Or start a session with your agent in the Pipecat Cloud Sandbox:

https://pipecat.daily.co/<your-org-name>/agents/aws-deepgram-sa-hackathon/sandbox

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
client		client
server		server
.gitignore		.gitignore
README.md		README.md
sandbox.jpg		sandbox.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS + Deepgram Voice AI Hackathon: Pipecat Quickstart Project

Dependencies

Run your bot locally

Server

Client

Architecture Diagram

Customize the bot

Use Deepgram Flux STT

Use Deepgram on AWS SageMaker

Deploy to Pipecat Cloud

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AWS + Deepgram Voice AI Hackathon: Pipecat Quickstart Project

Dependencies

Run your bot locally

Server

Client

Architecture Diagram

Customize the bot

Use Deepgram Flux STT

Use Deepgram on AWS SageMaker

Deploy to Pipecat Cloud

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages