AI Sales Assistant Chatbot

Project Goal

This project aims to create a realistic, low-latency chatbot that functions as an AI sales assistant for Nooks, an AI-powered sales development platform. The chatbot responds when the user falls silent for some time, simulating a natural conversation flow.

The current implementation is relatively slow to respond the the user - the goal is to make it faster.

Demo of the current implementation: Loom

Demo of a reference solution that responds faster: Loom

Current Implementation

The system consists of three main components:

Speech-to-text (STT) using AssemblyAI's hosted API for real-time transcription
A sales chatbot powered by OpenAI's GPT-4 model (note - not gpt4o which is faster but not as accurate in some cases)
Text-to-speech (TTS) using ElevenLabs for voice output

The chatbot listens to user input, transcribes it in real-time, and generates a response when the user stops speaking. The AI's response is then converted to speech and played back to the user.

Task

Assume that you are not allowed to modify the services used (you must use Assembly's hosted model for STT, OpenAI's GPT-4 for the chatbot, and ElevenLabs with this voice setting for TTS).

In addition, you are not allowed to tinker with certain configuration settings that affect the bot's realism. For example, the Assembly chatbot waits 500ms for the user to be silent before a response. This adds to the perceived latency (because the chatbot only responds after 500ms), but is necessary to maintain the realism of the bot and not have it interrupt you mid phrase. Your solution must maintain the property of waiting for 500ms of silence from the user before any bot responds, and should NOT configure this to be lower (for higher latency) or higher (for more realism).

How would you modify the code to make the chatbot lower latency & respond faster?

Evaluation Criteria

Your solution will be evaluated based on:

Reduction in overall latency (comparable to the "reference solution" above). Please share a demo video so we can easily evaluate.
Maintenance of conversation quality and realism (i.e the chatbot doesn't interrupt the human speaker while they're in the middle of speaking. The chatbot must say the same things that the reference solution would have said.)
Code quality and clarity of explanation in README.md

Getting Started

Review the existing code in main.py, lib/sales_chatbot.py, and lib/elevenlabs_tts.py
Install the requirements by running pip install -r requirements.txt (or use a virtual environment if you prefer)
Set your OpenAI, AssemblyAI, and ElevenLabs API keys in .env - you should have received them via email.
Run the current implementation to understand its behavior by running python3 main.py
Begin your optimization process. Document your changes and reasoning in this README.md file when done.

Poetry Setup

If you're getting stuck with installation issues, we offer an alternative Poetry-based installation method.

Install Poetry
Install all requirements by running poetry install. You will need to brew install ffmpeg, brew install portaudio, pip3 install "assemblyai[extras]" (MacOS) if you haven't already.
Run the current implementation by running poetry run python3 main.py

Good luck!

Bonus

Right now a lot of the latency comes from external services (TTS, LLM inference, STT). An easy way to reduce latency would be to use local models. For example you could use:

Nemo ASR for STT
Llama for the chatbot
bark or tortoise for TTS Try building a version of this chatbot that is local-only and see what speedup you achieve!

Your Approach

I used async functions to make the process parallelized. The main parallization is in the respond function. I parallelized this function so that the reponse generation and the text to speech do not block eachother. I utelized the asyncio python library to do this and included await statements for respective function calls to make them parallelized.

Here is a recording of my implementation: Loom

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
packages		packages
.DS_Store		.DS_Store
.env		.env
.gitignore		.gitignore
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Sales Assistant Chatbot

Project Goal

Current Implementation

Task

Evaluation Criteria

Getting Started

Poetry Setup

Bonus

Your Approach

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

DhruvKumar1/VoiceChatbot

Folders and files

Latest commit

History

Repository files navigation

AI Sales Assistant Chatbot

Project Goal

Current Implementation

Task

Evaluation Criteria

Getting Started

Poetry Setup

Bonus

Your Approach

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages