Chatbot + RAG (Gemini & Pinecone)

This project is an open-source sample chatbot application built with HonoJS, utilizing Retrieval-Augmented Generation (RAG) powered by Google Gemini and Pinecone. It features real-time streaming responses and persists chat history in MongoDB.

Features

Framework: Built on HonoJS for a lightweight and fast web standard-based server.
LLM: Uses Google's Gemini 2.5 Flash for fast and efficient text generation.
Embeddings: Uses Gemini Text Embedding 004 for high-quality vector embeddings.
Vector Database: Integrates with Pinecone for efficient similarity search and context retrieval.
Database: Stores chat sessions and history in MongoDB.
Streaming: Supports streaming responses for a better user experience.
RAG: Implements a complete RAG pipeline:
- Document ingestion from text files.
- Chunking and embedding.
- Context-aware response generation.

Prerequisites

Before you begin, ensure you have the following:

Node.js (v18 or higher)
MongoDB: A running MongoDB instance (local or Atlas).
Pinecone Account: An API key and an Index created in Pinecone.
Google Gemini API Key: Access to Google's Generative AI models.

Installation

Clone the repository:

git clone <repository-url>
cd hono-chatbot-rag

Install dependencies:
```
npm install
```

Configuration

Create a .env file in the root directory
```
touch .env
```

Refer to the .env.example for the required env variables.

Data Ingestion

To use the RAG capabilities, you need to ingest documents into your Pinecone vector store.

Place your text documents (.txt files) in the .bin/docs directory.
- The script looks for files in .bin/docs relative to the project root.
- Create the directory if it doesn't exist: mkdir -p .bin/docs
Run the ingestion script:
```
npm run ingest:embeddings
```
This script will:
- Load text files from .bin/docs.
- Split them into chunks (1000 chars, 200 overlap).
- Generate embeddings using Gemini.
- Upload the vectors to your Pinecone index.

Running the Application

Development Mode

Run the server with hot-reloading:

npm run dev

Production Build

Build and start the production server:

npm run build
npm start

The server will start on http://localhost:3000 (or your configured PORT).

API Usage

1. Create a New Chat

Initialize a new chat session.

Endpoint: POST /chats
Response:
```
{
  "id": "65f..." // The Chat ID
}
```

2. Send a Message

Send a user message and receive a streaming response.

Endpoint: PUT /chats/:id

Body:

{
  "content": "What is name of the chapter one?"
}

Response: A text stream of the assistant's response.

Project Structure

src/app.ts: Main application entry point and server setup.
src/api/: Route handlers (Chat creation, message handling).
src/services/: External services integration (Gemini, Pinecone).
src/models/: Mongoose data models.
.bin/ingest.ts: Script for processing and ingesting documents.
.bin/docs: Directory for source documents for RAG.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.bin		.bin
src		src
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chatbot + RAG (Gemini & Pinecone)

Features

Prerequisites

Installation

Configuration

Data Ingestion

Running the Application

Development Mode

Production Build

API Usage

1. Create a New Chat

2. Send a Message

Project Structure

License

About

Uh oh!

Releases

Languages

License

h1dd3nsn1p3r/chatbot-sample-app

Folders and files

Latest commit

History

Repository files navigation

Chatbot + RAG (Gemini & Pinecone)

Features

Prerequisites

Installation

Configuration

Data Ingestion

Running the Application

Development Mode

Production Build

API Usage

1. Create a New Chat

2. Send a Message

Project Structure

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Languages