AI Research Assistant

A full-stack AI research assistant powered by HuggingFace models and Pinecone vector database. This is a monolith application with a FastAPI backend serving a React frontend.

Project Structure

├── app/                   # Python backend
│   ├── agents/            # AI agents (RAG agent)
│   ├── core/              # Core utilities (logging, config)
│   ├── llm/               # LLM integrations (HuggingFace)
│   ├── retriever/         # Document retrieval (Pinecone)
│   ├── schemas/           # Pydantic models
│   └── main.py            # FastAPI application
├── frontend/              # React frontend
│   ├── src/               # Source files
│   ├── public/            # Static assets
│   ├── package.json       # Node dependencies
│   └── vite.config.ts     # Vite configuration
├── static/                # Built frontend (generated)
├── cloudbuild.yaml        # GCP Cloud Build config
├── Dockerfile             # Multi-stage container build
└── requirements.txt       # Python dependencies

Local Development

Prerequisites

Python 3.12+
Node.js 20+
HuggingFace API token
Pinecone API key

Backend Setup

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Set environment variables
export HUGGINGFACE_TOKEN=your_token_here
export PINECONE_API_KEY=your_pinecone_key_here

# Run the backend server
uvicorn app.main:app --host 0.0.0.0 --port 8080 --reload

Frontend Setup

cd frontend

# Install dependencies
npm install

# Run the dev server (proxies API requests to backend)
npm run dev

The frontend dev server runs on http://localhost:5173 and proxies /api/* requests to the backend at http://localhost:8080.

Production Build

cd frontend
npm run build

This builds the frontend and outputs to the static/ directory, which FastAPI serves automatically.

API Endpoints

POST /chat - Send a query and receive an AI-generated response with source documents
GET /health - Health check endpoint
GET /test-retriever - Test the Pinecone retriever

GCP Cloud Build Deployment

This project includes a cloudbuild.yaml for automated deployment to Google Cloud Run.

Prerequisites

GCP Project with the following APIs enabled:
- Cloud Build API
- Cloud Run API
- Artifact Registry API
- Secret Manager API

Artifact Registry Repository (create if not exists):

gcloud artifacts repositories create cloud-run-images \
  --repository-format=docker \
  --location=us-central1 \
  --description="Docker images for Cloud Run"

Secrets in Secret Manager:

# HuggingFace token
echo -n "your_huggingface_token" | gcloud secrets create HUGGINGFACE_TOKEN --data-file=-

# Pinecone API key
echo -n "your_pinecone_api_key" | gcloud secrets create PINECONE_API_KEY --data-file=-

IAM Permissions for Cloud Build service account:

PROJECT_ID=$(gcloud config get-value project)
PROJECT_NUMBER=$(gcloud projects describe $PROJECT_ID --format='value(projectNumber)')

# Grant Cloud Run Admin role
gcloud projects add-iam-policy-binding $PROJECT_ID \
  --member="serviceAccount:${PROJECT_NUMBER}@cloudbuild.gserviceaccount.com" \
  --role="roles/run.admin"

# Grant Service Account User role
gcloud projects add-iam-policy-binding $PROJECT_ID \
  --member="serviceAccount:${PROJECT_NUMBER}@cloudbuild.gserviceaccount.com" \
  --role="roles/iam.serviceAccountUser"

# Grant Secret Manager Accessor role
gcloud projects add-iam-policy-binding $PROJECT_ID \
  --member="serviceAccount:${PROJECT_NUMBER}@cloudbuild.gserviceaccount.com" \
  --role="roles/secretmanager.secretAccessor"

Manual Trigger

gcloud builds submit --config cloudbuild.yaml

With Custom Substitutions

gcloud builds submit --config cloudbuild.yaml \
  --substitutions=_REGION=us-west1,_SERVICE_NAME=my-ai-assistant

Set Up Continuous Deployment

Connect your repository to Cloud Build for automatic deployments on push:

gcloud builds triggers create github \
  --repo-name=ai-research-assistant \
  --repo-owner=YOUR_GITHUB_USERNAME \
  --branch-pattern="^main$" \
  --build-config=cloudbuild.yaml

Configuration

The Cloud Build configuration uses the following substitution variables (can be overridden):

Variable	Default	Description
`_REGION`	`us-central1`	GCP region for deployment
`_SERVICE_NAME`	`ai-research-assistant`	Cloud Run service name
`_REPOSITORY`	`cloud-run-images`	Artifact Registry repository

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
app		app
frontend		frontend
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
FRONTEND_SPEC.md		FRONTEND_SPEC.md
README.md		README.md
cloudbuild.yaml		cloudbuild.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Research Assistant

Project Structure

Local Development

Prerequisites

Backend Setup

Frontend Setup

Production Build

API Endpoints

GCP Cloud Build Deployment

Prerequisites

Manual Trigger

With Custom Substitutions

Set Up Continuous Deployment

Configuration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Research Assistant

Project Structure

Local Development

Prerequisites

Backend Setup

Frontend Setup

Production Build

API Endpoints

GCP Cloud Build Deployment

Prerequisites

Manual Trigger

With Custom Substitutions

Set Up Continuous Deployment

Configuration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages