Name	Name	Last commit message	Last commit date
parent directory ..
pipelines	pipelines
scripts	scripts
.env.example	.env.example
.gitignore	.gitignore
Dockerfile	Dockerfile
LICENSE	LICENSE
README.md	README.md
SECURITY.md	SECURITY.md
docker-compose.yml	docker-compose.yml
hayhooks.yml	hayhooks.yml
nginx.conf	nginx.conf
pyproject.toml	pyproject.toml
serializing_pipelines.md	serializing_pipelines.md
uv.lock	uv.lock

Chapter 7 Deploying Haystack-based applications

In this chapter you will learn about how to deploy multiple Haystack pipelines as REST API services that can interact with one another using Hayhooks, providing production-ready endpoints for document indexing and intelligent question answering. This is Part II in chapter 7, covering an advanced case where you will learn to deploy an indexing and advanced retrieval pipelines that can dynamically upload a PDF and web urls and answer questions about the uploaded material. In Part I we learn about a simpler case where we deploy a Haystack pipeline as a REST API using FastAPI and dockerize the application.

🔒 Security

This deployment includes nginx reverse proxy with authentication. Your Hayhooks endpoints are secured and not accessible to the world.

See SECURITY.md for complete security setup and configuration details.

Quick Start

Setup Environment:

# Install dependencies
uv sync

# Copy .env.example and add your openAI API key
cp .env.example .env

Edit .env and add your OpenAI API key:

# OpenAI API Configuration
OPENAI_API_KEY=your_actual_openai_api_key_here

# Hayhooks Configuration
HAYHOOKS_HOST=0.0.0.0
HAYHOOKS_PORT=1416
HAYHOOKS_PIPELINES_DIR=./pipelines
HAYHOOKS_SHOW_TRACEBACKS=true

Setup Authentication (Required for production):

# Generate credentials for API access
./scripts/generate_password.sh

Run Hayhooks:

Option A: Local Development (⚠️ NOT password protected)

# Runs on http://localhost:1416 - open to anyone with network access
# Use this for local testing only, NOT for production
uv run hayhooks run

You can then visit http://0.0.0.0:1416/docs#/ and test the endpoints.

Option B: Using Docker Compose with nginx Security (🔒 Recommended for production)

# Start both Hayhooks and nginx with authentication
docker-compose up -d

# Check logs
docker-compose logs -f

# Test the secured API
./scripts/test_secured_api.sh

# Access protected endpoints with authentication
curl -u username:password http://localhost:8080/status

You can then visit http://localhost:8080/docs, enter your passowrd and test the endpoints.

# Stop services
docker-compose down

Option C: Using Docker (Single Container, No Security)

Build the Docker image:

docker build -t hayhooks-index-rag .

Run the container:

docker run -d \
  --name hayhooks-rag \
  -p 1416:1416 \
  -e OPENAI_API_KEY=your_actual_key_here \
  -v $(pwd)/qdrant_storage:/app/qdrant_storage \
  hayhooks-index-rag

Check logs:

docker logs -f hayhooks-rag

Test the API:

# Verify the server is running (unsecured)
curl http://localhost:1416/status

You can then visit http://0.0.0.0:1416/docs#/ and test the endpoints.

Stop the container:

docker stop hayhooks-rag
docker rm hayhooks-rag

Docker Deployment Guide

Understanding the Docker Setup

The Docker container packages all dependencies and pipelines for easy deployment:

Port Mapping: Container port 1416 maps to host localhost:1416
Volume Mounting: Local qdrant_storage directory persists vector data
Environment Variables: OpenAI API key passed securely via -e flag

Docker Commands Reference

# Build the image
docker build -t hayhooks-index-rag .

# Run container (basic)
docker run -d \
  --name hayhooks-rag \
  -p 1416:1416 \
  -e OPENAI_API_KEY=your_key \
  -v $(pwd)/qdrant_storage:/app/qdrant_storage \
  hayhooks-index-rag

# View logs in real-time
docker logs -f hayhooks-rag

# View last 100 lines of logs
docker logs --tail 100 hayhooks-rag

# Check container status
docker ps

# Stop container
docker stop hayhooks-rag

# Start existing container
docker start hayhooks-rag

# Remove container
docker rm hayhooks-rag

# Access container shell for debugging
docker exec -it hayhooks-rag /bin/bash

# Rebuild image (after code changes)
docker build --no-cache -t hayhooks-index-rag .

Advanced Docker Configuration

Run with custom port:

docker run -d \
  --name hayhooks-rag \
  -p 8000:1416 \
  -e OPENAI_API_KEY=your_key \
  -v $(pwd)/qdrant_storage:/app/qdrant_storage \
  hayhooks-index-rag

Run with additional environment variables:

docker run -d \
  --name hayhooks-rag \
  -p 1416:1416 \
  -e OPENAI_API_KEY=your_key \
  -e QDRANT_API_KEY=your_qdrant_key \
  -e QDRANT_HOST_URL=your_qdrant_url \
  -v $(pwd)/qdrant_storage:/app/qdrant_storage \
  hayhooks-index-rag

Troubleshooting Docker Deployment

Issue: Port already in use

# Find process using port 1416
lsof -i :1416

# Kill the process
kill -9 <PID>

# Or use a different port
docker run -p 8080:1416 ...

Issue: Container exits immediately

# Check logs for errors
docker logs hayhooks-rag

# Common causes:
# - Missing OPENAI_API_KEY
# - Port conflict
# - Invalid pipeline configuration

Issue: Changes not reflected after rebuild

# Stop and remove container
docker stop hayhooks-rag && docker rm hayhooks-rag

# Rebuild without cache
docker build --no-cache -t hayhooks-index-rag .

# Run new container
docker run -d --name hayhooks-rag -p 1416:1416 \
  -e OPENAI_API_KEY=your_key \
  -v $(pwd)/qdrant_storage:/app/qdrant_storage \
  hayhooks-index-rag

Issue: Permission denied on qdrant_storage

# Create directory with proper permissions
mkdir -p qdrant_storage
chmod 755 qdrant_storage

API Endpoints

Once deployed, your pipelines are available at:

Endpoint	Method	Description
`/indexing/run`	POST	Index documents from URLs or files
`/hybrid_rag/run`	POST	Answer questions using RAG
`/v1/chat/completions`	POST	OpenAI-compatible chat interface
`/docs`	GET	Interactive API documentation
`/status`	GET	Server and pipeline status

Usage Examples

Using cURL

Index a Document

curl -X POST "http://localhost:1416/indexing/run" \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://example.com/article"]}'

Ask a Question

curl -X POST "http://localhost:1416/hybrid_rag/run" \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the benefits of AI?"}'

OpenAI-Style Chat

curl -X POST "http://localhost:1416/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "hybrid_rag",
    "messages": [{"role": "user", "content": "Explain machine learning"}]
  }'

Using Python (Programmatic Access)

Hayhooks provides a Python client for programmatic access to your deployed pipelines:

import requests

# Base URL for your Hayhooks server
BASE_URL = "http://localhost:1416"

# Index documents
response = requests.post(
    f"{BASE_URL}/indexing/run",
    json={"urls": ["https://example.com/article"]}
)
print(response.json())

# Query the RAG pipeline
response = requests.post(
    f"{BASE_URL}/hybrid_rag/run",
    json={"query": "What are the benefits of AI?"}
)
print(response.json())

# OpenAI-compatible chat completions
response = requests.post(
    f"{BASE_URL}/v1/chat/completions",
    json={
        "model": "hybrid_rag",
        "messages": [{"role": "user", "content": "Explain machine learning"}]
    }
)
print(response.json())

Using the Hayhooks CLI

You can also run pipelines directly from the command line:

# Run a query through the RAG pipeline
hayhooks pipeline run hybrid_rag --param 'query="What is AI?"'

# Index documents with file upload
hayhooks pipeline run indexing --file document.pdf

# Index multiple files
hayhooks pipeline run indexing --file file1.pdf --file file2.pdf

# Index a directory of files
hayhooks pipeline run indexing --dir ./documents

Overview

This project consists of two main pipelines:

Indexing Pipeline: Processes documents (PDFs, web content) and stores them with embeddings in Qdrant
Retrieval Pipeline: Performs hybrid RAG (BM25 + embeddings) to answer questions based on indexed documents

Both pipelines are exposed through Hayhooks as REST API endpoints, making them easy to integrate into web applications or other systems.

Prerequisites

Python 3.11+
uv package manager
Docker and Docker Compose
OpenAI API key

Project Structure

hayhooks-mcp/
├── docker-compose.yml             # Docker services
├── pipelines/                        # Pipeline wrappers
│   ├── indexing/
│   │   └── pipeline_wrapper.py      # Indexing API wrapper
│   └── hybrid_rag/
│       └── pipeline_wrapper.py      # RAG API wrapper

What This Provides

Document Indexing API

URL Indexing: Fetch and index web content
File Upload: Index PDF, HTML, and text files
Batch Processing: Handle multiple documents at once
Smart Chunking: Sentence-based splitting with overlap

Hybrid RAG Query API

Intelligent Retrieval: BM25 + embedding-based search
Reranking: Advanced relevance scoring
Contextual Answers: GPT-powered response generation
Source Attribution: Track document sources

OpenAI Compatibility

Chat Completions: /v1/chat/completions endpoint
Easy Integration: Works with existing OpenAI clients
Streaming Support: Real-time response streaming

Features

Production Ready: Docker deployment with health checks
Auto Documentation: Swagger/OpenAPI docs generation
Smart Retrieval: Hybrid BM25 + embedding search
Fast Processing: Optimized pipeline execution
Error Handling: Comprehensive error reporting
Scalable: Horizontal scaling support
Extensible: Easy to add new endpoints

Troubleshooting

Common Issues

1. Pipeline Loading Errors

Error: 'dict' object has no attribute 'resolve_value' in DocumentWriter

Cause: Configuration issues with document store
Solution: Ensure document store is properly configured in pipeline YAML files

Error: ModuleNotFoundError: No module named 'nltk' or similar import errors

Cause: Missing optional dependencies

Solution: Install missing packages:

uv add nltk>=3.9.1 lxml_html_clean pypdf>=6.1.3

2. Document Store Issues

Error: Connection or storage issues

Solution: Ensure Qdrant storage directory exists and has proper permissions:
```
mkdir -p ./qdrant_storage
```

3. OpenAI API Issues

Error: Invalid API key or authentication errors

Solution: Verify your OpenAI API key is correctly set in .env

4. Qdrant Concurrent Access Issues

Error: Storage folder ./qdrant_storage is already accessed by another instance of Qdrant client

Cause: Running the indexing pipeline and then attempting to run the RAG pipeline causes a concurrent access error because both pipelines try to access the same local Qdrant storage folder
Impact: You'll see this error when trying to use the /hybrid_rag/run endpoint after indexing documents

Solution: Modify both pipeline serialization scripts to use cloud-based Qdrant instead of local storage:

In pipelines/indexing_pipeline_serialization.py and pipelines/rag_pipeline_serialization.py, replace:

# Initialize document store (same path as indexing)
document_store = QdrantDocumentStore(
    path="./qdrant_storage",
    index="documents",
    embedding_dim=1536,  # text-embedding-3-small dimension
    recreate_index=False,
    use_sparse_embeddings=True  # Enable sparse embeddings for BM25-like retrieval
)

With cloud-based configuration:

# Initialize document store (cloud-based for concurrent access)
document_store = QdrantDocumentStore(
    url="your_qdrant_url",  # e.g., "https://xyz-example.eu-central.aws.cloud.qdrant.io:6333"
    api_key=Secret.from_env_var("QDRANT_API_KEY"),
    index="documents",
    embedding_dim=1536,
    recreate_index=False,
    use_sparse_embeddings=True
)

Then add to your .env file:

QDRANT_API_KEY=your_qdrant_api_key_here
QDRANT_HOST_URL=your_qdrant_url_here

5. MPS Device Compatibility Issues

Warning: MPS backend is not available or errors related to device configuration

Cause: The serialized pipeline YAML files may have device: mps set for the ranker component, but MPS (Metal Performance Shaders) is only available on Apple Silicon Macs and may not be supported in all environments
Impact: Pipeline fails to load or run with device-related errors
Solution: Modify the device configuration in the YAML files to use CPU instead:

In pipelines/hybrid_rag/rag.yml, find the ranker component's device configuration and change it to:
```
device:
  device: cpu
  type: single
```
This ensures compatibility across all environments including Docker containers and cloud deployments.

Advanced Usage

For detailed instructions on:

Pipeline wrapper development
YAML serialization
Docker deployment
Performance tuning
Custom endpoints
Production configuration

Review pipeline serialization and deployment steps

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Chapter 7 Deploying Haystack-based applications

🔒 Security

Quick Start

Docker Deployment Guide

Understanding the Docker Setup

Docker Commands Reference

Advanced Docker Configuration

Troubleshooting Docker Deployment

API Endpoints

Usage Examples

Using cURL

Index a Document

Ask a Question

OpenAI-Style Chat

Using Python (Programmatic Access)

Using the Hayhooks CLI

Overview

Prerequisites

Project Structure

What This Provides

Document Indexing API

Hybrid RAG Query API

OpenAI Compatibility

Features

Troubleshooting

Common Issues

1. Pipeline Loading Errors

2. Document Store Issues

3. OpenAI API Issues

4. Qdrant Concurrent Access Issues

5. MPS Device Compatibility Issues

Advanced Usage

FilesExpand file tree

ch7-hayhooks

Directory actions

More options

Directory actions

More options

Latest commit

History

ch7-hayhooks

Folders and files

parent directory

README.md

Chapter 7 Deploying Haystack-based applications

🔒 Security

Quick Start

Docker Deployment Guide

Understanding the Docker Setup

Docker Commands Reference

Advanced Docker Configuration

Troubleshooting Docker Deployment

API Endpoints

Usage Examples

Using cURL

Index a Document

Ask a Question

OpenAI-Style Chat

Using Python (Programmatic Access)

Using the Hayhooks CLI

Overview

Prerequisites

Project Structure

What This Provides

Document Indexing API

Hybrid RAG Query API

OpenAI Compatibility

Features

Troubleshooting

Common Issues

1. Pipeline Loading Errors

2. Document Store Issues

3. OpenAI API Issues

4. Qdrant Concurrent Access Issues

5. MPS Device Compatibility Issues

Advanced Usage