Skip to content

AI-on-Power-DACH/rag-on-ppc64le-workshop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ RAG on IBM POWER Systems Workshop

License Python Platform

A comprehensive hands-on workshop for building and deploying Retrieval Augmented Generation (RAG) applications on IBM POWER Systems (ppc64le architecture).

πŸ“‹ Overview

This repository provides everything you need to deploy a RAG application on IBM POWER Systems, featuring:

  • πŸ€– LLM Runtime: llama.cpp with IBM Granite 4.0-H-Tiny model
  • πŸ—„οΈ Vector Database: ChromaDB for efficient document retrieval
  • 🎨 User Interface: Gradio-based chat interface with IBM theming
  • πŸ“š Knowledge Base: IBM RedBooks content on POWER systems, OpenShift, and Ansible

πŸ—οΈ Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Gradio UI      β”‚ ← User queries IBM RedBooks content
β”‚  (Port 7860)    β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚
         β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
         β”‚                  β”‚
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  ChromaDB       β”‚  β”‚  llama.cpp      β”‚
β”‚  Vector Store   β”‚  β”‚  LLM Server     β”‚
β”‚  (Port 8000)    β”‚  β”‚  (Port 8080)    β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

πŸ“‚ Repository Structure

.
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ app.py                # Gradio frontend application
β”‚   β”œβ”€β”€ ibm_theme.py          # IBM Design theme for Gradio
β”‚   β”œβ”€β”€ insert_documents.py   # Document ingestion script
β”‚   └── db_files/             # Knowledge base documents
β”‚       β”œβ”€β”€ Ansible.md        # Ansible automation on POWER
β”‚       β”œβ”€β”€ E1050.md          # IBM Power E1050 server
β”‚       β”œβ”€β”€ E1080.md          # IBM Power E1080 server
β”‚       β”œβ”€β”€ Openshift.md      # Red Hat OpenShift on POWER
β”‚       └── Scale_OUT.md      # POWER Scale-Out servers
β”œβ”€β”€ instructions.md           # Detailed deployment guide
β”œβ”€β”€ LICENSE                   # Apache 2.0 License
└── README.md                 # This file

πŸš€ Quick Start

Prerequisites

  • IBM POWER system running RHEL 9/10 (ppc64le)
  • Root or sudo access
  • Internet connectivity for package downloads

Deployment

Follow the comprehensive step-by-step instructions in instructions.md to:

  1. Deploy the LLM: Set up llama.cpp with IBM Granite model
  2. Configure Vector DB: Build and deploy ChromaDB from source
  3. Launch the UI: Start the Gradio-based chat interface

The complete deployment takes approximately 30-45 minutes.

πŸ’‘ Use Cases

This workshop demonstrates practical RAG applications for:

  • πŸ“– Technical Documentation Search: Query IBM RedBooks and technical manuals
  • πŸ” Knowledge Management: Build searchable knowledge bases from markdown documents
  • 🀝 IT Support: Provide AI-assisted answers from internal documentation
  • πŸŽ“ Training & Education: Interactive learning from technical content

About

Supporting material for a "RAG on IBM POWER Systems" workshop

Resources

License

Stars

Watchers

Forks

Languages