YouTube RAG Q&A with Google Gemini

🎬 YouTube Video Q&A powered by Retrieval-Augmented Generation (RAG) and Google Gemini LLM.
Ask questions about YouTube videos and get answers directly from the transcript.

Introduction

The YouTube RAG Q&A with Google Gemini project is an interactive web application that allows users to ask questions about YouTube videos and get precise answers based solely on the video transcript.

Using a Retrieval-Augmented Generation (RAG) approach, the app splits video transcripts into smaller chunks, converts them into embeddings, and stores them in a vector database (FAISS). When a user asks a question, the system retrieves the most relevant chunks and uses Google Gemini LLM to generate context-aware, accurate responses.

This project demonstrates the combination of video processing, natural language understanding, and large language models to create an intelligent and interactive video assistant.

Live Demo

🌐 Try the app

📸 Screenshot

Features

Automatic video validation and transcript fetching from YouTube.
RAG-based retrieval: answers are generated using only the video transcript.
Powered by Google Gemini LLM for high-quality answers.
Supports multiple languages in transcripts (e.g., English & Hindi).
Built with Streamlit for an interactive web interface.

How It Works

Video Validation: The app checks if the YouTube URL is valid and accessible.
Transcript Fetching: Captions are fetched using YouTubeTranscriptApi.
Text Chunking: The transcript is split into smaller chunks for better retrieval.
Embeddings & Vector Store: Each chunk is converted into embeddings using Google Gemini, and stored in FAISS.
Retrieval-Augmented Generation (RAG): When you ask a question, relevant chunks are retrieved from FAISS.
LLM Answering: Google Gemini LLM generates answers based only on the retrieved transcript chunks.

How to Use

Enter your Google AI Studio API key in the input field.
- To get a free API key: Google AI Studio.
Paste the full YouTube video URL you want to analyze.
Type your question about the video.
Click Submit Question to get the AI-generated answer.

Installation

Clone the repository:

git clone https://github.com/tahirkorma/rag-youtube-assistant.git
cd <rag-youtube-assistant>

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # Linux/macOS
venv\Scripts\activate     # Windows

Install dependencies:

pip install -r requirements.txt

Run the App

streamlit run app.py

Requirements

Python 3.8+
Streamlit
Pytube
youtube-transcript-api
LangChain
FAISS
langchain-google-genai
(All dependencies are listed in requirements.txt)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
screenshot.png		screenshot.png
youtube_rag_chatbot.ipynb		youtube_rag_chatbot.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YouTube RAG Q&A with Google Gemini

Introduction

Live Demo

📸 Screenshot

Features

How It Works

How to Use

Installation

Requirements

About

Uh oh!

Releases

Packages

Languages

tahirkorma/rag-youtube-assistant

Folders and files

Latest commit

History

Repository files navigation

YouTube RAG Q&A with Google Gemini

Introduction

Live Demo

📸 Screenshot

Features

How It Works

How to Use

Installation

Requirements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages