PodVibe.fm - AI-Powered Podcast Summarizer

Skip the fluff. Get the wisdom.
An intelligent agentic AI system that extracts the 20% of podcast content that delivers 80% of the learning value, powered by Google Gemini.

🎯 Overview

PodVibe.fm is a complete agentic AI system that transforms long-form YouTube podcast videos into concise, high-value summaries. Built for the Agentic AI Hackathon, it demonstrates a production-ready architecture with planning, execution, and observability components.

Key Features

🧠 Agentic AI Architecture - Modular design with Planner, Executor, and Memory components
🎙️ Intelligent Summarization - Uses Google Gemini to extract the 20% of content that delivers 80% of value
🔍 Semantic Keyword Extraction - Identifies 10 core concepts for quick navigation
⏱️ Timestamp Navigation - Click keywords to jump to where topics are discussed
📊 Full Observability - Complete memory logging of all agent decisions and actions
🎨 Modern UI - React frontend with Streamlit interface for development
🔄 ReAct Pattern - Demonstrates Reasoning + Acting workflow

🏗️ Architecture

This agent follows a clean agentic AI pattern with three core modules:

User Input → PLANNER → EXECUTOR → MEMORY
                ↓          ↓          ↓
           Sub-tasks   Tool Calls  Logging
                         ↓
                   [YouTube API]
                   [Gemini API]

Core Components

Planner (src/planner.py) - Breaks down user goals into executable sub-tasks
Executor (src/executor.py) - Executes tasks using appropriate tools (YouTube API, Gemini API)
Memory (src/memory.py) - Logs all agent activities for full observability

For detailed architecture documentation, see ARCHITECTURE.md.

🚀 Quick Start

Prerequisites

Python 3.8+
Google Gemini API Key
YouTube API Key (Optional) - For trending videos feature

Installation

Clone the repository

git clone https://github.com/euglopi/PodVibe.fm.git
cd PodVibe.fm

Install Python dependencies
```
pip install -r requirements.txt
```

Set up environment variables

Create a .env file in the project root:

GEMINI_API_KEY=your-gemini-api-key-here
YOUTUBE_API_KEY=your-youtube-api-key-here  # Optional

Running the Application

Option 1: Streamlit UI (Recommended for Testing)

cd src
streamlit run app.py

Open your browser to http://localhost:8501

Option 2: Flask API Server

cd src
python api.py

API will be available at http://localhost:8000

Available Endpoints:

GET /api/health - Health check
POST /api/summarize - Summarize a YouTube video
GET /api/trending - Get trending podcasts by category
POST /api/find-timestamp - Find where keywords are discussed
GET /api/models - List available Gemini models

Option 3: React Frontend

# Terminal 1: Start Flask backend
cd src
python api.py

# Terminal 2: Start React frontend
cd frontend
npm install
npm run dev

Option 4: Direct Python Usage

cd src
python youtube_summarizer.py

📖 How It Works

1. Planning Phase (ReAct: Reasoning)

The Planner creates an execution plan by breaking down the summarization task:

Extract video ID from YouTube URL
Fetch transcript from YouTube
Generate AI summary using Gemini (80/20 principle)
Extract 10 semantic keywords
Store results with metadata

2. Execution Phase (ReAct: Acting)

The Executor runs each task using appropriate tools:

URL Parser - Extracts video ID from various YouTube URL formats
YouTube API - Fetches transcripts with timestamps
Gemini API - Generates intelligent summaries focusing on high-value content
Keyword Extractor - Identifies core concepts using semantic analysis
Timestamp Finder - Locates where topics are discussed in the video

3. Memory & Observability

The Memory component logs every step:

User inputs
Execution plans
Task start/completion/failure events
Tool calls and results
Final outputs
Error details

All logs are available via API or exported to JSON files in logs/ directory.

For detailed technical explanation, see EXPLANATION.md.

🎯 Hackathon Requirements

✅ Google Gemini API Integration - Core AI engine for intelligent summarization
✅ Agentic Architecture - Modular design with Planner, Executor, and Memory
✅ ReAct Pattern - Reasoning + Acting workflow demonstrated
✅ Tool Integration - YouTube Transcript API + Gemini API
✅ Full Observability - Complete memory logging of all agent decisions
✅ Complete Documentation - Architecture, explanation, and demo included

📁 Project Structure

PodVibe.fm/
├── src/                          # Backend source code
│   ├── planner.py               # Planning module
│   ├── executor.py              # Execution module
│   ├── memory.py                # Memory/observability module
│   ├── youtube_summarizer.py   # Main agent orchestrator
│   ├── api.py                   # Flask REST API
│   ├── app.py                   # Streamlit UI
│   └── trending.py              # Trending videos API
│
├── frontend/                     # React frontend
│   ├── src/
│   │   ├── pages/              # Page components
│   │   │   ├── Home.tsx        # Landing page
│   │   │   ├── Browse.tsx      # Browse trending
│   │   │   └── Player.tsx      # Video player
│   │   └── components/         # UI components
│   └── package.json
│
├── logs/                         # Memory logs (auto-generated)
├── podcasts_out/                 # Saved summaries
│
├── ARCHITECTURE.md              # System architecture documentation
├── EXPLANATION.md               # Technical explanation
├── DEMO.md                      # Demo video link
├── README.md                    # This file
└── requirements.txt             # Python dependencies

🔧 Configuration

Environment Variables

Variable	Required	Description
`GEMINI_API_KEY`	Yes	Google Gemini API key for summarization
`YOUTUBE_API_KEY`	No	YouTube Data API key for trending videos (falls back to sample data)

Summary Types

The system supports three summary types:

comprehensive - Detailed summary with full context (default)
brief - Concise 2-3 paragraph summary
key_points - Bulleted list of main takeaways

📊 API Usage Examples

Summarize a Video

curl -X POST http://localhost:8000/api/summarize \
  -H "Content-Type: application/json" \
  -d '{"url": "https://www.youtube.com/watch?v=VIDEO_ID"}'

Find Keyword Timestamp

curl -X POST http://localhost:8000/api/find-timestamp \
  -H "Content-Type: application/json" \
  -d '{
    "video_id": "VIDEO_ID",
    "keyword": "artificial intelligence"
  }'

Get Trending Podcasts

curl http://localhost:8000/api/trending

🧪 Testing

Manual Testing

Streamlit UI - Interactive testing with visual feedback
Flask API - Test endpoints with curl or Postman
Direct Python - Run python src/youtube_summarizer.py

Test Scenarios

✅ Valid YouTube URLs with transcripts
✅ Invalid URLs (error handling)
✅ Videos without transcripts
✅ Different summary types
✅ Keyword extraction accuracy
✅ Memory logging completeness

📚 Documentation

ARCHITECTURE.md - Detailed system architecture and component design
EXPLANATION.md - Technical explanation of agent workflow and implementation
DEMO.md - Video demonstration

🤝 Contributing

This is a hackathon submission. For questions or feedback, please open an issue.

📄 License

See LICENSE file for details.

🙏 Acknowledgments

Google Gemini - Powerful AI for intelligent summarization
YouTube Transcript API - Reliable transcript fetching
Agentic AI Hackathon - Inspiration and framework

Built with ❤️ for the Agentic AI Hackathon

Skip the fluff. Get the wisdom.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
__pycache__		__pycache__
frontend		frontend
podcasts_out		podcasts_out
src		src
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
DEMO.md		DEMO.md
EXPLANATION.md		EXPLANATION.md
LICENSE		LICENSE
README.md		README.md
YOUTUBE_API_SETUP.md		YOUTUBE_API_SETUP.md
bun.lockb		bun.lockb
gemini_api_quickstart.py		gemini_api_quickstart.py
idea-guy-overview-12-12-25.md		idea-guy-overview-12-12-25.md
index.html		index.html
index.ts		index.ts
package-lock.json		package-lock.json
podcast_api_example.py		podcast_api_example.py
requirements.txt		requirements.txt
setup_youtube_api.sh		setup_youtube_api.sh
start.sh		start.sh
test_youtube_api.sh		test_youtube_api.sh
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Folders and files

Latest commit

History

Repository files navigation

PodVibe.fm - AI-Powered Podcast Summarizer

🎯 Overview

Key Features

🏗️ Architecture

Core Components

🚀 Quick Start

Prerequisites

Installation

Running the Application

Option 1: Streamlit UI (Recommended for Testing)

Option 2: Flask API Server

Option 3: React Frontend

Option 4: Direct Python Usage

📖 How It Works

1. Planning Phase (ReAct: Reasoning)

2. Execution Phase (ReAct: Acting)

3. Memory & Observability

🎯 Hackathon Requirements

📁 Project Structure

🔧 Configuration

Environment Variables

Summary Types

📊 API Usage Examples

Summarize a Video

Find Keyword Timestamp

Get Trending Podcasts

🧪 Testing

Manual Testing

Test Scenarios

📚 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages