sudoleg
diff --git a/‎.assets/home.md‎
Lines changed: 21 additions & 0 deletions b/‎.assets/home.md‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎.assets/rag_quidelines.md‎
Lines changed: 15 additions & 0 deletions b/‎.assets/rag_quidelines.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎.dockerignore‎
Lines changed: 2 additions & 1 deletion b/‎.dockerignore‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎.github/workflows/test.yaml‎
Lines changed: 3 additions & 2 deletions b/‎.github/workflows/test.yaml‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎.gitignore‎
Lines changed: 1 addition & 1 deletion b/‎.gitignore‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.streamlit/config.toml‎
Lines changed: 3 additions & 0 deletions b/‎.streamlit/config.toml‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎Dockerfile‎
Lines changed: 3 additions & 3 deletions b/‎Dockerfile‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎README.md‎
Lines changed: 20 additions & 7 deletions b/‎README.md‎
Lines changed: 20 additions & 7 deletions
diff --git a/‎config.json‎
Lines changed: 4 additions & 1 deletion b/‎config.json‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎data/chroma/.gitkeep‎ b/‎data/chroma/.gitkeep‎
@@ -0,0 +1,21 @@
+# YTAI - Your personal YouTube AI
+
+Get insights from YouTube videos with YTAI, an LLM-based app that allows you to summarize or even ask questions and receive answers about them! tailors YouTube video summaries to your needs, offering a custom prompt feature for summaries exactly how you want them.
+
+Check out the project on [GitHub](https://github.com/sudoleg/ytai) for more information! Also, if you like the app, I would be very happy about a star :star:
+
+## Summary
+
+If you have a relatively short video (<30 minutes), you can use the `Summary` part of the application. It allows you to provide **a custom prompt** to tailor the summary to your needs. As the video is short anyways, you could, for example, provide your questions here. Moreover, you can save your responses (if you are running the app locally).
+
+Or you could ask to list all the topics in the video and then ask specific questions about them using the other part of the application (see below) 😉
+
+## Q&A - Chat
+
+The `Chat` part of the application is best suited when you have a longer video you have specific questions about. It is more efficient and less cost consuming, as only the relevant parts of the video are provided to the language model. Moreover, once you process a video, you can Q&A it whenever you want (as long as you don't remove the volume accidentally 😅).
+
+I've found that the chat function works especially well for videos with manual transcripts but it's also really good for videos with autogernerated ones. Personally, I use the chat feature for lengthy podcasts with timestamps, like podcasts from [Andrew Huberman](https://www.youtube.com/@hubermanlab), [Lex Fridman](https://www.youtube.com/@lexfridman) or [Chris Williamson](https://www.youtube.com/@ChrisWillx). You can just copy the title of the section/timestamp and get a good overview of the topics discussed.
+
+## WIP: FAQ
+
+Some day, an FAQ section will be here...
@@ -0,0 +1,15 @@
+# Guidelines
+
+You can either provide a question or just a topic from the video. If you ask a question, try to be as specific as possible. Your question should adress a specific topic or a passage from the video.
+
+**Don't ask for a summary! This part of the application is not designed for that! Use the `Summary` part instead!**
+
+## General tips
+
+- be specific
+- be concise
+- Don't include instructions, like 'explain in detail' or 'answer from a perspective of X' etc.
+
+## Important notice ❗
+
+While this part of the application is designed to assist with your questions using advanced technology, it relies on a large language model (LLM) and therefore may produce inaccurate responses. LLMs may reflect biases present in its training data, leading to potentially inappropriate or unfair answers. The model might hallucinate, misinterpret questions, provide vague responses and does not possess true understanding. Responses might inadvertently include sensitive or inappropriate content. Please acknowledge these limitations and use the provided information with caution and critical judgment.
@@ -7,4 +7,5 @@ test/
 .vscode/
 video_meta/*
 docs/
-.git/
+.git/
+data/*
@@ -16,7 +16,8 @@ jobs:
       - uses: actions/checkout@v4
       - uses: actions/setup-python@v5
         with:
-          python-version: "3.12.3"
+          python-version: "3.12.4"
       - uses: streamlit/streamlit-app-action@v0.0.3
         with:
-          app-path: main.py
+          app-path: main.py
+          skip-smoke: 'true'
@@ -5,11 +5,11 @@ video_meta/
 design/
 test/
 certs/
-*.ipynb
 transcripts*/
 /data/chroma/*
 !/data/chroma/.gitkeep
 *.sqlite3
+scripts/test.ipynb
 
 # Byte-compiled / optimized / DLL files
 __pycache__/
 
@@ -1,3 +1,6 @@
+[client]
+showSidebarNavigation = false
+
 [browser]
 gatherUsageStats = false
 
 
@@ -1,5 +1,4 @@
-# Base stage for shared environment setup
-FROM python:3.12.3-slim
+FROM python:3.12.4
 
 # Set working directory
 WORKDIR /app
@@ -8,7 +7,7 @@ WORKDIR /app
 COPY requirements.txt .
 
 # Install Python dependencies
-RUN pip3 install -r requirements.txt
+RUN pip3 install --upgrade pip && pip3 install -r requirements.txt
 
 # Copy application's code
 COPY . /app/
@@ -17,6 +16,7 @@ COPY . /app/
 ENV PYTHONPATH="/app"
 ENV STREAMLIT_CLIENT_TOOLBAR_MODE="viewer"
 ENV STREAMLIT_SERVER_PORT=8501
+ENV ENVIRONMENT=production
 
 # Expose port for the application
 EXPOSE 8501
 
@@ -6,12 +6,14 @@
 
 ## Features :sparkles:
 
-YTAI summarizes YouTube videos and is not the first project to do that. However, it offers some features that other similar projects and AI summarizers on the internet don't:
+YTAI lets you **summarize and chat (Q&A)** with YouTube videos. Its features include:
 
-- **provide a custom prompt** :writing_hand:
-  - you can tailor the response to your needs by providing a custom prompt or just use the default summarization
+- **provide a custom prompt for summaries** :writing_hand:
+  - you can tailor the summary to your needs by providing a custom prompt or just use the default summarization
 - **automatically save summaries** :open_file_folder:
   - the summaries can be automatically saved in the directory where you run the app. The summaries will be available under `<YT-channel-name>/<video-title>.md`
+- **create your own knowledge base**  :floppy_disk:
+  - once you process a video, you can chat with it at any time!
 - **choose from different OpenAI models** :robot:
   - currently available: gpt-3.5-turbo, gpt-4 (turbo), gpt-4o
   - by choosing a different model, you can summarize even longer videos and potentially get better responses
@@ -24,13 +26,23 @@ YTAI summarizes YouTube videos and is not the first project to do that. However,
 
 No matter how you choose to run the app, you will first need to get an OpenAI API-Key. This is very straightforward and free. Have a look at [their instructions](https://platform.openai.com/docs/quickstart/account-setup) to get started.
 
-### build & run with Docker
+### build & run with Docker (or docker-compose)
+
+1. make sure to provide an OpenAI API key (l. 43 in [docker-compose.yml](docker-compose.yml))
+2. adjust the path to save the summaries (l. 39 in [docker-compose.yml](docker-compose.yml))
+3. execute the following command:
 
 ```bash
-# build locally
-docker build --tag=ytai:latest .
-# or pull from Docker Hub
+docker-compose up --build -d
+```
+
+### if you are only interested in summaries
+
+```bash
+# pull from Docker Hub
 docker pull sudoleg/ytai:latest
+# or build locally
+docker build --tag=ytai:latest .
 docker run -d -p 8501:8501 -v $(pwd):/app/responses -e OPENAI_API_KEY=<your-openai-api-key> --name yt-summarizer sudoleg/ytai:latest
 ```
 
@@ -64,6 +76,7 @@ The project is built using some amazing libraries:
 - The project uses [YouTube Transcript API](https://github.com/jdepoix/youtube-transcript-api) for fetching transcripts.
 - [LangChain](https://github.com/langchain-ai/langchain) is used to create a prompt, submit it to an LLM and process it's response.
 - The UI is built using [Streamlit](https://github.com/streamlit/streamlit).
+- [ChromaDB](https://docs.trychroma.com/) is used as a vector store for embeddings.
 
 ## License
 
 
@@ -16,6 +16,9 @@
         "model": "The OpenAI API is powered by a diverse set of models with different capabilities and price points. Read more at https://platform.openai.com/docs/models/overview",
         "temperature": "In short, the lower the temperature, the more deterministic the results in the sense that the highest probable next token is always picked. Increasing temperature could lead to more randomness, which encourages more diverse or creative outputs. Read more at https://platform.openai.com/docs/guides/text-generation/how-should-i-set-the-temperature-parameter.",
         "top_p": "If you use Top P it means that only the tokens comprising the top_p probability mass are considered for responses, so a low top_p value selects the most confident responses. This means that a high top_p value will enable the model to look at more possible words, including less likely ones, leading to more diverse outputs. Read more at https://www.promptingguide.ai/introduction/settings",
-        "saving_responses": "Whether to save responses in the directory, where you run the app. The responses will be saved under '<YT-channel-name>/<video-title>.md'."
+        "saving_responses": "Whether to save responses in the directory, where you run the app. The responses will be saved under '<YT-channel-name>/<video-title>.md'.",
+        "chunk_size": "A larger chunk size increases the amount of context provided to the model to answer your question. However, it may be less relevant than with a small chunk size, as smaller chunks can encapsulate more semantic meaning. I would reccommend to use a smaller chunk size for shorter and a larger one for longer videos (> 1h).",
+        "preprocess_checkbox": "By enabling this, the original transcript gets preprocessed. This can greatly improve the results, especially for videos with automatically generated transcripts. However, it results in higher costs, as the whole transcript get's processed by gpt3.5-turbo. Also, the preprocessing will take a substantial amount of time.",
+        "selected_video": "Once you process a video, it gets saved in a database. You can chat with it at any time, without processing it again! Tip: you may also search for videos by typing (parts of) its title."
     }
 }
Original file line number	Diff line number	Diff line change
`@@ -16,6 +16,9 @@`
`16`	`16`	`"model": "The OpenAI API is powered by a diverse set of models with different capabilities and price points. Read more at https://platform.openai.com/docs/models/overview",`
`17`	`17`	`"temperature": "In short, the lower the temperature, the more deterministic the results in the sense that the highest probable next token is always picked. Increasing temperature could lead to more randomness, which encourages more diverse or creative outputs. Read more at https://platform.openai.com/docs/guides/text-generation/how-should-i-set-the-temperature-parameter.",`
`18`	`18`	`"top_p": "If you use Top P it means that only the tokens comprising the top_p probability mass are considered for responses, so a low top_p value selects the most confident responses. This means that a high top_p value will enable the model to look at more possible words, including less likely ones, leading to more diverse outputs. Read more at https://www.promptingguide.ai/introduction/settings",`
`19`		`- "saving_responses": "Whether to save responses in the directory, where you run the app. The responses will be saved under '<YT-channel-name>/<video-title>.md'."`
	`19`	`+ "saving_responses": "Whether to save responses in the directory, where you run the app. The responses will be saved under '<YT-channel-name>/<video-title>.md'.",`
	`20`	`+ "chunk_size": "A larger chunk size increases the amount of context provided to the model to answer your question. However, it may be less relevant than with a small chunk size, as smaller chunks can encapsulate more semantic meaning. I would reccommend to use a smaller chunk size for shorter and a larger one for longer videos (> 1h).",`
	`21`	`+ "preprocess_checkbox": "By enabling this, the original transcript gets preprocessed. This can greatly improve the results, especially for videos with automatically generated transcripts. However, it results in higher costs, as the whole transcript get's processed by gpt3.5-turbo. Also, the preprocessing will take a substantial amount of time.",`
	`22`	`+ "selected_video": "Once you process a video, it gets saved in a database. You can chat with it at any time, without processing it again! Tip: you may also search for videos by typing (parts of) its title."`
`20`	`23`	`}`
`21`	`24`	`}`