Notebook 3 finalise

init27 · init27 · commit 73dc1c6ab737 · 2024-10-24T10:40:35.000-07:00
diff --git a/recipes/quickstart/NotebookLlama/Step-3-Re-Writer.ipynb b/recipes/quickstart/NotebookLlama/Step-3-Re-Writer.ipynb
@@ -1,5 +1,37 @@
 {
  "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "d0b5beda",
+   "metadata": {},
+   "source": [
+    "## Notebook 3: Transcript Re-writer\n",
+    "\n",
+    "In the previouse notebook, we got a great podcast transcript using the raw file we have uploaded earlier. \n",
+    "\n",
+    "In this one, we will use `Llama-3.1-8B-Instruct` model to re-write the output from previous pipeline and make it more dramatic or realistic."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "fdc3d32a",
+   "metadata": {},
+   "source": [
+    "We will again set the `SYSTEM_PROMPT` and remind the model of its task. \n",
+    "\n",
+    "Note: We can even prompt the model like so to encourage creativity:\n",
+    "\n",
+    "> Your job is to use the podcast transcript written below to re-write it for an AI Text-To-Speech Pipeline. A very dumb AI had written this so you have to step up for your kind.\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c32c0d85",
+   "metadata": {},
+   "source": [
+    "Note: We will prompt the model to return a list of Tuples to make our life easy in the next stage of using these for Text To Speech Generation"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 1,
@@ -51,6 +83,14 @@
     "\"\"\""
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "8ee70bee",
+   "metadata": {},
+   "source": [
+    "This time we will use the smaller 8B model"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 2,
@@ -61,6 +101,14 @@
     "MODEL = \"meta-llama/Llama-3.1-8B-Instruct\""
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "f7bc794b",
+   "metadata": {},
+   "source": [
+    "Let's import the necessary libraries"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 3,
@@ -79,6 +127,16 @@
     "warnings.filterwarnings('ignore')"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "8020c39c",
+   "metadata": {},
+   "source": [
+    "We will load in the pickle file saved from previous notebook\n",
+    "\n",
+    "This time the `INPUT_PROMPT` to the model will be the output from the previous stage"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 4,
@@ -92,6 +150,14 @@
     "    INPUT_PROMPT = pickle.load(file)"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "c4461926",
+   "metadata": {},
+   "source": [
+    "We can again use Hugging Face `pipeline` method to generate text from the model"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -140,6 +206,14 @@
     ")"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "612a27e0",
+   "metadata": {},
+   "source": [
+    "We can verify the output from the model"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": null,
@@ -160,6 +234,14 @@
     "save_string_pkl = outputs[0][\"generated_text\"][-1]['content']"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "d495a957",
+   "metadata": {},
+   "source": [
+    "Let's save the output as a pickle file to be used in Notebook 4"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": null,
diff --git a/recipes/quickstart/NotebookLlama/Step-4-TTS-Workflow.ipynb b/recipes/quickstart/NotebookLlama/Step-4-TTS-Workflow.ipynb
@@ -5,17 +5,17 @@
    "id": "c31c0e37",
    "metadata": {},
    "source": [
-    "## Notebook 4: TTS Workflow"
+    "## Notebook 4: TTS Workflow\n",
+    "\n",
+    "We have the exact podcast transcripts ready now. "
    ]
   },
   {
    "cell_type": "markdown",
    "id": "be20fda2-409e-4d86-b502-33aee1a73151",
    "metadata": {},
    "source": [
-    "\n",
-    "\n",
-    "Copy-Pasted from: https://colab.research.google.com/drive/1dWWkZzvu7L9Bunq9zvD-W02RFUXoW-Pd?usp=sharing#scrollTo=68QtoUqPWdLk\n"
+    "Credit: [This](https://colab.research.google.com/drive/1dWWkZzvu7L9Bunq9zvD-W02RFUXoW-Pd?usp=sharing#scrollTo=68QtoUqPWdLk) Colab was used for starter code\n"
    ]
   },
   {

Original file line number	Diff line number	Diff line change
`@@ -5,17 +5,17 @@`
`5`	`5`	`"id": "c31c0e37",`
`6`	`6`	`"metadata": {},`
`7`	`7`	`"source": [`
`8`		`- "## Notebook 4: TTS Workflow"`
	`8`	`+ "## Notebook 4: TTS Workflow\n",`
	`9`	`+ "\n",`
	`10`	`+ "We have the exact podcast transcripts ready now. "`
`9`	`11`	`]`
`10`	`12`	`},`
`11`	`13`	`{`
`12`	`14`	`"cell_type": "markdown",`
`13`	`15`	`"id": "be20fda2-409e-4d86-b502-33aee1a73151",`
`14`	`16`	`"metadata": {},`
`15`	`17`	`"source": [`
`16`		`- "\n",`
`17`		`- "\n",`
`18`		`- "Copy-Pasted from: https://colab.research.google.com/drive/1dWWkZzvu7L9Bunq9zvD-W02RFUXoW-Pd?usp=sharing#scrollTo=68QtoUqPWdLk\n"`
	`18`	`+ "Credit: [This](https://colab.research.google.com/drive/1dWWkZzvu7L9Bunq9zvD-W02RFUXoW-Pd?usp=sharing#scrollTo=68QtoUqPWdLk) Colab was used for starter code\n"`
`19`	`19`	`]`
`20`	`20`	`},`
`21`	`21`	`{`