Skip to content

Commit ca19fd6

Browse files
committed
Fix spellings
1 parent 46efd25 commit ca19fd6

File tree

2 files changed

+9
-9
lines changed

2 files changed

+9
-9
lines changed

recipes/quickstart/NotebookLlama/README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ pip install -r requirements.txt
3131

3232
- Notebook 1:
3333

34-
This notebook is used for processing the PDF and processing it using the new Featherlight model into a `.txt` file.
34+
This notebook is used for processing the PDF and processing it using the new Feather light model into a `.txt` file.
3535

3636
Update the first cell with a PDF link that you would like to use. Please decide on a PDF to use for Notebook 1, it can be any link but please remember to update the first cell of the notebook with the right link.
3737

@@ -45,26 +45,26 @@ Please try experimenting with the System prompts for the model and see if you ca
4545

4646
- Notebook 3:
4747

48-
This notebook takes the transcript from earlier and prompts `Llama-3.1-8B-Instruct` to add more dramatisation and interruptions in the conversations.
48+
This notebook takes the transcript from earlier and prompts `Llama-3.1-8B-Instruct` to add more dramatization and interruptions in the conversations.
4949

5050
There is also a key factor here: we return a tuple of conversation which makes our lives easier later. Yes, studying Data Structures 101 was actually useful for once!
5151

5252
For our TTS logic, we use two different models that behave differently with certain prompts. So we prompt the model to add specifics for each speaker accordingly.
5353

54-
Please again try changing the system prompt and see if you can imporve the results. We encourage testing the featherlight 3B and 1B models as well at this stage
54+
Please again try changing the system prompt and see if you can improve the results. We encourage testing the feather light 3B and 1B models as well at this stage
5555

5656
- Notebook 4:
5757

5858
Finally, we take the results from last notebook and convert them into a podcast. We use the `parler-tts/parler-tts-mini-v1` and `bark/suno` models for a conversation.
5959

60-
The speakers and the prompt for parler model were decided based on experimentation and suggestions from the model authors. Please try experimentating, you can find more details in the resources section.
60+
The speakers and the prompt for parler model were decided based on experimentation and suggestions from the model authors. Please try experimenting, you can find more details in the resources section.
6161

6262

6363
#### Note: Right now there is one issue: Parler needs transformers 4.43.3 or earlier and for steps 1 to 3 of the pipeline you need latest, so we just switch versions in the last notebook.
6464

6565
### Next-Improvements/Further ideas:
6666

67-
- Speech Model experimentation: The TTS model is the limitation of how natural this will sound. This probably be improved with a better pipeline and with the help of somone more knowledgable-PRs are welcome! :)
67+
- Speech Model experimentation: The TTS model is the limitation of how natural this will sound. This probably be improved with a better pipeline and with the help of someone more knowledgable-PRs are welcome! :)
6868
- LLM vs LLM Debate: Another approach of writing the podcast would be having two agents debate the topic of interest and write the podcast outline. Right now we use a single LLM (70B) to write the podcast outline
6969
- Testing 405B for writing the transcripts
7070
- Better prompting

recipes/quickstart/NotebookLlama/TTS_Notes.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -54,11 +54,11 @@ Similar to parler models, suno has a [library](https://suno-ai.notion.site/8b8e8
5454

5555
v9 from their library sounded robotic so we use Parler for our first speaker and the best one from bark.
5656

57-
The incredible thing about Bark model is being able to add sound effects: `[Laugh]`, `[Gasps]`, `[Sigh]`, `[clears throat]`, making words capital causes the model to emphasise them.
57+
The incredible thing about Bark model is being able to add sound effects: `[Laugh]`, `[Gasps]`, `[Sigh]`, `[clears throat]`, making words capital causes the model to emphasize them.
5858

59-
Adding `-` gives a break in the text. We utilise this knowledge when we re-write the transcript using the 8B model to add effects to our transcript.
59+
Adding `-` gives a break in the text. We utilize this knowledge when we re-write the transcript using the 8B model to add effects to our transcript.
6060

61-
Note: Authors suggest using `...`. However, this didn't work as effectively as adding a hypen during trails.
61+
Note: Authors suggest using `...`. However, this didn't work as effectively as adding a hyphen during trails.
6262

6363
#### Hyper-parameters:
6464

@@ -105,7 +105,7 @@ Promising directions to explore in future:
105105

106106
#### Some more models that weren't tested:
107107

108-
In other words, we leave this as an excercise to readers :D
108+
In other words, we leave this as an exercise to readers :D
109109

110110
- [Fish-Speech](https://huggingface.co/fishaudio/fish-speech-1.4)
111111
- [MMS-TTS-Eng](https://huggingface.co/facebook/mms-tts-eng)

0 commit comments

Comments
 (0)