ChefBot.llm

Description: LLM based kitchen assistant that scrapes & summarizes recipe articles (e.g. from Pinterest).

Version 1

Use large LLM (llama3-8B-Instruct) to curate an instruct finetuning dataset.
- Develop some sort of dataset generator class that handles everything
- will need to manually curate recipe article URLs to use
- in version 1, just provide a bulleted list of the recipe ingredients and instructions.
finetune smaller LLM (tinyllama?) on generated dataset (via LoRA / qLoRA?)
Deploy finetuned small LLM to iOS
- INT4?
- leverage Apple Neural Engine (ANE)
Develop frontend iOS app with Swift
- User either copy/pastes URL into app or
- User can open Pinterest article in app from Pinterest and automatically run LLM on it

Can large LLM reliably create an accurate instruct finetuning dataset?
what finetuning methods work the best? Do I need qLoRA with small model?
Can I deploy huggingface model directly? do I need to write custom model/model-components?
Can iOS handle the model size? (i.e. performance & memory limitations)
How much of a hurdle is it to target ANEs?
Can an article be opened directly from Pinterest?

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
cfg.yml		cfg.yml
data.py		data.py
evaluate.py		evaluate.py
generate_dataset.py		generate_dataset.py
runner.py		runner.py
sandbox.ipynb		sandbox.ipynb
tests.py		tests.py
training.py		training.py
utils.py		utils.py