You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merge vLLM deployer project to llm-finetuning (#163)
* use the official huggingface integration
* add option to deploy to vllm
* add vllm deployer step
* fix readme and add deployment target option
* test config
* test
* fix vllm integration
* rm llm-vllm project
* uncomment config file
* Update dependencies and GCP library versions in requirements.txt
* stop overwriting the config for dataset generation
* Configure Hugging Face cache directories for dataset preparation
* Update dataset name from htahir1 to zenml namespace in configuration files
* Update README with ZenML namespace dataset and repository links
* change hf repo to zenml
* use uv
* fix syntax
* update deprecated log metadata command
---------
Co-authored-by: Alex Strick van Linschoten <[email protected]>
Co-authored-by: Alex Strick van Linschoten <[email protected]>
The `feature_engineering` and `deployment` pipeline can be run simply with the `default` stack, but the training pipelines [stack](https://docs.zenml.io/user-guide/production-guide/understand-stacks) will depend on the config.
A working prototype has been trained and deployed as of Jan 19 2024. The model is using minimal data and finetuned using QLoRA and PEFT. The model was trained using 1 A100 GPU on the cloud:
129
129
130
-
- Training dataset [Link](https://huggingface.co/datasets/htahir1/zenml-codegen-v1)
130
+
- Training dataset [Link](https://huggingface.co/datasets/zenml/zenml-codegen-v1)
131
131
- PEFT Model [Link](https://huggingface.co/htahir1/peft-lora-zencoder15B-personal-copilot/)
132
132
- Fully merged model (Ready to deploy on HuggingFace Inference Endpoints) [Link](https://huggingface.co/htahir1/peft-lora-zencoder15B-personal-copilot-merged)
133
133
@@ -147,7 +147,7 @@ The [ZenML Pro](https://zenml.io/pro) was used to manage the pipelines, models,
147
147
148
148
This project recently did a [call of volunteers](https://www.linkedin.com/feed/update/urn:li:activity:7150388250178662400/). This TODO list can serve as a source of collaboration. If you want to work on any of the following, please [create an issue on this repository](https://github.com/zenml-io/zenml-projects/issues) and assign it to yourself!
149
149
150
-
-[x] Create a functioning data generation pipeline (initial dataset with the core [ZenML repo](https://github.com/zenml-io/zenml) scraped and pushed [here](https://huggingface.co/datasets/htahir1/zenml-codegen-v1))
150
+
-[x] Create a functioning data generation pipeline (initial dataset with the core [ZenML repo](https://github.com/zenml-io/zenml) scraped and pushed [here](https://huggingface.co/datasets/zenml/zenml-codegen-v1))
151
151
-[x] Deploy the model on a HuggingFace inference endpoint and use it in the [VS Code Extension](https://github.com/huggingface/llm-vscode#installation) using a deployment pipeline.
152
152
-[x] Create a functioning training pipeline.
153
153
-[ ] Curate a set of 5-10 repositories that are using the ZenML latest syntax and use data generation pipeline to push dataset to HuggingFace.
0 commit comments