Skip to content

Commit e9480dc

Browse files
authored
Merge branch 'main' into bugfix/fixgame
2 parents d66ba58 + 5ad4548 commit e9480dc

File tree

97 files changed

+6883
-50
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

97 files changed

+6883
-50
lines changed

.github/workflows/staging_run_complete_llm.yml

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -45,6 +45,18 @@ jobs:
4545
run: |
4646
zenml init
4747
48+
- name: List and describe ZenML projects
49+
working-directory: ./llm-complete-guide
50+
run: |
51+
zenml project list || echo "Could not list projects"
52+
zenml project describe || echo "Failed to describe project"
53+
54+
- name: Register Set ZenML project
55+
working-directory: ./llm-complete-guide
56+
run: |
57+
zenml project register llm-complete-guide || echo "Failed to register project"
58+
zenml project set llm-complete-guide || echo "Failed to set project"
59+
4860
- name: Set stack (Staging)
4961
working-directory: ./llm-complete-guide
5062
run: |

.typos.toml

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@ extend-exclude = [
55
"*.ipynb",
66
"llm-finetuning/*",
77
"end-to-end-computer-vision/*",
8+
"research-radar/data/*.jsonl",
89
]
910

1011
[default.extend-identifiers]
@@ -17,6 +18,8 @@ MDEyOk9yZ2FuaXphdGlvbjg4Njc2OTU1 = "MDEyOk9yZ2FuaXphdGlvbjg4Njc2OTU1"
1718

1819
[default.extend-words]
1920
# Don't correct the surname "Teh"
21+
preprocesser = "preprocesser"
22+
Preprocesser = "Preprocesser"
2023
aks = "aks"
2124
hashi = "hashi"
2225
womens = "womens"
@@ -39,6 +42,12 @@ colours = "colours"
3942
initialised = "initialised"
4043
visualisation = "visualisation"
4144
customise = "customise"
45+
modernbert = "modernbert"
46+
deepseek = "deepseek"
47+
huggingface = "huggingface"
48+
answerdotai = "answerdotai"
49+
preprocessor = "preprocessor"
50+
logits = "logits"
4251

4352
[default]
44-
locale = "en-us"
53+
locale = "en-us"

README.md

Lines changed: 17 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
<!-- PROJECT LOGO -->
2-
<img referrerpolicy="no-referrer-when-downgrade" src="https://static.scarf.sh/a.png?x-pxid=03d804c9-e44a-471e-b56d-81085bc925ec" />
2+
<img referrerpolicy="no-referrer-when-downgrade" src="https://static.scarf.sh/a.png?x-pxid=03d804c9-e44a-471e-b56d-81685bc925ec" />
33

44
<br />
55
<div align="center">
@@ -55,23 +55,20 @@ you can adapt for your application. We maintain a growing list of projects
5555
from various ML domains including time-series, tabular data, computer vision,
5656
etc.
5757

58-
# 🧱 Project List
59-
60-
A list of updated and maintained projects by the ZenML team and the community:
61-
62-
| Project | Tags | Tools |
63-
|---------|------|-------|
64-
| [LLM Agents](llm-agents) | `NLP` `LLM Agents` `Conversational AI` `RAG` `Vector Stores` `Production MLOps` | `langchain` `llama_index` `faiss` `openai` |
65-
| [LLM Finetuning](llm-finetuning) | `NLP` `LLM` `Model Fine-tuning` `Transfer Learning` `Parameter Optimization` | `huggingface` `pytorch` `wandb` |
66-
| [Complete Guide to LLMs](llm-complete-guide) | `NLP` `LLM` `RAG` `Fine-tuning` `Model Evaluation` `Embeddings` `Synthetic Data` | `openai` `supabase` `huggingface` `argilla` `gradio` `anthropic` `litellm` |
67-
| [LLM LoRA Finetuning](llm-lora-finetuning) | `NLP` `Parameter-Efficient Fine-tuning` `LoRA` `LLM` `Distributed Training` | `huggingface` `pytorch` `accelerate` `peft` `phi-2` |
68-
| [End-to-end Computer Vision](end-to-end-computer-vision) | `Computer Vision` `Object Detection` `Data Labeling` `Human-in-the-Loop` | `pytorch` `label_studio` `fiftyone` `vertex-ai` `gcp` `yolov8` |
69-
| [Flux Dreambooth](flux-dreambooth) | `Image Generation` `Fine-tuning` `Stable Diffusion` `LoRA` `Video Generation` | `modal` `kubernetes` `huggingface` `flux` `stable-video-diffusion` |
70-
| [Huggingface to Sagemaker](huggingface-sagemaker) | `Model Deployment` `NLP` `Sentiment Analysis` `Model Training` `CI/CD` | `pytorch` `mlflow` `huggingface` `aws` `sagemaker` `s3` `kubeflow` `slack` `github` |
71-
| [Databricks Production QA Demo](databricks-production-qa-demo) | `Quality Assurance` `CI/CD` `Model Monitoring` `Model Explainability` `Data Drift` | `databricks` `mlflow` `evidently` `shap` `slack` |
72-
| [ECB Interest Rate Prediction with GCP Cloud Composer](airflow-cloud-composer-etl-feature-train) | `ETL` `Time Series` `Feature Engineering` `Regression` `Workflow Orchestration` | `cloud-composer` `airflow` `vertex-ai` `bigquery` `xgboost` `gcp` |
73-
| [Supabase OpenAI Summary](supabase-openai-summary) | `NLP` `Text Summarization` `Database Integration` `LLM` `Automated Reporting` | `openai` `supabase` `slack` `github-actions` `gcp` |
74-
| [Sign Language Detection with YOLOv5](sign-language-detection-yolov5) | `Computer Vision` `Object Detection` `Real-time Processing` `Model Deployment` | `mlflow` `gcp` `bentoml` `vertex-ai` `docker` |
58+
| Project | Domain | Key Features | Core Technologies |
59+
|---------|--------|-------------|-------------------|
60+
| [ZenML Support Agent](zenml-support-agent) | 🤖 LLMOps | 🔍 RAG, 📊 Vector DB, 💬 Conversational | langchain, llama_index, openai |
61+
| [ZenCoder](zencoder) | 🤖 LLMOps | 🧠 Fine-tuning, 📈 Transfer Learning | huggingface, pytorch, wandb |
62+
| [Complete Guide to LLMs](llm-complete-guide) | 🤖 LLMOps | 🔍 RAG, 🧠 Fine-tuning, 📊 Evaluation | openai, huggingface, anthropic |
63+
| [Gamesense](gamesense) | 🤖 LLMOps | 🧠 LoRA, ⚡ Efficient Training | pytorch, peft, phi-2 |
64+
| [Nightwatch AI](nightwatch-ai) | 🤖 LLMOps | 📝 Summarization, 📊 Reporting | openai, supabase, slack |
65+
| [ResearchRadar](research-radar) | 🤖 LLMOps | 📝 Classification, 📊 Comparison | anthropic, huggingface, transformers |
66+
| [End-to-end Computer Vision](end-to-end-computer-vision) | 👁️ Vision | 🎯 Object Detection, 🏷️ Labeling | pytorch, label_studio, yolov8 |
67+
| [Magic Photobooth](magic-photobooth) | 👁️ Vision | 🖼️ Image Gen, 🎬 Video Gen | stable-diffusion, huggingface |
68+
| [Sign Language Detection](sign-language-detection-yolov5) | 👁️ Vision | 🎯 Object Detection, ⚡ Real-time | mlflow, bentoml, vertex-ai |
69+
| [Huggingface to Sagemaker](huggingface-sagemaker) | 🚀 MLOps | 🔄 CI/CD, 📦 Deployment | mlflow, sagemaker, kubeflow |
70+
| [Databricks Production QA](databricks-production-qa-demo) | 🚀 MLOps | 📊 Monitoring, 🔍 Quality Assurance | databricks, evidently, shap |
71+
| [Eurorate Predictor](eurorate-predictor) | 📊 Data | ⏱️ Time Series, 🔄 ETL | airflow, bigquery, xgboost |
7572

7673
# 💻 System Requirements
7774

@@ -80,7 +77,7 @@ Read [our docs](https://docs.zenml.io/getting-started/installation) for
8077
installation details.
8178

8279
- Linux or macOS.
83-
- Python 3.7, 3.8, 3.9 or 3.10
80+
- Python >=3.9
8481

8582
# 🪃 Contributing
8683

@@ -131,21 +128,19 @@ the Apache License Version 2.0.
131128
| 🗳 **[Vote for Features]** | Pick what we work on next! |
132129
| 📓 **[Docs]** | Full documentation for creating your own ZenML pipelines. |
133130
| 📒 **[API Reference]** | Detailed reference on ZenML's API. |
134-
| 👨‍🍳 **[MLStacks]** | Terraform-based infrastructure recipes for pre-made ZenML stacks. |
135131
| ⚽️ **[Examples]** | Learn best through examples where ZenML is used. We've got you covered. |
136132
| 📬 **[Blog]** | Use cases of ZenML and technical deep dives on how we built it. |
137133
| 🔈 **[Podcast]** | Conversations with leaders in ML, released every 2 weeks. |
138134
| 💬 **[Join Slack]** | Need help with your specific use case? Say hi on Slack! |
139135
| 🗺 **[Roadmap]** | See where ZenML is working to build new features. |
140136
| 🙋‍♀️ **[Contribute]** | How to contribute to the ZenML project and code base. |
141137

142-
[ZenML 101]: https://docs.zenml.io/user-guide/starter-guide
138+
[ZenML 101]: https://docs.zenml.io/user-guides/starter-guide
143139
[Core Concepts]: https://docs.zenml.io/getting-started/core-concepts
144140
[Our latest release]: https://github.com/zenml-io/zenml/releases
145141
[Vote for Features]: https://zenml.io/discussion
146142
[Docs]: https://docs.zenml.io/
147143
[API Reference]: https://apidocs.zenml.io/
148-
[MLStacks]: https://github.com/zenml-io/mlops-stacks
149144
[Examples]: https://github.com/zenml-io/zenml/tree/main/examples
150145
[Blog]: https://blog.zenml.io/
151146
[Podcast]: https://podcast.zenml.io/

databricks-production-qa-demo/steps/deployment/deployment_deploy.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ def deployment_deploy() -> Annotated[
4545
In this example, the step can be configured to use different input data.
4646
See the documentation for more information:
4747
48-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
48+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
4949
5050
Args:
5151
dataset_inf: The inference dataset.

databricks-production-qa-demo/steps/etl/train_data_splitter.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@ def train_data_splitter(
4141
In this example, the step can be configured to use different test
4242
set sizes. See the documentation for more information:
4343
44-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
44+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
4545
4646
Args:
4747
dataset: Dataset read from source.

databricks-production-qa-demo/steps/hp_tuning/hp_tuning_single_search.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ def hp_tuning_single_search(
5050
to use different input datasets and also have a flag to fall back to default
5151
model architecture. See the documentation for more information:
5252
53-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
53+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
5454
5555
Args:
5656
model_package: The package containing the model to use for hyperparameter tuning.

databricks-production-qa-demo/steps/inference/inference_predict.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ def inference_predict(
4343
In this example, the step can be configured to use different input data.
4444
See the documentation for more information:
4545
46-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
46+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
4747
4848
Args:
4949
dataset_inf: The inference dataset.

databricks-production-qa-demo/steps/promotion/compute_performance_metrics_on_current_data.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ def compute_performance_metrics_on_current_data(
4444
and target environment stage for promotion.
4545
See the documentation for more information:
4646
47-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
47+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
4848
4949
Args:
5050
dataset_tst: The test dataset.

databricks-production-qa-demo/steps/promotion/promote_with_metric_compare.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ def promote_with_metric_compare(
4646
and target environment stage for promotion.
4747
See the documentation for more information:
4848
49-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
49+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
5050
5151
Args:
5252
latest_metric: Recently trained model metric results.

databricks-production-qa-demo/steps/training/model_trainer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,7 @@ def model_trainer(
7272
hyperparameters to the model constructor. See the documentation for more
7373
information:
7474
75-
https://docs.zenml.io/how-to/build-pipelines/use-pipeline-step-parameters
75+
https://docs.zenml.io/how-to/pipeline-development/use-configuration-files
7676
7777
Args:
7878
dataset_trn: The preprocessed train dataset.

0 commit comments

Comments
 (0)