You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: llm-complete-guide/README.md
+47-1Lines changed: 47 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -116,7 +116,7 @@ Note that Claude will require a different API key from Anthropic. See [the
116
116
`litellm` docs](https://docs.litellm.ai/docs/providers/anthropic) on how to set
117
117
this up.
118
118
119
-
### Run the evaluation pipeline
119
+
### Run the LLM RAG evaluation pipeline
120
120
121
121
To run the evaluation pipeline, you can use the following command:
122
122
@@ -127,6 +127,52 @@ python run.py --evaluation
127
127
You'll need to have first run the RAG pipeline to have the necessary assets in
128
128
the database to evaluate.
129
129
130
+
## Embeddings finetuning
131
+
132
+
For embeddings finetuning we first generate synthetic data and then finetune the
133
+
embeddings. Both of these pipelines are described in [the LLMOps guide](https://docs.zenml.io/v/docs/user-guide/llmops-guide/finetuning-embeddings) and
134
+
instructions for how to run them are provided below.
135
+
136
+
### Run the `distilabel` synthetic data generation pipeline
137
+
138
+
To run the `distilabel` synthetic data generation pipeline, you can use the following commands:
139
+
140
+
```shell
141
+
pip install -r requirements-argilla.txt # special requirements
142
+
python run.py --synthetic
143
+
```
144
+
145
+
You will also need to have set up and connected to an Argilla instance for this
146
+
to work. Please follow the instructions in the [Argilla
0 commit comments