Skip to content

Commit 4025330

Browse files
authored
Added embedding dimensions by model (#307)
1 parent b893eb7 commit 4025330

File tree

3 files changed

+34
-13
lines changed

3 files changed

+34
-13
lines changed

platform/embedding.mdx

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -59,9 +59,16 @@ on Hugging Face:
5959

6060
## Generate embeddings
6161

62-
To generate embeddings, choose one of the following embedding providers in the **Providers** section of an **Embedder** node in a workflow:
62+
To generate embeddings, choose one of the following embedding providers and models in the **Providers** section of an **Embedder** node in a workflow:
6363

6464
<Note>You can change a workflow's predefined provider only through [Custom](/platform/workflows#create-a-custom-workflow) workflow settings.</Note>
6565

66-
- **OpenAI**: Use [OpenAI](https://openai.com) to generate embeddings.
67-
- **Vertex AI**: Use [Vertex AI](https://cloud.google.com/vertex-ai) to generate embeddings.
66+
- **OpenAI**: Use [OpenAI](https://openai.com) to generate embeddings. Also, choose the model to use:
67+
68+
- **text-embedding-3-small**, with 1536 dimensions.
69+
- **text-embedding-3-large**, with 3072 dimensions.
70+
- **Ada 002 (Text)**, with 1536 dimensions.
71+
72+
[Learn more](https://platform.openai.com/docs/guides/embeddings).
73+
74+
- **Vertex AI**: Use [Vertex AI](https://cloud.google.com/vertex-ai) to generate embeddings by using the [textembedding-gecko@001](https://cloud.google.com/vertex-ai/generative-ai/docs/embeddings/get-text-embeddings) model, with 768 dimensions.

platform/workflows.mdx

Lines changed: 18 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -171,8 +171,15 @@ There are two ways to create a custom workflow:
171171
16. In the **Embed** area, for **Provider**, choose one of the following:
172172

173173
- **None**: Do not generate embeddings.
174-
- **OpenAI**: Use OpenAI to generate embeddings.
175-
- **Vertex AI**: Use Vertex AI to generate embeddings.
174+
- **OpenAI**: Use OpenAI to generate embeddings. Also, choose the model to use:
175+
176+
- **text-embedding-3-small**, with 1536 dimensions.
177+
- **text-embedding-3-large**, with 3072 dimensions.
178+
- **Ada 002 (Text)**, with 1536 dimensions.
179+
180+
[Learn more](https://platform.openai.com/docs/guides/embeddings).
181+
182+
- **Vertex AI**: Use Vertex AI to generate embeddings by using the `textembedding-gecko@001` model, with 768 dimensions. [Learn more](https://cloud.google.com/vertex-ai/generative-ai/docs/embeddings/get-text-embeddings).
176183

177184
Learn more:
178185

@@ -299,8 +306,15 @@ There are two ways to create a custom workflow:
299306
<Accordion title="Embedder node">
300307
For **Providers**, select one of the following:
301308

302-
- **OpenAI**: Use OpenAI to generate embeddings.
303-
- **Vertex AI**: Use Vertex AI to generate embeddings.
309+
- **OpenAI**: Use OpenAI to generate embeddings. Also, choose the model to use:
310+
311+
- **text-embedding-3-small**, with 1536 dimensions.
312+
- **text-embedding-3-large**, with 3072 dimensions.
313+
- **Ada 002 (Text)**, with 1536 dimensions.
314+
315+
[Learn more](https://platform.openai.com/docs/guides/embeddings).
316+
317+
- **Vertex AI**: Use Vertex AI to generate embeddings by using the `textembedding-gecko@001` model, with 768 dimensions. [Learn more](https://cloud.google.com/vertex-ai/generative-ai/docs/embeddings/get-text-embeddings).
304318

305319
Learn more:
306320

snippets/ingest-configuration-shared/embedding-configuration.mdx

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -31,16 +31,16 @@ A common embedding configuration is a critical component that allows for dynamic
3131

3232
* `aws-bedrock`: None
3333

34-
* `huggingface`: `sentence-transformers/all-MiniLM-L6-v2`
34+
* `huggingface`: `sentence-transformers/all-MiniLM-L6-v2`, with 384 dimensions
3535

36-
* `mixedbread-ai`: `mixedbread-ai/mxbai-embed-large-v1`
36+
* `mixedbread-ai`: `mixedbread-ai/mxbai-embed-large-v1`, with 1024 dimensions
3737

38-
* `octoai`: `thenlper/gte-large`
38+
* `octoai`: `thenlper/gte-large`, with 1024 dimensions
3939

40-
* `openai`: `text-embedding-ada-002`
40+
* `openai`: `text-embedding-ada-002`, with 1536 dimensions
4141

42-
* `togetherai`: `togethercomputer/m2-bert-80M-8k-retrieval`
42+
* `togetherai`: `togethercomputer/m2-bert-80M-8k-retrieval`, with 768 dimensions
4343

44-
* `vertexai`: `textembedding-gecko@001`
44+
* `vertexai`: `textembedding-gecko@001`, with 768 dimensions
4545

4646
* `voyageai`: None

0 commit comments

Comments
 (0)