Ollama embedding error context length exceeded for jina-embeddings-v2-zh #1164

chenyuz3 · 2024-09-12T07:24:21Z

chenyuz3
Sep 12, 2024

I noticed that for no matter what models used for vualt QA embedding, the default context length/chunk size is 2048 tokens, which may lead to a reduction of retrieval performance as any note longer than that will be simply abandoned. Many embedding models have larger context window by default so may be we can add an option in settings for that?

logancyang · 2024-09-12T18:24:28Z

logancyang
Sep 12, 2024
Maintainer

@chenyuz3 Ollama has model-specific settings which you can set using /set parameter .... Embedding context length is model-specific, so it shouldn't be set on the client side imo.

0 replies

chenyuz3 · 2024-10-30T01:36:01Z

chenyuz3
Oct 30, 2024
Author

Thx @logancyang , I have created a new model based on the original model with a context length larger, however I can still see some errors stating the context window is still not enough for some notes, I wonder how do copilot coupe with such situation? Does it simply cut off the note or something else?

0 replies

logancyang · 2024-10-30T04:38:52Z

logancyang
Oct 30, 2024
Maintainer

@chenyuz3 copilot does nothing when it exceeds the context length, the error you see should fail the chunk entirely so it won't appear in the index.

It is up to the backend to decide whether to truncate. In the past Ollama truncated the excessive length silently without error, now it seems it no longer does that, which is better.

What embedding model are you using and how long exactly is your context length? Copilot's chunk is 4000 chars, i.e. roughly 1000 tokens, should be short enough for most embedding models.

0 replies

chenyuz3 · 2024-10-30T04:45:31Z

chenyuz3
Oct 30, 2024
Author

@chenyuz3 copilot does nothing when it exceeds the context length, the error you see should fail the chunk entirely so it won't appear in the index.

It is up to the backend to decide whether to truncate. In the past Ollama truncated the excessive length silently without error, now it seems it no longer does that, which is better.

What embedding model are you using and how long exactly is your context length? Copilot's chunk is 4000 chars, i.e. roughly 1000 tokens, should be short enough for most embedding models.

@logancyang I use jina-embeddings-v2-zh as the embedding model with 8192 context length. I used to use bge-m3 models which also had a 8192 context length though at that time there were no embedding error found.

0 replies

logancyang · 2024-10-30T04:57:05Z

logancyang
Oct 30, 2024
Maintainer

@chenyuz3 I see. Could be language related since it's Chinese? I will test with jina-embeddings-v2-zh and see what's up.

0 replies

chenyuz3 · 2024-10-30T05:00:25Z

chenyuz3
Oct 30, 2024
Author

@chenyuz3 I see. Could be language related since it's Chinese? I will test with jina-embeddings-v2-zh and see what's up.

@logancyang The bge-m3 model I used to use is also multilingual (including Chinese). I should say there were one or two errors
(code 400) when using it, though the copilot plugin did not say it was a context length issue.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Ollama embedding error context length exceeded for jina-embeddings-v2-zh #1164

Uh oh!

{{title}}

Uh oh!

Replies: 6 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Ollama embedding error context length exceeded for jina-embeddings-v2-zh #1164

Uh oh!

chenyuz3 Sep 12, 2024

Replies: 6 comments

Uh oh!

logancyang Sep 12, 2024 Maintainer

Uh oh!

Uh oh!

chenyuz3 Oct 30, 2024 Author

Uh oh!

logancyang Oct 30, 2024 Maintainer

Uh oh!

chenyuz3 Oct 30, 2024 Author

Uh oh!

logancyang Oct 30, 2024 Maintainer

Uh oh!

chenyuz3 Oct 30, 2024 Author

chenyuz3
Sep 12, 2024

logancyang
Sep 12, 2024
Maintainer

chenyuz3
Oct 30, 2024
Author

logancyang
Oct 30, 2024
Maintainer

chenyuz3
Oct 30, 2024
Author

logancyang
Oct 30, 2024
Maintainer

chenyuz3
Oct 30, 2024
Author