not enough space in the scratch memory pool #373

ahmetax · 2023-08-15T17:44:30Z

ahmetax
Aug 15, 2023

I have a 6GB Graphics card. I am using:
EMBEDDING_MODEL_NAME = "hkunlp/instructor-large"
MODEL_ID = "TheBloke/orca_mini_3B-GGML"
MODEL_BASENAME = "orca-mini-3b.ggmlv3.q4_0.bin"

The first query is answered without a problem, but at the second query, I get similar errors as following:

Enter a query: what is the power of the congress
Llama.generate: prefix-match hit
ggml_new_tensor_impl: not enough space in the scratch memory pool (needed 271565568, available 268435456)
Segmentation fault

How can I prevent this error?

Answered by jnfarooq

Aug 22, 2023

The LLM will still use GPU for generation, its simply changing the embedding model. This will improve your GPUT utilization.

View full answer

PromtEngineer · 2023-08-16T19:37:14Z

PromtEngineer
Aug 16, 2023
Maintainer

One recommendation that I will have is to change the embedding model. For starters, try the all-MiniLM-L6-v2. The instructorEmbedding model also runs on GPU but the all-MiniLM-L6-v2 embedding doesn't need GPU (will have an impact on performance). This will reduce your GPU vRAM usages.

The issue you are facing is coming from llamacpp and seems to be a common (here, and here).

3 replies

ahmetax Aug 19, 2023
Author

Thanks. But I want to use GPU, because when I use GPU, I get answers 60 times faster than CPU.

jnfarooq Aug 22, 2023

The LLM will still use GPU for generation, its simply changing the embedding model. This will improve your GPUT utilization.

Answer selected by ahmetax

ahmetax Aug 23, 2023
Author

Thanks PromptEngineer and jnfarooq. When I used all-MiniLM-L6-v2 as the embedding model, my problem is resolved.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

not enough space in the scratch memory pool #373

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

not enough space in the scratch memory pool #373

Uh oh!

ahmetax Aug 15, 2023

Replies: 1 comment · 3 replies

Uh oh!

PromtEngineer Aug 16, 2023 Maintainer

Uh oh!

ahmetax Aug 19, 2023 Author

Uh oh!

jnfarooq Aug 22, 2023

Uh oh!

Uh oh!

ahmetax Aug 23, 2023 Author

ahmetax
Aug 15, 2023

Replies: 1 comment 3 replies

PromtEngineer
Aug 16, 2023
Maintainer

ahmetax Aug 19, 2023
Author

ahmetax Aug 23, 2023
Author