Can you make it load layer partially onto the IGPU. #1741

sorasoras · 2023-12-11T16:18:36Z

sorasoras
Dec 11, 2023

most CPU these that come with an IGPU.
The question is can it load your model partially or all of them onto the IGPU regardless of the GPU has/hasn't enough VRM, because fo IGPU, the system ram is the vram, so that you can have GPU acceleration on a laptop
just provide a option to load selected number of layer that load onto GPU like llama cpp does in the command.

cebtenzzre · 2023-12-11T17:43:16Z

cebtenzzre
Dec 11, 2023
Maintainer

Partial offloading is tracked in #1562. This being a feature request, it doesn't belong in "Discussions" anyway. I am close to disabling this tab altogether because it seems like nobody understands what it's for.

1 reply

sorasoras Dec 11, 2023
Author

Fine, sorry about that.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can you make it load layer partially onto the IGPU. #1741

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Can you make it load layer partially onto the IGPU. #1741

Uh oh!

sorasoras Dec 11, 2023

Replies: 1 comment · 1 reply

Uh oh!

cebtenzzre Dec 11, 2023 Maintainer

Uh oh!

sorasoras Dec 11, 2023 Author

sorasoras
Dec 11, 2023

Replies: 1 comment 1 reply

cebtenzzre
Dec 11, 2023
Maintainer

sorasoras Dec 11, 2023
Author