Auto determine how much of the model to load into RAM

Use cases:
1. You can fit the whole model into GPU ram
2. You can fit part of the model into GPU ram
3. You need keep all the model weights on disk

In all these cases, we should be able to detect how much GPU ram is available, and determine the max amount of model to store that way. More advanced use cases of sharing GPU with other applications may need manual control over the memory, but that can be done later.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Auto determine how much of the model to load into RAM #9

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Auto determine how much of the model to load into RAM #9

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions