-
-
Notifications
You must be signed in to change notification settings - Fork 150
Closed
Labels
new featureNew feature or requestNew feature or requestrequires triageRequires triagingRequires triaging
Description
Feature Description
Vulkan buffer allocations cap out at 4GB, which isn't enough for some models. This was recently addressed upstream.
The Solution
It would be nice to get a new release on npm with those changes.
Considered Alternatives
Building locally works in the meantime. I might also add ROCm support if I find the time.
Additional Context
No response
Related Features to This Feature Request
- Metal support
- CUDA support
- Vulkan support
- Grammar
- Function calling
Are you willing to resolve this issue by submitting a Pull Request?
N/A
Metadata
Metadata
Assignees
Labels
new featureNew feature or requestNew feature or requestrequires triageRequires triagingRequires triaging