Skip to content

feat: Vulkan split allocations #512

@raftario

Description

@raftario

Feature Description

Vulkan buffer allocations cap out at 4GB, which isn't enough for some models. This was recently addressed upstream.

The Solution

It would be nice to get a new release on npm with those changes.

Considered Alternatives

Building locally works in the meantime. I might also add ROCm support if I find the time.

Additional Context

No response

Related Features to This Feature Request

  • Metal support
  • CUDA support
  • Vulkan support
  • Grammar
  • Function calling

Are you willing to resolve this issue by submitting a Pull Request?

N/A

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions