Skip to content

Do you support multiple GPUs to run pipeline speculative decoding? #1

@chenwenyan

Description

@chenwenyan

Hi, I am very interested in your work on PipeInfer!
However, the current implementation does not seem to support multiple GPUs. Are there any upcoming plans or suggestions for integrating support for GPUs with pipeline speculative decoding?
I have experimented with various approaches, but so far, none of them can work for me.
Thanks a lot!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions