Skip to content
Discussion options

You must be logged in to vote

You're correct. Currently, KT's multi-GPU implementation is based on pipeline, which is designed for users with multiple GPUs but limited VRAM on each device. At this stage, the multi-gpu doesn't provide acceleration benefits, but rather enables model deployment across multiple smaller GPUs.

We are actively working on improving this functionality to deliver performance enhancements in future releases.

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@yuliao0214
Comment options

Answer selected by yuliao0214
Comment options

You must be logged in to vote
1 reply
@yuliao0214
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants