Replies: 2 comments
-
We currrently shard the model via |
Beta Was this translation helpful? Give feedback.
0 replies
-
But we are working on an internal sharding feature right now, so stay tuned! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I see that the
accelerate
library is in the dependencies. However, I cannot see any argument that enables model sharing for inference. I may be missing something tho.Beta Was this translation helpful? Give feedback.
All reactions