Skip to content
Discussion options

You must be logged in to vote

The LLM will still use GPU for generation, its simply changing the embedding model. This will improve your GPUT utilization.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@ahmetax
Comment options

@jnfarooq
Comment options

Answer selected by ahmetax
@ahmetax
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants