How to use Hugging Face inference endpoint with Langchain? #5586

Abe410 · 2023-06-02T00:03:35Z

Abe410
Jun 2, 2023

Hi

I have used the HuggingFacePipeline with different models such as flan-t5 and stablelm-7b etc., and it works with local inference.

I tried using the HuggingFaceHub as well, but it constantly gives a time out error for basically every model.

Now I have created an inference endpoint on HF, but how do I use that with langchain? The HuggingFaceHub class only accepts a text parameter which is the repo_id or model name, but the inference endpoint gives me a URL only.

I can get individual text samples by a simple API request, but how do I integrate this with langchain?

Answered by Galeos93

Jun 18, 2023

It seems this functionality was added here. Here is the class that allows the integration. I have not tried it yet, but you can give it a try. Unfortunately, this integration is not in the official documentation.

View full answer

Galeos93 · 2023-06-18T12:23:22Z

Galeos93
Jun 18, 2023

It seems this functionality was added here. Here is the class that allows the integration. I have not tried it yet, but you can give it a try. Unfortunately, this integration is not in the official documentation.

2 replies

Abe410 Jun 19, 2023
Author

Yupp, I found the integration after browsing through the actual code.

Would help if it is included in the official documentation as well!

unnati-rane Sep 7, 2023

can you please provide the code that you find useful. i am not able to find the actual code

levalencia · 2023-06-21T13:58:24Z

levalencia
Jun 21, 2023

Looks like its very new functionality! some docs are needed :)

0 replies

levalencia · 2023-06-21T13:59:30Z

levalencia
Jun 21, 2023

Hi

I have used the HuggingFacePipeline with different models such as flan-t5 and stablelm-7b etc., and it works with local inference.

I tried using the HuggingFaceHub as well, but it constantly gives a time out error for basically every model.

Now I have created an inference endpoint on HF, but how do I use that with langchain? The HuggingFaceHub class only accepts a text parameter which is the repo_id or model name, but the inference endpoint gives me a URL only.

I can get individual text samples by a simple API request, but how do I integrate this with langchain?

How to get the URL: https://huggingface.co/docs/inference-endpoints/guides/test_endpoint

You have to deploy the model you want as an inference point, basically you have to pay in HF for inference compute time

0 replies

syeminpark · 2023-07-07T14:45:10Z

syeminpark
Jul 7, 2023

the provided class
https://github.com/hwchase17/langchain/blob/370becdfc2dea35eab6b56244872001116d24f0b/langchain/llms/huggingface_endpoint.py#L15
has a bug.
it should be

if self.task == "text-generation":
# Text generation return includes the starter text.
text = generated_text[0]["generated_text"]

not
text = generated_text[0]["generated_text"][len(prompt) :]

the current class will likely just return a 0.

0 replies

uncensorie · 2023-10-05T16:47:04Z

uncensorie
Oct 5, 2023

@syeminpark did you test it? It seems to work fine

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use Hugging Face inference endpoint with Langchain? #5586

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to use Hugging Face inference endpoint with Langchain? #5586

Uh oh!

Abe410 Jun 2, 2023

Replies: 5 comments · 2 replies

Uh oh!

Galeos93 Jun 18, 2023

Uh oh!

Abe410 Jun 19, 2023 Author

Uh oh!

unnati-rane Sep 7, 2023

Uh oh!

levalencia Jun 21, 2023

Uh oh!

levalencia Jun 21, 2023

Uh oh!

syeminpark Jul 7, 2023

Uh oh!

uncensorie Oct 5, 2023

Abe410
Jun 2, 2023

Replies: 5 comments 2 replies

Galeos93
Jun 18, 2023

Abe410 Jun 19, 2023
Author

levalencia
Jun 21, 2023

levalencia
Jun 21, 2023

syeminpark
Jul 7, 2023

uncensorie
Oct 5, 2023