tokens and usage cost #3397

Karanaiinfox · 2024-09-18T13:05:07Z

Karanaiinfox
Sep 18, 2024

I am using hugging face endpoints for my LLM and I have used langchain and langfuse callbacks to get traces but I want to know the token used and the cost for each trace

Answered by marcklingen

Sep 18, 2024

If this does not work out of the box via the langchain integration, please open a bug report issue. You should see toke counts in langfuse.

View full answer

@Karanaiinfox · 2024-09-18T13:08:15Z

dosubot[bot]
bot Sep 18, 2024

Hey @Karanaiinfox! I'm here to assist you with any bugs, questions, or contributions you have. Just let me know how I can help!

To determine the tokens used and the cost for each trace when using Hugging Face endpoints with LangChain and LangFuse callbacks, you can use the explicit cost and token ingestion method. Here’s how you can do it:

Track the cost and token counts: Ensure that your LLM calls return the cost and token counts as part of the response payload.
Pass these values back to Langfuse: Use the Langfuse API to ingest these values.

Here is an example of how you might implement this in code:

# Example of tracking tokens and cost with LangChain and LangFuse

from langchain import HuggingFaceEndpoint
from langfuse import LangFuseCallback

# Initialize your Hugging Face endpoint
hf_endpoint = HuggingFaceEndpoint(api_key="your_hugging_face_api_key")

# Initialize LangFuse callback
langfuse_callback = LangFuseCallback(api_key="your_langfuse_api_key")

# Make a request to the Hugging Face endpoint
response = hf_endpoint.call("your_model_name", input_data)

# Extract token usage and cost from the response
tokens_used = response['usage']['total_tokens']
cost = response['usage']['total_cost']

# Pass the token usage and cost to LangFuse
langfuse_callback.track_usage(
    model_name="your_model_name",
    tokens=tokens_used,
    cost=cost
)

This code assumes that the response from the Hugging Face endpoint includes usage information with total_tokens and total_cost fields. Adjust the field names according to the actual response structure from Hugging Face.

Additionally, Langfuse offers automated token counts for OpenAI and Anthropic models, which can simplify the process. Langfuse automatically calculates token counts based on the ingested prompts and completions, reducing the integration effort. For more detailed information and examples, please refer to the cost and tokens in Langfuse documentation ^[1]^[2]^[3]^[4]^[5].

To continue talking to Dosu, mention @dosu.

0 replies

marcklingen · 2024-09-18T13:29:16Z

marcklingen
Sep 18, 2024
Maintainer

If this does not work out of the box via the langchain integration, please open a bug report issue. You should see toke counts in langfuse.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Langfuse

tokens and usage cost #3397

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Langfuse

tokens and usage cost #3397

Uh oh!

Karanaiinfox Sep 18, 2024

Replies: 2 comments

Uh oh!

dosubot[bot] bot Sep 18, 2024

Uh oh!

marcklingen Sep 18, 2024 Maintainer

Karanaiinfox
Sep 18, 2024

dosubot[bot]
bot Sep 18, 2024

marcklingen
Sep 18, 2024
Maintainer