Fix/OpenAI streaming usage #1798

Arui1122 · 2025-10-01T12:30:54Z

Related Issues or Context

Fixes token counting issue for OpenAI-compatible APIs in streaming mode.

OpenAI-compatible providers (like LiteLLM) do not return token usage by default in streaming responses. This causes Dify to fall back to token estimation instead of using the actual count from the provider.

This PR contains Changes to Non-Plugin

Documentation
Other

This PR contains Changes to Non-LLM Models Plugin

I have Run Comprehensive Tests Relevant to My Changes

This PR contains Changes to LLM Models Plugin

My Changes Affect Message Flow Handling (System Messages and User→Assistant Turn-Taking)
My Changes Affect Tool Interaction Flow (Multi-Round Usage and Output Handling, for both Agent App and Agent Node)
My Changes Affect Multimodal Input Handling (Images, PDFs, Audio, Video, etc.)
My Changes Affect Multimodal Output Generation (Images, Audio, Video, etc.)
My Changes Affect Structured Output Format (JSON, XML, etc.)
My Changes Affect Token Consumption Metrics
My Changes Affect Other LLM Functionalities (Reasoning Process, Grounding, Prompt Caching, etc.)
Other Changes (Add New Models, Fix Model Parameters etc.)

Version Control

I have Bumped Up the Version in Manifest.yaml (Top-Level Version Field, Not in Meta Section)

Dify Plugin SDK Version

I have Ensured dify_plugin>=0.3.0,<0.5.0 is in requirements.txt (SDK docs)

Environment Verification

Local Deployment Environment

Dify Version is: 1.7.1, I have Tested My Changes on Local Deployment Dify with a Clean Environment That Matches the Production Configuration.

SaaS Environment

I have Tested My Changes on cloud.dify.ai with a Clean Environment That Matches the Production Configuration

models/openai_api_compatible/models/llm/llm.py

Copilot

Pull Request Overview

This PR fixes token counting issues for OpenAI-compatible APIs when using streaming mode by implementing request usage data collection and adding proper stream options configuration.

Key changes:

Overrides the _generate method to request usage data in streaming mode via stream_options
Adds comprehensive OpenAI-compatible API handling with proper headers, authentication, and response formatting
Updates the manifest version to reflect the fix

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
models/openai_api_compatible/models/llm/llm.py	Implements custom `_generate` method with streaming usage data collection and complete OpenAI API compatibility
models/openai_api_compatible/manifest.yaml	Bumps version from 0.0.22 to 0.0.23

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

models/openai_api_compatible/models/llm/llm.py

Arui1122 added 2 commits October 1, 2025 20:10

fix(openai_api_compatible): include usage in streaming responses

1eea78e

chore: bump version to 0.0.23

232ca01

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Oct 1, 2025

Arui1122 commented Oct 1, 2025

View reviewed changes

models/openai_api_compatible/models/llm/llm.py Show resolved Hide resolved

crazywoola requested a review from Copilot October 8, 2025 02:22

Copilot AI reviewed Oct 8, 2025

View reviewed changes

models/openai_api_compatible/models/llm/llm.py Show resolved Hide resolved

models/openai_api_compatible/models/llm/llm.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix/OpenAI streaming usage #1798

Fix/OpenAI streaming usage #1798

Arui1122 commented Oct 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix/OpenAI streaming usage #1798

Are you sure you want to change the base?

Fix/OpenAI streaming usage #1798

Conversation

Arui1122 commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issues or Context

This PR contains Changes to Non-Plugin

This PR contains Changes to Non-LLM Models Plugin

This PR contains Changes to LLM Models Plugin

Version Control

Dify Plugin SDK Version

Environment Verification

Local Deployment Environment

SaaS Environment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Arui1122 commented Oct 1, 2025 •

edited

Loading