fix: respect USE_USER_TOKEN config in OpenAI endpoint by balex42 · Pull Request #2187 · huggingface/chat-ui

balex42 · 2026-03-16T18:19:53Z

Description

This PR fixes an authentication bug where the USE_USER_TOKEN environment variable is ignored across both standard and MCP OpenAI-compatible endpoint calls.

Currently, if a user session exists (e.g., via a custom OIDC provider), locals.token is unconditionally injected as the Authorization: Bearer header for LLM inference calls. This overwrites the configured OPENAI_API_KEY, causing a 401 Unauthorized error when interacting with external OpenAI-compatible backends (like Ollama Cloud or vLLM) that do not recognize the OIDC token.

Motivation and Context

The .env file explicitly documents that the user token should only be used for inference if USE_USER_TOKEN=true:
# When set to true, user token will be used for inference calls
USE_USER_TOKEN=false

However, the implementation was unconditionally applying locals.token if it existed. This PR updates the headers payload to respect the USE_USER_TOKEN configuration for both standard generation and tool use.

Changes

Modified the following files to check config.USE_USER_TOKEN === "true" before injecting locals.token into the authorization header:

src/lib/server/endpoints/openai/endpointOai.ts (Covers standard streaming and non-streaming completions)
src/lib/server/textGeneration/mcp/runMcpFlow.ts (Covers MCP tool-enabled streaming and fallback non-streaming completions)

Before:

...(locals?.token ? { Authorization: `Bearer ${locals.token}` } : {}),

After:

...(config.USE_USER_TOKEN === "true" && locals?.token ? { Authorization: `Bearer ${locals.token}` } : {}),

How Has This Been Tested?

Configured a custom OIDC provider (OPENID_CLIENT_ID, OPENID_PROVIDER_URL).
Configured an external OpenAI-compatible backend (OPENAI_BASE_URL and OPENAI_API_KEY).
Set USE_USER_TOKEN=false.
Logged in via OIDC.
Attempted to send a standard chat message, and a chat message requiring tool (MCP) use.

Without this PR: The API calls fail with 401 Unauthorized and [mcp] flow failed, falling back to default endpoint because the backend receives the OIDC token instead of the API key.
With this PR: Both API calls succeed because the OPENAI_API_KEY is correctly utilized.

Previously, the OpenAI endpoint wrapper unconditionally injected the user's OIDC token into the Authorization header if a session existed. This overwrote the OPENAI_API_KEY, causing 401 Unauthorized errors when using external providers alongside OIDC authentication. This commit updates the header injection to properly check if USE_USER_TOKEN is set to "true" before passing the user token, restoring compatibility with external OpenAI-compatible backends.

balex42 added 2 commits March 16, 2026 17:08

fix: respect USE_USER_TOKEN in MCP MCP flow

0e7cfc0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: respect USE_USER_TOKEN config in OpenAI endpoint#2187

fix: respect USE_USER_TOKEN config in OpenAI endpoint#2187
balex42 wants to merge 2 commits intohuggingface:mainfrom
balex42:main

balex42 commented Mar 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

balex42 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Changes

How Has This Been Tested?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

balex42 commented Mar 16, 2026 •

edited

Loading