File upload and RAG Changes and Improvements #3367

NDolensek · 2024-07-17T16:36:06Z

NDolensek
Jul 17, 2024

Currently, file upload functionality significantly differs from what users learned to expect based on the web chat interfaces. Librechat enforces local RAG even for text files that could easily fit in the chat context and, hence, the behavior of the models when uploading files is often sub-par and unexpected. This results in many user complaints and basically either forces users to return to the 2022 ChatGPT-era copying and pasting of text into the chat or even abandoning the Librechat interface and settling for the standard web interfaces.
I understand that e.g. OpenAI API doesn't make it easy to support file uploads, but I suggest some effort is put into aligning Librechat behavior with the web chat interfaces it is (at least stylistically) replicating. At the minimum, I suggest the local RAG to be toggleable. I believe a text extractor functionality could alleviate much of these issues in a relatively simple manner.

for a text file attachment that is significantly shorter than the model context length, the entire text should be dumped into the chat context. RAG is completely unnecessary here and will only result in a significantly decreased response quality. For a pdf file, a standard extractor could be used to extract text or convert to xml and pass this directly to the chat API. This should happen in the background (users chat window shouldn't be filled with the extracted text, but the text should be passed to the API as-is). Effectively, the extracted text should be appended to the users message in-context. Possibly, the user could be able to inspect the extracted text by clicking, but this is minor.
if a file is very large, say >50% or even >80% of the context, warn the user/refuse/offer RAG/offer Assistants API for an OpenAI request. The refusal option would replicate the standard chat web interface behavior, which would be expected behavior for most users.
some models (e.g. Gemini API, with others presumably reaching parity soon) now support direct file upload for text, images, videos. Leverage direct file upload when possible. This option would probably require some work if multiple files are uploaded, since the files have to be referred to directly in the chat requests. Possibly, only a single file could be supported initially.

avimar · 2025-07-24T11:54:29Z

avimar
Jul 24, 2025

+1 I definitely would like this control. To choose if my text files, docx files, PDF files are included/OCR'd and included, or just sent to RAG.

And I would like to see this for every message moving forward (e.g. if we're still pulling RAG for this thread) with the ability to toggle it. Control, options, visibility.

0 replies

onestardao · 2025-08-01T01:57:11Z

onestardao
Aug 1, 2025

yo @NDolensek — you’re spot on.
i’ve seen this file upload > forced RAG > degraded output pattern over and over in local LLM setups.

what you’re describing is basically a form of “involuntary vector ingestion” — you drop a file in, expecting smart context… but instead get meaningless chunk injection, or worse, broken reasoning.

i ran into the same issue and ended up building an open-source patch that:

splits file ingestion from actual RAG calls

lets you extract → preview → manually or auto-attach text to prompt

supports fallback logic: e.g. if chunking fails, don't break the whole flow

handles what i call “semantic drift”: chunks look good but break reasoning downstream

MIT license, and stable in real workloads.
can walk through setup if helpful — this is one of those pain points that looks minor but kills UX if left unsolved.

cheers and big +1 for this kind of granular control request.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

File upload and RAG Changes and Improvements #3367

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

File upload and RAG Changes and Improvements #3367

Uh oh!

NDolensek Jul 17, 2024

Replies: 2 comments

Uh oh!

avimar Jul 24, 2025

Uh oh!

onestardao Aug 1, 2025

NDolensek
Jul 17, 2024

avimar
Jul 24, 2025

onestardao
Aug 1, 2025