Project-level shared knowledge / mini-RAG for conversations #2193

tessaherself · 2026-03-16T21:40:42Z

tessaherself
Mar 16, 2026

Problem

ChatGPT and Claude both let you upload files to a project that become available as context in every conversation within that project. chat-ui currently has no equivalent — each conversation is isolated.

With the Projects feature in #2192, we now have named containers for conversations with shared custom instructions. The natural next question is: how should projects share knowledge/files?

How competitors do it

Platform	Approach	Limits
ChatGPT Projects	Files uploaded → full text injected into context window. No vector search. Simple but token-expensive.	128k context
Claude Projects	Files uploaded → chunked + prepended as "project knowledge" in system prompt. Likely uses retrieval for larger bases.	200k context
Both	Knowledge is per-project, never leaks between projects.	—

Proposed approach for chat-ui

#2192 includes an implementation of a two-tier hybrid system:

Tier 1 — Context stuffing (zero extra infra)

Upload small files (< 50k chars) to a project → store in GridFS → extract text at upload time → prepend full text to system prompt on each turn.

Pros: Works with any model, no extra infrastructure, immediate availability.
Cons: Token-expensive for larger knowledge bases.

Tier 2 — Chunk + retrieve (needs TEI)

For larger knowledge bases:

Chunk files (1000 chars, 200 overlap)
Embed via HuggingFace TEI
Store embeddings in MongoDB
Per user message: embed query → cosine similarity → top-K chunks → inject into system prompt

Pros: Scales to large knowledge bases, token-efficient.
Cons: Requires running TEI, async embedding pipeline.

Automatic tier selection

The system auto-selects based on total project knowledge size vs a configurable threshold. Falls back to Tier 1 if no TEI endpoint is configured.

Questions for the community

TEI integration — Should this use HuggingFace's hosted inference infrastructure, or require self-hosted TEI? What embedding models should be default?
Storage limits — What's a reasonable limit per project? (Currently: 20 files, 10MB each)
Supported file types — Currently: PDF, TXT, MD, CSV, JSON, XML, HTML, YAML. Should we add DOCX, PPTX, or other formats?
Interaction with web search — chat-ui previously had web search / RAG infrastructure that was removed. How does project knowledge interact with (or replace) that functionality?
Chunk strategy — Simple character-based chunking with paragraph/sentence boundary awareness. Is this sufficient, or should we support semantic chunking?
Vector search — Currently using manual cosine similarity (works with MongoMemoryServer in dev). Should we use MongoDB Atlas Vector Search for production deployments?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Project-level shared knowledge / mini-RAG for conversations #2193

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Project-level shared knowledge / mini-RAG for conversations #2193

Uh oh!

tessaherself Mar 16, 2026

Problem

How competitors do it

Proposed approach for chat-ui

Tier 1 — Context stuffing (zero extra infra)

Tier 2 — Chunk + retrieve (needs TEI)

Automatic tier selection

Questions for the community

Related

Replies: 0 comments

tessaherself
Mar 16, 2026