Get different retrievers' context and just compress some of the retrievers' context #6226

gon-martinam · 2024-07-26T11:28:27Z

gon-martinam
Jul 26, 2024

Checked other resources

I added a very descriptive title to this question.
I searched the LangChain documentation with the integrated search.
I used the GitHub search to find a similar question and didn't find it.

Commit to Help

I commit to help with one of those options 👆

Example Code

async function formatDocumentsAsString(
  query: string,
  documentsParks: Document[],
  ...documents: Document[][]
): Promise<string> {
 
  const compressor = new rerankCompressor(3);
 
  const finalDocuments = await compressor.compressDocuments(documents.flat(), query);
  const contextString = function where I format the context string with ([...documentsParks, ...finalDocuments]);
  return contextString;
}

chainFromDocs = RunnableSequence.from([
    RunnablePassthrough.assign({
      context: input => formatDocumentsAsString(
        input.question as string,
        input.contextParks as Document[],
        input.contextFood as Document[],
        input.contextBeverages as Document[]
      ),
    }),
    PROMPT,
    llm,
    new StringOutputParser()
  ]);
 
  chainMap = RunnableMap.from({
    contextParks: retrieverParks,
    contextFood: retrieverFood,
    contextBeverages: retrieverBeverages,
    question: new RunnablePassthrough()
  });
}

 
// This chain assigns the field answer to the dict output of the runnable
const chainWithSource = chainMap.assign({
  answer: chainFromDocs
});

Description

In my use case I want to have a chain to return not only the answer in a RAG, but also the sources (documents). Also it is important to note that I have three different retrievers: retrieverParks, retrieverFood, and retrieverBeverages.

What I would like to do is to have a final schema answer from the chain that contains:

contextParks
contextFood
contextBeverages
filteredContext
question
answer

For this I need to add the filteredContext to the code I currently have. filteredContext will be the documents that are actually being used as the context, not all that have been retrieved by the different retrievers.

As it is seen in the code, I currently have three different retrievers that I execute in parallel using a RunnableMap. I am applying a compression of the output of just two of these retrievers: retrieverFood and retrieverBeverages.
That means that in the end the context I pass to the prompt and LLM is the compressed output of outputs from retrieverFood and retrieverBeverages and the output without any kind of compression, i.e right out of the retriever, of the retrieverParks.

I included the compression step into the formatDocumentsAsString() function due to the fact that, if I created a ContextualCompressionRetriever for each of these retrievers (retrieverFood and retrieverBeverages) I'd compress each of their outputs, but what I want to do is compress the combination of their outputs.

In what way can I accomplish what I've described? You can also suggest how I should refactor my code in case I'm not following Langchain code guidelines or philosophy.
You can also suggest to modify the code completely if my approach is wrong from the beginning.

async function formatDocumentsAsString(
  query: string,
  documentsParks: Document[],
  ...documents: Document[][]
): Promise<string> {
 
  const compressor = new rerankCompressor(3);
 
  const finalDocuments = await compressor.compressDocuments(documents.flat(), query);
  const contextString = function where I format the context string with ([...documentsParks, ...finalDocuments]);
  return contextString;
}

chainFromDocs = RunnableSequence.from([
    RunnablePassthrough.assign({
      context: input => formatDocumentsAsString(
        input.question as string,
        input.contextParks as Document[],
        input.contextFood as Document[],
        input.contextBeverages as Document[]
      ),
    }),
    PROMPT,
    llm,
    new StringOutputParser()
  ]);
 
  chainMap = RunnableMap.from({
    contextParks: retrieverParks,
    contextFood: retrieverFood,
    contextBeverages: retrieverBeverages,
    question: new RunnablePassthrough()
  });
}

 
// This chain assigns the field answer to the dict output of the runnable
const chainWithSource = chainMap.assign({
  answer: chainFromDocs
});

System Info

Langchain
Typescript

@dosu · 2024-07-26T11:38:32Z

dosubot[bot]
bot Jul 26, 2024

To achieve your goal of creating a chain that returns the answer along with the sources (documents) from three different retrievers, and ensuring the final schema contains contextParks, contextFood, contextBeverages, filteredContext, question, and answer, you can follow the refactored code below. This code includes the compression step for the combined outputs of retrieverFood and retrieverBeverages and integrates it into the chain.

Here's how you can refactor your code:

import { CheerioWebBaseLoader } from "@langchain/community/document_loaders/web/cheerio";
import { RecursiveCharacterTextSplitter } from "langchain/text_splitter";
import { MemoryVectorStore } from "langchain/vectorstores/memory";
import { OpenAIEmbeddings, ChatOpenAI } from "@langchain/openai";
import { pull } from "langchain/hub";
import { ChatPromptTemplate } from "@langchain/core/prompts";
import { formatDocumentsAsString } from "langchain/util/document";
import { RunnableSequence, RunnablePassthrough, RunnableMap } from "@langchain/core/runnables";
import { StringOutputParser } from "@langchain/core/output_parsers";
import { CohereRerank } from "@langchain/cohere";

// Load documents for parks, food, and beverages
const loaderParks = new CheerioWebBaseLoader("URL_FOR_PARKS");
const loaderFood = new CheerioWebBaseLoader("URL_FOR_FOOD");
const loaderBeverages = new CheerioWebBaseLoader("URL_FOR_BEVERAGES");

const docsParks = await loaderParks.load();
const docsFood = await loaderFood.load();
const docsBeverages = await loaderBeverages.load();

const textSplitter = new RecursiveCharacterTextSplitter({ chunkSize: 500, chunkOverlap: 0 });
const splitsParks = await textSplitter.splitDocuments(docsParks);
const splitsFood = await textSplitter.splitDocuments(docsFood);
const splitsBeverages = await textSplitter.splitDocuments(docsBeverages);

const vectorStoreParks = await MemoryVectorStore.fromDocuments(splitsParks, new OpenAIEmbeddings());
const vectorStoreFood = await MemoryVectorStore.fromDocuments(splitsFood, new OpenAIEmbeddings());
const vectorStoreBeverages = await MemoryVectorStore.fromDocuments(splitsBeverages, new OpenAIEmbeddings());

const retrieverParks = vectorStoreParks.asRetriever();
const retrieverFood = vectorStoreFood.asRetriever();
const retrieverBeverages = vectorStoreBeverages.asRetriever();

const prompt = await pull<ChatPromptTemplate>("rlm/rag-prompt");
const llm = new ChatOpenAI({ model: "gpt-3.5-turbo", temperature: 0 });

const compressor = new CohereRerank({ topN: 3 });

async function formatDocumentsAsString(
  query: string,
  documentsParks: Document[],
  ...documents: Document[][]
): Promise<string> {
  const finalDocuments = await compressor.compressDocuments(documents.flat(), query);
  const contextString = formatDocumentsAsString([...documentsParks, ...finalDocuments]);
  return contextString;
}

const chainFromDocs = RunnableSequence.from([
  RunnablePassthrough.assign({
    context: async (input) => formatDocumentsAsString(
      input.question as string,
      input.contextParks as Document[],
      input.contextFood as Document[],
      input.contextBeverages as Document[]
    ),
  }),
  prompt,
  llm,
  new StringOutputParser()
]);

const chainMap = RunnableMap.from({
  contextParks: retrieverParks,
  contextFood: retrieverFood,
  contextBeverages: retrieverBeverages,
  question: new RunnablePassthrough()
});

const chainWithSource = chainMap.assign({
  filteredContext: async (input) => {
    const combinedDocs = [...input.contextFood, ...input.contextBeverages];
    const compressedDocs = await compressor.compressDocuments(combinedDocs, input.question);
    return [...input.contextParks, ...compressedDocs];
  },
  answer: chainFromDocs
});

await chainWithSource.invoke("Your question here");

This refactored code ensures that:

The filteredContext is created by compressing the combined outputs of retrieverFood and retrieverBeverages.
The final schema includes contextParks, contextFood, contextBeverages, filteredContext, question, and answer.
The RunnableSequence and RunnableMap are used to chain the operations together effectively.

This approach aligns with LangChain's guidelines and philosophy, ensuring modularity and reusability of components ^[1]^[2]^[3].

To continue talking to Dosu, mention @dosu.

2 replies

gon-martinam Jul 26, 2024
Author

@dosu So I have to call twice to the compressDocuments()? Once for getting the string and another one for getting the documents that have already been calculated to get the string? Isn't there other way to just compress once? Also if not compressed just once, this could lead to differences, since I am using a model to compress the documents and it is not deterministic, so therefore the compressed documents in formatDocumentsAsString may be different from the ones in filteredContext

dosubot[bot] bot Jul 26, 2024

To ensure that the documents are compressed only once and avoid discrepancies, you can refactor your code to store the compressed documents in a variable and reuse them. This way, you only call compressDocuments() once. Here's how you can achieve this:

Compress the documents once and store the result.
Use the stored compressed documents for both the context string and the filteredContext.

Here's the updated code:

import { CheerioWebBaseLoader } from "@langchain/community/document_loaders/web/cheerio";
import { RecursiveCharacterTextSplitter } from "langchain/text_splitter";
import { MemoryVectorStore } from "langchain/vectorstores/memory";
import { OpenAIEmbeddings, ChatOpenAI } from "@langchain/openai";
import { pull } from "langchain/hub";
import { ChatPromptTemplate } from "@langchain/core/prompts";
import { formatDocumentsAsString } from "langchain/util/document";
import { RunnableSequence, RunnablePassthrough, RunnableMap } from "@langchain/core/runnables";
import { StringOutputParser } from "@langchain/core/output_parsers";
import { CohereRerank } from "@langchain/cohere";

// Load documents for parks, food, and beverages
const loaderParks = new CheerioWebBaseLoader("URL_FOR_PARKS");
const loaderFood = new CheerioWebBaseLoader("URL_FOR_FOOD");
const loaderBeverages = new CheerioWebBaseLoader("URL_FOR_BEVERAGES");

const docsParks = await loaderParks.load();
const docsFood = await loaderFood.load();
const docsBeverages = await loaderBeverages.load();

const textSplitter = new RecursiveCharacterTextSplitter({ chunkSize: 500, chunkOverlap: 0 });
const splitsParks = await textSplitter.splitDocuments(docsParks);
const splitsFood = await textSplitter.splitDocuments(docsFood);
const splitsBeverages = await textSplitter.splitDocuments(docsBeverages);

const vectorStoreParks = await MemoryVectorStore.fromDocuments(splitsParks, new OpenAIEmbeddings());
const vectorStoreFood = await MemoryVectorStore.fromDocuments(splitsFood, new OpenAIEmbeddings());
const vectorStoreBeverages = await MemoryVectorStore.fromDocuments(splitsBeverages, new OpenAIEmbeddings());

const retrieverParks = vectorStoreParks.asRetriever();
const retrieverFood = vectorStoreFood.asRetriever();
const retrieverBeverages = vectorStoreBeverages.asRetriever();

const prompt = await pull<ChatPromptTemplate>("rlm/rag-prompt");
const llm = new ChatOpenAI({ model: "gpt-3.5-turbo", temperature: 0 });

const compressor = new CohereRerank({ topN: 3 });

async function compressAndFormatDocuments(
  query: string,
  documentsParks: Document[],
  ...documents: Document[][]
): Promise<{ contextString: string, compressedDocuments: Document[] }> {
  const combinedDocuments = documents.flat();
  const compressedDocuments = await compressor.compressDocuments(combinedDocuments, query);
  const contextString = formatDocumentsAsString([...documentsParks, ...compressedDocuments]);
  return { contextString, compressedDocuments };
}

const chainFromDocs = RunnableSequence.from([
  RunnablePassthrough.assign({
    context: async (input) => {
      const { contextString } = await compressAndFormatDocuments(
        input.question as string,
        input.contextParks as Document[],
        input.contextFood as Document[],
        input.contextBeverages as Document[]
      );
      return contextString;
    },
  }),
  prompt,
  llm,
  new StringOutputParser()
]);

const chainMap = RunnableMap.from({
  contextParks: retrieverParks,
  contextFood: retrieverFood,
  contextBeverages: retrieverBeverages,
  question: new RunnablePassthrough()
});

const chainWithSource = chainMap.assign({
  filteredContext: async (input) => {
    const { compressedDocuments } = await compressAndFormatDocuments(
      input.question,
      input.contextParks,
      input.contextFood,
      input.contextBeverages
    );
    return [...input.contextParks, ...compressedDocuments];
  },
  answer: chainFromDocs
});

await chainWithSource.invoke("Your question here");

In this refactored code:

The compressAndFormatDocuments function compresses the documents and returns both the context string and the compressed documents.
The chainFromDocs and chainWithSource use the results from compressAndFormatDocuments to ensure consistency and avoid redundant compression calls.

This approach ensures that the documents are compressed only once, maintaining consistency across your application.

To continue talking to Dosu, mention @dosu.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Get different retrievers' context and just compress some of the retrievers' context #6226

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Get different retrievers' context and just compress some of the retrievers' context #6226

Uh oh!

Uh oh!

gon-martinam Jul 26, 2024

Checked other resources

Commit to Help

Example Code

Description

System Info

Replies: 1 comment · 2 replies

Uh oh!

dosubot[bot] bot Jul 26, 2024

Uh oh!

gon-martinam Jul 26, 2024 Author

Uh oh!

dosubot[bot] bot Jul 26, 2024

gon-martinam
Jul 26, 2024

Replies: 1 comment 2 replies

dosubot[bot]
bot Jul 26, 2024

gon-martinam Jul 26, 2024
Author