Documentation for RAGStack TS (#372)

nicoloboschi · web-flow · commit bec60078f3a2 · 2024-05-31T13:06:16.000+02:00
diff --git a/docs/modules/ROOT/nav.adoc b/docs/modules/ROOT/nav.adoc
@@ -5,6 +5,11 @@
 * xref:ROOT:packages.adoc[]
 * xref:ROOT:migration.adoc[]
 
+UNCOMMENT WHEN READY TO PUBLISH
+.Get Started (TS/JS)
+* xref:ragstack-ts:quickstart.adoc[]
+* xref:ragstack-ts:migration.adoc[]
+
 .RAG Default architecture
 * xref:default-architecture:index.adoc[]
 * xref:default-architecture:loading.adoc[]
@@ -51,4 +56,5 @@
 * xref:ROOT:tests.adoc[]
 
 .Release notes
-* xref:ROOT:changelog.adoc[]
+* xref:ROOT:changelog.adoc[]
+* xref:ragstack-ts:changelog.adoc[]
diff --git a/docs/modules/ragstack-ts/pages/changelog.adoc b/docs/modules/ragstack-ts/pages/changelog.adoc
@@ -0,0 +1,88 @@
+= Changelog (TS/JS)
+
+The RAGStack changelog provides information about new features, bug fixes, dependency versions and breaking changes in each release.
+
+For more, see:
+
+* https://github.com/datastax/ragstack-ai-ts/releases[GitHub releases^]{external-link-icon}
+
+* https://www.npmjs.com/package/@datastax/ragstack-ai[NPM^]{external-link-icon}
+
+
+== 1.0.0
+
+[caption=]
+.Dependencies
+[%autowidth]
+[cols="2*",options="header"]
+|===
+| Package | Version
+
+
+| @datastax/astra-db-ts
+| 1.0.1
+
+| @langchain/azure-openai
+| 0.0.2
+
+| @langchain/community
+| 0.2.4
+
+| @langchain/core
+| 0.2.2
+
+| @langchain/google-vertexai
+| 0.0.17
+
+| @langchain/openai
+| 0.0.34
+
+| cassandra-driver
+| 4.7.2
+
+| langchain
+| 0.2.3
+
+| langsmith
+| 0.1.8
+
+
+|===
+
+
+== 0.1.1
+
+[caption=]
+.Dependencies
+[%autowidth]
+[cols="2*",options="header"]
+|===
+| Package | Version
+
+
+| @datastax/astra-db-ts
+| 0.1.4
+
+| @langchain/azure-openai
+| 0.0.2
+
+| @langchain/community
+| 0.0.33
+
+| @langchain/core
+| 0.1.36
+
+| @langchain/openai
+| 0.0.26
+
+| cassandra-driver
+| 4.7.2
+
+| langchain
+| 0.1.23
+
+| langsmith
+| 0.1.8
+
+
+|===
diff --git a/docs/modules/ragstack-ts/pages/migration.adoc b/docs/modules/ragstack-ts/pages/migration.adoc
@@ -0,0 +1,30 @@
+= Migrate to RAGStack
+
+Migrating existing LangChain applications to RAGStack is very easy.
+RAGStack comes with a set of pinned, tested versions of the LangChain libraries and integrations.
+
+The RAGStack CLI is the recommended way to manage your RAGStack projects.
+With the `install` command you can safely add or change the RAGStack version without worrying about transitive dependencies versions.
+This is especially important because RAGStack is a stack of multiple packages that are tested together for compatibility, performance, and security.
+
+You don't need to install the CLI, using `npx` is the recommended way to run it.
+
+Move your terminal to the project you want to install RAGStack in and run the following command:
+[source,bash]
+----
+npx @datastax/ragstack-ai-cli install
+----
+
+This command will modify the `package.json`, install `@datastax/ragstack-ai` and refresh your local dependencies.
+The supported package managers are `npm` and `yarn` (both classic and berry).
+
+The CLI automatically detects the package manager you are using and installs the correct version of RAGStack.
+However, if you never built the project before, it's recommended to force a specific package manager by setting the `--use-npm` or `--use-yarn` option.
+
+
+`@datastax/ragstack-ai` only includes a subset of the LangChain libraries, therefore you might want to keep some `@langchain/*` packages that are not included in RAGStack.
+To check what packages are included in RAGStack, you can run the following command:
+[source,bash]
+----
+npm show @datastax/ragstack-ai dependencies
+----
diff --git a/docs/modules/ragstack-ts/pages/quickstart.adoc b/docs/modules/ragstack-ts/pages/quickstart.adoc
@@ -0,0 +1,164 @@
+= Quickstart with RAGStack for TS
+
+This quickstart demonstrates a basic RAG pattern using RAGStack TS and the vector-enabled {db-serverless} database to retrieve context and pass it to a language model for generation.
+
+1. <<Construct information base>>
+2. <<Basic retrieval>>
+3. <<Generation with augmented context>>
+
+== Setup
+
+RAGStack TS includes all the standard libraries you need for the RAG pattern, including the vector database, embeddings pipeline, and retrieval.
+
+. Create a new project using NPM or Yarn:
++
+[tabs]
+======
+NPM::
++
+[source,bash]
+----
+npm init
+----
+
+Yarn::
++
+[source,console]
+----
+yarn init
+----
+======
+
+. Then add the RAGStack package via the CLI:
++
+[tabs]
+======
+NPM::
++
+[source,bash]
+----
+npx @datastax/ragstack-ai-ts install --use-npm
+----
+
+Yarn::
++
+[source,console]
+----
+npx @datastax/ragstack-ai-ts install --use-yarn
+----
+======
++
+. Set the AstraDB vector credentials. If you don't have a vector database, create one at https://astra.datastax.com/.
++
+[source,bash]
+----
+export ASTRA_DB_APPLICATION_TOKEN=AstraCS:xx
+export ASTRA_DB_API_ENDPOINT=https://xx.apps.astra.datastax.com
+----
+The {db-serverless} application token is associated automatically with the Database Administrator permission. An auth token example: `AstraCS:WSnyFUhRxsrg...`).
++
+Both the endpoint and the token are available in the {astra-ui}.
++
+. Create an OpenAI key at https://platform.openai.com/ and set it as an environment variable:
++
+[source,bash]
+----
+export OPENAI_API_TOKEN=sk-xx
+----
+
+== RAG workflow
+
+With your environment set up, you're ready to create a RAG workflow in Javascript.
+Create a new file, `index.js`, and copy the following code:
+
+[source,javascript]
+----
+const { OpenAIEmbeddings, ChatOpenAI } = require("@langchain/openai")
+const { AstraDBVectorStore } = require("@langchain/community/vectorstores/astradb")
+const { ChatPromptTemplate } = require("@langchain/core/prompts")
+const { RunnableSequence, RunnablePassthrough } = require("@langchain/core/runnables")
+const { StringOutputParser } = require("@langchain/core/output_parsers")
+
+
+async function main() {
+    // create the embeddings object with the OpenAI API key
+    const embeddings = new OpenAIEmbeddings()
+
+    // AstraDB connection parameters
+    const astra = {
+        token: process.env.ASTRA_DB_APPLICATION_TOKEN,
+        endpoint: process.env.ASTRA_DB_API_ENDPOINT,
+        collection: "demo",
+        collectionOptions: {
+            vector: {
+                dimension: 1536, /** 1536 for OpenAI embeddings */
+                metric: "cosine",
+            },
+        }
+    }
+
+    /** Index some text into the Astra Vector Store */
+
+    const vectorStore = await AstraDBVectorStore.fromTexts(
+        [
+            "RAGStack is a framework for building RAG applications",
+            "RAGStack has first-class support for AstraDB and Cassandra",
+        ],
+        [{source: "documentation"}, {source: "documentation"}],
+        embeddings,
+        astra
+    )
+    /** Now prepare the retrieval  */
+    const prompt = ChatPromptTemplate.fromMessages([
+        ["system", "You're an helpful assistant. Help the user to understand what is RAGStack. Use only information provided in the CONTEXT.\nCONTEXT:\n{context}"],
+        ["human", "{question}"],
+    ])
+
+    const docParser = (docs) => {
+        const formatted = docs.map((doc, i) => {
+            return `<doc id='${i}'>${doc.pageContent}</doc>`
+        }).join("\n")
+        return formatted
+    }
+
+    const chain = RunnableSequence.from([
+        {
+            context: vectorStore.asRetriever().pipe(docParser),
+            question: new RunnablePassthrough(),
+        },
+        prompt,
+        new ChatOpenAI({}),
+        new StringOutputParser()
+    ]);
+    /** Finally ask a question about RAGStack to the chatbot */
+    const answer = await chain.invoke("What is RAGStack?")
+    console.log("Answer:", answer)
+}
+main()
+----
+
+After that, you can run the script with Node.js:
+[source,bash]
+----
+node index.js
+>Connected to Astra DB collection
+>Answer:  RAGStack is a framework for building RAG applications. It also has first-class support for AstraDB and Cassandra.
+----
+
+== Upgrade RAGStack version
+After you have installed the RAGStack package, you can upgrade it to the latest version using the re-running the cli command:
+[source,bash]
+----
+npx @datastax/ragstack-ai-ts install
+----
+or you can upgrade to a specific version:
+[source,bash]
+----
+npx @datastax/ragstack-ai-ts install x.y.z
+----
+
+
+== What's next?
+
+* xref:what-is-rag.adoc[]: Learn more about the RAG pattern.
+