GitHub - NguyenVietAn1824/KDN_langchain_chatbot_2024

Build a RAG application using Langchain with advanced skills

Introduction

This project implements the Retrieval-Augmented Generation (RAG) model using Langchain. The goal of this application is to provide an application that improves the retrieval performance of a dataset of law questions, contributing to providing answers quickly and efficiently. In this project, I have used some advanced techniques such as semantic router to analyze the semantics of questions, adding historical reflection to improve the flow of user conversations, and combining retrieval techniques to improve retrieval efficiency. The LLM model used in this project is google-gemma2b to perform answer generation.

Features

Langchain: A framework that simplifies working with language models and integrates multiple components like retrieval and generation.
Retrieval-Augmented Generation (RAG): A machine learning technique that enhances text generation by retrieving relevant information from an external knowledge base.
Semantic router : With Semantic Router, I use a set of available samples, representing the topic of the rule, from which I calculate the similarity of the question with this data set, and from there make a decision about the question type, reduce the need to include distracting questions in the LLM.
History reflection : Improving query answering.
Streamlit framework : A Framework for building interfaces.
Google-gemma2b : Model to perform answer generation.

Installation

Clone the repository:

git clone https://github.com/yourusername/project-name.git

Install dependencies:
```
pip install -r requirements.txt
```
Run Project
```
streamlit run app.py
```

PipeLine

User Query: Input is the user's question.
Chat History: Summarizes the previous history of the conversation and adds it to the current question.
Semantic Router: Provides semantic orientation for user questions.
RAGs System: Uses semantic and keyword retrieval to increase searchability.
LLM: Uses gemma:2b to generate the final answer.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
src		src
vector_store		vector_store
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build a RAG application using Langchain with advanced skills

Introduction

Features

Installation

PipeLine

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Build a RAG application using Langchain with advanced skills

Introduction

Features

Installation

PipeLine

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages