`StatsChat`

Code state

Warning

Please be aware that for development purposes, these experiments use experimental Large Language Models (LLM's) not intended for production. They can present inaccurate information, hallucinated statements and offensive text by random chance or through malevolent prompts.

Under development / Experimental
Tested on macOS & windows only
Peer-reviewed
Depends on external API's

Introduction

This is an experimental application for semantic search of statistical publications. It uses LangChain and HuggingFace to implement a fairly simple Retriaval Augmented Generation (RAG) using embedding search and QA information retrieval process.

Upon receiving a query, documents are returned as search results using embedding similarity to score relevance. Next, the relevant text is passed to a Large Language Model (LLM), which is prompted to write an answer to the original question, if it can, using only the information contained within the documents.

To use this application, you will need to set up a vector store with the relevant documents, which can be done by scraping the relevant websites and processing the PDF documents into JSON files. Read the documentation below in the given order to get started.

Table of Key Documentation

To get started with the project, please refer to the following documentation in the docs folder:

The tool can be interacted with in several ways, and the following guides will help you get started:

In order to tailor the tool to your use case and deploying it effectively, the following documentation is also available:

License

The code, unless otherwise stated, is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 351 Commits
data/local_pdfs		data/local_pdfs
docs		docs
fast-api		fast-api
flask-app		flask-app
log		log
statschat		statschat
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`StatsChat`

Code state

Introduction

Table of Key Documentation

License

About

Uh oh!

Releases

Packages

Languages

License

datasciencecampus/statschat-global

Folders and files

Latest commit

History

Repository files navigation

StatsChat

Code state

Introduction

Table of Key Documentation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`StatsChat`

Packages