LLM with RAG BaseLine

LLM : "QuantFactory/Meta-Llama-3-8B-Instruct-GGUF" with llama_cpp(0.2.64)

Embedding model : "BAAI/bge-m3"

VectorDB : milvus(2.4.0)

Web demo : gradio(4.28)

RAG data : 한국어 대학 강의 데이터
(https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=data&dataSetSn=71627)

Data path

1. USER -> Gradio : Question

2. Gradio : Embedding question uing BGE-M3

3. Gradio -> Milvus server : Search context most related with question

4. Milvus server -> Gradio : context related with question

5. Gradio -> LLaMA3 server : Question with RAG context

6. LLaMA3 server -> Gradio : Answer

7. Gradio -> USER : Answer

Deploy path

1. create milvus standalone server with docker : "wget https://github.com/milvus-io/milvus/releases/download/v2.4.0/milvus-standalone-docker-compose.yml -O docker-compose.yml"

2. 한국어 대학 강의 데이터(JSON->CSV) -> embedded by bge-m3 -> create milvus collection & store the data in milvus collection

3. Create OpenAI Compatible server with llama-cpp & Llama-3

4. Create a Gradio server with BGE-M3 to create embedding vectors

** llama-cpp-python 설치 시, gpu 사용을 위해서 환경변수 설정이 필요할 수 있습니다.

** config로 llama-cpp서버 올리는 것은 https://llama-cpp-python.readthedocs.io/en/latest/server/#configuration-and-multi-model-support 를 참고하세요.

** LLaMA-3의 한국어 능력에 문제가 있어 한국어에 대한 성능을 기대하고 사용할 수 없습니다.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
JSON_to_CSV.py		JSON_to_CSV.py
README.MD		README.MD
data_to_milvus.py		data_to_milvus.py
gradio_llama3_4bit.py		gradio_llama3_4bit.py
llama_cpp_config		llama_cpp_config
push_test.MD		push_test.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM with RAG BaseLine

Data path

Deploy path

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM with RAG BaseLine

Data path

Deploy path

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages