RAG Performance & Fairness Evaluation Toolkit (OpenVINO + LangChain) #3114

pkhara31 · 2025-11-12T06:56:38Z

This toolkit enables developers to build, evaluate, and optimize Retrieval-Augmented Generation (RAG) applications with comprehensive quality metrics including accuracy, bias detection, and perplexity analysis plus a racial-bias indicator. This uses RAG pipeline optimized with Intel OpenVINO for enhanced performance on CPU, GPU, and NPU. The pipeline leverages:

Optimum-Intel’s OVModelForCausalLM with the OpenVINO backend for efficient inference.
LangChain for orchestration of document loading, chunking, embedding, retrieval, reranking, and generation.
Goal: Provide a portable notebook-driven workflow for rapid experimentation, model comparison, and validation of RAG systems on custom/private corpora.

…. The toolkit computes standard metrics (BERT, BLEU, ROUGE, perplexity score) and a racial-bias indicator, and it is implemented using Optimum-Intel’s OVModelForCausalLM with the OpenVINO backend and LangChain for orchestration.

review-notebook-app · 2025-11-12T06:56:43Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

sbalandi · 2025-11-19T17:36:32Z

Hi @pkhara31 , the notebook https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/llm-rag-langchain/llm-rag-langchain.ipynb seems to cover the same tasks you described in comment. Could you please check ? Does this match your idea of the notebook you wanted to add ?
Let's discuss this point and leave only something new in this notebook or update the existing notebook

pkhara31 · 2025-11-20T05:15:38Z

Hi @sbalandi , This NB covers the methodology to evaluate the performance of RAG pipeline by computing the BERT, BLEU, ROGUE, perplexity, racial bias scores.
This notebook also provides ability to scrap web URLs and implement RAG pipeline on the web content as well.
These are the key deltas & additions to the existing notebook on RAG.
Let me know if any more questions!

openvino-dev-samples · 2025-11-20T05:25:58Z

Thanks for your contribution @pkhara31 Do you think if we can add this NB to RAG notebook, as a separate .ipynb, so users can directly evaluate their RAG system with selected models in your pipeline ?

openvino-dev-samples · 2025-11-20T05:41:17Z

Another idea is to change this .ipynb to an evaludation_helper.py which can be called inside current NB, and users can evaluate the RAG pipeline without loading the model again. Whats your thought ? @pkhara31

pkhara31 · 2025-11-20T05:46:29Z

The model loading redundancy can be avoided, but I think the evaluation and the ability to apply RAG pipeline on web docs (scraping) should still remain in the helper.

pkhara31 · 2025-12-01T10:46:40Z

May I know the latest update on this?

sbalandi · 2025-12-01T18:51:30Z

May I know the latest update on this?

Hi, sorry for the delay, I'm in the process of reviewing notebook, I'll write comments within a week

sbalandi · 2025-11-28T15:03:37Z

notebooks/llm-rag-ov-langchain/requirements.txt

+langchain_community
+msoffcrypto-tool
+docx2txt
+urllib


I don't think we need urllib , os , typing here as os/typing are build in modules
also, please, move requiremetns to notebook and use pip_install(), please, check example in https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/llm-rag-langchain/llm-rag-langchain.ipynb , Prerequisites section

sbalandi · 2025-12-05T19:25:12Z

notebooks/llm-rag-ov-langchain/ov_rag_evaluator.ipynb

@@ -0,0 +1,662 @@
+{


Why these modules needed ?
Please, add openvino, "optimum[openvino,nncf,onnxruntime]"

Reply via ReviewNB

sbalandi · 2025-12-05T19:25:12Z

notebooks/llm-rag-ov-langchain/ov_rag_evaluator.ipynb

@@ -0,0 +1,662 @@
+{


we can use pre-converted model here https://huggingface.co/OpenVINO/Phi-3-mini-4k-instruct-int4-ov
also, please, save model via model.save_pretrained("ov_model")

Reply via ReviewNB

sbalandi · 2025-12-05T19:25:12Z

notebooks/llm-rag-ov-langchain/ov_rag_evaluator.ipynb

@@ -0,0 +1,662 @@
+{


embedding_model_name = "BAAI/bge-small-en-v1.5"
It would be also great to save the model so you don't have to download it every time

embedding = OpenVINOBgeEmbeddings( model_name_or_path=embedding_model_name, model_kwargs=embedding_model_kwargs, encode_kwargs=encode_kwargs, )

Reply via ReviewNB

sbalandi · 2025-12-05T19:26:22Z

notebooks/llm-rag-ov-langchain/ov_rag_evaluator.ipynb

@@ -0,0 +1,662 @@
+{


Line #11. response = urlopen(req)
I haven't managed to load it on linux, but looks like on windows works well. Did you check it on Linux ?

Reply via ReviewNB

pkhara31 · 2025-12-08T09:14:08Z

@sbalandi I have made the suggested changes, can you please have a look once? I did not get to check it on Linux. Please let me know.

openvino-dev-samples · 2025-12-12T05:55:04Z

Another idea is to change this .ipynb to an evaludation_helper.py which can be called inside current NB, and users can evaluate the RAG pipeline without loading the model again. Whats your thought ? @pkhara31

Hi @pkhara31 any feedback on this idea? Would you mind to consolidate this notebook into RAG notebook. Please let me know your concern.

pkhara31 · 2025-12-15T04:34:26Z

Another idea is to change this .ipynb to an evaludation_helper.py which can be called inside current NB, and users can evaluate the RAG pipeline without loading the model again. Whats your thought ? @pkhara31

Hi @pkhara31 any feedback on this idea? Would you mind to consolidate this notebook into RAG notebook. Please let me know your concern.

Yes, I think it should be OK.

pkhara31 · 2025-12-22T04:19:25Z

Any updates here? @sbalandi

github-actions · 2026-01-06T00:09:31Z

This PR will be closed in a week because of 2 weeks of no activity.

pkhara31 · 2026-01-06T04:41:20Z

Do we have any updates on this PR/next steps ? @sbalandi

sbalandi · 2026-01-15T14:31:16Z

Hi, @pkhara31 thank you for the changes ! they look good for me, but further checks and review are transferred to @openvino-dev-samples , please, contact him for any questions

FYI @aleksandr-mokrov

openvino-dev-samples · 2026-01-16T00:33:40Z

hi @pkhara31 thanks for your update, and how about moving this notebook into [notebooks/llm-rag-langchain] instead of [notebooks/llm-rag-ov-langchain]. Then you can also merge the README into existing one.

pkhara31 · 2026-01-16T04:26:10Z

Hi @openvino-dev-samples , I am already aligned to move this notebook into llm-rag-langchain, but may I know if that is something you would help out?

Thanks.

openvino-dev-samples · 2026-01-16T05:01:53Z

Hi @openvino-dev-samples , I am already aligned to move this notebook into llm-rag-langchain, but may I know if that is something you would help out?

Thanks.

Thanks. It generally looks good to me, but I would like to test it with my local environment if you could help to move it, then I can give you detailed feedback.

Added a notebook (.ipynb) code to evaluate the effectiveness of the RAG by computing standard metrics to evaluate the RAG performance.

Update README.md with the steps to execute the RAG evaluation notebook.

openvino-dev-samples · 2026-01-17T06:06:15Z