sciknoworg
diff --git a/‎docs/source/_static/custom.css‎
Lines changed: 12 additions & 4 deletions b/‎docs/source/_static/custom.css‎
Lines changed: 12 additions & 4 deletions
diff --git a/‎docs/source/_static/custom.js‎
Lines changed: 70 additions & 29 deletions b/‎docs/source/_static/custom.js‎
Lines changed: 70 additions & 29 deletions
diff --git a/‎docs/source/aligner/lightweight.rst‎
Lines changed: 203 additions & 178 deletions b/‎docs/source/aligner/lightweight.rst‎
Lines changed: 203 additions & 178 deletions
diff --git a/‎docs/source/aligner/llm.rst‎
Lines changed: 231 additions & 125 deletions b/‎docs/source/aligner/llm.rst‎
Lines changed: 231 additions & 125 deletions
diff --git a/‎docs/source/aligner/rag.rst‎
Lines changed: 250 additions & 10 deletions b/‎docs/source/aligner/rag.rst‎
Lines changed: 250 additions & 10 deletions
@@ -5,9 +5,9 @@
     height: auto;
 }
 
-.nav-item:nth-child(-n+5) {
-    display: none; /* Hides the first four nav items */
-}
+/*.nav-item:nth-child(-n+4) {*/
+/*    display: none; !* Hides the first four nav items *!*/
+/*}*/
 
 .full-width {
     display: block;
@@ -61,7 +61,7 @@
 /* Medium screens (e.g. tablets) */
 @media (max-width: 1024px) {
     .content:not(.custom) {
-        max-width: 90%;
+        max-width: 100%;
     }
 }
 
@@ -71,3 +71,11 @@
         max-width: 100%;
     }
 }
+
+.project-vision {
+  background-color: #e6f7ff;
+  padding: 1em;
+  border-left: 5px solid #ffbe18;
+  margin-bottom: 1.2em;
+  font-size: 1.1em;
+}
@@ -1,7 +1,10 @@
-Retrieval Augmented Generation
+Retrieval-Augmented Generation
 ================================
 
-This tutorial walks you through the process of ontology matching using the OntoAligner library, leveraging retrieval-augmented generation (RAG) techniques. Starting with the necessary module imports, it defines a task and loads source and target ontologies along with reference matchings. The tutorial then encodes the ontologies using a specialized encoder, configures a retriever and an LLM, and generates predictions. Finally, it demonstrates two postprocessing techniques—heuristic and hybrid—followed by saving the matched alignments in XML format, ready for use or further analysis.
+Usage
+----------------
+
+This guide walks you through the process of ontology matching using the OntoAligner library, leveraging **retrieval-augmented generation (RAG)** techniques. Starting with the necessary module imports, it defines a task and loads source and target ontologies along with reference matchings. The tutorial then encodes the ontologies using a specialized encoder, configures a retriever and an LLM, and generates predictions. Finally, it demonstrates two postprocessing techniques—heuristic and hybrid—followed by saving the matched alignments in XML format, ready for use or further analysis.
 
 .. code-block:: python
 
@@ -70,13 +73,84 @@ In this tutorial, we demonstrated:
 * Refining results with heuristic and hybrid postprocessing
 * Saving results in XML format
 
-You can customize the configurations and thresholds based on your specific dataset and use case. For more details, refer to the :doc:`../package_reference/postprocess`
+.. hint::
+
+    You can customize the configurations and thresholds based on your specific dataset and use case. For more details, refer to the :doc:`../package_reference/postprocess`
+
+Embedded RAG aligners within OntoAligner:
+
+.. list-table::
+   :widths: 30 60 10
+   :header-rows: 1
+
+   * - RAG Aligner
+     - Description
+     - Link
+
+   * - ``FalconLLMAdaRetrieverRAG``
+     - Uses Falcon LLM with Ada-based dense retrieval.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L85-L94>`__
+
+   * - ``FalconLLMBERTRetrieverRAG``
+     - Uses Falcon LLM with BERT-based retrieval for contextual matching.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L95-L102>`__
+
+   * - ``GPTOpenAILLMAdaRetrieverRAG``
+     - Uses OpenAI GPT (e.g., GPT-4) with Ada-based retriever.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L65-L73>`__
+
+   * - ``GPTOpenAILLMBERTRetrieverRAG``
+     - Combines OpenAI GPT models with BERT-based retrieval.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L75-L83>`__
+
+   * - ``LLaMALLMAdaRetrieverRAG``
+     - Wraps LLaMA models with Ada retriever for hybrid RAG-based alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L25-L33>`__
+
+   * - ``LLaMALLMBERTRetrieverRAG``
+     - Uses LLaMA models with BERT for semantic retrieval.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L35-L43>`__
+
+   * - ``MPTLLMAdaRetrieverRAG``
+     - Utilizes MPT models with Ada retriever for alignment generation.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L125-L132>`__
+
+   * - ``MPTLLMBERTRetrieverRAG``
+     - MPT model with BERT-based retrieval for enhanced context grounding.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L135-L142>`__
+
+   * - ``MambaLLMAdaRetrieverRAG``
+     - Uses Mamba LLM with Ada retriever for token-efficient alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L145-L152>`__
+
+   * - ``MambaLLMBERTRetrieverRAG``
+     - Mamba LLM paired with BERT retriever for structured knowledge alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L155-L162>`__
+
+   * - ``MistralLLMAdaRetrieverRAG``
+     - Mistral model with Ada retriever for compact and fast RAG workflows.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L45-L52>`__
+
+   * - ``MistralLLMBERTRetrieverRAG``
+     - Mistral model enhanced with BERT-based retrieval.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L55-L63>`__
+
+   * - ``VicunaLLMAdaRetrieverRAG``
+     - Vicuna model using Ada retrieval for alignment generation.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L105-L112>`__
+
+   * - ``VicunaLLMBERTRetrieverRAG``
+     - Vicuna model with BERT retriever for high-accuracy RAG-based alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/rag/models.py#L115-L122>`__
+
 
-FewShot RAG
+
+
+FewShot-RAG Aligner
 ------------------------
-This tutorial works based on FewShot RAG matching, an extension of the RAG model, designed for few-shot learning tasks. The FewShot RAG workflow is the same as RAG but with two differences:
+FewShot-RAG aligner is an extension of the RAG aligner, designed for few-shot learning based alignment. The FewShot RAG workflow is the same as RAG but with two differences:
 
-1. You only need to use FewShot encoders as follows, and since a fewshot model uses multiple examples you might also provide only specific examples from reference or other examples as a fewshot samples.
+1. You only need to use ``FewShotEncoder`` encoders as follows, and since a few-shot model uses multiple examples you might also provide only specific examples from reference or other examples as a fewshot samples.
 
 .. code-block:: python
 
@@ -95,8 +169,80 @@ This tutorial works based on FewShot RAG matching, an extension of the RAG model
 
     model = MistralLLMBERTRetrieverFSRAG(positive_ratio=0.7, n_shots=5, **config)
 
-In-Context Vectors RAG
-------------------------
+Embedded FewShot-RAG aligners within OntoAligner:
+
+.. list-table::
+   :widths: 30 60 10
+   :header-rows: 1
+
+   * - FewShot-RAG Aligner
+     - Description
+     - Link
+
+   * - ``FalconLLMAdaRetrieverFSRAG``
+     - Falcon LLM with Ada retriever and few-shot examples for enhanced alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L87-L95>`__
+
+   * - ``FalconLLMBERTRetrieverFSRAG``
+     - Falcon LLM with BERT-based retrieval in a few-shot setup.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L97-L105>`__
+
+   * - ``GPTOpenAILLMAdaRetrieverFSRAG``
+     - OpenAI GPT with Ada retriever for few-shot RAG alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L67-L75>`__
+
+   * - ``GPTOpenAILLMBERTRetrieverFSRAG``
+     - Combines OpenAI GPT and BERT retriever with few-shot prompting.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L77-L84>`__
+
+   * - ``LLaMALLMAdaRetrieverFSRAG``
+     - LLaMA model with Ada retriever for prompt-efficient few-shot alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L27-L34>`__
+
+   * - ``LLaMALLMBERTRetrieverFSRAG``
+     - LLaMA with BERT retriever in a few-shot reasoning framework.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L37-L44>`__
+
+   * - ``MPTLLMAdaRetrieverFSRAG``
+     - MPT LLM with Ada-based retrieval in few-shot alignment generation.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L127-L134>`__
+
+   * - ``MPTLLMBERTRetrieverFSRAG``
+     - MPT model using BERT retriever and few-shot prompting for improved accuracy.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L137-L144>`__
+
+   * - ``MambaLLMAdaRetrieverFSRAG``
+     - Mamba LLM integrated with Ada retriever for low-latency few-shot alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L147-L154>`__
+
+   * - ``MambaLLMBERTRetrieverFSRAG``
+     - Mamba model paired with BERT-based retrieval and few-shot capabilities.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L157-L164>`__
+
+   * - ``MistralLLMAdaRetrieverFSRAG``
+     - Mistral LLM with Ada retriever and few-shot support.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L47-L54>`__
+
+   * - ``MistralLLMBERTRetrieverFSRAG``
+     - Mistral model with BERT retrieval, enhanced by few-shot prompting.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L57-L64>`__
+
+   * - ``VicunaLLMAdaRetrieverFSRAG``
+     - Vicuna model with Ada retriever for fast, few-shot alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L107-L114>`__
+
+   * - ``VicunaLLMBERTRetrieverFSRAG``
+     - Vicuna with BERT retriever in a few-shot setting for high-precision alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/fewshot/models.py#L117-L124>`__
+
+ICV-RAG Aligner
+---------------------------------
+
+.. sidebar:: Citation
+
+    [1] Liu, S., Ye, H., Xing, L., & Zou, J. (2023). `In-context vectors: Making in context learning more effective and controllable through latent space steering <https://arxiv.org/abs/2311.06668>`_. arXiv preprint arXiv:2311.06668.
+
+
 This RAG variant performs ontology matching using ``ConceptRAGEncoder`` only. The In-Contect Vectors introduced by [1](https://github.com/shengliu66/ICV) tackle in-context learning as in-context vectors (ICV). We used LLMs in this perspective in the RAG module. The workflow is the same as RAG or FewShot RAG with the following differences:
 
 
@@ -108,7 +254,7 @@ This RAG variant performs ontology matching using ``ConceptRAGEncoder`` only. Th
     encoder_model = ConceptRAGEncoder()
     encoded_ontology = encoder_model(source=dataset['source'], target=dataset['target'], reference=dataset['reference'])
 
-2. Next, import an ICVRAG model, here we use Falcon model:
+2. Next, import an ICV-RAG aligner, here we use Falcon model:
 
 .. code-block:: python
 
@@ -118,4 +264,98 @@ This RAG variant performs ontology matching using ``ConceptRAGEncoder`` only. Th
     model.load(llm_path="tiiuae/falcon-7b", ir_path="all-MiniLM-L6-v2")
 
 
-[1] Liu, S., Ye, H., Xing, L., & Zou, J. (2023). `In-context vectors: Making in context learning more effective and controllable through latent space steering <https://arxiv.org/abs/2311.06668>`_. arXiv preprint arXiv:2311.06668.
+Embedded ICV-RAG aligners within OntoAligner:
+
+.. list-table::
+   :widths: 30 60 10
+   :header-rows: 1
+
+   * - ICV-RAG Aligner
+     - Description
+     - Link
+
+   * - ``FalconLLMAdaRetrieverICVRAG``
+     - Falcon LLM with Ada retriever for iterative consistency verification (ICV) alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L47-L54>`__
+
+   * - ``FalconLLMBERTRetrieverICVRAG``
+     - Falcon LLM combined with BERT-based retriever for ICV-guided alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L57-L65>`__
+
+   * - ``LLaMALLMAdaRetrieverICVRAG``
+     - LLaMA model with Ada retriever optimized for ICV-based reasoning.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L15-L31>`__
+
+   * - ``LLaMALLMBERTRetrieverICVRAG``
+     - LLaMA model paired with BERT retriever for ICV-driven alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L27-L34>`__
+
+   * - ``MPTLLMAdaRetrieverICVRAG``
+     - MPT model with Ada retrieval for consistency-verified RAG alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L87-L94>`__
+
+   * - ``MPTLLMBERTRetrieverICVRAG``
+     - MPT LLM with BERT retriever in an ICV pipeline for robust alignment.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L97-L104>`__
+
+   * - ``VicunaLLMAdaRetrieverICVRAG``
+     - Vicuna LLM with Ada retriever for ICV-RAG tasks.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L67-L74>`__
+
+   * - ``VicunaLLMBERTRetrieverICVRAG``
+     - Vicuna model paired with BERT-based retrieval for iterative consistency verification.
+     - `Source <https://github.com/sciknoworg/OntoAligner/blob/main/ontoaligner/aligner/icv/models.py#L77-L84>`__
+
+
+Customized-RAG Aligner
+-----------------------
+
+.. sidebar:: Useful links:
+
+    * `OntoAlignerPipeline Experimentation <https://github.com/sciknoworg/OntoAligner/blob/main/examples/OntoAlignerPipeline-Exp.ipynb>`_
+
+You can use custom LLMs with RAG for alignment. Below, we define two classes, each combining a retrieval mechanism with a LLMs to implement RAG aligner functionality.
+
+.. code-block:: python
+
+    from ontoaligner.aligner import (
+        TFIDFRetrieval,
+        SBERTRetrieval,
+        AutoModelDecoderRAGLLM,
+        AutoModelDecoderRAGLLMV2,
+        RAG
+    )
+
+    class QwenLLMTFIDFRetrieverRAG(RAG):
+        Retrieval = TFIDFRetrieval
+        LLM = AutoModelDecoderRAGLLMV2
+
+    class MinistralLLMBERTRetrieverRAG(RAG):
+        Retrieval = SBERTRetrieval
+        LLM = AutoModelDecoderRAGLLM
+
+As you can see,  **QwenLLMTFIDFRetrieverRAG** Utilizes ``TFIDFRetrieval`` for lightweight retriever with Qwen LLM. While, **MinistralLLMBERTRetrieverRAG** Employs ``SBERTRetrieval`` for retriever using sentence transformers and Ministral LLM.
+
+**AutoModelDecoderRAGLLMV2 and AutoModelDecoderRAGLLM Differences:**
+
+The primary distinction between ``AutoModelDecoderRAGLLMV2`` and ``AutoModelDecoderRAGLLM`` lies in the enhanced functionality of the former. ``AutoModelDecoderRAGLLMV2`` includes additional methods (as presented in the following) for better classification and token validation. Overall, these classes enable seamless integration of retrieval mechanisms with LLM-based generation, making them powerful tools for ontology alignment and other domain-specific applications.
+
+
+.. code-block:: python
+
+    def get_probas_yes_no(self, outputs):
+        """Retrieves the probabilities for the "yes" and "no" labels from model output."""
+        probas_yes_no = (outputs.scores[0][:, self.answer_sets_token_id["yes"] +
+                                              self.answer_sets_token_id["no"]].float().softmax(-1))
+        return probas_yes_no
+
+    def check_answer_set_tokenizer(self, answer: str) -> bool:
+        """Checks if the tokenizer produces a single token for a given answer string."""
+        return len(self.tokenizer(answer).input_ids) == 1
+
+
+.. note::
+
+    Consider reading the following section next:
+
+    * `Package Reference > Aligners <../package_reference/aligners.html>`_
Original file line number	Diff line number	Diff line change
`@@ -5,9 +5,9 @@`
`5`	`5`	`height: auto;`
`6`	`6`	`}`
`7`	`7`
`8`		`-.nav-item:nth-child(-n+5) {`
`9`		`- display: none; /* Hides the first four nav items */`
`10`		`-}`
	`8`	`+/.nav-item:nth-child(-n+4) {/`
	`9`	`+/* display: none; !* Hides the first four nav items !/`
	`10`	`+/}/`
`11`	`11`
`12`	`12`	`.full-width {`
`13`	`13`	`display: block;`
`@@ -61,7 +61,7 @@`
`61`	`61`	`/* Medium screens (e.g. tablets) */`
`62`	`62`	`@media (max-width: 1024px) {`
`63`	`63`	`.content:not(.custom) {`
`64`		`- max-width: 90%;`
	`64`	`+ max-width: 100%;`
`65`	`65`	`}`
`66`	`66`	`}`
`67`	`67`
`@@ -71,3 +71,11 @@`
`71`	`71`	`max-width: 100%;`
`72`	`72`	`}`
`73`	`73`	`}`
	`74`	`+`
	`75`	`+.project-vision {`
	`76`	`+ background-color: #e6f7ff;`
	`77`	`+ padding: 1em;`
	`78`	`+ border-left: 5px solid #ffbe18;`
	`79`	`+ margin-bottom: 1.2em;`
	`80`	`+ font-size: 1.1em;`
	`81`	`+}`