GitHub - Glareone/AI-RAG-In-Depth: Structured approach in AI and ML. Fundamentals and Advanced topics. RAG, Scoring & Profiling, LangChain & LangGraph, Certified Azure AI Engineer materials.

OpenAI and ChatGPT repo

My Workshops and Posts

My LinkedIn Posts & Presentations

GenAI. Where could be applied. Post 1.pdf
GenAI in Application Refactoring field, Slides.pdf
Legal problems with AI.pdf
Paradigms: Rag, Self-RAG, Re-Ranking RAG, FLARE v.2.pdf
Working with opinionated requests. S2A, RLHF, RLAIF.pdf
Multi-Modal RAG and its features.pdf
Measuring the GenAI Quality.pdf
LLM leveraging RLHF in code review
Everything of Thoughts (XoT). All modern techniques in one place
Non deterministic embedding results
AI Search vs PostgreSQL with pgvector in PROD
Prod-Ready LLM Solutions. Cook Book.
Quality Framework For RAG Applications.pdf
Crew.AI. Agents in LLM Applications (In Progress)
Pydantic data classes and how to manage the output format (In Progress)
XML vs Markdown vs Json for tagging in prompting and metaprompting (In Progress)
Crawlers for LLMs:
- https://python.langchain.com/v0.1/docs/use_cases/web_scraping/ ,
- https://ai.gopubby.com/use-ai-to-scrape-almost-all-websites-easily-in-2025-f868adc41e0f,
- https://github.com/Skyvern-AI/skyvern,
- https://gotenberg.dev/docs/routes,
- https://jina.ai/reader,
- https://github.com/unclecode/crawl4ai,
- https://crawlee.dev/,
- https://github.com/bracesproul/site-rag/,
- https://www.firecrawl.dev,
- https://github.com/mishushakov/llm-scraper
Table extraction in RAG systems (In Progress)
Choosing the right programming language for your next AI LLM project
Misjudgements using LogProbs (In Progress)

My Workshops

Theoretical Part

Azure AI-102 Learn Materials useful for exam

Azure Search & Document Intelligence

Machine Learning Materials

Extra materials

Practical Part. Table of Content

Example:ConsoleApp CommandGuess
Example: Azure Function with ChatGPT (completion and chat-completion)
Example: Integration with PowerAutomate
Example: Integration with PowerApp
Integration with Outlook (In progress)
OpenAI + PowerAutomate Workshop by me.pptx
Example: OpenAI + Redis
BMW Dealer assistant. ChatGPT Chat + Startup + Redis + Context
Get Embedding
Form Recognizer Cognitive Service
Content Filters (in progress)
OpenAI straightforward examples
Azure Bot Service & Chatbot Framework
LangChain meets Go
TenzorZero Framework (In progress)
Key Phrases Extraction. AI Language. Sentiment Analysis. Extracted Linked Entities
AI Search and Custom Skill using Azure Function
Document Intelligence, Best Practices (In progress)
MCP Server example using FastMCP

Advanced Topics. Theory and Practice.

1. Advanced Evaluation Metrics & Methodologies

Document Retrieval Metrics
a. NDCG@K (Normalized Discounted Cumulative Gain) - Ranking quality with relevance grades
b. Mean Reciprocal Rank (MRR) - First relevant document positioning. How quickly users find their first relevant result. Critical for RAG user experience.
c. Contextual Relevancy - How relevant is the retrieved context to the user's question?
c. Expected Reciprocal Rank (ERR) - User behavior modeling with graded relevance
d. Rank-Biased Precision (RBP) - Early result weighting strategies
e. Embedding Quality Metrics - Intra-cluster vs inter-cluster distance analysis. Quality of your vector space - are similar documents close together
Document Retrieval Metrics 2
a. Fidelity - Measures recall quality - what percentage of all relevant documents in your dataset were actually retrieved in the top-n results.
b. XDCG - Ranking quality within your retrieved top-k chunks, ignoring the rest of your document collection
c. XDCG vs NDCG
d. Max Relevance N - highest relevance score among your top-k retrieved chunks
e. Holes - Counts missing ground truth data
Response Quality Metrics
a. F1, Recall, Precision. Fundamental metrics
b. BLEU Score - N-gram overlap evaluation
c. ROUGE (L/1/2) - Recall-oriented summarization metrics
d. G-Eval (LLM as a judge) - Sophisticated evaluation framework that uses LLMs themselves to evaluate outputs based on detailed criteria.
d. BERTScore - Semantic similarity using contextualized embeddings
e. BLEURT - BERT-based learned evaluation metric
f. SacreBLEU - Standardized BLEU with proper tokenization
g. METEOR - Synonym and paraphrase consideration
h. CIDEr - Consensus-based evaluation
i. CHRF - Character-level F-score for multilingual evaluation
Human-Correlation Metrics
a. Preference-Based Ranking - Win/loss ratios in A/B testing
b. Pearson/Spearman Correlation - Human judge alignment
c. Likert Scale Rating Systems - Multi-point evaluation frameworks

2. RAG Evaluation Frameworks and Libraties. Agentic Application Evaluation

RAG System assessment and quality control:
a. Arize Phoenix
b. LangWatch
c. LangFuse
b. Galileo
c. Ragas
d. DeepEval
e. TrueLens
f. HuggingFace
g. AI Foundry
Agentic Application Evaluation
a. General Agentic Application Evaluations
b. Monitoring
c. Trajectory Evaluation
d. Structure of the Evaluation
e. Application Improvements using G-Eval (LLM-as-a-Judge)

3. Advanced ML Architecture & Training

Neural Network Fundamentals
a. ReLU vs Advanced Activations (GELU, Swish/SiLU) b. Layer Normalization vs Batch Normalization - Training stability techniques
c. Gradient Clipping - Exploding gradient prevention
d. Mixed Precision Training - FP16/BF16 memory optimization
CNN Advanced Concepts
a. Kernel Size Impact - Local vs global feature extraction (3x3 vs 7x7)
b. Parameter Sharing Benefits - Translation invariance principles
c. Hierarchical Feature Learning - Low-level to high-level progression
d. CNN vs MLP Scalability - O(k×c×f) vs O(n×m) parameter complexity
Advanced Training Techniques
a. Learning Rate Scheduling - Cosine annealing, linear decay
b. Warmup Steps - Training stability (10% of total steps)
c. Checkpoint Averaging - Model stability improvement
d. Gradient Accumulation - Simulating larger batch sizes

4. Parameter-Efficient Fine-tuning (PEFT)

SFT = What you're doing (the training objective/paradigm)
  - Training on prompt-response pairs in a supervised manner
  - The goal is to teach the model to follow instructions or perform specific tasks

PeFT/LoRA = How you're doing it (the training technique/method)
  - A more efficient way to update model weights
  - Instead of updating all billions of parameters, you only update a small subset or add small adapter layers

PeFT is orthogonal to the training objective.

Training Objective (WHAT):        Implementation Method (HOW):
├─ Pretraining                   ├─ Full fine-tuning (update all params)
├─ Supervised Fine-Tuning (SFT)  └─ PeFT (LoRA, QLoRA, etc.)
├─ RLHF/Preference Tuning             └─ Only update small adapters
└─ Continued Pretraining

PeFT/Lora/QLora could be used in:
  * Stage 2 - Supervised Fine-Tuning. You can use PeFT/LoRA when doing supervised fine-tuning with prompt→hand-written answer pairs  
  * Stage 3 - Reinforcement Learning. RLHF/preference tuning. You can use PeFT/LoRA during RLHF when training with human preferences  
  * Stage 4 - Fine-tuning. Additional fine-tuning. This general fine-tuning step (creating the fine-tuned base LLM) could also use PeFT/LoRA  .
  * Stage 1` - Even continued pretraining.

LoRA (Low-Rank Adaptation)
a. Rank Parameter (r) - 8-64 range, efficiency vs capacity trade-off
b. Alpha Scaling Factor - Typically 16-32
c. Target Module Selection - Query, value, key, output projections
d. AdaLoRA - Adaptive rank allocation
e. QLoRA - 4-bit quantized LoRA for memory efficiency
Training Parameters
a. Learning Rate Ranges - 1e-5 to 5e-4 for LLMs with warmup
b. Batch Size Optimization - 8-32 full fine-tuning, 64-128 LoRA
c. Sequence Length Limits - 512-4096 tokens task dependency
d. Weight Decay (L2 Regularization) - λ||w||² with λ = 1e-4 to 1e-2

5. Advanced Retrieval & Re-ranking

Re-ranking Algorithms
a. Reciprocal Rank Fusion (RRF) - RRF_score = Σ(1/(k + rank_i))
b. Cross-encoder vs Bi-encoder - Accuracy vs speed trade-offs
c. Neural Re-rankers - BERT/T5-based cross-attention models
d. Learning to Rank (LTR) - ML-based ranking optimization
e. Score Normalization Techniques - Min-max, z-score, sigmoid
Advanced Retrieval Concepts
a. Semantic Similarity Scoring - Cosine similarity between embeddings
b. Context Preservation - Chunk coherence maintenance
c. Window Size Optimization - Re-ranking candidate selection (100-1000)

6. MLOps & Production Platforms

Evaluation Platforms
a. AI Foundry (Microsoft) - Model testing and evaluation
b. Weights & Biases (W&B) - Experiment tracking
c. Neptune.ai - MLOps platform capabilities
d. LangSmith (LangChain) - LLM application testing
e. Phoenix (Arize AI) - LLM observability and evaluation
Model Management
a. MLFlow - Model lifecycle management
b. DVC (Data Version Control) - Data and model versioning
c. BentoML - Model serving framework architecture

7. Advanced LLM Frameworks. LangChain. LangGraph. Semantic Kernel.

LangChain. Demo examples wiht pipelines
a. LangChain using Golang (In Progress)
LangGraph Basics
a. When to Use What (Decision Framework)
b. Core LangGraph Primitives: StateGraph & MessageGraph, Compilation model, Checkpointers, Thread/Run concepts
c. Graph Execution Model: how LangGraph executes iteratively. StateGraph & MessageGraph
d. LangGraph Checkpointers: MemorySaver, SqliteSaver, PostgresSaver
e. LangGraph composition: START, END, Conditional Edge. Parallel node execution. Cycle limit (recursion_limit), infinite loops
e. Subgraphs & Composition: when to use subgraphs vs separate graphs
f. Error Handling & Interrupts (Critical for production)
LangGraph. Patterns. Examples
a. React. Using LangGraph. In Progress
b. React. Simple LangGraph Prototype
c. ReACT. Pre-coded loop + LLM to calculate the total weight of dogs
LangGraph Advanced Topics
a. State Management - Persistent conversation state
b. Graph Architecture - Nodes and edges for complex workflows
c. Conditional Routing - Dynamic flow based on LLM decisions
d. Human-in-the-Loop - Approval gates and manual interventions
e. Parallel Processing - Concurrent graph branch execution
LangGraph Examples and Prototypes
LangGraph System Prompt Techniques
a. Decision-Tree Prompts & Pattern
b. Multi-Agent Prompt & Pattern. Primitive Version
c. Plan-Execute Prompt & Pattern
d. ReAct. Prompts & Ideas
e. Prompt-Reflection Pattern. Idea
Semantic Kernel (Microsoft)
a. Kernel Architecture - Central orchestration engine
b. Plugin System - Reusable functions (native C# or prompt-based)
c. Planners - Automatic workflow generation
d. Memory Management - Vector-based semantic memory patterns
Magentic One + Semantic Kernel
Advanced Framework Concepts
a. Multi-Agent Systems - Collaborative AI agent coordination
b. Error Recovery Strategies - Retry logic, fallback mechanisms
c. Async Execution - Resource management at scale

8. Structured Output & Schema Design

Pydantic Advanced Usage
a. Field Validation - Custom validators, constraints (min/max, regex)
b. JSON Schema Generation - Automatic API documentation
c. Error Handling - Detailed validation error message design
d. Schema Compliance Monitoring - Production tracking metrics
Best Practices
a. Schema Complexity vs Success Rates - Optimization strategies
b. Retry Logic Implementation - Parse failure handling
c. Validation Feedback Loops - Error correction workflows
Structured Output & outlines

9. Massive Parallel Training (Enterprise Scale)

Distributed Training Strategies
a. Data Parallelism - Batch distribution across GPUs
b. Model Parallelism - Layer splitting across devices
c. Pipeline Parallelism - Sequential processing stages
d. Gradient Synchronization - AllReduce, parameter servers
e. Mixed Precision Training - Memory efficiency optimization

10. Advanced Overfitting Prevention

Regularization Techniques
a. Early Stopping - Validation loss plateau detection
b. Dropout Rates - 0.1-0.3 optimal ranges
c. Training/Validation Loss Curves - Overfitting gap analysis
d. Cross-validation Strategies - 5-10 fold robust evaluation

11. GenAI Design Patterns

12. Homomorphic encryption (encryption on the fly) in LLMs

Main Topics. What, Why, How
Q&A. Quick references
a. Whether or not. Use Cases
b. How to Start
c. Parameters
d. Performance and Trade-offs
Examples
a. TenSeal, Concrete-ML, Microsoft Seal

AI System monitoring, evaluation and tracking. LangWatch vs LangFuse vs Phoenix vs AI Foundry

LangWatch vs LangFuse vs AI Foundry comparison
Examples. Arize Phoenix for Python a. Phoenix. Trajectory analysis
b. Phoenix. Evaluations
c. Phoenix. Tool and Span tracing
Examples. Arize Phoenix for LangGraph (In progress)

Name		Name	Last commit message	Last commit date
Latest commit History 489 Commits
AI Search and Custom Skills		AI Search and Custom Skills
Arize-Phoenix		Arize-Phoenix
Azure Bot Service. Usage		Azure Bot Service. Usage
ChatGPT.AzureFunction/ChatGPT.AzureFunction		ChatGPT.AzureFunction/ChatGPT.AzureFunction
ChatGPT		ChatGPT
DealerAssistant		DealerAssistant
DocumentIntelligence/DocumentETL		DocumentIntelligence/DocumentETL
ExamplesInPython/experiments-with-embeddings		ExamplesInPython/experiments-with-embeddings
Fine Tuning		Fine Tuning
LangGraph		LangGraph
LangWatch		LangWatch
Langchain meets Go		Langchain meets Go
Langchain/first-experiment		Langchain/first-experiment
MCPServers/FastMCP-Example		MCPServers/FastMCP-Example
ML		ML
advanced-topics		advanced-topics
design-patterns		design-patterns
.gitignore		.gitignore
AI-Language.md		AI-Language.md
AI-Speech-To-Text-and-Text-To-Speech.md		AI-Speech-To-Text-and-Text-To-Speech.md
Computer-Vision.md		Computer-Vision.md
Deep-Learning-Works.md		Deep-Learning-Works.md
Document-intelligence.md		Document-intelligence.md
Machine-Learning.md		Machine-Learning.md
PowerApp-Integration.md		PowerApp-Integration.md
PowerAutomate-Integration.md		PowerAutomate-Integration.md
Practical_task_1.ipynb		Practical_task_1.ipynb
Practical_task_2.ipynb		Practical_task_2.ipynb
Practical_task_3.ipynb		Practical_task_3.ipynb
README.md		README.md
Semantic Kernel LearningWeek 2023 Workshop.pdf		Semantic Kernel LearningWeek 2023 Workshop.pdf
Transformers-Embeddings-Foundational.md		Transformers-Embeddings-Foundational.md
azure-ai-search.md		azure-ai-search.md
beta_Practical_task_1.ipynb		beta_Practical_task_1.ipynb
beta_Practical_task_2.ipynb		beta_Practical_task_2.ipynb
language-knowledge-base.md		language-knowledge-base.md
language-understanding-commands-for-smart-home.md		language-understanding-commands-for-smart-home.md
natural-language-processing.md		natural-language-processing.md
regression-linear-and-logistic-regression.md		regression-linear-and-logistic-regression.md
responsible-generative-ai.md		responsible-generative-ai.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

My Workshops and Posts

My LinkedIn Posts & Presentations

My Workshops

Theoretical Part

Azure AI-102 Learn Materials useful for exam

Azure Search & Document Intelligence

Machine Learning Materials

Extra materials

Practical Part. Table of Content

Advanced Topics. Theory and Practice.

1. Advanced Evaluation Metrics & Methodologies

2. RAG Evaluation Frameworks and Libraties. Agentic Application Evaluation

3. Advanced ML Architecture & Training

4. Parameter-Efficient Fine-tuning (PEFT)

5. Advanced Retrieval & Re-ranking

6. MLOps & Production Platforms

7. Advanced LLM Frameworks. LangChain. LangGraph. Semantic Kernel.

8. Structured Output & Schema Design

9. Massive Parallel Training (Enterprise Scale)

10. Advanced Overfitting Prevention

11. GenAI Design Patterns

12. Homomorphic encryption (encryption on the fly) in LLMs

AI System monitoring, evaluation and tracking. LangWatch vs LangFuse vs Phoenix vs AI Foundry

Advanced Topics. Practice. Semantic Kernel Knowledge base

Advanced Topics. Practice. SemanticKernel.

RAG. Cheatsheet for .Net

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Glareone/AI-RAG-In-Depth

Folders and files

Latest commit

History

Repository files navigation

My Workshops and Posts

My LinkedIn Posts & Presentations

My Workshops

Theoretical Part

Azure AI-102 Learn Materials useful for exam

Azure Search & Document Intelligence

Machine Learning Materials

Extra materials

Practical Part. Table of Content

Advanced Topics. Theory and Practice.

1. Advanced Evaluation Metrics & Methodologies

2. RAG Evaluation Frameworks and Libraties. Agentic Application Evaluation

3. Advanced ML Architecture & Training

4. Parameter-Efficient Fine-tuning (PEFT)

5. Advanced Retrieval & Re-ranking

6. MLOps & Production Platforms

7. Advanced LLM Frameworks. LangChain. LangGraph. Semantic Kernel.

8. Structured Output & Schema Design

9. Massive Parallel Training (Enterprise Scale)

10. Advanced Overfitting Prevention

11. GenAI Design Patterns

12. Homomorphic encryption (encryption on the fly) in LLMs

AI System monitoring, evaluation and tracking. LangWatch vs LangFuse vs Phoenix vs AI Foundry

Advanced Topics. Practice. Semantic Kernel Knowledge base

Advanced Topics. Practice. SemanticKernel.

RAG. Cheatsheet for .Net

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages