Generative AI (GenAI) is a branch of artificial intelligence that focuses on creating new, original content. Instead of just analyzing or predicting outcomes, it generates text, images, music, video, or even code.
-
Example:
- ChatGPT โ generates human-like text.
- MidJourney/DALLยทE โ generates images.
- GitHub Copilot โ generates code.
The key strength of GenAI is its ability to learn from vast datasets and then produce creative outputs that mimic human intelligence.
| Term | Meaning | Example |
|---|---|---|
| AI (Artificial Intelligence) | The broad field of machines simulating human intelligence. | Self-driving cars, chess-playing bots |
| ML (Machine Learning) | Subset of AI where systems learn from data and improve over time. | Spam email filter |
| DL (Deep Learning) | Subset of ML using neural networks with multiple layers to process complex data. | Face recognition, speech-to-text |
| Generative AI | Subset of DL/ML where models create new data/content based on what they learned. | ChatGPT (text), DALLยทE (images) |
๐ Think of it as:
AI = Big umbrella
ML = Learning from data
DL = Complex ML with deep neural networks
Generative AI = AI that creates
Large Language Models (LLMs) like ChatGPT (OpenAI) or LLaMA 3 (Meta) follow a multi-step training process:
-
Pretraining
- The model is fed with massive amounts of raw text data (books, articles, websites, code).
- Goal: Learn grammar, facts, and reasoning.
- Technique: Self-supervised learning (predict the next word in a sentence).
-
Fine-tuning
- Model is further trained on high-quality, domain-specific data.
- Helps adapt the model for real-world applications.
-
RLHF (Reinforcement Learning with Human Feedback)
- Human reviewers rank model responses.
- AI learns what's helpful, safe, and human-like.
-
Deployment
- Optimized for speed, scalability, and efficiency (through quantization, distributed GPUs, etc.).
๐ Difference between ChatGPT & LLaMA 3:
- ChatGPT โ Trained on massive private datasets, optimized for conversation.
- LLaMA 3 โ Open-source model, lighter but highly efficient, designed for research & customization.
- 2017 โ Introduction of Transformers (Google's "Attention is All You Need").
- 2018 โ BERT (Bidirectional Encoder Representations from Transformers) โ focused on understanding.
- 2020 โ GPT-3 โ true generative power with 175B parameters.
- 2022 โ ChatGPT (GPT-3.5) โ made conversational AI mainstream.
- 2023 โ GPT-4, LLaMA 2, Claude, PaLM โ more efficient, safer, multimodal.
- 2024โ2025 โ LLaMA 3, GPT-4.1/5, Gemini, Mistral โ smaller, faster, open-source models dominating research & industry.
๐ Trend: Models are becoming smarter, safer, multimodal (text+image+audio), and more open-source.
(Reference: artificialanalysis.ai)
| Model | Developer | Strengths | Use Case |
|---|---|---|---|
| GPT-4/ChatGPT | OpenAI | Best reasoning, creative writing, coding | Business, research, education |
| LLaMA 3 | Meta | Open-source, lightweight, fine-tuning friendly | Research, startups, custom apps |
| Claude | Anthropic | Strong safety & long context window | Enterprises, compliance |
| Gemini (Google) | Google DeepMind | Multimodal (text + image + video) | Search, content, enterprise |
| Mistral / Mixtral | Mistral AI | Efficient, small models with high performance | On-device AI, startups |
| Grok (xAI/Elon Musk) | xAI | Integrated with X (Twitter) | Social + conversational AI |
This repository contains comprehensive implementations and documentation for various Generative AI concepts. Each topic has been organized into separate folders with detailed README files:
Comprehensive guide to the LangChain framework, including:
- Framework overview and ecosystem
- Integration capabilities with various LLM providers (OpenAI, Ollama, Hugging Face)
- Core components: chains, agents, memory, prompts
- Complete RAG implementation pipeline
- Data processing: ingestion, transformation, embeddings
- Vector databases: FAISS, Chroma, Pinecone
- Advanced search techniques and hybrid search
- Practical implementations and production-ready examples
LangChain Expression Language project featuring:
- Simple translation application implementation
- Multi-LLM integration (OpenAI, Groq)
- Declarative workflow composition
- Production-ready patterns and examples
Graph database integration with LangChain featuring:
- Natural language to Cypher query translation
- Neo4j integration with LangChain's GraphCypherQAChain
- Movie database with actors, directors, and genres
- Few-shot prompting for improved query accuracy
- Integration with Groq's Gemma2-9b-It model
- Support for complex graph queries and relationships
Advanced machine learning techniques for model customization:
- Custom dataset preparation for Q&A tasks
- Model finetuning with configurable hyperparameters
- LoRA (Low-Rank Adaptation) implementation
- QLoRA (Quantized LoRA) support
- Quantization techniques for memory optimization
- Secure API key management using environment variables
- Support for various optimization strategies
Enterprise-grade Generative AI applications on AWS Cloud:
- AWS Bedrock integration with Claude, Llama2, and Stable Diffusion XL
- RAG-based PDF document analysis and querying
- Multi-modal AI capabilities (text and image generation)
- Enterprise security with AWS IAM and VPC integration
- Scalable cloud-native architecture for production
- Cost optimization strategies and performance monitoring
- Comprehensive AWS services integration (S3, Lambda, CloudWatch)
Advanced graph-based AI applications with stateful workflows:
- Multi-actor systems with coordinated AI agents
- Graph-based reasoning and multi-hop query capabilities
- Neo4j integration with LangChain's GraphCypherQAChain
- Stateful applications with conversation memory
- Complex relationship analysis and network insights
- Interactive graph exploration and visualization
- Production-ready graph database applications
- Simple_LLm_LCEL Repository: GitHub Link
- LLM Model Analysis: artificialanalysis.ai
- Text to Math Problem Solver: AI-powered mathematical reasoning with step-by-step solutions
- Multi-modal Intelligence: Combines calculation with Wikipedia research for comprehensive answers
- Interactive Chat Interface: User-friendly mathematical problem-solving experience
- PDF Query with Astra DB: Enterprise-grade document querying using Cassandra vector store
- Relevance Scoring: Intelligent document ranking with accuracy measurement
- Scalable Architecture: Cloud-native database integration for production environments
- Multi-language Code Assistant: Cross-programming language AI assistance
- Code Generation & Analysis: Real-time programming support and optimization
- Language-specific Templates: Tailored assistance for different programming paradigms
This project leverages a comprehensive stack of modern AI and machine learning technologies to build robust Generative AI applications:
This section showcases practical implementations and projects that demonstrate the technologies and concepts covered in this repository.
| Name | Description | Tech Stack | Link |
|---|---|---|---|
| Q&A Chatbot | Enhanced Q&A chatbot with switchable model backends (OpenAI/Ollama) using LangChain. Features Streamlit UI, configurable parameters, and LangSmith tracing for observability. | Q-A-CHATBOT | |
| Simple LLM LCEL | Language translation application using LangChain Expression Language. Demonstrates declarative workflow composition with multi-LLM integration (OpenAI, Groq) and production-ready patterns. | Simple_LLm_LCEL | |
| RAG Document Q&A | Interactive RAG app for querying research papers (PDFs) using FAISS vector search and Groq-hosted Llama 3.1 model. Features document embedding, similarity search, and conversational Q&A. | RAG-Document-Q-A | |
| RAG Document Q&A2 | Advanced RAG implementation with HuggingFace embeddings, Chroma vector store, and conversation history. Features PDF upload, chunking, and multi-turn dialogue with context-aware retrieval. | RAG-Document-Q-A2 | |
| Chatbot Search Engine | Intelligent chatbot with search engine capabilities combining conversational AI with web search functionality for comprehensive information retrieval and response generation. | chatbot_SearchEngine | |
| LangChain SQL DB Chat Agent | Intelligent conversational interface for querying SQL databases using natural language. Features AI-powered SQL generation, multi-database support (SQLite/MySQL), and real-time query execution. | langchain_with_sqldb | |
| Text Summarizer LangChain | Advanced text summarization application using LangChain with chunking strategies, prompt templates, and multiple AI models. Features customizable summarization parameters and batch processing capabilities. | TextSummarizer_langchain | |
| YouTube URL Summarizer | Intelligent YouTube video summarization tool that extracts and summarizes video content using LangChain. Features URL processing, content extraction, and AI-powered summarization with customizable output formats. | Youtube-Website-Url-Summarizer | |
| Text to Math Problem Solver | AI-powered application that transforms complex mathematical word problems into step-by-step solutions using Google's Gemma 2 model. Features multi-modal intelligence, Wikipedia research integration, and interactive chat interface. | Text_TO_Math | |
| PDF Query with Astra DB | Advanced RAG system for querying PDF documents using Cassandra Astra DB vector store. Features OpenAI embeddings, character text splitting, and intelligent document retrieval with relevance scoring. | PDFQuery_LAngchain_Astradb | |
| Multi-language Code Assistant | Intelligent code assistant supporting multiple programming languages with AI-powered code generation, debugging, and optimization. Features language-specific templates and real-time code analysis. | multi_language_code-assisstant | |
| NVIDIA NIM Document Q&A | Advanced document Q&A system using NVIDIA NIM inference services with Llama models. Features optimized RAG pipeline, real-time document processing, and scalable architecture for enterprise document analysis. | NVIDIA-NIM-Document-Q-A-System | |
| Multi-AI Agent CrewAI | Collaborative AI agent system using CrewAI framework for complex task orchestration. Features multi-agent coordination, specialized role assignments, and intelligent workflow management for enterprise automation. | Multi_AIAgent_Crewai | |
| AWS Bedrock GenAI Project | Comprehensive AWS Bedrock application showcasing RAG-based PDF chat, text generation with Claude/Llama2, and image generation with Stable Diffusion XL. Features multi-model architecture, enterprise-grade security, and scalable cloud-native design. | Genai_Aws_BedRock | |
| LangGraph Neo4j Integration | Advanced graph-based AI application using LangGraph and Neo4j for stateful multi-actor systems. Features graph-based reasoning, multi-hop queries, and complex relationship analysis with movie database integration. | langchain_grapghdb |
This section showcases projects specifically built with Google's Gemini AI model, demonstrating advanced conversational AI capabilities and modern web application development.
| Name | Description | Tech Stack | Live Demo | Repository |
|---|---|---|---|---|
| Gemini AI Chat Application | Modern, responsive web application integrating Google's Gemini AI for intelligent conversations with image analysis capabilities. Features beautiful gradient UI, real-time typing animations, and glassmorphism effects. | ๐ Live Demo | Q-A-chatbot-with-gemini | |
| Gemini Pro Q&A with Chat History | Interactive Streamlit web application providing real-time Q&A with Gemini AI, featuring persistent conversation memory and clean responsive interface. Demonstrates advanced chat functionality with session management. | No Live Demo Available | Q-A_WITH_CHAT_HISTORY_GEMINI | |
| Multi-Language Invoice Extractor | AI-powered multilingual invoice data extraction tool using Google's Gemma AI models. Features multiple model options (Gemma 3 27B/12B/4B IT), up to 95% accuracy, and supports various languages for extracting invoice data like amounts, vendor names, and dates. | No Live Demo Available | MultiLanguage_Invoice_Extractor | |
| DocuAI | Advanced document analysis and Q&A system using Google Gemini AI with FAISS vector search. Features document upload, intelligent chunking, semantic search, and conversational AI for comprehensive document understanding and information retrieval. | No Live Demo Available | DocuAI | |
| Text to SQL Generator | Intelligent text-to-SQL conversion tool powered by Google Gemini AI. Transforms natural language queries into SQL statements with database schema understanding, query optimization, and error handling for efficient database interactions. | No Live Demo Available | Text_to-SQl | |
| ATS Resume Expert | Intelligent resume analysis platform powered by Google's Gemini AI for ATS optimization. Features comprehensive analysis with section-wise scoring, technical skills matching, experience level assessment, and professional enhancement recommendations. | ๐ Live Demo | ATS-Resume-Expert | |
| Calories Advisor | AI-powered nutrition and calorie tracking application using Google Gemini AI. Features intelligent meal analysis, calorie calculation, nutritional insights, and personalized dietary recommendations with modern React frontend and FastAPI backend. | ๐ Live Demo | calories-advisor | |
| YouTube Video Summarizer | Enhanced AI-powered YouTube video summarization tool using Google's Gemini AI. Features multiple summary types (bullet points, detailed, key insights, timeline), batch processing, sentiment analysis, keyword extraction, and comprehensive video metadata analysis. | No Live Demo Available | YouTube_Video_summarizer | |
| AI News Research Writing Crew | Multi-agent AI system using CrewAI framework with Google Gemini AI for automated news research and article writing. Features Senior Researcher and Writer agents that collaborate to uncover trends, analyze data, and generate compelling tech articles with real-time search integration. | No Live Demo Available | AI-News-Research-Writing-Crew |
- Udemy Course: Complete Generative AI Course with LangChain and HuggingFace
- Comprehensive course covering LangChain framework
- Hands-on projects with HuggingFace models
- Real-world applications and best practices
- Advanced techniques for production deployment
- LangChain Documentation: https://python.langchain.com/
- LangChain Expression Language (LCEL): https://python.langchain.com/docs/expression_language/
- LangGraph Documentation: https://langchain-ai.github.io/langgraph/
- OpenAI API Documentation: https://platform.openai.com/docs
- Hugging Face Documentation: https://huggingface.co/docs
- Google AI Studio (Gemini): https://ai.google.dev/
- Anthropic Claude API: https://docs.anthropic.com/
- Groq API Documentation: https://console.groq.com/docs
- FAISS Documentation: https://faiss.ai/
- Chroma Documentation: https://docs.trychroma.com/
- Pinecone Documentation: https://docs.pinecone.io/
- Astra DB Documentation: https://docs.datastax.com/en/astra/
- AWS Bedrock Documentation: https://docs.aws.amazon.com/bedrock/
- NVIDIA NIM Documentation: https://docs.nvidia.com/nim/
- Streamlit Documentation: https://docs.streamlit.io/
- FastAPI Documentation: https://fastapi.tiangolo.com/
- React Documentation: https://react.dev/
- Python Documentation: https://docs.python.org/
- Transformers Library: https://huggingface.co/docs/transformers
- CrewAI Documentation: https://docs.crewai.com/
- Neo4j Documentation: https://neo4j.com/docs/
- PyPDF Documentation: https://pypdf.readthedocs.io/