Skip to content
View reddy-nithin's full-sized avatar

Highlights

  • Pro

Block or report reddy-nithin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
reddy-nithin/README.md

Hi, I'm Nithin Songala 👋

Data Scientist · ML Engineer · Data & Business Analyst
MS in Data Science · University of Missouri–Kansas City · Graduating May 2026


🧠 About Me

I'm a graduate student specializing in Data Science with a passion for turning raw, messy data into decisions that matter. I build end-to-end data pipelines, machine learning applications, and analytical dashboards — from ingestion to insight.

  • 🎓 Graduating May 2026 with an MS in Data Science from UMKC
  • 🔭 Currently building AI-powered tools in the healthcare & regulatory space
  • 🌱 Exploring LLMs, RAG pipelines, and MLOps practices
  • 🎯 Actively seeking roles in Data Science · ML Engineering · Data/Business Analytics

🛠️ Tech Stack

Languages & Querying

Python SQL JavaScript HTML5 CSS3

Data Engineering & Cloud

Google BigQuery GitHub Actions ETL Pipelines Excel

Machine Learning & AI

Scikit-Learn FAISS BM25 Google Gemini RAG Jupyter

Visualization & BI

Tableau Streamlit


🚀 Featured Projects

RAG-powered drug label Q&A system using FDA data

  • Built a Retrieval-Augmented Generation (RAG) pipeline combining FAISS + BM25 hybrid search with Google Gemini LLM
  • Delivers evidence-based answers to drug label questions via a clean Streamlit interface
  • Foundation for a comprehensive pharmaceutical intelligence platform
  • Python FAISS BM25 Google Gemini Streamlit Jupyter

Bank-grade regulatory risk reporting pipeline

  • End-to-end data pipeline covering ingestion, validation, reconciliation, and reporting
  • Interactive risk dashboards built in Streamlit for loan portfolio analysis
  • Demonstrates production-level data engineering patterns used in financial services
  • Python Streamlit Data Validation Risk Analytics

Automated ETL pipeline for Kansas City 311 service requests

  • Python ETL → Google BigQueryTableau dashboard, fully automated
  • Scheduled with GitHub Actions for hands-free, recurring data refresh
  • Demonstrates real-world data engineering + BI integration
  • Python BigQuery Tableau GitHub Actions ETL

Automated archival metadata extraction tool

  • Extracts, cleans, and organizes date/metadata from structured Excel files for library and archival use
  • Designed for real-world archival workflows at UMKC
  • Python Excel Data Cleaning Automation

📊 GitHub Stats


📫 Let's Connect

I'm actively looking for full-time opportunities in Data Science, ML Engineering, Data Analytics, and Business Analytics starting May 2026. Feel free to reach out!

LinkedIn · Email · Portfolio

Pinned Loading

  1. TruPharma-Clinical-Intelligence TruPharma-Clinical-Intelligence Public

    A RAG application that answers drug-label questions using official FDA data with evidence base response. Built with Streamlit, FAISS, BM25, and Google Gemini LLM. Foundation for building a comprehe…

    Python 1 1

  2. TruPharma-MVP TruPharma-MVP Public

    A RAG application that answers drug-label questions using official FDA data with evidence base response. Built with Streamlit, FAISS, BM25, and Google Gemini LLM. Foundation for building a comprehe…

    Jupyter Notebook 1

  3. ReguCheck-Risk-Engine ReguCheck-Risk-Engine Public

    End-to-end regulatory risk data pipeline demonstrating bank-grade loan risk reporting: data ingestion, validation, reconciliation, and interactive dashboards with Streamlit

    Python 1

  4. -311-KC-Dashboard -311-KC-Dashboard Public

    Automated data pipeline for Kansas City 311 service requests. Python ETL → Google BigQuery → Tableau dashboards. Scheduled via GitHub Actions.

    Python 1

  5. UMKC-Archives-data-extractor UMKC-Archives-data-extractor Public

    This Python tool automates the process of extracting, cleaning, and organizing date-related information and metadata from structured Excel files. It is designed for use in archival, library, or his…

    Python 1