Welcome to my GitHub profile!
I'm Avinash Pawar, a Software Development Engineer & Data Engineer from Maharashtra, India
,
with a Master's in Data Science from
Indiana University, USA .
I build scalable backend services, high-throughput data pipelines, and ML-powered applications. I’m passionate about using **Python, SQL, AWS**, and modern data/ML tools to turn complex problems into reliable, production-ready solutions.
- Languages: Python, SQL, C++, JavaScript, Java, Shell
- Backend & APIs: Flask, Django, FastAPI, Node.js, REST APIs
- Databases: PostgreSQL, MySQL, SQL Server, MongoDB, Snowflake
- Data & ML: Pandas, NumPy, PyTorch, Scikit-learn, LightGBM
- Cloud: AWS (S3, EC2, RDS, Lambda, CloudWatch), GCP
- DevOps: GitHub/GitLab, Jenkins, Docker, CI/CD pipelines
- PragyaYantra – Generative AI Mini-Apps Multi-module web app (Text, Chat, Attachments, Docs, Code) using Google Gemini API and LangChain. View on GitHub
- Data Pipeline Automation Python + Apache Airflow + AWS S3/RDS pipeline processing 4TB/day with 60% faster execution and fault-tolerant recovery.
- Anomaly Detection in ETL PyTorch & Scikit-learn models for anomaly detection in data ingestion workflows, improving early error detection by 30%.
If you'd like to collaborate or discuss opportunities, feel free to reach out: