AI / ML Engineer working on LLMs, multimodal AI, agent systems, and large-scale data platforms.
Work includes fine-tuning, RLHF training, quantization, agent architectures, and distributed ML systems.
Currently building enterprise NLP and document AI systems at John Snow Labs.
Interested in distributed systems, AI agents, multimodal models, and unusual side projects.
AI Systems
- LLM fine-tuning (LoRA / QLoRA / PEFT)
- RLHF and reinforcement learning pipelines
- agent architectures and tool-using LLM systems
- RAG pipelines and knowledge retrieval
- multimodal AI (Vision-Language Models, OCR, document AI)
Infrastructure
- distributed GPU training
- Spark / big-data pipelines
- model quantization and inference optimization
- large-scale ML deployment
Other Systems
- blockchain analytics and DeFi intelligence
- AI automation pipelines
- agent-driven workflows and AI experiments
| Project | Role / Contribution |
|---|---|
| Dijkstra Labs | CTO — AI, Big Data, and distributed systems consulting |
| John Snow Labs Ecosystem | Core development across enterprise NLP / AI libraries |
| NLU Library | Created & lead development of high-level ML/NLP abstraction layer |
| NLP Server | Created scalable NLP inference server |
| Spark NLP for Finance | Lead development |
| Spark NLP for Legal | Lead development |
| Spark NLP Healthcare | Core contributions & development |
| Spark NLP | Core contributor |
| CODA – Cognitive Data Analytics Framework | Distributed big-data analytics platform |
Enterprise Multimodal Platforms
Large-scale NLP and Vision ecosystems used in healthcare, finance, and enterprise AI.
- Spark NLP
- enterprise clinical NLP systems
- multimodal document AI pipelines
- large model inference infrastructure
WalletMarketCap
Blockchain analytics platform tracking trader behavior across chains.
Features include:
- wallet profit simulation
- MEV detection
- on-chain data pipelines
- DeFi trading analytics
AI Agents & Automation
Experimental systems combining LLM reasoning with tools and workflows.
Examples include:
- Text2SQL agents
- RAG assistants
- agentic coding workflows
- Telegram communities powered by LLM agents
- automated AI content pipelines
- Data Science NLP Training — Big Data & NLP course
- Automated Text Generation & Data Augmentation
- Graph + AI Summit — Spark NLP + TigerGraph
- Python Web Dev Conf — NLU and 5000+ models
- Healthcare NLP Summit — Biomedical NLP models
- NYC/DC NLP Meetup — GPT-2 / T5 text generation
- NLP Summit — Spark NLP with NLU
- multimodal model benchmarking
- agentic coding workflows
- RLHF training methods
- document AI evaluation
- large-scale AI system architectures
Got a hard problem or a strange idea? Let's build it.





