Skip to content
View malikrohail's full-sized avatar

Highlights

  • Pro

Block or report malikrohail

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
malikrohail/README.md

Hi, I’m Malik 👋

AI/ML Software Engineer • LLM Systems • Real-time Inference • Mobile + Cloud

I build production-grade AI systems end-to-end — from model training & optimization to inference APIs, mobile apps, and cloud deployment.

LinkedIn Email


🚀 About Me

  • B.A. Computer Science & Mathematics — Bennington College (May 2025)
  • Focus: Applied ML, AI Infrastructure, Full-Stack Engineering
  • Interests: LLM systems (RAG/agents), multimodal AI, speech/vision, mobile-first AI products, scalable platforms

🧩 What I Build

  • LLM-powered products: RAG, agents, tool-calling, embeddings, evals
  • Real-time inference APIs: streaming speech/text, low-latency pipelines
  • Mobile + web apps: AI-native UX, voice/chat/search flows
  • Optimization: latency/cost tuning, batching, quantization, GPU/CPU tradeoffs
  • Deployment: serverless, containers, GPU-backed services, CI/CD

🛠️ Tech Stack

Languages
Python TypeScript JavaScript Rust C++

ML / AI
PyTorch TensorFlow HuggingFace OpenCV

LLM Systems / Data
Postgres pgvector Redis Pinecone Weaviate

Frontend / Mobile
React Next.js React Native Expo

Cloud / DevOps
Docker Kubernetes AWS GCP Terraform GitHub Actions


⭐ Featured Project

🗣️ Speech-to-Text & Speaker Intelligence Platform

Production-ready voice AI system combining:

  • Whisper ASR + speaker diarization + speaker verification (embeddings)
  • Streaming + batch inference, real-time APIs for web/mobile clients
  • Containerized GPU deployment with autoscaling
  • Lower cost + faster inference via quantization & mixed precision

🧠 Also Built: Spec-Driven Form / Document Engine (Excel → UI → PDF → XML)

I built a spec-driven rendering engine that turns an Excel-based schema into a dynamic, validated UI and exports PDF + MISMO/XML.

What it does

  • Parses an Excel spec (UID/xPath bindings, containers, enums/formats, cardinality, rules)
  • Generates a Section Tree + Field Registry to drive the UI
  • Uses a Rule Engine (required/visible/validate) enforced at runtime
  • Centralized XMLStore as the “source of truth”
  • React Context + hooks for read/write bindings across inputs
  • PDF renderer mirrors the same UI state
  • XML builder exports MISMO nodes while honoring R/CR + repeatable sections

🏆 Hackathons

🥈 2nd Place — MIT BTT-AI AJL 2025 (Kaggle)

  • Built dermatology models fair across Fitzpatrick I–VI
  • Used group-aware sampling, reweighted losses, calibration + ensembling
  • Placed 2nd / 300+ teams

Pinned Loading

  1. Chat-App Chat-App Public

    JavaScript

  2. Rusty-News Rusty-News Public

    Developed a Rust-based web application that fetches and displays real-time news articles from NewsAPI. The project features secure API key handling, smooth JSON data flow, and provides advanced art…

    Rust

  3. travelitinerary travelitinerary Public

    Python 1

  4. crypto-forecast crypto-forecast Public

    Developing a website that forecasts profits based on standardized data sets

    Python

  5. gameoflife gameoflife Public

    JavaScript

  6. Python-File-Mover Python-File-Mover Public

    Forked from AustinCGomez/Open-File-Mover-CLI

    A simple command line utility to move massive amounts of files on Windows PC

    Python