Ramshankar07

Follow

🎯

Open To Work

Ramshankar Ramshankar07

🎯

Open To Work

Follow

GPU poor guy | MLSys | Open to work

15 followers · 97 following

Boston
17:03 (UTC -05:00)
https://ramshankar07.github.io/portfoliov3/index.html
in/ramshankarb
https://ramshankar07.substack.com/

Achievements

Achievements

Highlights

Pro

Ramshankar07/README.md

Hi 👋, I'm Ramshankar

Research Interests

GPU kernel optimization and low-level performance engineering
Model quantization and precision-efficient inference

💻 Tech Stack

Pinned Loading

CUDA-llama3.1-inference CUDA-llama3.1-inference Public

This repository is CUDA implementation for LLAMA 3.1 open models

Cuda 2
qwen600-ROCm-inference qwen600-ROCm-inference Public

Forked from yassa9/qwen600

Static suckless single batch qwen3-0.6B mini inference engine

C++ 1
Fintech-Data-Processing-ETL-Platform Fintech-Data-Processing-ETL-Platform Public

Assignment 02 for the course work DAMG7245-Spring 2025

Jupyter Notebook
Parallelizing-Text-to-Image-Generation Parallelizing-Text-to-Image-Generation Public

Explore the feasibility and performance characteristics of using multiple CPUs versus GPUs for preprocessing tasks in text-to-image generation pipelines. Compare speedup, efficiency, and cost to id…

Jupyter Notebook
Real-Time-TTS-FastPitch-Finetune Real-Time-TTS-FastPitch-Finetune Public

Python
RecSys-Transformer-for-Food RecSys-Transformer-for-Food Public

Jupyter Notebook