Skip to content
View kimmaru's full-sized avatar

Block or report kimmaru

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kimmaru/README.md

AI Engineer

Computer Vision & Machine Learning Specialist

Portfolio Email LinkedIn


About

AI Engineer focused on Computer Vision applications. Experienced in building end-to-end ML pipelines from research to production deployment.

Core Expertise: Deep Learning, Computer Vision, MLOps, Model Optimization


Tech Stack

Languages
Python JavaScript C++

ML/AI
PyTorch TensorFlow OpenCV scikit--learn

DevOps
AWS Docker MLflow


Featured Projects

Vision Transformer for Sketch Classification

GitHub

Advanced classification system for 500-class sketch images using Vision Transformers with data-centric optimization.

Metric Baseline Final Improvement
Accuracy 50.3% 90.3% +40.0pp
Training Speed 1x 4-6x AMP Optimization
Memory Usage 100% 80% Attention Freezing

Tech: DeiT3 ViT PyTorch Label Smoothing TTA AMP

Medical Image Segmentation

GitHub

Pixel-level semantic segmentation for 29 bone structures in hand X-ray images.

Metric Result Performance
Dice Score 97.64% Clinical-grade accuracy
Training Time 50% faster AMP implementation
Reliability Consistent Cross-patient validation

Tech: U-Net++ SegFormer Swin-Transformer Medical AI

Object Detection for Waste Classification

GitHub

Environmental AI system for identifying and categorizing 10 types of recyclable waste materials.

Achievement Impact Technology
+5% Performance TTA on Swin Transformer Weighted Boxes Fusion
Real Deployment Pilot recycling facilities Edge optimization
Data Innovation Diffusion model augmentation Class imbalance solution

Tech: YOLO Swin-Transformer Diffusion Models WBF Ensemble

Data-Centric OCR

GitHub

Pure data-centric approach for receipt text detection with fixed EAST architecture.

Phase F1 Score Improvement Innovation
Baseline 0.20 - Raw dataset
Optimized 0.8321 +315% Data-centric AI
Efficiency Same quality 50% faster Pipeline optimization

Tech: EAST Data-Centric AI Albumentations Pipeline Optimization

Multimodal LLM Optimization

GitHub

SALMONN-based multimodal large language model optimization for efficiency and performance balance.

Optimization Before After Improvement
Memory Usage 9.18GB 5.96GB -35%
Audio Captioning 0.20 0.32 +58.8%
Speech Recognition 15.2% WER 14.0% -7.7% error

Tech: SALMONN Llama-3 4-bit Quantization Flash Attention 2 VB-LoRA


GitHub Analytics

Contribution Stats
GitHub Stats

Language Distribution
Top Languages

Contribution Calendar
Contribution Calendar


Contact

Email LinkedIn Portfolio

Profile Views

Popular repositories Loading

  1. PyTorch PyTorch Public

    Forked from deeplearningzerotoall/PyTorch

    Deep Learning Zero to All - Pytorch

    Jupyter Notebook

  2. cv-21-collaboration cv-21-collaboration Public

    Forked from boyamie/cv-21-collaboration

    naver boostcamp cv-21team github study

    HTML

  3. machine-learning machine-learning Public

    Forked from teddylee777/machine-learning

    머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)

    Jupyter Notebook

  4. llm-engineer-toolkit llm-engineer-toolkit Public

    Forked from KalyanKS-NLP/llm-engineer-toolkit

    A curated list of 120+ LLM libraries category wise.

  5. prompt-gallery prompt-gallery Public

    Forked from GENEXIS-AI/prompt-gallery

  6. kimmaru kimmaru Public