Skip to content
View wenhuach21's full-sized avatar

Block or report wenhuach21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wenhuach21/README.md

Wenhua Cheng is a Software Engineer at Intel, with a strong background in large language model (LLM) quantization, model compression, and computer vision. Previously, he worked at Alibaba Cloud as a software engineer and at Intel Labs as a researcher. Wenhua holds a Master’s degree from Zhejiang University and a Bachelor’s degree from Nanjing University of Science and Technology. Wenhua’s expertise spans two main domains:

LLM Compression: As the first author, he has contributed to methods such as SignRound and SignRoundV2, signed gradient descent-based rounding optimizations for LLM quantization, and TEQ (Trainable Equivalent Transformation).

Computer Vision: As a co-first author, he won two championships in the DawnBench competition, outperforming teams from Huawei, Google, and others. As the first author, he also ranked 4th and 6th in two tracks of the 2017 ICDAR Scene Text Detection Competition.

Wenhua has filed 21 patents, 11 of which have been granted. Over the past four years, he has contributed to 300+ merged PRs.

Pinned Loading

  1. intel/auto-round intel/auto-round Public

    SOTA rounding quantization for high-accuracy low-bit LLM inference. Seamlessly optimized for vLLM, sglang, and CPU/GPU/CUDA with multi-datatype support.

    Python 919 90

  2. wenhuach21.github.io wenhuach21.github.io Public

    Forked from RayeRen/acad-homepage.github.io

    AcadHomepage: A Modern and Responsive Academic Personal Homepage

    SCSS