Skip to content

Commit 7d5f636

Browse files
authored
[Bug fix 5528642] [Bug fix 5528695] VLM NVBug fix (#355)
Signed-off-by: Yue <[email protected]>
1 parent 8b1cedf commit 7d5f636

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

examples/llm_ptq/hf_ptq.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -316,6 +316,7 @@ def main(args):
316316
mtq.quantize(child, disabled_quant_cfg, forward_loop=None)
317317

318318
model = model.language_model
319+
model_type = get_model_type(model)
319320

320321
if args.sparsity_fmt != "dense":
321322
if args.batch_size == 0:

examples/vlm_ptq/scripts/huggingface_example.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ if [ -n "$KV_CACHE_QUANT" ]; then
7373
PTQ_ARGS+=" --kv_cache_qformat=$KV_CACHE_QUANT "
7474
fi
7575

76-
if [ "${MODEL_TYPE}" = "vila" ]; then
76+
if [[ "${MODEL_NAME,,}" == *"vila"* ]]; then
7777
# Install required dependency for VILA
7878
pip install -r ../vlm_ptq/requirements-vila.txt
7979
# Clone original VILA repo

0 commit comments

Comments
 (0)