Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit e733606

Browse files
authored
Update pull.yml
Remove fp16 and fp32 int4 quantized models for now. @jerryzh168 Not sure why these dtypes are not compatible with int4 quantization?
1 parent cb24aec commit e733606

File tree

1 file changed

+8
-8
lines changed

1 file changed

+8
-8
lines changed

.github/workflows/pull.yml

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -340,10 +340,10 @@ jobs:
340340
fi
341341
echo "::endgroup::"
342342
343-
echo "::group::Run inference with quantize file"
344-
python3 torchchat.py export --output-snap model.tc --dtype float32 --quantize torchchat/quant_config/cuda-32.json --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
345-
python3 torchchat.py generate --snap model.tc --dtype float32 --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
346-
echo "::endgroup::"
343+
# echo "::group::Run inference with quantize file"
344+
# python3 torchchat.py export --output-snap model.tc --dtype float32 --quantize torchchat/quant_config/cuda-32.json --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
345+
# python3 torchchat.py generate --snap model.tc --dtype float32 --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
346+
# echo "::endgroup::"
347347
348348
test-gpu-aoti-float16:
349349
permissions:
@@ -390,10 +390,10 @@ jobs:
390390
fi
391391
echo "::endgroup::"
392392
393-
echo "::group::Run inference with quantize file"
394-
python3 torchchat.py export --output-snap model.tc --dtype float16 --quantize torchchat/quant_config/cuda-32.json --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
395-
python3 torchchat.py generate --snap model.tc --dtype float16 --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
396-
echo "::endgroup::"
393+
# echo "::group::Run inference with quantize file"
394+
# python3 torchchat.py export --output-snap model.tc --dtype float16 --quantize torchchat/quant_config/cuda-32.json --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
395+
# python3 torchchat.py generate --snap model.tc --dtype float16 --checkpoint "./checkpoints/${REPO_NAME}/model.pth"
396+
# echo "::endgroup::"
397397
398398
test-gpu-eval-sanity-check:
399399
permissions:

0 commit comments

Comments
 (0)