fix: Enable quantization and compilation in the same optimization job via ModelBuilder and add validations to prevent compilation for Llama-3.1 on TRTLLM. #482
Triggered via pull request
September 30, 2024 18:36
Status
Success
Total duration
3m 19s
Artifacts
–