fix: Enable quantization and compilation in the same optimization job via ModelBuilder and add validations to prevent compilation for Llama-3.1 on TRTLLM. #454
Triggered via pull request
September 18, 2024 00:45
Status
Success
Total duration
3m 41s
Artifacts
–