fix: Enable quantization and compilation in the same optimization job via ModelBuilder and add validations to prevent compilation for Llama-3.1 on TRTLLM. #1730
codebuild-ci.yml
on: pull_request_target
Annotations
1 error
wait-for-approval
Canceling since a higher priority waiting request for 'PR Checks-4875' exists
|