2 files changed
+2
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | | - | |
| 21 | + | |
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
| |||
Submodule tensorrt_llm updated 95 files
- .github/CODEOWNERS+3
- .github/pull_request_template.md+1
- .github/tava_architecture_diagram.md+108
- .pre-commit-config.yaml+2-2
- ATTRIBUTIONS-CPP-aarch64.md+15.1k
- ATTRIBUTIONS-CPP-x86_64.md+15.0k
- ATTRIBUTIONS-Python.md+45.9k
- constraints.txt+3
- cpp/tensorrt_llm/kernels/communicationKernels/mnnvlTwoShotAllreduceKernels.cu+2
- cpp/tensorrt_llm/kernels/cutlass_kernels/fp8_blockscale_gemm/fp8_blockscale_tma_utils.cuh-4
- cpp/tensorrt_llm/kernels/recoverFromRingAtten.cu+4-5
- cpp/tensorrt_llm/kernels/trtllmGenKernels/gemm/KernelRunner.cpp+6-3
- docker/Dockerfile.multi+4
- docker/common/install_base.sh+1-1
- docs/source/commands/trtllm-serve/run-benchmark-with-trtllm-serve.md+43-24
- docs/source/developer-guide/perf-benchmarking.md+33-10
- docs/source/developer-guide/perf-overview.md+246-229
- docs/source/examples/dynamo_k8s_example.rst+4-14
- docs/source/features/quantization.md+6-5
- docs/source/legacy/reference/support-matrix.md+2-1
- docs/source/models/supported-models.md+2-2
- docs/source/overview.md+12-12
- docs/source/quick-start-guide.md+2
- docs/source/release-notes.md+107-11
- examples/auto_deploy/README.md+3-3
- examples/llm-api/llm_mgmn_llm_distributed.sh+40-2
- examples/llm-api/llm_mgmn_trtllm_bench.sh+38-2
- examples/llm-api/llm_mgmn_trtllm_serve.sh+38-2
- jenkins/current_image_tags.properties+4-4
- requirements.txt+3-2
- security_scanning/docs/poetry.lock+16-16
- security_scanning/docs/pyproject.toml+1-1
- security_scanning/examples/apps/poetry.lock+10-11
- security_scanning/examples/auto_deploy/poetry.lock+19-31
- security_scanning/examples/draft_target_model/poetry.lock+12-24
- security_scanning/examples/eagle/poetry.lock+12-24
- security_scanning/examples/llm-eval/lm-eval-harness/poetry.lock+12-24
- security_scanning/examples/lookahead/poetry.lock+12-24
- security_scanning/examples/medusa/poetry.lock+12-24
- security_scanning/examples/models/contrib/baichuan/poetry.lock+161-51
- security_scanning/examples/models/contrib/bloom/poetry.lock+12-24
- security_scanning/examples/models/contrib/chatglm-6b/poetry.lock+24-36
- security_scanning/examples/models/contrib/chatglm-6b/pyproject.toml+1-1
- security_scanning/examples/models/contrib/chatglm2-6b/poetry.lock+24-36
- security_scanning/examples/models/contrib/chatglm2-6b/pyproject.toml+1-1
- security_scanning/examples/models/contrib/chatglm3-6b-32k/poetry.lock+24-36
- security_scanning/examples/models/contrib/chatglm3-6b-32k/pyproject.toml+1-1
- security_scanning/examples/models/contrib/dbrx/poetry.lock+12-24
- security_scanning/examples/models/contrib/deepseek_v1/poetry.lock+12-24
- security_scanning/examples/models/contrib/deepseek_v2/poetry.lock+12-24
- security_scanning/examples/models/contrib/falcon/poetry.lock+8-8
- security_scanning/examples/models/contrib/gptj/poetry.lock+12-24
- security_scanning/examples/models/contrib/gptneox/poetry.lock+12-24
- security_scanning/examples/models/contrib/grok/poetry.lock+27-39
- security_scanning/examples/models/contrib/hyperclovax/poetry.lock+11-23
- security_scanning/examples/models/contrib/internlm/poetry.lock+12-24
- security_scanning/examples/models/contrib/jais/poetry.lock+12-24
- security_scanning/examples/models/contrib/mmdit/poetry.lock+112-18
- security_scanning/examples/models/contrib/mpt/poetry.lock+12-24
- security_scanning/examples/models/contrib/opt/poetry.lock+12-24
- security_scanning/examples/models/contrib/skywork/poetry.lock+12-24
- security_scanning/examples/models/contrib/smaug/poetry.lock+12-24
- security_scanning/examples/models/contrib/stdit/poetry.lock+168-167
- security_scanning/examples/models/contrib/stdit/pyproject.toml+2-2
- security_scanning/examples/models/core/commandr/poetry.lock+12-24
- security_scanning/examples/models/core/gemma/poetry.lock+29-41
- security_scanning/examples/models/core/glm-4-9b/poetry.lock+24-36
- security_scanning/examples/models/core/glm-4-9b/pyproject.toml+1-1
- security_scanning/examples/models/core/gpt/poetry.lock+12-24
- security_scanning/examples/models/core/llama/poetry.lock+8-8
- security_scanning/examples/models/core/mamba/poetry.lock+8-8
- security_scanning/examples/models/core/mixtral/poetry.lock+9-9
- security_scanning/examples/models/core/mllama/poetry.lock+12-12
- security_scanning/examples/models/core/nemotron/poetry.lock+12-24
- security_scanning/examples/models/core/phi/poetry.lock+12-24
- security_scanning/examples/models/core/qwen/poetry.lock+244-253
- security_scanning/examples/models/core/qwen/pyproject.toml+2-2
- security_scanning/examples/models/core/qwen2audio/poetry.lock+8-8
- security_scanning/examples/models/core/qwenvl/poetry.lock+273-171
- security_scanning/examples/models/core/qwenvl/pyproject.toml+1-1
- security_scanning/examples/models/core/recurrentgemma/poetry.lock+19-19
- security_scanning/examples/models/core/whisper/poetry.lock+57-57
- security_scanning/examples/models/core/whisper/pyproject.toml+1-1
- security_scanning/examples/ngram/poetry.lock+12-24
- security_scanning/examples/quantization/poetry.lock+161-51
- security_scanning/examples/redrafter/poetry.lock+12-24
- security_scanning/examples/trtllm-eval/poetry.lock+12-24
- security_scanning/poetry.lock+248-226
- security_scanning/pyproject.toml+7-6
- security_scanning/tests/integration/defs/perf/poetry.lock+112-120
- security_scanning/tests/integration/defs/perf/pyproject.toml+1-1
- security_scanning/triton_backend/poetry.lock+9-9
- tensorrt_llm/_torch/modules/fused_moe/fused_moe_trtllm_gen.py+18-10
- tensorrt_llm/_torch/pyexecutor/py_executor_creator.py+2-1
- tests/integration/test_lists/waives.txt+1-1
0 commit comments