Skip to content

Commit 4d1eb0c

Browse files
Major improvement of READMEs and documentation
Signed-off-by: Keval Morabia <[email protected]>
1 parent 9e68994 commit 4d1eb0c

File tree

39 files changed

+1875
-1547
lines changed

39 files changed

+1875
-1547
lines changed

README.md

Lines changed: 60 additions & 108 deletions
Large diffs are not rendered by default.
284 KB
Loading

docs/source/deployment/3_unified_hf.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -95,7 +95,7 @@ Deployment with Selected Inference Frameworks
9595

9696
.. tab:: TensorRT-LLM
9797

98-
Follow the `TensorRT-LLM installation instructions. <https://nvidia.github.io/TensorRT-LLM/installation/linux.html>`_
98+
Follow the `TensorRT-LLM installation instructions. <https://nvidia.github.io/TensorRT-LLM/quick-start-guide.html#installation>`_
9999

100100
Currently we support fp8 and nvfp4 quantized models for TensorRT-LLM deployment, you need v0.17.0 or later version of TensorRT-LLM.
101101

docs/source/getting_started/3_quantization.rst

Lines changed: 0 additions & 70 deletions
This file was deleted.

docs/source/getting_started/4_quantization_windows.rst

Lines changed: 0 additions & 65 deletions
This file was deleted.

docs/source/getting_started/5_pruning.rst

Lines changed: 0 additions & 101 deletions
This file was deleted.

docs/source/getting_started/6_distillation.rst

Lines changed: 0 additions & 115 deletions
This file was deleted.

0 commit comments

Comments
 (0)