|
1 | 1 | nav: |
2 | 2 | - Home: index.md |
3 | | - - Why use LLM Compressor?: getting-started/why-llmcompressor.md |
4 | | - - Choosing the right compression scheme: getting-started/choosing-scheme.md |
5 | | - - Choosing the right compression algorithm: getting-started/choosing-algo.md |
| 3 | + - Why use LLM Compressor?: steps/why-llmcompressor.md |
| 4 | + - Compressing your model, step-by-step: |
| 5 | + - Choosing your model: steps/choosing-model.md |
| 6 | + - Choosing the right compression scheme: steps/choosing-scheme.md |
| 7 | + - Choosing the right compression algorithm: steps/choosing-algo.md |
| 8 | + - Choosing a dataset: steps/choosing-dataset.md |
| 9 | + - Compressing your model: steps/compress.md |
| 10 | + - Deploying with vLLM: steps/deploy.md |
6 | 11 | - Getting started: |
7 | 12 | - getting-started/index.md |
8 | 13 | - Installing LLM Compressor: getting-started/install.md |
9 | | - - Compressing your Model: getting-started/compress.md |
10 | | - - Deploying with vLLM: getting-started/deploy.md |
11 | | - - FAQ: getting-started/faq.md |
12 | 14 | - Key Models: |
13 | 15 | - key-models/index.md |
14 | 16 | - Llama 4: |
|
24 | 26 | - key-models/mistral-large-3/index.md |
25 | 27 | - FP8 Example: key-models/mistral-large-3/fp8-example.md |
26 | 28 | - Guides: |
| 29 | + - Big Models and Distributed Support: |
| 30 | + - Model Loading: guides/big_models_and_distributed/model_loading.md |
| 31 | + - Sequential Onloading: guides/big_models_and_distributed/sequential_onloading.md |
| 32 | + - Distributed Oneshot: guides/big_models_and_distributed/distributed_oneshot.md |
27 | 33 | - Compression Schemes: guides/compression_schemes.md |
28 | 34 | - Saving a Model: guides/saving_a_model.md |
29 | | - - Observers: observers.md |
| 35 | + - Observers: guides/observers.md |
| 36 | + - Memory Requirements: guides/memory.md |
| 37 | + - Runtime Performance: guides/runtime.md |
30 | 38 | - Examples: |
31 | | - - examples/index.md |
| 39 | + - examples/README.md |
32 | 40 | - examples/* |
| 41 | + - Experimental: |
| 42 | + - experimental/README.md |
| 43 | + - experimental/* |
33 | 44 | - Developer: |
34 | 45 | - developer/index.md |
35 | 46 | - developer/* |
36 | 47 | - API Reference: |
37 | 48 | - api/* |
| 49 | + - FAQ: |
| 50 | + - faq/faq.md |
0 commit comments