You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|[Activation Sparsity](./src/nncf/experimental/torch/sparsify_activations/ActivationSparsity.md)| Not supported | Experimental | Not supported| Not supported| Not supported |
49
49
50
50
### Training-Time Compression Algorithms
@@ -409,9 +409,9 @@ A list of notebooks demonstrating OpenVINO conversion and inference together wit
|[Activation Sparsity](https://github.com/openvinotoolkit/nncf/blob/develop/src/nncf/experimental/torch/sparsify_activations/ActivationSparsity.md)| Not supported | Experimental | Not supported| Not supported| Not supported |
Copy file name to clipboardExpand all lines: docs/usage/post_training_compression/weights_compression/Usage.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@
17
17
18
18
The Weights Compression algorithm is aimed at compressing the weights of the models and can be used to optimize the model footprint and performance of large models where the size of weights is relatively larger than the size of activations, for example, Large Language Models (LLM). The algorithm compresses weights for Linear, Convolution and Embedding layers.
19
19
20
-
[OpenVINO](https://github.com/openvinotoolkit/openvino) is the preferred backend to run Weights Compression with. PyTorch and Torch FX are also supported.
20
+
[OpenVINO](https://github.com/openvinotoolkit/openvino) is the preferred backend to run Weights Compression with. PyTorch, ONNX and Torch FX are also supported.
21
21
22
22
### Supported modes
23
23
@@ -696,7 +696,7 @@ Accuracy/footprint trade-off for `microsoft/Phi-3-mini-4k-instruct`:
696
696
697
697
### Limitations
698
698
699
-
- The algorithm is supported for OpenVINO, PyTorch and Torch FX models.
699
+
- The algorithm is supported for OpenVINO, PyTorch, ONNX and Torch FX models.
700
700
- The compression applies in-place.
701
701
- The compressed model is not trainable.
702
702
- INT4_SYM, INT4_ASYM, NF4 and E2M1 modes, grouped quantization and mixed precision selection is available for OpenVINO backend only.
0 commit comments