You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-4Lines changed: 6 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@
10
10
11
11
## Introduction
12
12
13
-
FMS Model Optimizer is a framework for developing reduced precision neural network models. Quantization techniques, such as [quantization-aware-training (QAT)](https://arxiv.org/abs/2407.11062), [post-training quantization (PTQ)](https://arxiv.org/abs/2102.05426), and several other optimization techniques on popular deep learning workloads are supported.
13
+
FMS Model Optimizer is a framework for developing reduced precision neural network models. [Quantization](https://www.ibm.com/think/topics/quantization) techniques, such as [quantization-aware-training (QAT)](https://arxiv.org/abs/2407.11062), [post-training quantization (PTQ)](https://arxiv.org/abs/2102.05426), and several other optimization techniques on popular deep learning workloads are supported.
14
14
15
15
## Highlights
16
16
@@ -37,10 +37,11 @@ FMS Model Optimizer is a framework for developing reduced precision neural netwo
37
37
38
38
1.**🐧 Linux system with Nvidia GPU (V100/A100/H100)**
39
39
2. Python 3.9 to Python 3.11
40
+
40
41
📋 Python 3.12 is currently not supported due to PyTorch Dynamo constraint
41
42
3. CUDA >=12
42
43
43
-
*Optional packages based on optimization functionalities required:*
44
+
*Optional packages based on optimization functionality required:*
44
45
45
46
-**GPTQ** is a popular compression method for LLMs:
46
47
-[auto_gptq](https://pypi.org/project/auto-gptq/) or build from [source](https://github.com/AutoGPTQ/AutoGPTQ)
@@ -53,15 +54,16 @@ FMS Model Optimizer is a framework for developing reduced precision neural netwo
- To enable compute graph plotting function (mostly for troubleshooting purpose):
57
+
-[matplotlib](https://matplotlib.org/)
56
58
-[graphviz](https://graphviz.org/)
57
59
-[pygraphviz](https://pygraphviz.github.io/)
58
60
59
61
> [!NOTE]
60
-
> PyTorch version should be < 2.4 if you would like to experiment deployment with external INT8 kernel.
62
+
> PyTorch version should be < 2.5 if you would like to experiment deployment with external INT8 kernel.
61
63
62
64
### Installation
63
65
64
-
We recommend using a Python virtual environment with Python 3.10+. Here is how to setup a virtual environment using [Python venv](https://docs.python.org/3/library/venv.html):
66
+
We recommend using a Python virtual environment with Python 3.9+. Here is how to setup a virtual environment using [Python venv](https://docs.python.org/3/library/venv.html):
0 commit comments