Skip to content

Commit 5e8d499

Browse files
authored
Merge pull request #40 from hickeyma/doc/readme-update
Small updates to the docs
2 parents e53751d + dc663eb commit 5e8d499

File tree

3 files changed

+23
-6
lines changed

3 files changed

+23
-6
lines changed

.spellcheck-en-custom.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -50,6 +50,7 @@ lossy
5050
LSTM
5151
matmul
5252
matmuls
53+
matplotlib
5354
maxperCh
5455
maxpertoken
5556
Miniforge

README.md

Lines changed: 21 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010

1111
## Introduction
1212

13-
FMS Model Optimizer is a framework for developing reduced precision neural network models. Quantization techniques, such as [quantization-aware-training (QAT)](https://arxiv.org/abs/2407.11062), [post-training quantization (PTQ)](https://arxiv.org/abs/2102.05426), and several other optimization techniques on popular deep learning workloads are supported.
13+
FMS Model Optimizer is a framework for developing reduced precision neural network models. [Quantization](https://www.ibm.com/think/topics/quantization) techniques, such as [quantization-aware-training (QAT)](https://arxiv.org/abs/2407.11062), [post-training quantization (PTQ)](https://arxiv.org/abs/2102.05426), and several other optimization techniques on popular deep learning workloads are supported.
1414

1515
## Highlights
1616

@@ -37,10 +37,11 @@ FMS Model Optimizer is a framework for developing reduced precision neural netwo
3737

3838
1. **🐧 Linux system with Nvidia GPU (V100/A100/H100)**
3939
2. Python 3.9 to Python 3.11
40+
4041
📋 Python 3.12 is currently not supported due to PyTorch Dynamo constraint
4142
3. CUDA >=12
4243

43-
*Optional packages based on optimization functionalities required:*
44+
*Optional packages based on optimization functionality required:*
4445

4546
- **GPTQ** is a popular compression method for LLMs:
4647
- [auto_gptq](https://pypi.org/project/auto-gptq/) or build from [source](https://github.com/AutoGPTQ/AutoGPTQ)
@@ -50,9 +51,10 @@ FMS Model Optimizer is a framework for developing reduced precision neural netwo
5051
- Clone the [CUTLASS](https://github.com/NVIDIA/cutlass) repository
5152
- `PyTorch 2.3.1` (as newer version will cause issue for the custom CUDA kernel used in these examples)
5253
- **FP8** is a reduced precision format like **INT8**:
53-
- Nvidia H100 family or higher
54+
- Nvidia A100 family or higher
5455
- [llm-compressor](https://github.com/vllm-project/llm-compressor)
5556
- To enable compute graph plotting function (mostly for troubleshooting purpose):
57+
- [matplotlib](https://matplotlib.org/)
5658
- [graphviz](https://graphviz.org/)
5759
- [pygraphviz](https://pygraphviz.github.io/)
5860

@@ -61,7 +63,7 @@ FMS Model Optimizer is a framework for developing reduced precision neural netwo
6163
6264
### Installation
6365

64-
We recommend using a Python virtual environment with Python 3.10+. Here is how to setup a virtual environment using [Python venv](https://docs.python.org/3/library/venv.html):
66+
We recommend using a Python virtual environment with Python 3.9+. Here is how to setup a virtual environment using [Python venv](https://docs.python.org/3/library/venv.html):
6567

6668
```
6769
python3 -m venv fms_mo_venv
@@ -71,7 +73,21 @@ source fms_mo_venv/bin/activate
7173
> [!TIP]
7274
> If you use [pyenv](https://github.com/pyenv/pyenv), [Conda Miniforge](https://github.com/conda-forge/miniforge) or other such tools for Python version management, create the virtual environment with that tool instead of venv. Otherwise, you may have issues with installed packages not being found as they are linked to your Python version management tool and not `venv`.
7375
74-
To install `fms_mo` package from source:
76+
There are 2 ways to install the FMS Model Optimizer as follows:
77+
78+
#### From Release
79+
80+
To install from release ([PyPi package](https://pypi.org/project/fms-model-optimizer/)):
81+
82+
```shell
83+
python3 -m venv fms_mo_venv
84+
source fms_mo_venv/bin/activate
85+
pip install fms-model-optimizer
86+
```
87+
88+
#### From Source
89+
90+
To install from source(GitHub Repository):
7591

7692
```shell
7793
python3 -m venv fms_mo_venv

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ classifiers=[
2121
]
2222
dynamic = ["version"]
2323
dependencies = [
24-
"torch >=2.1, <2.5",
24+
"torch >=2.1, <2.4",
2525
"numpy",
2626
"tqdm",
2727
"tensorboard",

0 commit comments

Comments
 (0)