Skip to content

Commit 5062a7e

Browse files
committed
Update after review
Review comments: - #40 (review) Signed-off-by: Martin Hickey <[email protected]>
1 parent ed79ecd commit 5062a7e

File tree

3 files changed

+4
-4
lines changed

3 files changed

+4
-4
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -51,15 +51,15 @@ FMS Model Optimizer is a framework for developing reduced precision neural netwo
5151
- Clone the [CUTLASS](https://github.com/NVIDIA/cutlass) repository
5252
- `PyTorch 2.3.1` (as newer version will cause issue for the custom CUDA kernel used in these examples)
5353
- **FP8** is a reduced precision format like **INT8**:
54-
- Nvidia H100 family or higher
54+
- Nvidia A100 family or higher
5555
- [llm-compressor](https://github.com/vllm-project/llm-compressor)
5656
- To enable compute graph plotting function (mostly for troubleshooting purpose):
5757
- [matplotlib](https://matplotlib.org/)
5858
- [graphviz](https://graphviz.org/)
5959
- [pygraphviz](https://pygraphviz.github.io/)
6060

6161
> [!NOTE]
62-
> PyTorch version should be < 2.5 if you would like to experiment deployment with external INT8 kernel.
62+
> PyTorch version should be < 2.4 if you would like to experiment deployment with external INT8 kernel.
6363
6464
### Installation
6565

examples/FP8_QUANT/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ This is an example of mature FP8, which under the hood leverages some functional
1010
## Requirements
1111

1212
- [FMS Model Optimizer requirements](../../README.md#requirements)
13-
- Nvidia H100 family or higher
13+
- Nvidia A100 family or higher
1414
- The [llm-compressor](https://github.com/vllm-project/llm-compressor) library can be installed using pip:
1515

1616
```bash

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ classifiers=[
2121
]
2222
dynamic = ["version"]
2323
dependencies = [
24-
"torch >=2.1, <2.5",
24+
"torch >=2.1, <2.4",
2525
"numpy",
2626
"tqdm",
2727
"tensorboard",

0 commit comments

Comments
 (0)