Skip to content

Commit 51140cf

Browse files
Update README.md
Signed-off-by: chichun-charlie-liu <[email protected]>
1 parent e8cc8e8 commit 51140cf

File tree

1 file changed

+17
-23
lines changed

1 file changed

+17
-23
lines changed

examples/GPTQ/README.md

Lines changed: 17 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -62,10 +62,10 @@ This end-to-end example utilizes the common set of interfaces provided by `fms_m
6262
6363
```
6464
layer mem (MB)
65-
dtype
66-
torch.float16 224 109.051904
67-
torch.float32 67 4203.757568
68-
torch.int32 672 3521.904640
65+
dtype
66+
torch.bfloat16 67 2101.878784
67+
torch.float16 224 109.051904
68+
torch.int32 672 3521.904640
6969
```
7070
7171
4. **Evaluate the quantized model**'s performance on a selected task using `lm-eval` library, the command below will run evaluation on [`lambada_openai`](https://huggingface.co/datasets/EleutherAI/lambada_openai) task and show the perplexity/accuracy at the end.
@@ -82,29 +82,23 @@ This end-to-end example utilizes the common set of interfaces provided by `fms_m
8282
## Example Test Results
8383
8484
- Unquantized Model
85-
```bash
86-
|Model | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
87-
|------------|--------------|------:|------|-----:|----------|---|-----:|---|-----:|
88-
| LLAMA3-8B |lambada_openai| 1|none | 5|acc |↑ |0.7103|± |0.0063|
89-
| | | |none | 5|perplexity|↓ |3.7915|± |0.0727|
90-
```
85+
|Model | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
86+
|------------|--------------|------:|------|-----:|----------|---|-----:|---|-----:|
87+
| LLAMA3-8B |lambada_openai| 1|none | 5|acc |↑ |0.7103|± |0.0063|
88+
| | | |none | 5|perplexity|↓ |3.7915|± |0.0727|
9189
9290
- Quantized model with the settings showed above (`desc_act` default to False.)
93-
```bash
94-
|Model | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
95-
|------------|--------------|------:|------|-----:|----------|---|------:|---|-----:|
96-
| LLAMA3-8B |lambada_openai| 1|none | 5|acc ||0.4271 |± |0.0069|
97-
| | | |none | 5|perplexity||39.2316|± |2.2090|
98-
```
99-
91+
|Model | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
92+
|------------|--------------|------:|------|-----:|----------|---|------:|---|-----:|
93+
| LLAMA3-8B |lambada_openai| 1|none | 5|acc |↑ |0.6365 |± |0.0067|
94+
| | | |none | 5|perplexity|↓ |5.9307 |± |0.1830|
10095
10196
- Quantized model with `desc_act` set to `True` (could improve the model quality, but at the cost of inference speed.)
102-
```bash
103-
|Model | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
104-
|------------|--------------|------:|------|-----:|----------|---|------:|---|-----:|
105-
| LLAMA3-8B |lambada_openai| 1|none | 5|acc ||0.6193 |± |0.0068|
106-
| | | |none | 5|perplexity||5.8879 |± |0.1546|
107-
```
97+
|Model | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
98+
|------------|--------------|------:|------|-----:|----------|---|------:|---|-----:|
99+
| LLAMA3-8B |lambada_openai| 1|none | 5|acc |↑ |0.6193 |± |0.0068|
100+
| | | |none | 5|perplexity|↓ |5.8879 |± |0.1546|
101+
108102
> [!NOTE]
109103
> There is some randomness in generating the model and data, the resulting accuracy may vary ~$\pm$ 0.05.
110104

0 commit comments

Comments
 (0)