Add migx bert squad quant example #441

TedThemistokleous · 2024-06-11T18:40:55Z

Add an MIGraphX based quantization test the inference examples repo

Added additional pieces with argparse to select which version of squad we want to test, batch size, sequence length and some other useful things.

…runs Seeing stall on larger batch sizes. Adding flags for debugging.

- Allow for mixed precsion via fp16 flag -Allow for variable batch -Allow for variable calibrate data size

…d on version

Running into an issue with shapes with the calibration tools if I break up this calibration read. This needs a large amount of memory to create the histogram.

TedThemistokleous · 2024-06-11T18:43:30Z

ping @cloudhan @PeixuanZuo @ytaous

cloudhan

There are some unresolved conflicts leak into your commit.

quantization/image_classification/migraphx/resnet50/e2e_migraphx_resnet_example.py

quantization/nlp/bert/migraphx/e2e_migraphx_bert_example.py

Useful if we want to use another falvor of bert for now. TODO need to handle/fix some of the input/output arg maps vs the input data vs model input/outputs

Another knob to tune/play with in perf runs. Right now just allow this to be default.

…o handle features in each example Our MIGraphX EP requires a recompile of the model if we constantly change the input dimensions or batch size of the parameters. Without this we actually cause a slowdown with the larger batch size runs as we tend to go above the feature index. A workaround is to ensure that batch size stays constant as we feed data into the model we're testing to get inference timing and accuracy results via repeating the same sample until we have enough data for a proper batch size.

quantization/nlp/bert/migraphx/e2e_migraphx_bert_example.py

quantization/nlp/bert/migraphx/README.md

tianleiwu · 2024-08-19T15:09:21Z

@cloudhan, please help run the example end-2-end to see whether it works.

cloudhan · 2024-10-24T07:33:50Z

@tianleiwu We don't work on AMD related EPs anymore.

cloudhan don't work on the EP now.

TedThemistokleous · 2024-10-25T14:51:51Z

@tianleiwu We don't work on AMD related EPs anymore.

That's news to me. When did that occur. Is this something we need to also bring up with our other Devs?

Ted Themistokleous and others added 15 commits January 22, 2024 22:43

Initial Commit of bert quantization with squad v1.1 and v2.0

359d3eb

Added additional pieces with argparse to select which version of squad we want to test, batch size, sequence length and some other useful things.

Update output of script to show MIGraphX

2c5add5

Add Latency measurement for inferences

8501a42

Add additional input options for debugging as well as io_binding for …

149a86b

…runs Seeing stall on larger batch sizes. Adding flags for debugging.

Fix path for vocab.txt

ae7d18e

Add additional flags for resnet50 int8 run

cefcaef

- Allow for mixed precsion via fp16 flag -Allow for variable batch -Allow for variable calibrate data size

update compute_data()

dd8b66d

Fix error with json serialization of calibration data

c2c8861

Update script to enforce calibration cache name and change flags base…

24d7067

…d on version

Use model path directly

14b6ec4

Remove usage of path for e2e bert model

5383c0a

Load onnx model before calibration begins

f501474

Remove false from onnx load

92b848b

Remove strided data reader

56586a2

Running into an issue with shapes with the calibration tools if I break up this calibration read. This needs a large amount of memory to create the histogram.

Arg changes for bert script

ef737c0

TedThemistokleous mentioned this pull request Jun 11, 2024

[MIGraphX EP] Add migraphx ep save load compiles microsoft/onnxruntime#20643

Merged

Merge branch 'main' into add_migx_bert_squad_quant_example

167c805

cloudhan previously requested changes Jun 11, 2024

View reviewed changes

quantization/image_classification/migraphx/resnet50/e2e_migraphx_resnet_example.py Outdated Show resolved Hide resolved

Fix merge conflicts

109fdfe

cloudhan requested a review from tianleiwu June 12, 2024 03:22

This comment was marked as resolved.

Sign in to view

TedThemistokleous requested a review from cloudhan June 13, 2024 19:32

gyulaz-htec reviewed Jun 14, 2024

View reviewed changes

quantization/nlp/bert/migraphx/e2e_migraphx_bert_example.py Outdated Show resolved Hide resolved

gyulaz-htec reviewed Jun 14, 2024

View reviewed changes

quantization/nlp/bert/migraphx/e2e_migraphx_bert_example.py Outdated Show resolved Hide resolved

gyulaz-htec reviewed Jun 14, 2024

View reviewed changes

quantization/nlp/bert/migraphx/e2e_migraphx_bert_example.py Outdated Show resolved Hide resolved

cloudhan requested a review from faxu June 19, 2024 13:49

TedThemistokleous added 2 commits June 20, 2024 16:13

use ort_session instead of session

3eeccd1

Fix error message with sample size

7173cee

tianleiwu requested a review from yufenglee June 24, 2024 18:12

TedThemistokleous and others added 4 commits July 3, 2024 18:11

Additional changes to handle various model inputs

31de72c

Useful if we want to use another falvor of bert for now. TODO need to handle/fix some of the input/output arg maps vs the input data vs model input/outputs

Add sequence length to mxr file output naming

de45c1b

Add query length input parameter

62ea38d

Another knob to tune/play with in perf runs. Right now just allow this to be default.

cloudhan removed their request for review July 5, 2024 00:58

Ted Themistokleous and others added 6 commits July 9, 2024 16:20

Set toggle for save_load of models

6125ba8

Add EP option and gate out model run with save/load

518dcb7

Fix querry_length to query_len

5240ddc

Add option for calibration EP data selection

7757055

Additional Fixes for save_load and adding CPU EP option

09814cf

Only print quantizer info for int8 runs

a258158

TedThemistokleous force-pushed the add_migx_bert_squad_quant_example branch from feec95d to a258158 Compare August 9, 2024 18:42

TedThemistokleous requested review from cloudhan and gyulaz-htec August 12, 2024 17:40

tianleiwu reviewed Aug 17, 2024

View reviewed changes

quantization/nlp/bert/migraphx/e2e_migraphx_bert_example.py Show resolved Hide resolved

tianleiwu reviewed Aug 17, 2024

View reviewed changes

quantization/nlp/bert/migraphx/README.md Show resolved Hide resolved

TedThemistokleous requested a review from tianleiwu August 19, 2024 14:41

tianleiwu reviewed Aug 19, 2024

View reviewed changes

quantization/nlp/bert/migraphx/README.md Outdated Show resolved Hide resolved

Update README

a026198

tianleiwu enabled auto-merge (squash) October 24, 2024 17:31

tianleiwu approved these changes Oct 25, 2024

View reviewed changes

tianleiwu merged commit daefee3 into microsoft:main Oct 25, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add migx bert squad quant example #441

Add migx bert squad quant example #441

Uh oh!

TedThemistokleous commented Jun 11, 2024

Uh oh!

TedThemistokleous commented Jun 11, 2024

Uh oh!

cloudhan left a comment

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu commented Aug 19, 2024

Uh oh!

cloudhan commented Oct 24, 2024

Uh oh!

TedThemistokleous commented Oct 25, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add migx bert squad quant example #441

Add migx bert squad quant example #441

Uh oh!

Conversation

TedThemistokleous commented Jun 11, 2024

Uh oh!

TedThemistokleous commented Jun 11, 2024

Uh oh!

cloudhan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu commented Aug 19, 2024

Uh oh!

cloudhan commented Oct 24, 2024

Uh oh!

TedThemistokleous commented Oct 25, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants