[YN]Openvino backend #2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

daniil-lyakhov wants to merge 18 commits into openvino_backend from dl/openvino/model_enabling

Owner

daniil-lyakhov commented Feb 11, 2025

Summary

[PLEASE REMOVE] See CONTRIBUTING.md's Pull Requests for ExecuTorch PR guidelines.

[PLEASE REMOVE] If this PR closes an issue, please add a Fixes #<issue-id> line.

[PLEASE REMOVE] If this PR introduces a fix or feature that should be the upcoming release notes, please add a "Release notes: " label. For a list of available release notes labels, check out CONTRIBUTING.md's Pull Requests.

Test plan

[PLEASE REMOVE] How did you test this PR? Please write down any manual commands you used and note down tests that you have written if applicable.

alexsu52 reviewed

View reviewed changes

examples/openvino/aot/README.md Outdated

    
                Enable model quantization: Default is False.

              - **`--dataset`** (optional):

                Path to the calibration dataset. TODO: It is necessary to think in what form to support the dataset. For the experiment, tiny-imagenet is used, which can be downloaded from here http://cs231n.stanford.edu/tiny-imagenet-200.zip and specify the path to it.

alexsu52 Feb 11, 2025

?

Owner Author

daniil-lyakhov Feb 11, 2025

Fixed

examples/openvino/aot/aot_openvino_compiler.py

    
                      default="CPU",

                      help="Target device for compiling the model (e.g., CPU, GPU). Default is CPU.",

                  )

alexsu52 Feb 11, 2025

What do you think about adding a --qunatization_flow argument that would have values "pt2e" and "nncf" to specify which flow should be used during quantization?

Owner Author

daniil-lyakhov Feb 11, 2025

Done, please check

examples/openvino/aot/aot_openvino_compiler.py Show resolved Hide resolved

examples/openvino/openvino_build.sh Outdated Show resolved Hide resolved

examples/openvino/openvino_build.sh Outdated Show resolved Hide resolved

examples/openvino/aot/aot_openvino_compiler.py Outdated Show resolved Hide resolved

examples/openvino/aot/aot_openvino_compiler.py

    
                  if suite == "torchvision":

                      transform = torchvision_models.get_model_weights(model_name).DEFAULT.transforms()

                  else:

                      transform = create_transform(**resolve_data_config(model.pretrained_cfg, model=model))

alexsu52 Feb 11, 2025

Does it work for the huggingface suite?

Owner Author

daniil-lyakhov Feb 11, 2025

No, an exception is added

examples/openvino/aot/aot_openvino_compiler.py

    
                  elif isinstance(input_shape, list):

                      input_shape = tuple(input_shape)

                  else:

                      msg = "Input shape must be a list or tuple."

alexsu52 Feb 11, 2025

It looks like if input_shape is tuple I will be here.

Owner Author

daniil-lyakhov Feb 11, 2025

Done

examples/openvino/aot/aot_openvino_compiler.py

    
                  # Export the model to the aten dialect

                  aten_dialect: ExportedProgram = export(model, example_args)

                  if quantize:

alexsu52 Feb 11, 2025

Could you implement quantization flow in the separate function?

Owner Author

daniil-lyakhov Feb 11, 2025

Done

examples/openvino/aot/aot_openvino_compiler.py

    
                  print(f"Model exported and saved as {model_name}.pte on {device}.")

                  print(f"Model exported and saved as {model_file_name} on {device}.")

                  if validate:

alexsu52 Feb 11, 2025

Could you implement validation in the separate function?

Owner Author

daniil-lyakhov Feb 11, 2025

Done

alexsu52 reviewed

View reviewed changes

backends/openvino/quantizer/quantizer.py

    
              from nncf.common.graph.graph import NNCFGraph

              QUANT_ANNOTATION_KEY = "quantization_annotation"

alexsu52 Feb 11, 2025

I suggest to introduce the following class:

class QuantizationMode(StrEnum):
    """
    Defines special quantization modes.
 
    - INT8_SYM: INT8 symmetric quantization for both activations and weights.
    - INT8_MIXED: INT8 asymmetric quantization for activations, symmetric for weights.
    - INT8_TRANSFORMER: Optimized INT8 quantization for transformer-based models
    """
 
    INT8_SYM = "int8_sym"
    INT8_MIXED = "int8_mixed"
    INT8_TRANSFORMER = "int8_transformer"

Owner Author

daniil-lyakhov Feb 11, 2025

Done

backends/openvino/quantizer/quantizer.py

Comment on lines 36 to 54

    
                  def __init__(

                      self,

                      *,

                      mode: Optional[p.QuantizationMode] = None,

                      preset: Optional[q.structs.QuantizationPreset] = None,

                      target_device: p.TargetDevice = p.TargetDevice.ANY,

                      transformer_model: bool = False,

                      ignored_scope: Optional[nncf.IgnoredScope] = None,

                      overflow_fix: Optional[advanced_p.OverflowFix] = None,

                      quantize_outputs: bool = False,

                      activations_quantization_params: Optional[advanced_p.QuantizationParameters] = None,

                      weights_quantization_params: Optional[advanced_p.QuantizationParameters] = None,

                  ):

alexsu52 Feb 11, 2025

What do you think?

Suggested change

      
                def __init__(
          
                    self,
          
                    *,
          
                    mode: Optional[p.QuantizationMode] = None,
          
                    preset: Optional[q.structs.QuantizationPreset] = None,
          
                    target_device: p.TargetDevice = p.TargetDevice.ANY,
          
                    transformer_model: bool = False,
          
                    ignored_scope: Optional[nncf.IgnoredScope] = None,
          
                    overflow_fix: Optional[advanced_p.OverflowFix] = None,
          
                    quantize_outputs: bool = False,
          
                    activations_quantization_params: Optional[advanced_p.QuantizationParameters] = None,
          
                    weights_quantization_params: Optional[advanced_p.QuantizationParameters] = None,
          
                ):
          
                def __init__(
          
                    self,
          
                    *,
          
                    mode: QuantizationMode = QuantizationMode.INT8_MIXED,
          
                    ignored_scope: Optional[nncf.IgnoredScope] = None,
          
                    **kwargs
          
                ):

Owner Author

daniil-lyakhov Feb 11, 2025

I like the constructor parameters subset, refactored

Owner Author

daniil-lyakhov Feb 11, 2025

Also, I set default QuantizationMode == INT8_SYM to alight with the default MinMax parameters (which were default parameters before) https://github.com/openvinotoolkit/nncf/blob/develop/nncf/quantization/algorithms/min_max/algorithm.py#L211-L215

daniil-lyakhov force-pushed the dl/openvino/model_enabling branch from ddcbb11 to 01b88f8 Compare

February 11, 2025 17:01

alexsu52 reviewed

View reviewed changes

backends/openvino/quantizer/quantizer.py Outdated

    
                      self,

                      *,

                      mode: Optional[QuantizationMode] = QuantizationMode.INT8_SYM,

                      ignored_scope: Optional[nncf.IgnoredScope] = None,

alexsu52 Feb 12, 2025

Ignored scope is model specific information. What do you think about introducing additional method to set ignored scope?

def set_ignored_scope(names: Optional[List[str]] = None, patterns: Optional[List[str]] = None, types: Optional[List[str]] = None, subgraphs: Optional[List[Tuple[List[str], List[str]]]] = None, validate: bool = True)

Owner Author

daniil-lyakhov Feb 12, 2025

Done

daniil-lyakhov force-pushed the dl/openvino/model_enabling branch 2 times, most recently from 573f316 to c499e3c Compare

February 12, 2025 14:08

alexsu52 and others added 13 commits

February 12, 2025 15:08


          added init integration of quantization

5d2784d


          deit3_small_patch16_224_in21ft1k

61488d5


          Resnet-like model checked

42155a1

WIP

7c66314


          Formating

c1fa9e2


          openvino_executor_runner.cpp can run on several inputs

e2415af


          Validate option / minor

8cbb117


          Input shape from the input dataset

4b60fb4


          --batch_size

e0cd644


          Adapt subset size to keep +- 300 pics for calibration

2a04ee6


          Apply suggestions from code review

db7dc13

Co-authored-by: Alexander Suslov <[email protected]>


          Comments

de3f50b


          OpenVINOQuantizer: constructor arguments have been refined

17fe62f

daniil-lyakhov force-pushed the dl/openvino/model_enabling branch 2 times, most recently from fa6744b to c7e0758 Compare

February 12, 2025 14:31

daniil-lyakhov and others added 5 commits

February 12, 2025 15:37


          set_ignored_scope | readme updates

c7e0758


          openvino_executor_runner.cpp: comments

19cbc69


          Apply suggestions from code review

0892b9d

Co-authored-by: Yamini Nimmagadda <[email protected]>


          aot_openvino_compiler.py: comments

d1aa425


          README

b9b604d

daniil-lyakhov mentioned this pull request

[OpenVINOQuantizer] Mark quantizer and quantize_pt2e as API openvinotoolkit/nncf#3277

Merged

alexsu52 pushed a commit to openvinotoolkit/nncf that referenced this pull request


          [OpenVINOQuantizer] Mark quantizer and quantize_pt2e as API (#3277)

6b0fc1c

### Changes

Mark quantizer and quantize_pt2e as API

### Reason for changes

To introduce `OpenVINOQuantizer` and `quantize_pt2e` in the api docs:
https://openvinotoolkit.github.io/nncf/index.html

### Related tickets

daniil-lyakhov/executorch#2

daniil-lyakhov pushed a commit that referenced this pull request


          Use GraphBuilder in unit tests for ops removal #2.

ea8db06

Differential Revision: D75034439

Pull Request resolved: pytorch#11011

daniil-lyakhov pushed a commit that referenced this pull request


          Use GraphBuilder in test_replace_ops_passes. #2

53fd701

Differential Revision: D75982351

Pull Request resolved: pytorch#11456

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet